cloverleaf threadripper AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) and AMD Radeon RX 5700 8GB on Ubuntu 23.04 via the Phoronix Test Suite. a: Processor: AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads), Motherboard: Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS), Chipset: AMD Starship/Matisse, Memory: 128GB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: AMD Radeon RX 5700 8GB (1750/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: DELL P2415Q, Network: Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 23.04, Kernel: 6.2.0-34-generic (x86_64), Desktop: GNOME Shell 44.3, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 23.0.2 (LLVM 15.0.7 DRM 3.49), Compiler: GCC 12.3.0, File-System: ext4, Screen Resolution: 3840x2160 b: Processor: AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads), Motherboard: Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS), Chipset: AMD Starship/Matisse, Memory: 128GB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: AMD Radeon RX 5700 8GB (1750/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: DELL P2415Q, Network: Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 23.04, Kernel: 6.2.0-34-generic (x86_64), Desktop: GNOME Shell 44.3, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 23.0.2 (LLVM 15.0.7 DRM 3.49), Compiler: GCC 12.3.0, File-System: ext4, Screen Resolution: 3840x2160 c: Processor: AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads), Motherboard: Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS), Chipset: AMD Starship/Matisse, Memory: 128GB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: AMD Radeon RX 5700 8GB (1750/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: DELL P2415Q, Network: Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 23.04, Kernel: 6.2.0-34-generic (x86_64), Desktop: GNOME Shell 44.3, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 23.0.2 (LLVM 15.0.7 DRM 3.49), Compiler: GCC 12.3.0, File-System: ext4, Screen Resolution: 3840x2160 HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 55.82 |================================================================ b . 55.06 |=============================================================== c . 59.61 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 35.25 |==================================================================== b . 35.19 |==================================================================== c . 35.01 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 23.55 |==================================================================== b . 23.53 |==================================================================== c . 23.55 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 92.24 |==================================================================== b . 91.53 |=================================================================== c . 91.99 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 85.10 |==================================================================== b . 83.36 |=================================================================== c . 83.14 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 44.17 |==================================================================== b . 44.20 |==================================================================== c . 44.06 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 21.00 |============================================================ b . 19.86 |======================================================== c . 23.96 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 12.87 |==================================================================== b . 12.64 |=================================================================== c . 12.66 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 12.35 |==================================================================== b . 12.34 |==================================================================== c . 12.35 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024 GFLOP/s > Higher Is Better a . 27.13 |==================================================================== b . 27.12 |==================================================================== c . 27.15 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 49.92 |==================================================================== b . 49.75 |==================================================================== c . 47.11 |================================================================ HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 37.83 |=================================================================== b . 37.97 |=================================================================== c . 38.30 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 23.64 |==================================================================== b . 23.63 |==================================================================== c . 23.63 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 26.00 |================================================================= b . 27.28 |==================================================================== c . 17.37 |=========================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 28.35 |==================================================================== b . 27.67 |================================================================== c . 28.19 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.28 |==================================================================== b . 22.30 |==================================================================== c . 22.32 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024 GFLOP/s > Higher Is Better a . 48.48 |==================================================================== b . 48.45 |==================================================================== c . 48.51 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 82.37 |==================================================================== b . 79.86 |================================================================== c . 80.78 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 95.75 |=================================================================== b . 97.60 |==================================================================== c . 93.80 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 48.16 |==================================================================== b . 47.93 |==================================================================== c . 47.99 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024 GFLOP/s > Higher Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 29.11 |==================================================================== b . 28.58 |=================================================================== c . 28.33 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 13.06 |==================================================================== b . 12.85 |=================================================================== c . 12.69 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 12.40 |==================================================================== b . 12.40 |==================================================================== c . 12.40 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024 GFLOP/s > Higher Is Better a . 27.12 |==================================================================== b . 27.12 |==================================================================== c . 27.13 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024 GFLOP/s > Higher Is Better a . 24.91 |==================================================================== b . 24.91 |==================================================================== c . 24.93 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 48.30 |==================================================================== b . 48.58 |==================================================================== c . 48.18 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 34.41 |==================================================================== b . 34.38 |==================================================================== c . 34.58 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 24.09 |==================================================================== b . 24.10 |==================================================================== c . 24.09 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024 GFLOP/s > Higher Is Better a . 52.27 |==================================================================== b . 52.26 |==================================================================== c . 52.31 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024 GFLOP/s > Higher Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024 GFLOP/s > Higher Is Better a . 26.97 |==================================================================== b . 26.98 |==================================================================== c . 27.01 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 54.95 |=============================================================== b . 55.63 |================================================================ c . 59.47 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 34.69 |==================================================================== b . 34.79 |==================================================================== c . 34.77 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 23.61 |==================================================================== b . 23.61 |==================================================================== c . 23.57 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 91.31 |==================================================================== b . 91.23 |==================================================================== c . 89.81 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 82.39 |==================================================================== b . 82.70 |==================================================================== c . 81.13 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 44.24 |==================================================================== b . 44.12 |==================================================================== c . 44.28 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 18.89 |=============================================== b . 19.24 |================================================ c . 27.27 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 12.75 |==================================================================== b . 12.73 |==================================================================== c . 12.64 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 12.34 |==================================================================== b . 12.35 |==================================================================== c . 12.34 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024 GFLOP/s > Higher Is Better a . 27.18 |==================================================================== b . 27.18 |==================================================================== c . 27.18 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 50.17 |=================================================================== b . 50.54 |==================================================================== c . 50.73 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 37.75 |==================================================================== b . 37.86 |==================================================================== c . 37.81 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 23.71 |==================================================================== b . 23.66 |==================================================================== c . 23.69 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 25.04 |============================================================ b . 28.27 |==================================================================== c . 18.38 |============================================ HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 28.02 |==================================================================== b . 28.08 |==================================================================== c . 28.09 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.31 |==================================================================== b . 22.32 |==================================================================== c . 22.35 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024 GFLOP/s > Higher Is Better a . 48.57 |==================================================================== b . 48.56 |==================================================================== c . 48.57 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 81.12 |==================================================================== b . 80.87 |==================================================================== c . 80.86 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 95.56 |=================================================================== b . 96.27 |==================================================================== c . 95.47 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 48.30 |==================================================================== b . 48.09 |==================================================================== c . 48.28 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024 GFLOP/s > Higher Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 28.87 |==================================================================== b . 28.70 |==================================================================== c . 28.73 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 12.83 |==================================================================== b . 12.82 |==================================================================== c . 12.67 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 12.41 |==================================================================== b . 12.40 |==================================================================== c . 12.42 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024 GFLOP/s > Higher Is Better a . 27.18 |==================================================================== b . 27.20 |==================================================================== c . 27.21 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024 GFLOP/s > Higher Is Better a . 24.92 |==================================================================== b . 24.92 |==================================================================== c . 24.95 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 47.59 |=================================================================== b . 48.51 |==================================================================== c . 48.14 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 34.17 |=================================================================== b . 34.28 |==================================================================== c . 34.48 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 24.12 |==================================================================== b . 24.15 |==================================================================== c . 24.12 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024 GFLOP/s > Higher Is Better a . 52.35 |==================================================================== b . 52.38 |==================================================================== c . 52.41 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024 GFLOP/s > Higher Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024 GFLOP/s > Higher Is Better a . 26.99 |==================================================================== b . 26.98 |==================================================================== c . 27.01 |==================================================================== CloverLeaf 1.3 Input: clover_bm Seconds < Lower Is Better a . 17.54 |==================================================================== b . 16.13 |=============================================================== c . 17.00 |================================================================== CloverLeaf 1.3 Input: clover_bm16 Seconds < Lower Is Better a . 1253.12 |================================================================== b . 1252.53 |================================================================== c . 1252.61 |================================================================== CloverLeaf 1.3 Input: clover_bm64_short Seconds < Lower Is Better a . 144.51 |=================================================================== b . 144.65 |=================================================================== c . 144.69 |===================================================================