heffte 7950x AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 3060 Ti 8GB on Ubuntu 23.10 via the Phoronix Test Suite. a: Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 32GB, Disk: Western Digital WD_BLACK SN850X 1000GB, Graphics: NVIDIA GeForce RTX 3060 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Ubuntu 23.10, Kernel: 6.6.0-060600rc5-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 545.23.06, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.3.68, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 b: Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 32GB, Disk: Western Digital WD_BLACK SN850X 1000GB, Graphics: NVIDIA GeForce RTX 3060 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Ubuntu 23.10, Kernel: 6.6.0-060600rc5-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 545.23.06, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.3.68, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 c: Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 32GB, Disk: Western Digital WD_BLACK SN850X 1000GB, Graphics: NVIDIA GeForce RTX 3060 Ti 8GB, Audio: NVIDIA GA104 HD Audio, Monitor: ASUS MG28U, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Ubuntu 23.10, Kernel: 6.6.0-060600rc5-generic (x86_64), Desktop: GNOME Shell 45.0, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 545.23.06, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.3.68, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 11.30 |==================================================================== b . 11.31 |==================================================================== c . 11.32 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 11.32 |==================================================================== b . 11.30 |==================================================================== c . 11.34 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 11.55 |==================================================================== b . 11.56 |==================================================================== c . 11.57 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 11.57 |==================================================================== b . 11.57 |==================================================================== c . 11.57 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 20.59 |==================================================================== b . 20.49 |==================================================================== c . 20.52 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 20.59 |==================================================================== b . 20.63 |==================================================================== c . 20.56 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.31 |==================================================================== b . 22.27 |==================================================================== c . 22.28 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.31 |==================================================================== b . 22.33 |==================================================================== c . 22.29 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.65 |==================================================================== b . 22.67 |==================================================================== c . 22.65 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.65 |==================================================================== b . 22.64 |==================================================================== c . 22.65 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.47 |==================================================================== b . 22.45 |==================================================================== c . 22.47 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 22.47 |==================================================================== b . 22.48 |==================================================================== c . 22.47 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 40.69 |==================================================================== b . 40.70 |==================================================================== c . 40.68 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 40.74 |==================================================================== b . 40.74 |==================================================================== c . 40.73 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 GFLOP/s > Higher Is Better a . 43.95 |==================================================================== b . 43.93 |==================================================================== c . 43.94 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 GFLOP/s > Higher Is Better a . 44.00 |==================================================================== b . 44.03 |==================================================================== c . 43.97 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 10.13 |==================================================================== b . 10.14 |==================================================================== c . 10.18 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 10.21 |==================================================================== b . 10.12 |=================================================================== c . 10.16 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 10.47 |==================================================================== b . 10.47 |==================================================================== c . 10.46 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 10.46 |==================================================================== b . 10.46 |==================================================================== c . 10.46 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 80.70 |==================================================================== b . 77.99 |================================================================== c . 77.64 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 44.59 |==================================================================== b . 44.33 |==================================================================== c . 44.01 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 17.51 |==================================================================== b . 17.30 |=================================================================== c . 17.32 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 45.23 |================================================================== b . 46.52 |==================================================================== c . 45.56 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 60.58 |==================================================================== b . 59.87 |=================================================================== c . 60.87 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 62.42 |==================================================================== b . 61.76 |=================================================================== c . 61.26 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 19.06 |==================================================================== b . 19.13 |==================================================================== c . 19.14 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 19.14 |==================================================================== b . 19.09 |==================================================================== c . 19.15 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 137.40 |=================================================================== b . 133.32 |================================================================= c . 133.52 |================================================================= HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 20.88 |==================================================================== b . 20.89 |==================================================================== c . 20.83 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 115.31 |================================================================ b . 120.99 |=================================================================== c . 120.15 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 20.86 |==================================================================== b . 20.89 |==================================================================== c . 20.85 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 GFLOP/s > Higher Is Better a . 21.21 |==================================================================== b . 21.17 |==================================================================== c . 21.15 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 21.17 |==================================================================== b . 21.18 |==================================================================== c . 21.20 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 21.12 |==================================================================== b . 21.13 |==================================================================== c . 21.10 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 21.14 |==================================================================== b . 21.12 |==================================================================== c . 21.16 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 77.42 |==================================================================== b . 76.81 |=================================================================== c . 77.22 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 GFLOP/s > Higher Is Better a . 136.40 |=================================================================== b . 135.92 |=================================================================== c . 135.26 |================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 17.21 |==================================================================== b . 16.84 |=================================================================== c . 17.07 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 16.97 |==================================================================== b . 16.50 |================================================================== c . 17.09 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 47.41 |================================================================= b . 49.84 |==================================================================== c . 48.89 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 49.71 |==================================================================== b . 49.33 |=================================================================== c . 49.18 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 42.09 |==================================================================== b . 41.71 |=================================================================== c . 41.82 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 42.03 |==================================================================== b . 41.87 |==================================================================== c . 41.65 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 GFLOP/s > Higher Is Better a . 46.18 |=================================================================== b . 46.11 |=================================================================== c . 46.80 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 GFLOP/s > Higher Is Better a . 46.07 |==================================================================== b . 46.16 |==================================================================== c . 46.26 |==================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 GFLOP/s > Higher Is Better a . 17.52 |================================================================== b . 18.02 |==================================================================== c . 17.67 |=================================================================== HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 GFLOP/s > Higher Is Better a . 118.58 |================================================================== b . 116.58 |================================================================= c . 120.07 |===================================================================