heffte 7950x

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 3060 Ti 8GB on Ubuntu 23.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310274-PTS-HEFFTE7999.

heffte 7950xProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionabcAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GBNVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GA104 HD AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.106.6.0-060600rc5-generic (x86_64)GNOME Shell 45.0X Server 1.21.1.7NVIDIA 545.23.064.6.0OpenCL 3.0 CUDA 12.3.68GCC 13.2.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203 Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

heffte 7950xheffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float - 256heffte: c2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float - 512heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double - 512heffte: c2c - Stock - float - 128heffte: c2c - Stock - float - 256heffte: c2c - Stock - float - 512heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - double - 512heffte: r2c - Stock - float - 128heffte: r2c - Stock - float - 256heffte: r2c - Stock - float - 512heffte: c2c - Stock - double - 128heffte: c2c - Stock - double - 256heffte: c2c - Stock - double - 512heffte: r2c - Stock - double - 128heffte: r2c - Stock - double - 256heffte: r2c - Stock - double - 512heffte: c2c - FFTW - float-long - 128heffte: c2c - FFTW - float-long - 256heffte: c2c - FFTW - float-long - 512heffte: r2c - FFTW - float-long - 128heffte: r2c - FFTW - float-long - 256heffte: r2c - FFTW - float-long - 512heffte: c2c - FFTW - double-long - 128heffte: c2c - FFTW - double-long - 256heffte: c2c - FFTW - double-long - 512heffte: c2c - Stock - float-long - 128heffte: c2c - Stock - float-long - 256heffte: c2c - Stock - float-long - 512heffte: r2c - FFTW - double-long - 128heffte: r2c - FFTW - double-long - 256heffte: r2c - FFTW - double-long - 512heffte: r2c - Stock - float-long - 128heffte: r2c - Stock - float-long - 256heffte: r2c - Stock - float-long - 512heffte: c2c - Stock - double-long - 128heffte: c2c - Stock - double-long - 256heffte: c2c - Stock - double-long - 512heffte: r2c - Stock - double-long - 128heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double-long - 512abc77.419120.878022.3129136.39642.088540.735816.974210.206311.317562.418421.140922.470544.594019.137520.5913115.30646.069644.001017.524610.472411.550449.711021.209922.646880.701820.863022.3077137.39542.027040.692417.207310.132611.303360.580221.119322.471745.228119.062620.5876118.58146.177843.946517.511810.460411.565047.409521.170322.647776.808020.891922.3309135.91941.713540.742716.496810.119311.300661.757121.123722.476644.328319.087220.6314120.98746.157244.029918.020710.468011.558549.330821.165522.636177.986620.889522.2746133.31741.870240.698016.837410.138511.305459.870021.126722.448246.524019.129820.4946116.57646.111143.932017.301210.464011.573349.841421.183422.668877.221620.825222.2898135.26241.822940.725717.093710.155311.335361.261921.164322.465844.013119.148820.5596120.15346.260243.966017.670210.464111.567249.180721.148622.652777.637320.850122.2755133.51541.649440.677817.071910.183011.321360.868621.102822.467845.561919.142320.5184120.07346.798743.938517.321810.456511.565548.892121.203222.6505OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128abc20406080100SE +/- 1.05, N = 3SE +/- 1.96, N = 15SE +/- 2.22, N = 1277.4276.8177.221. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256abc510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 320.8820.8920.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 322.3122.3322.291. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128abc306090120150SE +/- 0.97, N = 12SE +/- 1.41, N = 3SE +/- 1.19, N = 15136.40135.92135.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256abc1020304050SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 342.0941.7141.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512abc918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 340.7440.7440.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128abc48121620SE +/- 0.15, N = 15SE +/- 0.02, N = 3SE +/- 0.10, N = 316.9716.5017.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256abc3691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 310.2110.1210.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512abc3691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 311.3211.3011.341. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128abc1428425670SE +/- 0.65, N = 15SE +/- 0.74, N = 3SE +/- 0.56, N = 1562.4261.7661.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256abc510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 321.1421.1221.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512abc510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 322.4722.4822.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128abc1020304050SE +/- 0.98, N = 15SE +/- 0.46, N = 15SE +/- 1.16, N = 1244.5944.3344.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 319.1419.0919.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512abc510152025SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 320.5920.6320.561. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128abc306090120150SE +/- 0.62, N = 3SE +/- 1.24, N = 14SE +/- 1.05, N = 15115.31120.99120.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256abc1020304050SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.03, N = 346.0746.1646.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512abc1020304050SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 344.0044.0343.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128abc48121620SE +/- 0.22, N = 4SE +/- 0.20, N = 4SE +/- 0.18, N = 317.5218.0217.671. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256abc3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.4710.4710.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512abc3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.5511.5611.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128abc1122334455SE +/- 0.50, N = 3SE +/- 0.36, N = 15SE +/- 0.58, N = 349.7149.3349.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256abc510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 321.2121.1721.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.6522.6422.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128abc20406080100SE +/- 1.13, N = 15SE +/- 1.24, N = 15SE +/- 2.02, N = 1580.7077.9977.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256abc510152025SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 320.8620.8920.851. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 322.3122.2722.281. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128abc306090120150SE +/- 1.34, N = 15SE +/- 1.70, N = 3SE +/- 1.45, N = 15137.40133.32133.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256abc1020304050SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 342.0341.8741.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512abc918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 340.6940.7040.681. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128abc48121620SE +/- 0.24, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 1517.2116.8417.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256abc3691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 310.1310.1410.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512abc3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 311.3011.3111.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128abc1428425670SE +/- 0.82, N = 15SE +/- 0.80, N = 3SE +/- 0.69, N = 1560.5859.8760.871. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 321.1221.1321.101. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 322.4722.4522.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128abc1122334455SE +/- 0.67, N = 15SE +/- 0.37, N = 3SE +/- 0.54, N = 1545.2346.5245.561. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256abc510152025SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 319.0619.1319.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512abc510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 320.5920.4920.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128abc306090120150SE +/- 0.75, N = 3SE +/- 0.49, N = 3SE +/- 1.65, N = 3118.58116.58120.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256abc1122334455SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 0.14, N = 346.1846.1146.801. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512abc1020304050SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 343.9543.9343.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128abc48121620SE +/- 0.24, N = 3SE +/- 0.14, N = 15SE +/- 0.13, N = 1517.5117.3017.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256abc3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.4610.4610.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512abc3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.5711.5711.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128abc1122334455SE +/- 0.53, N = 3SE +/- 0.46, N = 6SE +/- 0.58, N = 1547.4149.8448.891. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 321.1721.1821.201. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.6522.6722.651. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.4