heffte 7950x

AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 3060 Ti 8GB on Ubuntu 23.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310274-PTS-HEFFTE7999&grr.

heffte 7950xProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionabcAMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d832GBWestern Digital WD_BLACK SN850X 1000GBNVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GA104 HD AudioASUS MG28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.106.6.0-060600rc5-generic (x86_64)GNOME Shell 45.0X Server 1.21.1.7NVIDIA 545.23.064.6.0OpenCL 3.0 CUDA 12.3.68GCC 13.2.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203 Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

heffte 7950xheffte: c2c - FFTW - double-long - 512heffte: c2c - FFTW - double - 512heffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - double - 512heffte: c2c - FFTW - float-long - 512heffte: c2c - FFTW - float - 512heffte: r2c - Stock - double-long - 512heffte: r2c - Stock - double - 512heffte: c2c - Stock - float-long - 512heffte: c2c - Stock - float - 512heffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - float - 512heffte: r2c - Stock - float-long - 512heffte: r2c - Stock - float - 512heffte: c2c - FFTW - double-long - 256heffte: c2c - FFTW - double - 256heffte: c2c - Stock - double - 256heffte: c2c - Stock - double-long - 256heffte: c2c - FFTW - float-long - 128heffte: r2c - FFTW - double - 128heffte: c2c - Stock - double-long - 128heffte: r2c - FFTW - double-long - 128heffte: c2c - Stock - float-long - 128heffte: c2c - Stock - float - 128heffte: r2c - FFTW - double-long - 256heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - float-long - 128heffte: c2c - FFTW - float - 256heffte: r2c - Stock - float - 128heffte: c2c - FFTW - float-long - 256heffte: r2c - Stock - double - 256heffte: r2c - Stock - double-long - 256heffte: c2c - Stock - float-long - 256heffte: c2c - Stock - float - 256heffte: c2c - FFTW - float - 128heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - double-long - 128heffte: c2c - FFTW - double - 128heffte: r2c - Stock - double-long - 128heffte: r2c - Stock - double - 128heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float-long - 256heffte: r2c - Stock - float-long - 256heffte: r2c - Stock - float - 256heffte: c2c - Stock - double - 128heffte: r2c - Stock - float-long - 128abc11.303311.317511.550411.565020.587620.591322.307722.312922.647722.646822.471722.470540.692440.735843.946544.001010.132610.206310.472410.460480.701844.594017.511845.228160.580262.418419.062619.1375137.39520.8780115.30620.863021.209921.170321.119321.140977.4191136.39617.207316.974247.409549.711042.088542.027046.177846.069617.5246118.58111.305411.300611.558511.573320.494620.631422.274622.330922.668822.636122.448222.476640.698040.742743.932044.029910.138510.119310.468010.464077.986644.328317.301246.524059.870061.757119.129819.0872133.31720.8919120.98720.889521.165521.183421.126721.123776.8080135.91916.837416.496849.841449.330841.713541.870246.111146.157218.0207116.57611.321311.335311.567211.565520.518420.559622.275522.289822.650522.652722.467822.465840.677840.725743.938543.966010.183010.155310.464110.456577.637344.013117.321845.561960.868661.261919.142319.1488133.51520.8252120.15320.850121.148621.203221.102821.164377.2216135.26217.071917.093748.892149.180741.822941.649446.798746.260217.6702120.073OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512abc3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 311.3011.3111.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512abc3691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 311.3211.3011.341. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512abc3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.5511.5611.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512abc3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.5711.5711.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512abc510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 320.5920.4920.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512abc510152025SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 320.5920.6320.561. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 322.3122.2722.281. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 322.3122.3322.291. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.6522.6722.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.6522.6422.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512abc510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 322.4722.4522.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512abc510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 322.4722.4822.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512abc918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 340.6940.7040.681. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512abc918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 340.7440.7440.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512abc1020304050SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 343.9543.9343.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512abc1020304050SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 344.0044.0343.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256abc3691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 310.1310.1410.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256abc3691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 310.2110.1210.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256abc3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.4710.4710.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256abc3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.4610.4610.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128abc20406080100SE +/- 1.13, N = 15SE +/- 1.24, N = 15SE +/- 2.02, N = 1580.7077.9977.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128abc1020304050SE +/- 0.98, N = 15SE +/- 0.46, N = 15SE +/- 1.16, N = 1244.5944.3344.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128abc48121620SE +/- 0.24, N = 3SE +/- 0.14, N = 15SE +/- 0.13, N = 1517.5117.3017.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128abc1122334455SE +/- 0.67, N = 15SE +/- 0.37, N = 3SE +/- 0.54, N = 1545.2346.5245.561. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128abc1428425670SE +/- 0.82, N = 15SE +/- 0.80, N = 3SE +/- 0.69, N = 1560.5859.8760.871. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128abc1428425670SE +/- 0.65, N = 15SE +/- 0.74, N = 3SE +/- 0.56, N = 1562.4261.7661.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256abc510152025SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 319.0619.1319.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 319.1419.0919.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128abc306090120150SE +/- 1.34, N = 15SE +/- 1.70, N = 3SE +/- 1.45, N = 15137.40133.32133.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256abc510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 320.8820.8920.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128abc306090120150SE +/- 0.62, N = 3SE +/- 1.24, N = 14SE +/- 1.05, N = 15115.31120.99120.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256abc510152025SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 320.8620.8920.851. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256abc510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 321.2121.1721.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 321.1721.1821.201. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 321.1221.1321.101. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256abc510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 321.1421.1221.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128abc20406080100SE +/- 1.05, N = 3SE +/- 1.96, N = 15SE +/- 2.22, N = 1277.4276.8177.221. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128abc306090120150SE +/- 0.97, N = 12SE +/- 1.41, N = 3SE +/- 1.19, N = 15136.40135.92135.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128abc48121620SE +/- 0.24, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 1517.2116.8417.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128abc48121620SE +/- 0.15, N = 15SE +/- 0.02, N = 3SE +/- 0.10, N = 316.9716.5017.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128abc1122334455SE +/- 0.53, N = 3SE +/- 0.46, N = 6SE +/- 0.58, N = 1547.4149.8448.891. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128abc1122334455SE +/- 0.50, N = 3SE +/- 0.36, N = 15SE +/- 0.58, N = 349.7149.3349.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256abc1020304050SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 342.0941.7141.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256abc1020304050SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 342.0341.8741.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256abc1122334455SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 0.14, N = 346.1846.1146.801. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256abc1020304050SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.03, N = 346.0746.1646.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128abc48121620SE +/- 0.22, N = 4SE +/- 0.20, N = 4SE +/- 0.18, N = 317.5218.0217.671. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128abc306090120150SE +/- 0.75, N = 3SE +/- 0.49, N = 3SE +/- 1.65, N = 3118.58116.58120.071. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.4