cloverleaf threadripper

AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) and AMD Radeon RX 5700 8GB on Ubuntu 23.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310271-PTS-CLOVERLE91&grw&rdt&rro.

cloverleaf threadripperProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcAMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads)Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS)AMD Starship/Matisse128GBSamsung SSD 970 EVO Plus 500GBAMD Radeon RX 5700 8GB (1750/875MHz)AMD Navi 10 HDMI AudioDELL P2415QIntel I211 + Intel Wi-Fi 6 AX200Ubuntu 23.046.2.0-34-generic (x86_64)GNOME Shell 44.3X Server + Wayland4.6 Mesa 23.0.2 (LLVM 15.0.7 DRM 3.49)GCC 12.3.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-DAPbBt/gcc-12-12.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-DAPbBt/gcc-12-12.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107aSecurity Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

cloverleaf threadripperheffte: r2c - FFTW - float - 512cloverleaf: clover_bm16heffte: r2c - Stock - double-long - 512heffte: c2c - FFTW - double - 128heffte: c2c - Stock - float-long - 1024heffte: c2c - Stock - float-long - 512heffte: r2c - FFTW - float-long - 1024heffte: r2c - FFTW - float-long - 256heffte: c2c - FFTW - double-long - 512heffte: r2c - FFTW - float - 256heffte: c2c - FFTW - float-long - 128heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - float - 512heffte: c2c - FFTW - float - 256heffte: c2c - FFTW - float - 128cloverleaf: clover_bm64_shortheffte: c2c - FFTW - float-long - 512heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double-long - 128heffte: c2c - FFTW - double - 512heffte: c2c - Stock - float-long - 128heffte: c2c - FFTW - float - 1024heffte: r2c - FFTW - double-long - 256heffte: c2c - Stock - float - 128heffte: r2c - Stock - float-long - 256heffte: c2c - Stock - float - 256heffte: c2c - Stock - double-long - 256heffte: c2c - Stock - float - 512heffte: r2c - Stock - double-long - 128heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double - 256heffte: r2c - Stock - double - 1024heffte: r2c - FFTW - double - 512heffte: c2c - FFTW - float-long - 256heffte: r2c - FFTW - float - 1024heffte: r2c - FFTW - float-long - 128heffte: r2c - Stock - float - 128heffte: r2c - FFTW - float-long - 512heffte: r2c - Stock - float - 256heffte: c2c - FFTW - double-long - 256heffte: r2c - Stock - float - 512heffte: c2c - FFTW - float-long - 1024heffte: c2c - Stock - float-long - 256heffte: c2c - Stock - double - 128heffte: r2c - FFTW - double-long - 128heffte: c2c - Stock - double - 256heffte: r2c - FFTW - double-long - 512heffte: c2c - Stock - double - 512heffte: r2c - Stock - float-long - 128heffte: c2c - Stock - float - 1024heffte: r2c - Stock - float-long - 512heffte: r2c - FFTW - double - 1024heffte: c2c - Stock - double-long - 128heffte: r2c - Stock - double - 128heffte: c2c - Stock - double-long - 512heffte: r2c - Stock - double - 256heffte: r2c - FFTW - double-long - 1024heffte: r2c - Stock - double - 512heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - float - 1024heffte: r2c - Stock - float-long - 1024heffte: r2c - Stock - double-long - 1024cloverleaf: clover_bmabc44.16531253.1224.119220.999227.181923.709448.574482.387212.342185.100954.952492.239123.548635.247755.8176144.5123.606212.874718.890712.349250.166427.134128.023449.923395.564337.826212.834623.644947.589726.004628.352426.974722.280634.687148.477691.311582.367244.238795.752012.746748.163927.181037.750529.105225.036413.060822.310812.402081.123427.123548.299824.906728.866248.295312.405934.414824.924824.092934.168252.272652.346726.987517.5444.19771252.5324.146819.863127.204823.658548.564782.704412.348983.358055.626591.529523.527035.186855.0641144.6523.609312.638719.242412.337950.538027.117728.075349.746696.268437.974012.824623.633948.514427.279927.672026.977422.297534.793148.445391.227579.860444.119897.598112.731347.928127.178537.856628.577428.266012.845022.318812.403080.866927.116048.093324.914628.699148.582412.403534.379524.920424.098834.284452.263452.383026.976816.1344.06281252.6124.119923.963927.210123.68948.57181.127312.342783.14159.471591.99223.548535.013759.611144.6923.566512.656627.269312.349250.732627.149328.094447.113795.468938.300812.666723.628448.1417.374428.192927.008122.315434.774148.512589.813680.783744.277593.802612.644847.994827.177737.80828.325718.377112.688122.348912.40180.859827.129248.278324.930328.727848.177312.42134.582924.947324.08934.475352.308652.406927.011417.00OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512cba1020304050SE +/- 0.08, N = 3SE +/- 0.05, N = 344.0644.2044.171. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16cba30060090012001500SE +/- 0.09, N = 3SE +/- 1.27, N = 31252.611252.531253.121. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512cba612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 324.1224.1524.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128cba612182430SE +/- 1.31, N = 15SE +/- 1.55, N = 1223.9619.8621.001. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024cba612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 327.2127.2027.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512cba612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 323.6923.6623.711. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024cba1122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 348.5748.5648.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256cba20406080100SE +/- 0.40, N = 3SE +/- 0.20, N = 381.1382.7082.391. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512cba3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 312.3412.3512.341. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256cba20406080100SE +/- 0.64, N = 3SE +/- 0.47, N = 383.1483.3685.101. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128cba1326395265SE +/- 0.45, N = 9SE +/- 0.42, N = 1059.4755.6354.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128cba20406080100SE +/- 0.58, N = 3SE +/- 0.34, N = 391.9991.5392.241. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512cba612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 323.5523.5323.551. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256cba816243240SE +/- 0.20, N = 3SE +/- 0.04, N = 335.0135.1935.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128cba1326395265SE +/- 0.41, N = 13SE +/- 0.34, N = 1559.6155.0655.821. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortcba306090120150SE +/- 0.02, N = 3SE +/- 0.02, N = 3144.69144.65144.511. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512cba612182430SE +/- 0.00, N = 3SE +/- 0.02, N = 323.5723.6123.611. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256cba3691215SE +/- 0.03, N = 3SE +/- 0.06, N = 312.6612.6412.871. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128cba612182430SE +/- 1.41, N = 15SE +/- 1.04, N = 1527.2719.2418.891. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512cba3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 312.3512.3412.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128cba1122334455SE +/- 0.12, N = 3SE +/- 0.11, N = 350.7350.5450.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024cba612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.1527.1227.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256cba714212835SE +/- 0.06, N = 3SE +/- 0.09, N = 328.0928.0828.021. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128cba1122334455SE +/- 0.34, N = 3SE +/- 0.22, N = 347.1149.7549.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256cba20406080100SE +/- 0.91, N = 3SE +/- 0.68, N = 1295.4796.2795.561. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256cba918273645SE +/- 0.35, N = 3SE +/- 0.29, N = 338.3037.9737.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256cba3691215SE +/- 0.12, N = 3SE +/- 0.17, N = 312.6712.8212.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512cba612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 323.6323.6323.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128cba1122334455SE +/- 0.40, N = 3SE +/- 0.69, N = 348.1448.5147.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128cba612182430SE +/- 2.33, N = 15SE +/- 2.70, N = 1517.3727.2826.001. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256cba714212835SE +/- 0.25, N = 3SE +/- 0.14, N = 328.1927.6728.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024cba612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.0126.9826.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512cba510152025SE +/- 0.02, N = 3SE +/- 0.00, N = 322.3222.3022.281. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256cba816243240SE +/- 0.47, N = 3SE +/- 0.16, N = 334.7734.7934.691. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024cba1122334455SE +/- 0.05, N = 3SE +/- 0.01, N = 348.5148.4548.481. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128cba20406080100SE +/- 0.16, N = 3SE +/- 0.39, N = 389.8191.2391.311. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128cba20406080100SE +/- 0.45, N = 3SE +/- 0.06, N = 380.7879.8682.371. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512cba1020304050SE +/- 0.03, N = 3SE +/- 0.01, N = 344.2844.1244.241. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256cba20406080100SE +/- 0.29, N = 3SE +/- 0.22, N = 393.8097.6095.751. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256cba3691215SE +/- 0.10, N = 3SE +/- 0.05, N = 312.6412.7312.751. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512cba1122334455SE +/- 0.07, N = 3SE +/- 0.14, N = 347.9947.9348.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024cba612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.1827.1827.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256cba918273645SE +/- 0.04, N = 3SE +/- 0.23, N = 337.8137.8637.751. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128cba714212835SE +/- 0.41, N = 3SE +/- 0.31, N = 328.3328.5829.111. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128cba714212835SE +/- 2.45, N = 15SE +/- 3.07, N = 1218.3828.2725.041. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256cba3691215SE +/- 0.14, N = 3SE +/- 0.13, N = 312.6912.8513.061. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512cba510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 322.3522.3222.311. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512cba3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 312.4012.4012.401. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128cba20406080100SE +/- 0.24, N = 3SE +/- 0.29, N = 380.8680.8781.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024cba612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.1327.1227.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512cba1122334455SE +/- 0.05, N = 3SE +/- 0.23, N = 348.2848.0948.301. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024cba612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 324.9324.9124.911. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128cba714212835SE +/- 0.34, N = 3SE +/- 0.12, N = 328.7328.7028.871. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128cba1122334455SE +/- 0.52, N = 3SE +/- 0.44, N = 348.1848.5848.301. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512cba3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 312.4212.4012.411. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256cba816243240SE +/- 0.11, N = 3SE +/- 0.12, N = 334.5834.3834.411. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024cba612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 324.9524.9224.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512cba612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 324.0924.1024.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256cba816243240SE +/- 0.17, N = 3SE +/- 0.09, N = 334.4834.2834.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024cba1224364860SE +/- 0.02, N = 3SE +/- 0.02, N = 352.3152.2652.271. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024cba1224364860SE +/- 0.02, N = 3SE +/- 0.02, N = 352.4152.3852.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024cba612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 327.0126.9826.991. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmcba48121620SE +/- 0.25, N = 15SE +/- 1.44, N = 1217.0016.1317.541. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp


Phoronix Test Suite v10.8.5