cloverleaf threadripper

AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS) and AMD Radeon RX 5700 8GB on Ubuntu 23.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310271-PTS-CLOVERLE91
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Fortran Tests 2 Tests
OpenMPI Tests 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
a
October 27 2023
  2 Hours, 14 Minutes
b
October 27 2023
  2 Hours, 15 Minutes
c
October 27 2023
  44 Minutes
Invert Hiding All Results Option
  1 Hour, 44 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


cloverleaf threadripperOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads)Gigabyte TRX40 AORUS PRO WIFI (F6 BIOS)AMD Starship/Matisse128GBSamsung SSD 970 EVO Plus 500GBAMD Radeon RX 5700 8GB (1750/875MHz)AMD Navi 10 HDMI AudioDELL P2415QIntel I211 + Intel Wi-Fi 6 AX200Ubuntu 23.046.2.0-34-generic (x86_64)GNOME Shell 44.3X Server + Wayland4.6 Mesa 23.0.2 (LLVM 15.0.7 DRM 3.49)GCC 12.3.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionCloverleaf Threadripper BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-DAPbBt/gcc-12-12.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-DAPbBt/gcc-12-12.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107a- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

abcResult OverviewPhoronix Test Suite100%114%129%143%HeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleCloverLeafHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleCloverLeafHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleCloverLeafHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for ExascaleHeFFTe - Highly Efficient FFT for Exascaler2c - FFTW - double - 128r2c - FFTW - double-long - 128c2c - FFTW - double-long - 128c2c - FFTW - double - 128clover_bmc2c - FFTW - float - 128c2c - FFTW - float-long - 128c2c - Stock - float - 128r2c - Stock - float - 256r2c - Stock - float - 128c2c - Stock - double - 256c2c - Stock - double - 128r2c - FFTW - double - 256r2c - FFTW - float - 256r2c - FFTW - float-long - 256r2c - Stock - double-long - 128c2c - FFTW - double - 256r2c - FFTW - float-long - 128c2c - Stock - double-long - 256c2c - Stock - float - 256c2c - Stock - float-long - 128r2c - Stock - double-long - 256r2c - Stock - double - 128r2c - Stock - float-long - 256c2c - FFTW - double-long - 256r2c - FFTW - float - 128c2c - FFTW - float - 256r2c - Stock - double - 256c2c - Stock - double-long - 128r2c - Stock - float - 512r2c - Stock - float-long - 512r2c - FFTW - float-long - 512r2c - Stock - float-long - 128r2c - FFTW - float - 512c2c - FFTW - float-long - 256c2c - Stock - float-long - 256r2c - FFTW - double-long - 256c2c - Stock - float-long - 512c2c - FFTW - float-long - 512r2c - FFTW - double-long - 512r2c - FFTW - double - 512c2c - Stock - double-long - 512r2c - FFTW - float - 1024r2c - Stock - double-long - 1024clover_bm64_shortr2c - Stock - double - 1024c2c - FFTW - float - 1024r2c - Stock - float-long - 1024r2c - Stock - double-long - 512r2c - FFTW - double-long - 1024c2c - Stock - float-long - 1024r2c - FFTW - double - 1024c2c - FFTW - float - 512c2c - FFTW - double - 512r2c - Stock - float - 1024c2c - Stock - float - 512c2c - FFTW - double-long - 512c2c - Stock - float - 1024clover_bm16r2c - Stock - double - 512r2c - FFTW - float-long - 1024c2c - Stock - double - 512c2c - FFTW - float-long - 1024

cloverleaf threadripperheffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float-long - 128heffte: c2c - Stock - float - 128heffte: r2c - Stock - float - 256heffte: r2c - Stock - float - 128heffte: c2c - Stock - double - 256heffte: c2c - Stock - double - 128heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float-long - 256heffte: r2c - Stock - double-long - 128heffte: c2c - FFTW - double - 256heffte: r2c - FFTW - float-long - 128heffte: c2c - Stock - double-long - 256heffte: c2c - Stock - float - 256heffte: c2c - Stock - float-long - 128heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 128heffte: r2c - Stock - float-long - 256heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - float - 256heffte: r2c - Stock - double - 256heffte: c2c - Stock - double-long - 128heffte: r2c - Stock - float - 512heffte: r2c - Stock - float-long - 512heffte: r2c - FFTW - float-long - 512heffte: r2c - Stock - float-long - 128heffte: r2c - FFTW - float - 512heffte: c2c - FFTW - float-long - 256heffte: c2c - Stock - float-long - 256heffte: r2c - FFTW - double-long - 256heffte: c2c - Stock - float-long - 512heffte: c2c - FFTW - float-long - 512heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - double - 512heffte: c2c - Stock - double-long - 512heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - double-long - 1024cloverleaf: clover_bm64_shortheffte: r2c - Stock - double - 1024heffte: c2c - FFTW - float - 1024heffte: r2c - Stock - float-long - 1024heffte: r2c - Stock - double-long - 512heffte: r2c - FFTW - double-long - 1024heffte: c2c - Stock - float-long - 1024heffte: r2c - FFTW - double - 1024heffte: c2c - FFTW - float - 512heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 1024heffte: c2c - Stock - float - 512heffte: c2c - FFTW - double-long - 512heffte: c2c - Stock - float - 1024cloverleaf: clover_bm16heffte: r2c - Stock - double - 512heffte: r2c - FFTW - float-long - 1024heffte: c2c - Stock - double - 512heffte: c2c - FFTW - float-long - 1024heffte: r2c - FFTW - double-long - 128heffte: c2c - FFTW - double-long - 128heffte: r2c - FFTW - double - 128heffte: c2c - FFTW - double - 128cloverleaf: clover_bmabc55.817654.952449.923395.752082.367213.060829.105228.352485.100982.387247.589712.874791.311512.834637.826250.166434.168248.295395.564312.746792.239135.247734.414828.866248.163948.299844.238781.123444.165334.687137.750528.023423.709423.606222.310822.280612.405948.477626.9875144.5126.974727.134152.346724.119224.924827.181924.906723.548612.349252.272623.644912.342127.12351253.1224.092948.574412.402027.181025.036418.890726.004620.999217.5455.064155.626549.746697.598179.860412.845028.577427.672083.358082.704448.514412.638791.227512.824637.974050.538034.284448.582496.268412.731391.529535.186834.379528.699147.928148.093344.119880.866944.197734.793137.856628.075323.658523.609322.318822.297512.403548.445326.9768144.6526.977427.117752.383024.146824.920427.204824.914623.527012.337952.263423.633912.348927.11601252.5324.098848.564712.403027.178528.266019.242427.279919.863116.1359.61159.471547.113793.802680.783712.688128.325728.192983.14181.127348.1412.656689.813612.666738.300850.732634.475348.177395.468912.644891.99235.013734.582928.727847.994848.278344.277580.859844.062834.774137.80828.094423.68923.566522.348922.315412.42148.512527.0114144.6927.008127.149352.406924.119924.947327.210124.930323.548512.349252.308623.628412.342727.12921252.6124.08948.57112.40127.177718.377127.269317.374423.963917.00OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128abc1326395265SE +/- 0.34, N = 15SE +/- 0.41, N = 1355.8255.0659.611. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128abc1326395265SE +/- 0.42, N = 10SE +/- 0.45, N = 954.9555.6359.471. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128abc1122334455SE +/- 0.22, N = 3SE +/- 0.34, N = 349.9249.7547.111. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256abc20406080100SE +/- 0.22, N = 3SE +/- 0.29, N = 395.7597.6093.801. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128abc20406080100SE +/- 0.06, N = 3SE +/- 0.45, N = 382.3779.8680.781. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256abc3691215SE +/- 0.13, N = 3SE +/- 0.14, N = 313.0612.8512.691. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128abc714212835SE +/- 0.31, N = 3SE +/- 0.41, N = 329.1128.5828.331. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256abc714212835SE +/- 0.14, N = 3SE +/- 0.25, N = 328.3527.6728.191. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256abc20406080100SE +/- 0.47, N = 3SE +/- 0.64, N = 385.1083.3683.141. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256abc20406080100SE +/- 0.20, N = 3SE +/- 0.40, N = 382.3982.7081.131. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128abc1122334455SE +/- 0.69, N = 3SE +/- 0.40, N = 347.5948.5148.141. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256abc3691215SE +/- 0.06, N = 3SE +/- 0.03, N = 312.8712.6412.661. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128abc20406080100SE +/- 0.39, N = 3SE +/- 0.16, N = 391.3191.2389.811. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256abc3691215SE +/- 0.17, N = 3SE +/- 0.12, N = 312.8312.8212.671. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256abc918273645SE +/- 0.29, N = 3SE +/- 0.35, N = 337.8337.9738.301. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128abc1122334455SE +/- 0.11, N = 3SE +/- 0.12, N = 350.1750.5450.731. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256abc816243240SE +/- 0.09, N = 3SE +/- 0.17, N = 334.1734.2834.481. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128abc1122334455SE +/- 0.44, N = 3SE +/- 0.52, N = 348.3048.5848.181. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256abc20406080100SE +/- 0.68, N = 12SE +/- 0.91, N = 395.5696.2795.471. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256abc3691215SE +/- 0.05, N = 3SE +/- 0.10, N = 312.7512.7312.641. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128abc20406080100SE +/- 0.34, N = 3SE +/- 0.58, N = 392.2491.5391.991. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256abc816243240SE +/- 0.04, N = 3SE +/- 0.20, N = 335.2535.1935.011. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256abc816243240SE +/- 0.12, N = 3SE +/- 0.11, N = 334.4134.3834.581. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128abc714212835SE +/- 0.12, N = 3SE +/- 0.34, N = 328.8728.7028.731. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512abc1122334455SE +/- 0.14, N = 3SE +/- 0.07, N = 348.1647.9347.991. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512abc1122334455SE +/- 0.23, N = 3SE +/- 0.05, N = 348.3048.0948.281. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512abc1020304050SE +/- 0.01, N = 3SE +/- 0.03, N = 344.2444.1244.281. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128abc20406080100SE +/- 0.29, N = 3SE +/- 0.24, N = 381.1280.8780.861. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512abc1020304050SE +/- 0.05, N = 3SE +/- 0.08, N = 344.1744.2044.061. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256abc816243240SE +/- 0.16, N = 3SE +/- 0.47, N = 334.6934.7934.771. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256abc918273645SE +/- 0.23, N = 3SE +/- 0.04, N = 337.7537.8637.811. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256abc714212835SE +/- 0.09, N = 3SE +/- 0.06, N = 328.0228.0828.091. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512abc612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 323.7123.6623.691. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512abc612182430SE +/- 0.02, N = 3SE +/- 0.00, N = 323.6123.6123.571. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512abc510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 322.3122.3222.351. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512abc510152025SE +/- 0.00, N = 3SE +/- 0.02, N = 322.2822.3022.321. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512abc3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 312.4112.4012.421. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024abc1122334455SE +/- 0.01, N = 3SE +/- 0.05, N = 348.4848.4548.511. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024abc612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 326.9926.9827.011. (CXX) g++ options: -O3

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortabc306090120150SE +/- 0.02, N = 3SE +/- 0.02, N = 3144.51144.65144.691. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024abc612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 326.9726.9827.011. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024abc612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.1327.1227.151. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024abc1224364860SE +/- 0.02, N = 3SE +/- 0.02, N = 352.3552.3852.411. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512abc612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 324.1224.1524.121. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024abc612182430SE +/- 0.00, N = 3SE +/- 0.01, N = 324.9224.9224.951. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024abc612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 327.1827.2027.211. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024abc612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 324.9124.9124.931. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512abc612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 323.5523.5323.551. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512abc3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 312.3512.3412.351. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024abc1224364860SE +/- 0.02, N = 3SE +/- 0.02, N = 352.2752.2652.311. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512abc612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 323.6423.6323.631. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512abc3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 312.3412.3512.341. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024abc612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.1227.1227.131. (CXX) g++ options: -O3

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16abc30060090012001500SE +/- 1.27, N = 3SE +/- 0.09, N = 31253.121252.531252.611. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512abc612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 324.0924.1024.091. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024abc1122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 348.5748.5648.571. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512abc3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 312.4012.4012.401. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024abc612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.1827.1827.181. (CXX) g++ options: -O3

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

a: The test quit with a non-zero exit status. E: mpirun noticed that process rank 46 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

b: The test quit with a non-zero exit status. E: mpirun noticed that process rank 3 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

c: The test quit with a non-zero exit status. E: mpirun noticed that process rank 42 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

a: The test quit with a non-zero exit status. E: mpirun noticed that process rank 23 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

b: The test quit with a non-zero exit status. E: mpirun noticed that process rank 37 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

c: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128abc714212835SE +/- 3.07, N = 12SE +/- 2.45, N = 1525.0428.2718.381. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128abc612182430SE +/- 1.04, N = 15SE +/- 1.41, N = 1518.8919.2427.271. (CXX) g++ options: -O3

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

a: The test quit with a non-zero exit status. E: mpirun noticed that process rank 3 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

b: The test quit with a non-zero exit status. E: mpirun noticed that process rank 3 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

c: The test quit with a non-zero exit status. E: mpirun noticed that process rank 50 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

a: The test quit with a non-zero exit status. E: mpirun noticed that process rank 62 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

b: The test quit with a non-zero exit status. E: mpirun noticed that process rank 39 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

c: The test quit with a non-zero exit status. E: mpirun noticed that process rank 31 with PID 0 on node phoronix-TRX40-AORUS-PRO-WIFI exited on signal 9 (Killed).

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128abc612182430SE +/- 2.70, N = 15SE +/- 2.33, N = 1526.0027.2817.371. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128abc612182430SE +/- 1.55, N = 12SE +/- 1.31, N = 1521.0019.8623.961. (CXX) g++ options: -O3

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmabc48121620SE +/- 1.44, N = 12SE +/- 0.25, N = 1517.5416.1317.001. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

63 Results Shown

HeFFTe - Highly Efficient FFT for Exascale:
  c2c - FFTW - float - 128
  c2c - FFTW - float-long - 128
  c2c - Stock - float - 128
  r2c - Stock - float - 256
  r2c - Stock - float - 128
  c2c - Stock - double - 256
  c2c - Stock - double - 128
  r2c - FFTW - double - 256
  r2c - FFTW - float - 256
  r2c - FFTW - float-long - 256
  r2c - Stock - double-long - 128
  c2c - FFTW - double - 256
  r2c - FFTW - float-long - 128
  c2c - Stock - double-long - 256
  c2c - Stock - float - 256
  c2c - Stock - float-long - 128
  r2c - Stock - double-long - 256
  r2c - Stock - double - 128
  r2c - Stock - float-long - 256
  c2c - FFTW - double-long - 256
  r2c - FFTW - float - 128
  c2c - FFTW - float - 256
  r2c - Stock - double - 256
  c2c - Stock - double-long - 128
  r2c - Stock - float - 512
  r2c - Stock - float-long - 512
  r2c - FFTW - float-long - 512
  r2c - Stock - float-long - 128
  r2c - FFTW - float - 512
  c2c - FFTW - float-long - 256
  c2c - Stock - float-long - 256
  r2c - FFTW - double-long - 256
  c2c - Stock - float-long - 512
  c2c - FFTW - float-long - 512
  r2c - FFTW - double-long - 512
  r2c - FFTW - double - 512
  c2c - Stock - double-long - 512
  r2c - FFTW - float - 1024
  r2c - Stock - double-long - 1024
CloverLeaf
HeFFTe - Highly Efficient FFT for Exascale:
  r2c - Stock - double - 1024
  c2c - FFTW - float - 1024
  r2c - Stock - float-long - 1024
  r2c - Stock - double-long - 512
  r2c - FFTW - double-long - 1024
  c2c - Stock - float-long - 1024
  r2c - FFTW - double - 1024
  c2c - FFTW - float - 512
  c2c - FFTW - double - 512
  r2c - Stock - float - 1024
  c2c - Stock - float - 512
  c2c - FFTW - double-long - 512
  c2c - Stock - float - 1024
CloverLeaf
HeFFTe - Highly Efficient FFT for Exascale:
  r2c - Stock - double - 512
  r2c - FFTW - float-long - 1024
  c2c - Stock - double - 512
  c2c - FFTW - float-long - 1024
  r2c - FFTW - double-long - 128
  c2c - FFTW - double-long - 128
  r2c - FFTW - double - 128
  c2c - FFTW - double - 128
CloverLeaf