extra tests 3

AMD EPYC 9334 32-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310294-NE-EXTRATEST01&rdt&grs.

extra tests 3ProcessorMotherboardMemoryDiskGraphicsMonitorOSKernelCompilerFile-SystemScreen ResolutionAMD EPYC 9334 32-CorebcdAMD EPYC 9334 32-Core @ 2.70GHz (32 Cores / 64 Threads)Supermicro H13SSW (1.1 BIOS)12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbDELL E207WFPAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41680x1050OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra tests 3heffte: c2c - Stock - double-long - 256heffte: r2c - FFTW - float-long - 256heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - double-long - 256heffte: c2c - Stock - double - 128heffte: c2c - FFTW - float - 256heffte: c2c - Stock - float - 256heffte: c2c - FFTW - float-long - 256heffte: r2c - Stock - double-long - 128heffte: c2c - FFTW - float-long - 128heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: r2c - Stock - float-long - 128heffte: c2c - Stock - double-long - 128cpuminer-opt: Magiheffte: c2c - Stock - float-long - 256heffte: r2c - FFTW - double-long - 128heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 256heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 256heffte: c2c - Stock - float-long - 512heffte: c2c - Stock - float - 512heffte: c2c - FFTW - float - 512heffte: r2c - Stock - float-long - 512heffte: c2c - Stock - float-long - 128heffte: r2c - Stock - float-long - 256heffte: r2c - Stock - double - 128heffte: r2c - Stock - float - 512heffte: r2c - FFTW - double-long - 1024heffte: r2c - Stock - double-long - 512heffte: c2c - FFTW - float - 128heffte: r2c - Stock - double - 512heffte: r2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float-long - 128heffte: r2c - FFTW - double - 256cpuminer-opt: Garlicoincloverleaf: clover_bmheffte: r2c - Stock - float - 128heffte: c2c - FFTW - float-long - 512heffte: c2c - FFTW - double-long - 512cpuminer-opt: scryptheffte: r2c - FFTW - double - 1024cloverleaf: clover_bm16heffte: c2c - Stock - float - 1024heffte: c2c - FFTW - double-long - 1024heffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: c2c - Stock - float - 128heffte: c2c - Stock - float-long - 1024cpuminer-opt: LBC, LBRY Creditsheffte: c2c - FFTW - float - 1024heffte: c2c - FFTW - float-long - 1024heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - float - 1024cpuminer-opt: Ringcoincpuminer-opt: Myriad-Groestlheffte: c2c - Stock - double - 1024heffte: r2c - Stock - double-long - 1024heffte: r2c - FFTW - float-long - 1024duckdb: TPC-H Parquetheffte: c2c - FFTW - double - 1024cpuminer-opt: Skeincoinduckdb: IMDBcloverleaf: clover_bm64_shortheffte: r2c - Stock - float-long - 1024heffte: r2c - Stock - double - 1024cpuminer-opt: Deepcoincpuminer-opt: Quad SHA-256, Pyriteheffte: c2c - Stock - double-long - 1024cpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Blake-2 Sheffte: c2c - FFTW - double-long - 128heffte: c2c - Stock - double - 256AMD EPYC 9334 32-Corebcd36.2576157.87851.690837.2632150.134154.21968.662643.898173.598573.747171.348687.5342120.07572.152374.7756111.75935.5072154.41844.54791066.1770.0539103.26273.640876.544133.3927160.22247.84348.15351.5089146.15591.3719162.37588.0159150.33594.736378.2354118.84279.6518156.255186.529187.01271.42832049.7212.20158.53851.282933.2567439.793.743261.7293.554252.459831.639431.871191.97994.45872573092.799793.66695.837199.73375575.431944052.392100.53895.2984127.64652.65531079.51030.15100.144100.328139209805052.280614179021007051.330137.094840.2624141.25352.517440.0132137.587154.55966.917847.204773.511468.977975.742890.1002122.83872.337373.0679107.94835.1332161.08445.97791028.1171.6261106.54374.601175.91933.2644165.24649.429348.143951.1926146.27893.4461166.33887.1142148.08594.595679.8568116.89579.7769155.27190.581190.48571.66552086.5612.11156.98651.399433.1688439.3493.8516261.8493.970252.483431.733831.820292.389694.27192594093.545593.338695.258199.45625608.21946052.2198100.15295.4879127.90952.43315532079.55230.21100.01100.308139109791052.289314182021010053.954536.833641.5801153.63157.751536.9363145.746142.14372.213744.242570.59172.842872.334185.058116.32276.170476.9645112.00236.9202161.53546.70271073.1970.7121103.11776.711375.943132.1206159.01647.676249.898349.7359151.27694.4809165.42289.5721148.73392.502480.113117.91878.2561156.209186.801186.50770.36482070.3112.29158.62451.895233.1583438.7992.7438264.7694.518452.539131.930632.11392.184593.63262573093.16693.458595.371999.13315573.851943052.4271100.26695.553752.50715547030.16100.202100.137139009794052.252714179021011056.994338.003538.3486149.88454.359336.1576151.351151.13470.403746.595075.882571.087773.914588.2343117.17172.954774.7140106.49836.5444162.02945.11961025.6573.0896107.52474.384473.572832.3093164.51948.726849.460850.2743147.50392.1790161.46588.9655146.58293.732679.6484119.64278.0711152.917186.863189.95271.85612065.8912.10156.17451.135833.5619444.0593.7082262.7794.437852.039931.823631.942391.573494.12462581392.807792.936395.132399.76695578.971953352.1658100.20995.2054128.05452.45625548379.39130.19100.1439100.318139139800052.261014177721008353.470838.0871OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.55, N = 1236.2640.2641.5838.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.98, N = 3157.88141.25153.63149.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd1326395265SE +/- 0.72, N = 1551.6952.5257.7554.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.49, N = 337.2640.0136.9436.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.31, N = 8150.13137.59145.75151.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.58, N = 15154.22154.56142.14151.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.71, N = 668.6666.9272.2170.401. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.64, N = 343.9047.2044.2446.601. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.86, N = 373.6073.5170.5975.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.66, N = 373.7568.9872.8471.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.73, N = 671.3575.7472.3373.911. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.17, N = 387.5390.1085.0688.231. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.90, N = 3120.08122.84116.32117.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.50, N = 1372.1572.3476.1772.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.93, N = 1574.7873.0776.9674.711. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.27, N = 3111.76107.95112.00106.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd816243240SE +/- 0.21, N = 335.5135.1336.9236.541. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.39, N = 3154.42161.08161.54162.031. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.27, N = 344.5545.9846.7045.121. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: MagiAMD EPYC 9334 32-Corebcd2004006008001000SE +/- 0.51, N = 31066.171028.111073.191025.651. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.43, N = 370.0571.6370.7173.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.13, N = 3103.26106.54103.12107.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.83, N = 373.6474.6076.7174.381. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.01, N = 376.5475.9275.9473.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd816243240SE +/- 0.10, N = 333.3933.2632.1232.311. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.70, N = 3160.22165.25159.02164.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.49, N = 347.8449.4347.6848.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.57, N = 448.1548.1449.9049.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.60, N = 451.5151.1949.7450.271. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.83, N = 4146.16146.28151.28147.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.35, N = 391.3793.4594.4892.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.78, N = 15162.38166.34165.42161.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.15, N = 388.0287.1189.5788.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.28, N = 3150.34148.09148.73146.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.31, N = 394.7494.6092.5093.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.62, N = 378.2479.8680.1179.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.32, N = 3118.84116.90117.92119.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.37, N = 379.6579.7878.2678.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.20, N = 15156.26155.27156.21152.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.18, N = 3186.53190.58186.80186.861. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.76, N = 3187.01190.49186.51189.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.38, N = 371.4371.6770.3671.861. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: GarlicoinAMD EPYC 9334 32-Corebcd400800120016002000SE +/- 9.13, N = 32049.722086.562070.312065.891. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmAMD EPYC 9334 32-Corebcd3691215SE +/- 0.11, N = 312.2012.1112.2912.101. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.35, N = 3158.54156.99158.62156.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.32, N = 351.2851.4051.9051.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd816243240SE +/- 0.08, N = 333.2633.1733.1633.561. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: scrypt

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: scryptAMD EPYC 9334 32-Corebcd100200300400500SE +/- 5.54, N = 4439.70439.34438.79444.051. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.40, N = 393.7493.8592.7493.711. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AMD EPYC 9334 32-Corebcd60120180240300SE +/- 1.17, N = 3261.72261.84264.76262.771. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.25, N = 393.5593.9794.5294.441. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.16, N = 352.4652.4852.5452.041. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd714212835SE +/- 0.16, N = 331.6431.7331.9331.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd714212835SE +/- 0.13, N = 331.8731.8232.1131.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.59, N = 391.9892.3992.1891.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.18, N = 394.4694.2793.6394.121. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: LBC, LBRY CreditsAMD EPYC 9334 32-Corebcd6K12K18K24K30KSE +/- 38.44, N = 3257302594025730258131. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.76, N = 392.8093.5593.1792.811. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.13, N = 393.6793.3493.4692.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.15, N = 395.8495.2695.3795.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.08, N = 399.7399.4699.1399.771. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: RingcoinAMD EPYC 9334 32-Corebcd12002400360048006000SE +/- 15.94, N = 35575.435608.205573.855578.971. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestlAMD EPYC 9334 32-Corebcd4K8K12K16K20KSE +/- 58.12, N = 3194401946019430195331. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.06, N = 352.3952.2252.4352.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.16, N = 3100.54100.15100.27100.211. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.20, N = 395.3095.4995.5595.211. (CXX) g++ options: -O3

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetAMD EPYC 9334 32-Corebd306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3127.65127.91128.051. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.08, N = 352.6052.4352.5152.461. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: SkeincoinAMD EPYC 9334 32-Corebcd12K24K36K48K60KSE +/- 82.12, N = 3553105532055470554831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBAMD EPYC 9334 32-Corebd20406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 379.5179.5579.391. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAMD EPYC 9334 32-Corebcd714212835SE +/- 0.02, N = 330.1530.2130.1630.191. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.23, N = 3100.14100.01100.20100.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.17, N = 3100.33100.31100.14100.321. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinAMD EPYC 9334 32-Corebcd3K6K9K12K15KSE +/- 3.33, N = 3139201391013900139131. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Quad SHA-256, PyriteAMD EPYC 9334 32-Corebcd20K40K60K80K100KSE +/- 15.28, N = 3980509791097940980001. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.03, N = 352.2852.2952.2552.261. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinAMD EPYC 9334 32-Corebcd30K60K90K120K150KSE +/- 8.82, N = 31417901418201417901417771. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 SAMD EPYC 9334 32-Corebcd40K80K120K160K200KSE +/- 12.02, N = 32100702101002101102100831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd1326395265SE +/- 0.91, N = 1551.3353.9556.9953.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.64, N = 1537.0936.8338.0038.091. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5