extra tests 3

AMD EPYC 9334 32-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310294-NE-EXTRATEST01&grr.

extra tests 3ProcessorMotherboardMemoryDiskGraphicsMonitorOSKernelCompilerFile-SystemScreen ResolutionAMD EPYC 9334 32-CorebcdAMD EPYC 9334 32-Core @ 2.70GHz (32 Cores / 64 Threads)Supermicro H13SSW (1.1 BIOS)12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbDELL E207WFPAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41680x1050OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra tests 3duckdb: TPC-H Parquetcloverleaf: clover_bm16duckdb: IMDBheffte: c2c - FFTW - double-long - 1024heffte: c2c - Stock - double-long - 1024heffte: c2c - Stock - double - 1024heffte: c2c - FFTW - double - 1024cpuminer-opt: scryptcloverleaf: clover_bm64_shortcpuminer-opt: LBC, LBRY Creditscpuminer-opt: Blake-2 Scpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Deepcoincpuminer-opt: Garlicoincpuminer-opt: Myriad-Groestlcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Skeincoincpuminer-opt: Magicpuminer-opt: Ringcoinheffte: c2c - FFTW - float - 1024heffte: c2c - FFTW - float-long - 1024heffte: c2c - Stock - float-long - 1024heffte: c2c - Stock - float - 1024heffte: r2c - FFTW - double-long - 1024heffte: r2c - FFTW - double - 1024heffte: r2c - FFTW - float-long - 1024heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - double-long - 1024heffte: r2c - Stock - double - 1024heffte: r2c - Stock - float - 1024heffte: r2c - Stock - float-long - 1024heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double-long - 512cloverleaf: clover_bmheffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - float - 512heffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: c2c - FFTW - double - 512heffte: c2c - FFTW - double-long - 512heffte: c2c - Stock - float - 512heffte: c2c - FFTW - float - 512heffte: c2c - Stock - double - 256heffte: c2c - Stock - float-long - 512heffte: c2c - FFTW - float-long - 512heffte: c2c - Stock - double-long - 256heffte: r2c - Stock - float-long - 256heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double-long - 128heffte: r2c - Stock - double - 512heffte: r2c - Stock - double-long - 512heffte: r2c - Stock - float-long - 512heffte: r2c - FFTW - float - 256heffte: r2c - Stock - float - 512heffte: r2c - FFTW - double-long - 256heffte: c2c - FFTW - float-long - 256heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - double - 256heffte: c2c - Stock - float - 256heffte: c2c - Stock - float-long - 256heffte: c2c - FFTW - float - 256heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 256heffte: r2c - FFTW - float-long - 256heffte: r2c - Stock - float - 256heffte: c2c - Stock - double-long - 128heffte: c2c - Stock - double - 128heffte: c2c - Stock - float - 128heffte: r2c - Stock - double-long - 128heffte: c2c - Stock - float-long - 128heffte: r2c - Stock - double - 128heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double-long - 128heffte: r2c - Stock - float - 128heffte: r2c - FFTW - float-long - 128heffte: c2c - FFTW - float-long - 128heffte: c2c - FFTW - float - 128heffte: r2c - Stock - float-long - 128heffte: r2c - FFTW - float - 128ospray-studio: 3 - 4K - 1 - Path Tracer - CPUAMD EPYC 9334 32-Corebcd127.646261.7279.51052.459852.280652.39252.6439.730.1525730210070141790139202049.721944098050553101066.175575.4392.799793.66694.458793.554294.736393.74395.298495.8371100.538100.32899.7337100.14474.775672.152312.20154.219156.25531.639431.871133.392733.256748.15351.508937.094847.84351.282936.2576162.37551.690851.330179.651878.2354146.155150.134150.33568.662671.348635.507237.263271.428373.747170.053973.598573.640876.5441157.878160.22244.547943.898191.97987.534291.371988.0159111.759103.262158.538187.012120.075118.842154.418186.529127.909261.8479.55252.483452.289352.219852.4331439.3430.2125940210100141820139102086.561946097910553201028.115608.293.545593.338694.271993.970294.595693.851695.487995.2581100.152100.30899.4562100.0173.067972.337312.11154.559155.2731.733831.820233.264433.168848.143951.192636.833649.429351.399440.2624166.33852.517453.954579.776979.8568146.278137.587148.08566.917875.742835.133240.013271.665568.977971.626173.511474.601175.919141.253165.24645.977947.204792.389690.100293.446187.1142107.948106.543156.986190.485122.838116.895161.084190.581264.7652.539152.252752.427152.5071438.7930.1625730210110141790139002070.311943097940554701073.195573.8593.16693.458593.632694.518492.502492.743895.553795.3719100.266100.13799.1331100.20276.964576.170412.29142.143156.20931.930632.11332.120633.158349.898349.735938.003547.676251.895241.5801165.42257.751556.994378.256180.113151.276145.746148.73372.213772.334136.920236.936370.364872.842870.712170.59176.711375.9431153.631159.01646.702744.242592.184585.05894.480989.5721112.002103.117158.624186.507116.322117.918161.535186.801128.054262.7779.39152.039952.261052.165852.4562444.0530.1925813210083141777139132065.891953398000554831025.655578.9792.807792.936394.124694.437893.732693.708295.205495.1323100.209100.31899.7669100.143974.714072.954712.10151.134152.91731.823631.942332.309333.561949.460850.274338.087148.726851.135838.3486161.46554.359353.470878.071179.6484147.503151.351146.58270.403773.914536.544436.157671.856171.087773.089675.882574.384473.5728149.884164.51945.119646.595091.573488.234392.179088.9655106.498107.524156.174189.952117.171119.642162.029186.863OpenBenchmarking.org

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetAMD EPYC 9334 32-Corebd306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3127.65127.91128.051. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AMD EPYC 9334 32-Corebcd60120180240300SE +/- 1.17, N = 3261.72261.84264.76262.771. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBAMD EPYC 9334 32-Corebd20406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 379.5179.5579.391. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.16, N = 352.4652.4852.5452.041. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.03, N = 352.2852.2952.2552.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.06, N = 352.3952.2252.4352.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.08, N = 352.6052.4352.5152.461. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: scrypt

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: scryptAMD EPYC 9334 32-Corebcd100200300400500SE +/- 5.54, N = 4439.70439.34438.79444.051. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAMD EPYC 9334 32-Corebcd714212835SE +/- 0.02, N = 330.1530.2130.1630.191. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: LBC, LBRY CreditsAMD EPYC 9334 32-Corebcd6K12K18K24K30KSE +/- 38.44, N = 3257302594025730258131. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 SAMD EPYC 9334 32-Corebcd40K80K120K160K200KSE +/- 12.02, N = 32100702101002101102100831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinAMD EPYC 9334 32-Corebcd30K60K90K120K150KSE +/- 8.82, N = 31417901418201417901417771. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinAMD EPYC 9334 32-Corebcd3K6K9K12K15KSE +/- 3.33, N = 3139201391013900139131. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: GarlicoinAMD EPYC 9334 32-Corebcd400800120016002000SE +/- 9.13, N = 32049.722086.562070.312065.891. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestlAMD EPYC 9334 32-Corebcd4K8K12K16K20KSE +/- 58.12, N = 3194401946019430195331. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Quad SHA-256, PyriteAMD EPYC 9334 32-Corebcd20K40K60K80K100KSE +/- 15.28, N = 3980509791097940980001. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: SkeincoinAMD EPYC 9334 32-Corebcd12K24K36K48K60KSE +/- 82.12, N = 3553105532055470554831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: MagiAMD EPYC 9334 32-Corebcd2004006008001000SE +/- 0.51, N = 31066.171028.111073.191025.651. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: RingcoinAMD EPYC 9334 32-Corebcd12002400360048006000SE +/- 15.94, N = 35575.435608.205573.855578.971. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.76, N = 392.8093.5593.1792.811. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.13, N = 393.6793.3493.4692.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.18, N = 394.4694.2793.6394.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.25, N = 393.5593.9794.5294.441. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.31, N = 394.7494.6092.5093.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.40, N = 393.7493.8592.7493.711. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.20, N = 395.3095.4995.5595.211. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.15, N = 395.8495.2695.3795.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.16, N = 3100.54100.15100.27100.211. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.17, N = 3100.33100.31100.14100.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.08, N = 399.7399.4699.1399.771. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.23, N = 3100.14100.01100.20100.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.93, N = 1574.7873.0776.9674.711. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.50, N = 1372.1572.3476.1772.951. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmAMD EPYC 9334 32-Corebcd3691215SE +/- 0.11, N = 312.2012.1112.2912.101. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.58, N = 15154.22154.56142.14151.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.20, N = 15156.26155.27156.21152.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd714212835SE +/- 0.16, N = 331.6431.7331.9331.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd714212835SE +/- 0.13, N = 331.8731.8232.1131.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd816243240SE +/- 0.10, N = 333.3933.2632.1232.311. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd816243240SE +/- 0.08, N = 333.2633.1733.1633.561. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.57, N = 448.1548.1449.9049.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.60, N = 451.5151.1949.7450.271. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.64, N = 1537.0936.8338.0038.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.49, N = 347.8449.4347.6848.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.32, N = 351.2851.4051.9051.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.55, N = 1236.2640.2641.5838.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.78, N = 15162.38166.34165.42161.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd1326395265SE +/- 0.72, N = 1551.6952.5257.7554.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd1326395265SE +/- 0.91, N = 1551.3353.9556.9953.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.37, N = 379.6579.7878.2678.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.62, N = 378.2479.8680.1179.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.83, N = 4146.16146.28151.28147.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.31, N = 8150.13137.59145.75151.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.28, N = 3150.34148.09148.73146.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.71, N = 668.6666.9272.2170.401. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.73, N = 671.3575.7472.3373.911. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd816243240SE +/- 0.21, N = 335.5135.1336.9236.541. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.49, N = 337.2640.0136.9436.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.38, N = 371.4371.6770.3671.861. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.66, N = 373.7568.9872.8471.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.43, N = 370.0571.6370.7173.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.86, N = 373.6073.5170.5975.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.83, N = 373.6474.6076.7174.381. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.01, N = 376.5475.9275.9473.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.98, N = 3157.88141.25153.63149.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.70, N = 3160.22165.25159.02164.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.27, N = 344.5545.9846.7045.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.64, N = 343.9047.2044.2446.601. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.59, N = 391.9892.3992.1891.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.17, N = 387.5390.1085.0688.231. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.35, N = 391.3793.4594.4892.181. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.15, N = 388.0287.1189.5788.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.27, N = 3111.76107.95112.00106.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.13, N = 3103.26106.54103.12107.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.35, N = 3158.54156.99158.62156.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.76, N = 3187.01190.49186.51189.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.90, N = 3120.08122.84116.32117.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.32, N = 3118.84116.90117.92119.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.39, N = 3154.42161.08161.54162.031. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.18, N = 3186.53190.58186.80186.861. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.4