extra tests 3

AMD EPYC 9334 32-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310294-NE-EXTRATEST01&grr&sor.

extra tests 3ProcessorMotherboardMemoryDiskGraphicsMonitorOSKernelCompilerFile-SystemScreen ResolutionAMD EPYC 9334 32-CorebcdAMD EPYC 9334 32-Core @ 2.70GHz (32 Cores / 64 Threads)Supermicro H13SSW (1.1 BIOS)12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbDELL E207WFPAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41680x1050OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra tests 3duckdb: TPC-H Parquetcloverleaf: clover_bm16duckdb: IMDBheffte: c2c - FFTW - double-long - 1024heffte: c2c - Stock - double-long - 1024heffte: c2c - Stock - double - 1024heffte: c2c - FFTW - double - 1024cpuminer-opt: scryptcloverleaf: clover_bm64_shortcpuminer-opt: LBC, LBRY Creditscpuminer-opt: Blake-2 Scpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Deepcoincpuminer-opt: Garlicoincpuminer-opt: Myriad-Groestlcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Skeincoincpuminer-opt: Magicpuminer-opt: Ringcoinheffte: c2c - FFTW - float - 1024heffte: c2c - FFTW - float-long - 1024heffte: c2c - Stock - float-long - 1024heffte: c2c - Stock - float - 1024heffte: r2c - FFTW - double-long - 1024heffte: r2c - FFTW - double - 1024heffte: r2c - FFTW - float-long - 1024heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - double-long - 1024heffte: r2c - Stock - double - 1024heffte: r2c - Stock - float - 1024heffte: r2c - Stock - float-long - 1024heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double-long - 512cloverleaf: clover_bmheffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - float - 512heffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: c2c - FFTW - double - 512heffte: c2c - FFTW - double-long - 512heffte: c2c - Stock - float - 512heffte: c2c - FFTW - float - 512heffte: c2c - Stock - double - 256heffte: c2c - Stock - float-long - 512heffte: c2c - FFTW - float-long - 512heffte: c2c - Stock - double-long - 256heffte: r2c - Stock - float-long - 256heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double-long - 128heffte: r2c - Stock - double - 512heffte: r2c - Stock - double-long - 512heffte: r2c - Stock - float-long - 512heffte: r2c - FFTW - float - 256heffte: r2c - Stock - float - 512heffte: r2c - FFTW - double-long - 256heffte: c2c - FFTW - float-long - 256heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - double - 256heffte: c2c - Stock - float - 256heffte: c2c - Stock - float-long - 256heffte: c2c - FFTW - float - 256heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 256heffte: r2c - FFTW - float-long - 256heffte: r2c - Stock - float - 256heffte: c2c - Stock - double-long - 128heffte: c2c - Stock - double - 128heffte: c2c - Stock - float - 128heffte: r2c - Stock - double-long - 128heffte: c2c - Stock - float-long - 128heffte: r2c - Stock - double - 128heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double-long - 128heffte: r2c - Stock - float - 128heffte: r2c - FFTW - float-long - 128heffte: c2c - FFTW - float-long - 128heffte: c2c - FFTW - float - 128heffte: r2c - Stock - float-long - 128heffte: r2c - FFTW - float - 128ospray-studio: 3 - 4K - 1 - Path Tracer - CPUAMD EPYC 9334 32-Corebcd127.646261.7279.51052.459852.280652.39252.6439.730.1525730210070141790139202049.721944098050553101066.175575.4392.799793.66694.458793.554294.736393.74395.298495.8371100.538100.32899.7337100.14474.775672.152312.20154.219156.25531.639431.871133.392733.256748.15351.508937.094847.84351.282936.2576162.37551.690851.330179.651878.2354146.155150.134150.33568.662671.348635.507237.263271.428373.747170.053973.598573.640876.5441157.878160.22244.547943.898191.97987.534291.371988.0159111.759103.262158.538187.012120.075118.842154.418186.529127.909261.8479.55252.483452.289352.219852.4331439.3430.2125940210100141820139102086.561946097910553201028.115608.293.545593.338694.271993.970294.595693.851695.487995.2581100.152100.30899.4562100.0173.067972.337312.11154.559155.2731.733831.820233.264433.168848.143951.192636.833649.429351.399440.2624166.33852.517453.954579.776979.8568146.278137.587148.08566.917875.742835.133240.013271.665568.977971.626173.511474.601175.919141.253165.24645.977947.204792.389690.100293.446187.1142107.948106.543156.986190.485122.838116.895161.084190.581264.7652.539152.252752.427152.5071438.7930.1625730210110141790139002070.311943097940554701073.195573.8593.16693.458593.632694.518492.502492.743895.553795.3719100.266100.13799.1331100.20276.964576.170412.29142.143156.20931.930632.11332.120633.158349.898349.735938.003547.676251.895241.5801165.42257.751556.994378.256180.113151.276145.746148.73372.213772.334136.920236.936370.364872.842870.712170.59176.711375.9431153.631159.01646.702744.242592.184585.05894.480989.5721112.002103.117158.624186.507116.322117.918161.535186.801128.054262.7779.39152.039952.261052.165852.4562444.0530.1925813210083141777139132065.891953398000554831025.655578.9792.807792.936394.124694.437893.732693.708295.205495.1323100.209100.31899.7669100.143974.714072.954712.10151.134152.91731.823631.942332.309333.561949.460850.274338.087148.726851.135838.3486161.46554.359353.470878.071179.6484147.503151.351146.58270.403773.914536.544436.157671.856171.087773.089675.882574.384473.5728149.884164.51945.119646.595091.573488.234392.179088.9655106.498107.524156.174189.952117.171119.642162.029186.863OpenBenchmarking.org

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetAMD EPYC 9334 32-Corebd306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3127.65127.91128.051. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AMD EPYC 9334 32-Corebdc60120180240300SE +/- 1.17, N = 3261.72261.84262.77264.761. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBdAMD EPYC 9334 32-Coreb20406080100SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 379.3979.5179.551. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024cbAMD EPYC 9334 32-Cored1224364860SE +/- 0.16, N = 352.5452.4852.4652.041. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024bAMD EPYC 9334 32-Coredc1224364860SE +/- 0.03, N = 352.2952.2852.2652.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024cAMD EPYC 9334 32-Corebd1224364860SE +/- 0.06, N = 352.4352.3952.2252.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corecdb1224364860SE +/- 0.08, N = 352.6052.5152.4652.431. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: scrypt

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: scryptdAMD EPYC 9334 32-Corebc100200300400500SE +/- 5.54, N = 4444.05439.70439.34438.791. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAMD EPYC 9334 32-Corecdb714212835SE +/- 0.02, N = 330.1530.1630.1930.211. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: LBC, LBRY CreditsbdcAMD EPYC 9334 32-Core6K12K18K24K30KSE +/- 38.44, N = 3259402581325730257301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 ScbdAMD EPYC 9334 32-Core40K80K120K160K200KSE +/- 12.02, N = 32101102101002100832100701. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinbcAMD EPYC 9334 32-Cored30K60K90K120K150KSE +/- 8.82, N = 31418201417901417901417771. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinAMD EPYC 9334 32-Coredbc3K6K9K12K15KSE +/- 3.33, N = 3139201391313910139001. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: GarlicoinbcdAMD EPYC 9334 32-Core400800120016002000SE +/- 9.13, N = 32086.562070.312065.892049.721. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestldbAMD EPYC 9334 32-Corec4K8K12K16K20KSE +/- 58.12, N = 3195331946019440194301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Quad SHA-256, PyriteAMD EPYC 9334 32-Coredcb20K40K60K80K100KSE +/- 15.28, N = 3980509800097940979101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: SkeincoindcbAMD EPYC 9334 32-Core12K24K36K48K60KSE +/- 82.12, N = 3554835547055320553101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: MagicAMD EPYC 9334 32-Corebd2004006008001000SE +/- 0.51, N = 31073.191066.171028.111025.651. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: RingcoinbdAMD EPYC 9334 32-Corec12002400360048006000SE +/- 15.94, N = 35608.205578.975575.435573.851. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024bcdAMD EPYC 9334 32-Core20406080100SE +/- 0.76, N = 393.5593.1792.8192.801. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corecbd20406080100SE +/- 0.13, N = 393.6793.4693.3492.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebdc20406080100SE +/- 0.18, N = 394.4694.2794.1293.631. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024cdbAMD EPYC 9334 32-Core20406080100SE +/- 0.25, N = 394.5294.4493.9793.551. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebdc20406080100SE +/- 0.31, N = 394.7494.6093.7392.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024bAMD EPYC 9334 32-Coredc20406080100SE +/- 0.40, N = 393.8593.7493.7192.741. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024cbAMD EPYC 9334 32-Cored20406080100SE +/- 0.20, N = 395.5595.4995.3095.211. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corecbd20406080100SE +/- 0.15, N = 395.8495.3795.2695.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corecdb20406080100SE +/- 0.16, N = 3100.54100.27100.21100.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Coredbc20406080100SE +/- 0.17, N = 3100.33100.32100.31100.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024dAMD EPYC 9334 32-Corebc20406080100SE +/- 0.08, N = 399.7799.7399.4699.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024cAMD EPYC 9334 32-Coredb20406080100SE +/- 0.23, N = 3100.20100.14100.14100.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512cAMD EPYC 9334 32-Coredb20406080100SE +/- 0.93, N = 1576.9674.7874.7173.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512cdbAMD EPYC 9334 32-Core20406080100SE +/- 0.50, N = 1376.1772.9572.3472.151. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmdbAMD EPYC 9334 32-Corec3691215SE +/- 0.11, N = 312.1012.1112.2012.291. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512bAMD EPYC 9334 32-Coredc306090120150SE +/- 1.58, N = 15154.56154.22151.13142.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corecbd306090120150SE +/- 1.20, N = 15156.26156.21155.27152.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512cdbAMD EPYC 9334 32-Core714212835SE +/- 0.16, N = 331.9331.8231.7331.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512cdAMD EPYC 9334 32-Coreb714212835SE +/- 0.13, N = 332.1131.9431.8731.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebdc816243240SE +/- 0.10, N = 333.3933.2632.3132.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512dAMD EPYC 9334 32-Corebc816243240SE +/- 0.08, N = 333.5633.2633.1733.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512cdAMD EPYC 9334 32-Coreb1122334455SE +/- 0.57, N = 449.9049.4648.1548.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebdc1224364860SE +/- 0.60, N = 451.5151.1950.2749.741. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256dcAMD EPYC 9334 32-Coreb918273645SE +/- 0.64, N = 1538.0938.0037.0936.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512bdAMD EPYC 9334 32-Corec1122334455SE +/- 0.49, N = 349.4348.7347.8447.681. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512cbAMD EPYC 9334 32-Cored1224364860SE +/- 0.32, N = 351.9051.4051.2851.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256cbdAMD EPYC 9334 32-Core918273645SE +/- 0.55, N = 1241.5840.2638.3536.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256bcAMD EPYC 9334 32-Cored4080120160200SE +/- 1.78, N = 15166.34165.42162.38161.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128cdbAMD EPYC 9334 32-Core1326395265SE +/- 0.72, N = 1557.7554.3652.5251.691. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128cbdAMD EPYC 9334 32-Core1326395265SE +/- 0.91, N = 1556.9953.9553.4751.331. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512bAMD EPYC 9334 32-Corecd20406080100SE +/- 0.37, N = 379.7879.6578.2678.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.62, N = 380.1179.8679.6578.241. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512cdbAMD EPYC 9334 32-Core306090120150SE +/- 1.83, N = 4151.28147.50146.28146.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256dAMD EPYC 9334 32-Corecb306090120150SE +/- 1.31, N = 8151.35150.13145.75137.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corecbd306090120150SE +/- 1.28, N = 3150.34148.73148.09146.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256cdAMD EPYC 9334 32-Coreb1632486480SE +/- 0.71, N = 672.2170.4068.6666.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256bdcAMD EPYC 9334 32-Core20406080100SE +/- 0.73, N = 675.7473.9172.3371.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256cdAMD EPYC 9334 32-Coreb816243240SE +/- 0.21, N = 336.9236.5435.5135.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256bAMD EPYC 9334 32-Corecd918273645SE +/- 0.49, N = 340.0137.2636.9436.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256dbAMD EPYC 9334 32-Corec1632486480SE +/- 0.38, N = 371.8671.6771.4370.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corecdb1632486480SE +/- 0.66, N = 373.7572.8471.0968.981. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256dbcAMD EPYC 9334 32-Core1632486480SE +/- 0.43, N = 373.0971.6370.7170.051. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256dAMD EPYC 9334 32-Corebc20406080100SE +/- 0.86, N = 375.8873.6073.5170.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.83, N = 376.7174.6074.3873.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corecbd20406080100SE +/- 1.01, N = 376.5475.9475.9273.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corecdb306090120150SE +/- 0.98, N = 3157.88153.63149.88141.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256bdAMD EPYC 9334 32-Corec4080120160200SE +/- 1.70, N = 3165.25164.52160.22159.021. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128cbdAMD EPYC 9334 32-Core1122334455SE +/- 0.27, N = 346.7045.9845.1244.551. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128bdcAMD EPYC 9334 32-Core1122334455SE +/- 0.64, N = 347.2046.6044.2443.901. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128bcAMD EPYC 9334 32-Cored20406080100SE +/- 0.59, N = 392.3992.1891.9891.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128bdAMD EPYC 9334 32-Corec20406080100SE +/- 0.17, N = 390.1088.2387.5385.061. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.35, N = 394.4893.4592.1891.371. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128cdAMD EPYC 9334 32-Coreb20406080100SE +/- 1.15, N = 389.5788.9788.0287.111. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128cAMD EPYC 9334 32-Corebd306090120150SE +/- 0.27, N = 3112.00111.76107.95106.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128dbAMD EPYC 9334 32-Corec20406080100SE +/- 1.13, N = 3107.52106.54103.26103.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128cAMD EPYC 9334 32-Corebd4080120160200SE +/- 1.35, N = 3158.62158.54156.99156.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128bdAMD EPYC 9334 32-Corec4080120160200SE +/- 1.76, N = 3190.49189.95187.01186.511. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128bAMD EPYC 9334 32-Coredc306090120150SE +/- 0.90, N = 3122.84120.08117.17116.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128dAMD EPYC 9334 32-Corecb306090120150SE +/- 0.32, N = 3119.64118.84117.92116.901. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128dcbAMD EPYC 9334 32-Core4080120160200SE +/- 1.39, N = 3162.03161.54161.08154.421. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128bdcAMD EPYC 9334 32-Core4080120160200SE +/- 1.18, N = 3190.58186.86186.80186.531. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.4