extra tests 3

AMD EPYC 9334 32-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310300-NE-EXTRATEST84&grs&sor.

extra tests 3ProcessorMotherboardMemoryDiskGraphicsMonitorOSKernelCompilerFile-SystemScreen ResolutionAMD EPYC 9334 32-CorebcdAMD EPYC 9334 32-Core @ 2.70GHz (32 Cores / 64 Threads)Supermicro H13SSW (1.1 BIOS)12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbDELL E207WFPAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41680x1050OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra tests 3heffte: c2c - Stock - double-long - 256heffte: r2c - FFTW - float-long - 256heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - double-long - 256heffte: c2c - Stock - double - 128heffte: c2c - FFTW - float - 256heffte: c2c - Stock - float - 256heffte: c2c - FFTW - float-long - 256heffte: r2c - Stock - double-long - 128heffte: c2c - FFTW - float-long - 128heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: r2c - Stock - float-long - 128heffte: c2c - Stock - double-long - 128cpuminer-opt: Magiheffte: c2c - Stock - float-long - 256heffte: r2c - FFTW - double-long - 128heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 256heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 256heffte: c2c - Stock - float-long - 512heffte: c2c - Stock - float - 512heffte: c2c - FFTW - float - 512heffte: r2c - Stock - float-long - 512heffte: c2c - Stock - float-long - 128openvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUheffte: r2c - Stock - float-long - 256heffte: r2c - Stock - double - 128heffte: r2c - Stock - float - 512heffte: r2c - FFTW - double-long - 1024heffte: r2c - Stock - double-long - 512heffte: c2c - FFTW - float - 128openvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUheffte: r2c - Stock - double - 512heffte: r2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float-long - 128heffte: r2c - FFTW - double - 256cpuminer-opt: Garlicoincloverleaf: clover_bmheffte: r2c - Stock - float - 128heffte: c2c - FFTW - float-long - 512openvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Face Detection Retail FP16 - CPUheffte: c2c - FFTW - double-long - 512cpuminer-opt: scryptopenvino: Vehicle Detection FP16 - CPUheffte: r2c - FFTW - double - 1024cloverleaf: clover_bm16openvino: Vehicle Detection FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUheffte: c2c - Stock - float - 1024openvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUheffte: c2c - FFTW - double-long - 1024openvino: Person Vehicle Bike Detection FP16 - CPUheffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: c2c - Stock - float - 128heffte: c2c - Stock - float-long - 1024cpuminer-opt: LBC, LBRY Creditsheffte: c2c - FFTW - float - 1024openvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUheffte: c2c - FFTW - float-long - 1024heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - float - 1024cpuminer-opt: Ringcoincpuminer-opt: Myriad-Groestlheffte: c2c - Stock - double - 1024openvino: Age Gender Recognition Retail 0013 FP16 - CPUheffte: r2c - Stock - double-long - 1024heffte: r2c - FFTW - float-long - 1024duckdb: TPC-H Parquetheffte: c2c - FFTW - double - 1024cpuminer-opt: Skeincoinduckdb: IMDBcloverleaf: clover_bm64_shortheffte: r2c - Stock - float-long - 1024heffte: r2c - Stock - double - 1024openvino: Machine Translation EN To DE FP16 - CPUcpuminer-opt: Deepcoinopenvino: Machine Translation EN To DE FP16 - CPUcpuminer-opt: Quad SHA-256, Pyriteopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Face Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUheffte: c2c - Stock - double-long - 1024openvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUcpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Blake-2 Sopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUheffte: c2c - FFTW - double-long - 128heffte: c2c - Stock - double - 256AMD EPYC 9334 32-Corebcd36.2576157.87851.690837.2632150.134154.21968.662643.898173.598573.747171.348687.5342120.07572.152374.7756111.75935.5072154.41844.54791066.1770.0539103.26273.640876.544133.3927160.22247.84348.15351.5089146.15591.371931.131027.29162.37588.0159150.33594.736378.2354118.842194.2782.2979.6518156.255186.529187.01271.42832049.7212.20158.53851.282982.1194.663.3233.2567439.710.8793.743261.721468.6670376.534811.892254.3393.55427.092124.1552.45987.5231.639431.871191.97994.45872573092.799722.71703.9493.66695.837199.73375575.431944052.39259085.24100.53895.2984127.64652.65531079.51030.15100.144100.328229.011392069.7998050812.6939.358.473773.8519.73640.7437.3724.961961.2352.280616.29808.35427.316644.751417902100700.360.534.8151.330137.094840.2624141.25352.517440.0132137.587154.55966.917847.204773.511468.977975.742890.1002122.83872.337373.0679107.94835.1332161.08445.97791028.1171.6261106.54374.601175.91933.2644165.24649.429348.143951.1926146.27893.446131.811005.48166.33887.1142148.08594.595679.8568116.89519084.1379.7769155.27190.581190.48571.66552086.5612.11156.98651.399483.13192.273.3133.1688439.341193.8516261.841451.8270549.684822.012272.2493.97027.032133.0352.48347.4931.733831.820292.389694.27192594093.545522.76702.5393.338695.258199.45625608.21946052.219859230.03100.15295.4879127.90952.43315532079.55230.21100.01100.308228.751391069.8897910811.5539.48.473773.9319.75641.2637.3724.941962.2752.289316.28808.67427.236645.771418202101000.360.534.8153.954536.833641.5801153.63157.751536.9363145.746142.14372.213744.242570.59172.842872.334185.058116.32276.170476.9645112.00236.9202161.53546.70271073.1970.7121103.11776.711375.943132.1206159.01647.676249.898349.7359151.27694.480930.771039.35165.42289.5721148.73392.502480.113117.918193.5082.6178.2561156.209186.801186.50770.36482070.3112.29158.62451.895282.39194.063.2833.1583438.7910.9092.7438264.761465.3971175.964863.332248.6894.51847.102145.0552.53917.4531.930632.11392.184593.63262573093.16622.58708.0593.458595.371999.13315573.851943052.427159341.84100.26695.553752.50715547030.16100.202100.137229.081390069.7897940811.7139.408.463778.2819.75641.2537.3424.941962.8052.252716.28808.18427.466642.931417902101100.360.534.8156.994338.003538.3486149.88454.359336.1576151.351151.13470.403746.595075.882571.087773.914588.2343117.17172.954774.7140106.49836.5444162.02945.11961025.6573.0896107.52474.384473.572832.3093164.51948.726849.460850.2743147.50392.1790161.46588.9655146.58293.732679.6484119.64278.0711152.917186.863189.95271.85612065.8912.10156.17451.135833.5619444.0593.7082262.7794.437852.039931.823631.942391.573494.12462581392.807792.936395.132399.76695578.971953352.1658100.20995.2054128.05452.45625548379.39130.19100.1439100.318139139800052.261014177721008353.470838.0871OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256cbdAMD EPYC 9334 32-Core918273645SE +/- 0.55, N = 1241.5840.2638.3536.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corecdb306090120150SE +/- 0.98, N = 3157.88153.63149.88141.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128cdbAMD EPYC 9334 32-Core1326395265SE +/- 0.72, N = 1557.7554.3652.5251.691. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256bAMD EPYC 9334 32-Corecd918273645SE +/- 0.49, N = 340.0137.2636.9436.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256dAMD EPYC 9334 32-Corecb306090120150SE +/- 1.31, N = 8151.35150.13145.75137.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512bAMD EPYC 9334 32-Coredc306090120150SE +/- 1.58, N = 15154.56154.22151.13142.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256cdAMD EPYC 9334 32-Coreb1632486480SE +/- 0.71, N = 672.2170.4068.6666.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128bdcAMD EPYC 9334 32-Core1122334455SE +/- 0.64, N = 347.2046.6044.2443.901. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256dAMD EPYC 9334 32-Corebc20406080100SE +/- 0.86, N = 375.8873.6073.5170.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corecdb1632486480SE +/- 0.66, N = 373.7572.8471.0968.981. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256bdcAMD EPYC 9334 32-Core20406080100SE +/- 0.73, N = 675.7473.9172.3371.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128bdAMD EPYC 9334 32-Corec20406080100SE +/- 0.17, N = 390.1088.2387.5385.061. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128bAMD EPYC 9334 32-Coredc306090120150SE +/- 0.90, N = 3122.84120.08117.17116.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512cdbAMD EPYC 9334 32-Core20406080100SE +/- 0.50, N = 1376.1772.9572.3472.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512cAMD EPYC 9334 32-Coredb20406080100SE +/- 0.93, N = 1576.9674.7874.7173.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128cAMD EPYC 9334 32-Corebd306090120150SE +/- 0.27, N = 3112.00111.76107.95106.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256cdAMD EPYC 9334 32-Coreb816243240SE +/- 0.21, N = 336.9236.5435.5135.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128dcbAMD EPYC 9334 32-Core4080120160200SE +/- 1.39, N = 3162.03161.54161.08154.421. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128cbdAMD EPYC 9334 32-Core1122334455SE +/- 0.27, N = 346.7045.9845.1244.551. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: MagicAMD EPYC 9334 32-Corebd2004006008001000SE +/- 0.51, N = 31073.191066.171028.111025.651. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256dbcAMD EPYC 9334 32-Core1632486480SE +/- 0.43, N = 373.0971.6370.7170.051. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128dbAMD EPYC 9334 32-Corec20406080100SE +/- 1.13, N = 3107.52106.54103.26103.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.83, N = 376.7174.6074.3873.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corecbd20406080100SE +/- 1.01, N = 376.5475.9475.9273.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebdc816243240SE +/- 0.10, N = 333.3933.2632.3132.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256bdAMD EPYC 9334 32-Corec4080120160200SE +/- 1.70, N = 3165.25164.52160.22159.021. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512bdAMD EPYC 9334 32-Corec1122334455SE +/- 0.49, N = 349.4348.7347.8447.681. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512cdAMD EPYC 9334 32-Coreb1122334455SE +/- 0.57, N = 449.9049.4648.1548.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebdc1224364860SE +/- 0.60, N = 451.5151.1950.2749.741. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512cdbAMD EPYC 9334 32-Core306090120150SE +/- 1.83, N = 4151.28147.50146.28146.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.35, N = 394.4893.4592.1891.371. (CXX) g++ options: -O3

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUcAMD EPYC 9334 32-Coreb714212835SE +/- 0.09, N = 330.7731.1331.81MIN: 20.73 / MAX: 39.96MIN: 20.64 / MAX: 42.52MIN: 27.4 / MAX: 39.741. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUcAMD EPYC 9334 32-Coreb2004006008001000SE +/- 3.19, N = 31039.351027.291005.481. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256bcAMD EPYC 9334 32-Cored4080120160200SE +/- 1.78, N = 15166.34165.42162.38161.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128cdAMD EPYC 9334 32-Coreb20406080100SE +/- 1.15, N = 389.5788.9788.0287.111. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corecbd306090120150SE +/- 1.28, N = 3150.34148.73148.09146.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebdc20406080100SE +/- 0.31, N = 394.7494.6093.7392.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.62, N = 380.1179.8679.6578.241. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128dAMD EPYC 9334 32-Corecb306090120150SE +/- 0.32, N = 3119.64118.84117.92116.901. (CXX) g++ options: -O3

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb4080120160200SE +/- 0.80, N = 3194.27193.50190.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb20406080100SE +/- 0.34, N = 382.2982.6184.13MIN: 68.08 / MAX: 101.21MIN: 40.53 / MAX: 94.79MIN: 67.92 / MAX: 100.991. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512bAMD EPYC 9334 32-Corecd20406080100SE +/- 0.37, N = 379.7879.6578.2678.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corecbd306090120150SE +/- 1.20, N = 15156.26156.21155.27152.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128bdcAMD EPYC 9334 32-Core4080120160200SE +/- 1.18, N = 3190.58186.86186.80186.531. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128bdAMD EPYC 9334 32-Corec4080120160200SE +/- 1.76, N = 3190.49189.95187.01186.511. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256dbAMD EPYC 9334 32-Corec1632486480SE +/- 0.38, N = 371.8671.6771.4370.361. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: GarlicoinbcdAMD EPYC 9334 32-Core400800120016002000SE +/- 9.13, N = 32086.562070.312065.892049.721. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmdbAMD EPYC 9334 32-Corec3691215SE +/- 0.11, N = 312.1012.1112.2012.291. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128cAMD EPYC 9334 32-Corebd4080120160200SE +/- 1.35, N = 3158.62158.54156.99156.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512cbAMD EPYC 9334 32-Cored1224364860SE +/- 0.32, N = 351.9051.4051.2851.141. (CXX) g++ options: -O3

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAMD EPYC 9334 32-Corecb20406080100SE +/- 0.10, N = 382.1082.3983.13MIN: 68.73 / MAX: 93.7MIN: 65.37 / MAX: 93.63MIN: 41.84 / MAX: 94.381. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAMD EPYC 9334 32-Corecb4080120160200SE +/- 0.25, N = 3194.66194.06192.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUcbAMD EPYC 9334 32-Core0.7471.4942.2412.9883.735SE +/- 0.01, N = 33.283.313.32MIN: 2.03 / MAX: 13.53MIN: 2.14 / MAX: 12.63MIN: 2.1 / MAX: 13.611. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512dAMD EPYC 9334 32-Corebc816243240SE +/- 0.08, N = 333.5633.2633.1733.161. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: scrypt

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: scryptdAMD EPYC 9334 32-Corebc100200300400500SE +/- 5.54, N = 4444.05439.70439.34438.791. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb3691215SE +/- 0.01, N = 310.8710.9011.00MIN: 5.72 / MAX: 20.47MIN: 5.57 / MAX: 21.27MIN: 5.6 / MAX: 20.881. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024bAMD EPYC 9334 32-Coredc20406080100SE +/- 0.40, N = 393.8593.7493.7192.741. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AMD EPYC 9334 32-Corebdc60120180240300SE +/- 1.17, N = 3261.72261.84262.77264.761. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb30060090012001500SE +/- 0.81, N = 31468.661465.391451.821. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUcbAMD EPYC 9334 32-Core15K30K45K60K75KSE +/- 728.07, N = 371175.9670549.6870376.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUcbAMD EPYC 9334 32-Core10002000300040005000SE +/- 18.44, N = 34863.334822.014811.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec5001000150020002500SE +/- 5.81, N = 32272.242254.332248.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024cdbAMD EPYC 9334 32-Core20406080100SE +/- 0.25, N = 394.5294.4493.9793.551. (CXX) g++ options: -O3

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec246810SE +/- 0.02, N = 37.037.097.10MIN: 4.2 / MAX: 17.05MIN: 4.12 / MAX: 16.39MIN: 4.4 / MAX: 21.661. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core5001000150020002500SE +/- 19.67, N = 72145.052133.032124.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024cbAMD EPYC 9334 32-Cored1224364860SE +/- 0.16, N = 352.5452.4852.4652.041. (CXX) g++ options: -O3

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core246810SE +/- 0.07, N = 77.457.497.52MIN: 4.93 / MAX: 16.71MIN: 5.49 / MAX: 16.91MIN: 5.54 / MAX: 16.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512cdbAMD EPYC 9334 32-Core714212835SE +/- 0.16, N = 331.9331.8231.7331.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512cdAMD EPYC 9334 32-Coreb714212835SE +/- 0.13, N = 332.1131.9431.8731.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128bcAMD EPYC 9334 32-Cored20406080100SE +/- 0.59, N = 392.3992.1891.9891.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebdc20406080100SE +/- 0.18, N = 394.4694.2794.1293.631. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: LBC, LBRY CreditsbdcAMD EPYC 9334 32-Core6K12K18K24K30KSE +/- 38.44, N = 3259402581325730257301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024bcdAMD EPYC 9334 32-Core20406080100SE +/- 0.76, N = 393.5593.1792.8192.801. (CXX) g++ options: -O3

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb510152025SE +/- 0.04, N = 322.5822.7122.76MIN: 14.13 / MAX: 31.83MIN: 13.69 / MAX: 31.43MIN: 14.14 / MAX: 29.871. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb150300450600750SE +/- 1.40, N = 3708.05703.94702.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corecbd20406080100SE +/- 0.13, N = 393.6793.4693.3492.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corecbd20406080100SE +/- 0.15, N = 395.8495.3795.2695.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024dAMD EPYC 9334 32-Corebc20406080100SE +/- 0.08, N = 399.7799.7399.4699.131. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: RingcoinbdAMD EPYC 9334 32-Corec12002400360048006000SE +/- 15.94, N = 35608.205578.975575.435573.851. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestldbAMD EPYC 9334 32-Corec4K8K12K16K20KSE +/- 58.12, N = 3195331946019440194301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024cAMD EPYC 9334 32-Corebd1224364860SE +/- 0.06, N = 352.4352.3952.2252.171. (CXX) g++ options: -O3

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUcbAMD EPYC 9334 32-Core13K26K39K52K65KSE +/- 18.50, N = 359341.8459230.0359085.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corecdb20406080100SE +/- 0.16, N = 3100.54100.27100.21100.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024cbAMD EPYC 9334 32-Cored20406080100SE +/- 0.20, N = 395.5595.4995.3095.211. (CXX) g++ options: -O3

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetAMD EPYC 9334 32-Corebd306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3127.65127.91128.051. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corecdb1224364860SE +/- 0.08, N = 352.6052.5152.4652.431. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: SkeincoindcbAMD EPYC 9334 32-Core12K24K36K48K60KSE +/- 82.12, N = 3554835547055320553101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBdAMD EPYC 9334 32-Coreb20406080100SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 379.3979.5179.551. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAMD EPYC 9334 32-Corecdb714212835SE +/- 0.02, N = 330.1530.1630.1930.211. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024cAMD EPYC 9334 32-Coredb20406080100SE +/- 0.23, N = 3100.20100.14100.14100.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Coredbc20406080100SE +/- 0.17, N = 3100.33100.32100.31100.141. (CXX) g++ options: -O3

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUcAMD EPYC 9334 32-Coreb50100150200250SE +/- 0.31, N = 3229.08229.01228.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinAMD EPYC 9334 32-Coredbc3K6K9K12K15KSE +/- 3.33, N = 3139201391313910139001. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUcAMD EPYC 9334 32-Coreb1632486480SE +/- 0.09, N = 369.7869.7969.88MIN: 55.65 / MAX: 74.84MIN: 58.06 / MAX: 75.49MIN: 40.08 / MAX: 76.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Quad SHA-256, PyriteAMD EPYC 9334 32-Coredcb20K40K60K80K100KSE +/- 15.28, N = 3980509800097940979101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corecb2004006008001000SE +/- 2.73, N = 3812.69811.71811.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corecb918273645SE +/- 0.13, N = 339.3539.4039.40MIN: 33.85 / MAX: 46.73MIN: 24.93 / MAX: 45.2MIN: 25.61 / MAX: 46.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb246810SE +/- 0.00, N = 38.468.478.47MIN: 4.5 / MAX: 17.91MIN: 4.66 / MAX: 18.23MIN: 4.56 / MAX: 17.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUcbAMD EPYC 9334 32-Core8001600240032004000SE +/- 1.21, N = 33778.283773.933773.851. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUbcAMD EPYC 9334 32-Core510152025SE +/- 0.01, N = 319.7519.7519.731. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUbcAMD EPYC 9334 32-Core140280420560700SE +/- 0.98, N = 3641.26641.25640.741. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec918273645SE +/- 0.01, N = 337.3737.3737.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUcbAMD EPYC 9334 32-Core612182430SE +/- 0.04, N = 324.9424.9424.96MIN: 16.06 / MAX: 41.08MIN: 17.3 / MAX: 33.61MIN: 16 / MAX: 33.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core400800120016002000SE +/- 0.27, N = 31962.801962.271961.231. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024bAMD EPYC 9334 32-Coredc1224364860SE +/- 0.03, N = 352.2952.2852.2652.251. (CXX) g++ options: -O3

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core48121620SE +/- 0.00, N = 316.2816.2816.29MIN: 8.42 / MAX: 26.01MIN: 8.52 / MAX: 25.67MIN: 8.63 / MAX: 25.521. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUcAMD EPYC 9334 32-Coreb2004006008001000SE +/- 0.19, N = 3808.18808.35808.67MIN: 774.6 / MAX: 821.97MIN: 751.2 / MAX: 822.48MIN: 785.84 / MAX: 820.81. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec90180270360450SE +/- 0.13, N = 3427.23427.31427.46MIN: 406.18 / MAX: 436.27MIN: 406.06 / MAX: 433.09MIN: 403.85 / MAX: 437.321. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec14002800420056007000SE +/- 7.91, N = 36645.776644.756642.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinbcAMD EPYC 9334 32-Cored30K60K90K120K150KSE +/- 8.82, N = 31418201417901417901417771. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 ScbdAMD EPYC 9334 32-Core40K80K120K160K200KSE +/- 12.02, N = 32101102101002100832100701. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb0.0810.1620.2430.3240.405SE +/- 0.00, N = 30.360.360.36MIN: 0.22 / MAX: 8.19MIN: 0.22 / MAX: 9.63MIN: 0.22 / MAX: 8.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUcAMD EPYC 9334 32-Coreb0.11930.23860.35790.47720.5965SE +/- 0.00, N = 30.530.530.53MIN: 0.28 / MAX: 9.96MIN: 0.32 / MAX: 9.28MIN: 0.3 / MAX: 10.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb1.08232.16463.24694.32925.4115SE +/- 0.01, N = 34.814.814.81MIN: 3.23 / MAX: 32.04MIN: 3.24 / MAX: 14.25MIN: 3.08 / MAX: 14.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128cbdAMD EPYC 9334 32-Core1326395265SE +/- 0.91, N = 1556.9953.9553.4751.331. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256dcAMD EPYC 9334 32-Coreb918273645SE +/- 0.64, N = 1538.0938.0037.0936.831. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5