extra tests 3

AMD EPYC 9334 32-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310300-NE-EXTRATEST84&sro&grs.

extra tests 3ProcessorMotherboardMemoryDiskGraphicsMonitorOSKernelCompilerFile-SystemScreen ResolutionAMD EPYC 9334 32-CorebcdAMD EPYC 9334 32-Core @ 2.70GHz (32 Cores / 64 Threads)Supermicro H13SSW (1.1 BIOS)12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbDELL E207WFPAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41680x1050OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra tests 3heffte: c2c - Stock - double-long - 256heffte: r2c - FFTW - float-long - 256heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - double-long - 256heffte: c2c - Stock - double - 128heffte: c2c - FFTW - float - 256heffte: c2c - Stock - float - 256heffte: c2c - FFTW - float-long - 256heffte: r2c - Stock - double-long - 128heffte: c2c - FFTW - float-long - 128heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: r2c - Stock - float-long - 128heffte: c2c - Stock - double-long - 128cpuminer-opt: Magiheffte: c2c - Stock - float-long - 256heffte: r2c - FFTW - double-long - 128heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 256heffte: c2c - FFTW - double - 512heffte: r2c - Stock - float - 256heffte: c2c - Stock - float-long - 512heffte: c2c - Stock - float - 512heffte: c2c - FFTW - float - 512heffte: r2c - Stock - float-long - 512heffte: c2c - Stock - float-long - 128openvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUheffte: r2c - Stock - float-long - 256heffte: r2c - Stock - double - 128heffte: r2c - Stock - float - 512heffte: r2c - FFTW - double-long - 1024heffte: r2c - Stock - double-long - 512heffte: c2c - FFTW - float - 128openvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUheffte: r2c - Stock - double - 512heffte: r2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float-long - 128heffte: r2c - FFTW - double - 256cpuminer-opt: Garlicoincloverleaf: clover_bmheffte: r2c - Stock - float - 128heffte: c2c - FFTW - float-long - 512openvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Face Detection Retail FP16 - CPUheffte: c2c - FFTW - double-long - 512cpuminer-opt: scryptopenvino: Vehicle Detection FP16 - CPUheffte: r2c - FFTW - double - 1024cloverleaf: clover_bm16openvino: Vehicle Detection FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUheffte: c2c - Stock - float - 1024openvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUheffte: c2c - FFTW - double-long - 1024openvino: Person Vehicle Bike Detection FP16 - CPUheffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: c2c - Stock - float - 128heffte: c2c - Stock - float-long - 1024cpuminer-opt: LBC, LBRY Creditsheffte: c2c - FFTW - float - 1024openvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUheffte: c2c - FFTW - float-long - 1024heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - float - 1024cpuminer-opt: Ringcoincpuminer-opt: Myriad-Groestlheffte: c2c - Stock - double - 1024openvino: Age Gender Recognition Retail 0013 FP16 - CPUheffte: r2c - Stock - double-long - 1024heffte: r2c - FFTW - float-long - 1024duckdb: TPC-H Parquetheffte: c2c - FFTW - double - 1024cpuminer-opt: Skeincoinduckdb: IMDBcloverleaf: clover_bm64_shortheffte: r2c - Stock - float-long - 1024heffte: r2c - Stock - double - 1024openvino: Machine Translation EN To DE FP16 - CPUcpuminer-opt: Deepcoinopenvino: Machine Translation EN To DE FP16 - CPUcpuminer-opt: Quad SHA-256, Pyriteopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Face Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUheffte: c2c - Stock - double-long - 1024openvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUcpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Blake-2 Sopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUheffte: c2c - FFTW - double-long - 128heffte: c2c - Stock - double - 256AMD EPYC 9334 32-Corebcd36.2576157.87851.690837.2632150.134154.21968.662643.898173.598573.747171.348687.5342120.07572.152374.7756111.75935.5072154.41844.54791066.1770.0539103.26273.640876.544133.3927160.22247.84348.15351.5089146.15591.371931.131027.29162.37588.0159150.33594.736378.2354118.842194.2782.2979.6518156.255186.529187.01271.42832049.7212.20158.53851.282982.1194.663.3233.2567439.710.8793.743261.721468.6670376.534811.892254.3393.55427.092124.1552.45987.5231.639431.871191.97994.45872573092.799722.71703.9493.66695.837199.73375575.431944052.39259085.24100.53895.2984127.64652.65531079.51030.15100.144100.328229.011392069.7998050812.6939.358.473773.8519.73640.7437.3724.961961.2352.280616.29808.35427.316644.751417902100700.360.534.8151.330137.094840.2624141.25352.517440.0132137.587154.55966.917847.204773.511468.977975.742890.1002122.83872.337373.0679107.94835.1332161.08445.97791028.1171.6261106.54374.601175.91933.2644165.24649.429348.143951.1926146.27893.446131.811005.48166.33887.1142148.08594.595679.8568116.89519084.1379.7769155.27190.581190.48571.66552086.5612.11156.98651.399483.13192.273.3133.1688439.341193.8516261.841451.8270549.684822.012272.2493.97027.032133.0352.48347.4931.733831.820292.389694.27192594093.545522.76702.5393.338695.258199.45625608.21946052.219859230.03100.15295.4879127.90952.43315532079.55230.21100.01100.308228.751391069.8897910811.5539.48.473773.9319.75641.2637.3724.941962.2752.289316.28808.67427.236645.771418202101000.360.534.8153.954536.833641.5801153.63157.751536.9363145.746142.14372.213744.242570.59172.842872.334185.058116.32276.170476.9645112.00236.9202161.53546.70271073.1970.7121103.11776.711375.943132.1206159.01647.676249.898349.7359151.27694.480930.771039.35165.42289.5721148.73392.502480.113117.918193.5082.6178.2561156.209186.801186.50770.36482070.3112.29158.62451.895282.39194.063.2833.1583438.7910.9092.7438264.761465.3971175.964863.332248.6894.51847.102145.0552.53917.4531.930632.11392.184593.63262573093.16622.58708.0593.458595.371999.13315573.851943052.427159341.84100.26695.553752.50715547030.16100.202100.137229.081390069.7897940811.7139.408.463778.2819.75641.2537.3424.941962.8052.252716.28808.18427.466642.931417902101100.360.534.8156.994338.003538.3486149.88454.359336.1576151.351151.13470.403746.595075.882571.087773.914588.2343117.17172.954774.7140106.49836.5444162.02945.11961025.6573.0896107.52474.384473.572832.3093164.51948.726849.460850.2743147.50392.1790161.46588.9655146.58293.732679.6484119.64278.0711152.917186.863189.95271.85612065.8912.10156.17451.135833.5619444.0593.7082262.7794.437852.039931.823631.942391.573494.12462581392.807792.936395.132399.76695578.971953352.1658100.20995.2054128.05452.45625548379.39130.19100.1439100.318139139800052.261014177721008353.470838.0871OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.55, N = 1236.2640.2641.5838.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.98, N = 3157.88141.25153.63149.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd1326395265SE +/- 0.72, N = 1551.6952.5257.7554.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.49, N = 337.2640.0136.9436.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.31, N = 8150.13137.59145.75151.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.58, N = 15154.22154.56142.14151.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.71, N = 668.6666.9272.2170.401. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.64, N = 343.9047.2044.2446.601. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.86, N = 373.6073.5170.5975.881. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.66, N = 373.7568.9872.8471.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.73, N = 671.3575.7472.3373.911. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.17, N = 387.5390.1085.0688.231. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.90, N = 3120.08122.84116.32117.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.50, N = 1372.1572.3476.1772.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.93, N = 1574.7873.0776.9674.711. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.27, N = 3111.76107.95112.00106.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd816243240SE +/- 0.21, N = 335.5135.1336.9236.541. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.39, N = 3154.42161.08161.54162.031. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.27, N = 344.5545.9846.7045.121. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: MagiAMD EPYC 9334 32-Corebcd2004006008001000SE +/- 0.51, N = 31066.171028.111073.191025.651. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.43, N = 370.0571.6370.7173.091. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.13, N = 3103.26106.54103.12107.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.83, N = 373.6474.6076.7174.381. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.01, N = 376.5475.9275.9473.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd816243240SE +/- 0.10, N = 333.3933.2632.1232.311. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.70, N = 3160.22165.25159.02164.521. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.49, N = 347.8449.4347.6848.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd1122334455SE +/- 0.57, N = 448.1548.1449.9049.461. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.60, N = 451.5151.1949.7450.271. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.83, N = 4146.16146.28151.28147.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.35, N = 391.3793.4594.4892.181. (CXX) g++ options: -O3

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUAMD EPYC 9334 32-Corebc714212835SE +/- 0.09, N = 331.1331.8130.77MIN: 20.64 / MAX: 42.52MIN: 27.4 / MAX: 39.74MIN: 20.73 / MAX: 39.961. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUAMD EPYC 9334 32-Corebc2004006008001000SE +/- 3.19, N = 31027.291005.481039.351. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.78, N = 15162.38166.34165.42161.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 1.15, N = 388.0287.1189.5788.971. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.28, N = 3150.34148.09148.73146.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.31, N = 394.7494.6092.5093.731. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.62, N = 378.2479.8680.1179.651. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd306090120150SE +/- 0.32, N = 3118.84116.90117.92119.641. (CXX) g++ options: -O3

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc4080120160200SE +/- 0.80, N = 3194.27190.00193.501. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc20406080100SE +/- 0.34, N = 382.2984.1382.61MIN: 68.08 / MAX: 101.21MIN: 67.92 / MAX: 100.99MIN: 40.53 / MAX: 94.791. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.37, N = 379.6579.7878.2678.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebcd306090120150SE +/- 1.20, N = 15156.26155.27156.21152.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.18, N = 3186.53190.58186.80186.861. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.76, N = 3187.01190.49186.51189.951. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd1632486480SE +/- 0.38, N = 371.4371.6770.3671.861. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: GarlicoinAMD EPYC 9334 32-Corebcd400800120016002000SE +/- 9.13, N = 32049.722086.562070.312065.891. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmAMD EPYC 9334 32-Corebcd3691215SE +/- 0.11, N = 312.2012.1112.2912.101. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd4080120160200SE +/- 1.35, N = 3158.54156.99158.62156.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.32, N = 351.2851.4051.9051.141. (CXX) g++ options: -O3

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAMD EPYC 9334 32-Corebc20406080100SE +/- 0.10, N = 382.1083.1382.39MIN: 68.73 / MAX: 93.7MIN: 41.84 / MAX: 94.38MIN: 65.37 / MAX: 93.631. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAMD EPYC 9334 32-Corebc4080120160200SE +/- 0.25, N = 3194.66192.27194.061. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUAMD EPYC 9334 32-Corebc0.7471.4942.2412.9883.735SE +/- 0.01, N = 33.323.313.28MIN: 2.1 / MAX: 13.61MIN: 2.14 / MAX: 12.63MIN: 2.03 / MAX: 13.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd816243240SE +/- 0.08, N = 333.2633.1733.1633.561. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: scrypt

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: scryptAMD EPYC 9334 32-Corebcd100200300400500SE +/- 5.54, N = 4439.70439.34438.79444.051. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc3691215SE +/- 0.01, N = 310.8711.0010.90MIN: 5.72 / MAX: 20.47MIN: 5.6 / MAX: 20.88MIN: 5.57 / MAX: 21.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.40, N = 393.7493.8592.7493.711. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AMD EPYC 9334 32-Corebcd60120180240300SE +/- 1.17, N = 3261.72261.84264.76262.771. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc30060090012001500SE +/- 0.81, N = 31468.661451.821465.391. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc15K30K45K60K75KSE +/- 728.07, N = 370376.5370549.6871175.961. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUAMD EPYC 9334 32-Corebc10002000300040005000SE +/- 18.44, N = 34811.894822.014863.331. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc5001000150020002500SE +/- 5.81, N = 32254.332272.242248.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.25, N = 393.5593.9794.5294.441. (CXX) g++ options: -O3

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc246810SE +/- 0.02, N = 37.097.037.10MIN: 4.12 / MAX: 16.39MIN: 4.2 / MAX: 17.05MIN: 4.4 / MAX: 21.661. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc5001000150020002500SE +/- 19.67, N = 72124.152133.032145.051. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.16, N = 352.4652.4852.5452.041. (CXX) g++ options: -O3

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc246810SE +/- 0.07, N = 77.527.497.45MIN: 5.54 / MAX: 16.24MIN: 5.49 / MAX: 16.91MIN: 4.93 / MAX: 16.711. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebcd714212835SE +/- 0.16, N = 331.6431.7331.9331.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512AMD EPYC 9334 32-Corebcd714212835SE +/- 0.13, N = 331.8731.8232.1131.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.59, N = 391.9892.3992.1891.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.18, N = 394.4694.2793.6394.121. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: LBC, LBRY CreditsAMD EPYC 9334 32-Corebcd6K12K18K24K30KSE +/- 38.44, N = 3257302594025730258131. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.76, N = 392.8093.5593.1792.811. (CXX) g++ options: -O3

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc510152025SE +/- 0.04, N = 322.7122.7622.58MIN: 13.69 / MAX: 31.43MIN: 14.14 / MAX: 29.87MIN: 14.13 / MAX: 31.831. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc150300450600750SE +/- 1.40, N = 3703.94702.53708.051. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.13, N = 393.6793.3493.4692.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.15, N = 395.8495.2695.3795.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.08, N = 399.7399.4699.1399.771. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: RingcoinAMD EPYC 9334 32-Corebcd12002400360048006000SE +/- 15.94, N = 35575.435608.205573.855578.971. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestlAMD EPYC 9334 32-Corebcd4K8K12K16K20KSE +/- 58.12, N = 3194401946019430195331. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.06, N = 352.3952.2252.4352.171. (CXX) g++ options: -O3

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAMD EPYC 9334 32-Corebc13K26K39K52K65KSE +/- 18.50, N = 359085.2459230.0359341.841. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.16, N = 3100.54100.15100.27100.211. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.20, N = 395.3095.4995.5595.211. (CXX) g++ options: -O3

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetAMD EPYC 9334 32-Corebd306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3127.65127.91128.051. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.08, N = 352.6052.4352.5152.461. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: SkeincoinAMD EPYC 9334 32-Corebcd12K24K36K48K60KSE +/- 82.12, N = 3553105532055470554831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBAMD EPYC 9334 32-Corebd20406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 379.5179.5579.391. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAMD EPYC 9334 32-Corebcd714212835SE +/- 0.02, N = 330.1530.2130.1630.191. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.23, N = 3100.14100.01100.20100.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corebcd20406080100SE +/- 0.17, N = 3100.33100.31100.14100.321. (CXX) g++ options: -O3

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAMD EPYC 9334 32-Corebc50100150200250SE +/- 0.31, N = 3229.01228.75229.081. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinAMD EPYC 9334 32-Corebcd3K6K9K12K15KSE +/- 3.33, N = 3139201391013900139131. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAMD EPYC 9334 32-Corebc1632486480SE +/- 0.09, N = 369.7969.8869.78MIN: 58.06 / MAX: 75.49MIN: 40.08 / MAX: 76.27MIN: 55.65 / MAX: 74.841. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Quad SHA-256, PyriteAMD EPYC 9334 32-Corebcd20K40K60K80K100KSE +/- 15.28, N = 3980509791097940980001. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc2004006008001000SE +/- 2.73, N = 3812.69811.55811.711. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc918273645SE +/- 0.13, N = 339.3539.4039.40MIN: 33.85 / MAX: 46.73MIN: 25.61 / MAX: 46.53MIN: 24.93 / MAX: 45.21. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc246810SE +/- 0.00, N = 38.478.478.46MIN: 4.66 / MAX: 18.23MIN: 4.56 / MAX: 17.51MIN: 4.5 / MAX: 17.911. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc8001600240032004000SE +/- 1.21, N = 33773.853773.933778.281. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc510152025SE +/- 0.01, N = 319.7319.7519.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUAMD EPYC 9334 32-Corebc140280420560700SE +/- 0.98, N = 3640.74641.26641.251. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc918273645SE +/- 0.01, N = 337.3737.3737.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUAMD EPYC 9334 32-Corebc612182430SE +/- 0.04, N = 324.9624.9424.94MIN: 16 / MAX: 33.24MIN: 17.3 / MAX: 33.61MIN: 16.06 / MAX: 41.081. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc400800120016002000SE +/- 0.27, N = 31961.231962.271962.801. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebcd1224364860SE +/- 0.03, N = 352.2852.2952.2552.261. (CXX) g++ options: -O3

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc48121620SE +/- 0.00, N = 316.2916.2816.28MIN: 8.63 / MAX: 25.52MIN: 8.52 / MAX: 25.67MIN: 8.42 / MAX: 26.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUAMD EPYC 9334 32-Corebc2004006008001000SE +/- 0.19, N = 3808.35808.67808.18MIN: 751.2 / MAX: 822.48MIN: 785.84 / MAX: 820.8MIN: 774.6 / MAX: 821.971. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc90180270360450SE +/- 0.13, N = 3427.31427.23427.46MIN: 406.06 / MAX: 433.09MIN: 406.18 / MAX: 436.27MIN: 403.85 / MAX: 437.321. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc14002800420056007000SE +/- 7.91, N = 36644.756645.776642.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinAMD EPYC 9334 32-Corebcd30K60K90K120K150KSE +/- 8.82, N = 31417901418201417901417771. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 SAMD EPYC 9334 32-Corebcd40K80K120K160K200KSE +/- 12.02, N = 32100702101002101102100831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc0.0810.1620.2430.3240.405SE +/- 0.00, N = 30.360.360.36MIN: 0.22 / MAX: 9.63MIN: 0.22 / MAX: 8.93MIN: 0.22 / MAX: 8.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAMD EPYC 9334 32-Corebc0.11930.23860.35790.47720.5965SE +/- 0.00, N = 30.530.530.53MIN: 0.32 / MAX: 9.28MIN: 0.3 / MAX: 10.55MIN: 0.28 / MAX: 9.961. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corebc1.08232.16463.24694.32925.4115SE +/- 0.01, N = 34.814.814.81MIN: 3.24 / MAX: 14.25MIN: 3.08 / MAX: 14.22MIN: 3.23 / MAX: 32.041. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128AMD EPYC 9334 32-Corebcd1326395265SE +/- 0.91, N = 1551.3353.9556.9953.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corebcd918273645SE +/- 0.64, N = 1537.0936.8338.0038.091. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5