extra tests 3

AMD EPYC 9334 32-Core testing with a Supermicro H13SSW (1.1 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2310300-NE-EXTRATEST84&sor&grr.

extra tests 3ProcessorMotherboardMemoryDiskGraphicsMonitorOSKernelCompilerFile-SystemScreen ResolutionAMD EPYC 9334 32-CorebcdAMD EPYC 9334 32-Core @ 2.70GHz (32 Cores / 64 Threads)Supermicro H13SSW (1.1 BIOS)12 x 64 GB DDR5-4800MT/s HMCG94MEBRA123N2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbDELL E207WFPAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41680x1050OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101111Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra tests 3duckdb: TPC-H Parquetcloverleaf: clover_bm16duckdb: IMDBopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUheffte: c2c - FFTW - double-long - 1024heffte: c2c - Stock - double-long - 1024heffte: c2c - Stock - double - 1024heffte: c2c - FFTW - double - 1024cpuminer-opt: scryptcloverleaf: clover_bm64_shortcpuminer-opt: LBC, LBRY Creditscpuminer-opt: Blake-2 Scpuminer-opt: Triple SHA-256, Onecoincpuminer-opt: Deepcoincpuminer-opt: Garlicoincpuminer-opt: Myriad-Groestlcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Skeincoincpuminer-opt: Magicpuminer-opt: Ringcoinheffte: c2c - FFTW - float - 1024heffte: c2c - FFTW - float-long - 1024heffte: c2c - Stock - float-long - 1024heffte: c2c - Stock - float - 1024heffte: r2c - FFTW - double-long - 1024heffte: r2c - FFTW - double - 1024heffte: r2c - FFTW - float-long - 1024heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - double-long - 1024heffte: r2c - Stock - double - 1024heffte: r2c - Stock - float - 1024heffte: r2c - Stock - float-long - 1024heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - double-long - 512cloverleaf: clover_bmheffte: r2c - FFTW - float-long - 512heffte: r2c - FFTW - float - 512heffte: c2c - Stock - double - 512heffte: c2c - Stock - double-long - 512heffte: c2c - FFTW - double - 512heffte: c2c - FFTW - double-long - 512heffte: c2c - Stock - float - 512heffte: c2c - FFTW - float - 512heffte: c2c - Stock - double - 256heffte: c2c - Stock - float-long - 512heffte: c2c - FFTW - float-long - 512heffte: c2c - Stock - double-long - 256heffte: r2c - Stock - float-long - 256heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double-long - 128heffte: r2c - Stock - double - 512heffte: r2c - Stock - double-long - 512heffte: r2c - Stock - float-long - 512heffte: r2c - FFTW - float - 256heffte: r2c - Stock - float - 512heffte: r2c - FFTW - double-long - 256heffte: c2c - FFTW - float-long - 256heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double-long - 256heffte: r2c - FFTW - double - 256heffte: c2c - Stock - float - 256heffte: c2c - Stock - float-long - 256heffte: c2c - FFTW - float - 256heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double - 256heffte: r2c - FFTW - float-long - 256heffte: r2c - Stock - float - 256heffte: c2c - Stock - double-long - 128heffte: c2c - Stock - double - 128heffte: c2c - Stock - float - 128heffte: r2c - Stock - double-long - 128heffte: c2c - Stock - float-long - 128heffte: r2c - Stock - double - 128heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double-long - 128heffte: r2c - Stock - float - 128heffte: r2c - FFTW - float-long - 128heffte: c2c - FFTW - float-long - 128heffte: c2c - FFTW - float - 128heffte: r2c - Stock - float-long - 128heffte: r2c - FFTW - float - 128ospray-studio: 2 - 1080p - 32 - Path Tracer - CPUAMD EPYC 9334 32-Corebcd127.646261.7279.5107.522124.15808.3519.73427.3137.3782.29194.2782.1194.6669.79229.0122.71703.9424.96640.744.816644.750.3670376.5339.35812.697.092254.330.5359085.2431.131027.2910.871468.663.324811.8916.291961.238.473773.8552.459852.280652.39252.6439.730.1525730210070141790139202049.721944098050553101066.175575.4392.799793.66694.458793.554294.736393.74395.298495.8371100.538100.32899.7337100.14474.775672.152312.20154.219156.25531.639431.871133.392733.256748.15351.508937.094847.84351.282936.2576162.37551.690851.330179.651878.2354146.155150.134150.33568.662671.348635.507237.263271.428373.747170.053973.598573.640876.5441157.878160.22244.547943.898191.97987.534291.371988.0159111.759103.262158.538187.012120.075118.842154.418186.529127.909261.8479.5527.492133.03808.6719.75427.2337.3784.1319083.13192.2769.88228.7522.76702.5324.94641.264.816645.770.3670549.6839.4811.557.032272.240.5359230.0331.811005.48111451.823.314822.0116.281962.278.473773.9352.483452.289352.219852.4331439.3430.2125940210100141820139102086.561946097910553201028.115608.293.545593.338694.271993.970294.595693.851695.487995.2581100.152100.30899.4562100.0173.067972.337312.11154.559155.2731.733831.820233.264433.168848.143951.192636.833649.429351.399440.2624166.33852.517453.954579.776979.8568146.278137.587148.08566.917875.742835.133240.013271.665568.977971.626173.511474.601175.919141.253165.24645.977947.204792.389690.100293.446187.1142107.948106.543156.986190.485122.838116.895161.084190.581264.767.452145.05808.1819.75427.4637.3482.61193.5082.39194.0669.78229.0822.58708.0524.94641.254.816642.930.3671175.9639.40811.717.102248.680.5359341.8430.771039.3510.901465.393.284863.3316.281962.808.463778.2852.539152.252752.427152.5071438.7930.1625730210110141790139002070.311943097940554701073.195573.8593.16693.458593.632694.518492.502492.743895.553795.3719100.266100.13799.1331100.20276.964576.170412.29142.143156.20931.930632.11332.120633.158349.898349.735938.003547.676251.895241.5801165.42257.751556.994378.256180.113151.276145.746148.73372.213772.334136.920236.936370.364872.842870.712170.59176.711375.9431153.631159.01646.702744.242592.184585.05894.480989.5721112.002103.117158.624186.507116.322117.918161.535186.801128.054262.7779.39152.039952.261052.165852.4562444.0530.1925813210083141777139132065.891953398000554831025.655578.9792.807792.936394.124694.437893.732693.708295.205495.1323100.209100.31899.7669100.143974.714072.954712.10151.134152.91731.823631.942332.309333.561949.460850.274338.087148.726851.135838.3486161.46554.359353.470878.071179.6484147.503151.351146.58270.403773.914536.544436.157671.856171.087773.089675.882574.384473.5728149.884164.51945.119646.595091.573488.234392.179088.9655106.498107.524156.174189.952117.171119.642162.029186.863OpenBenchmarking.org

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetAMD EPYC 9334 32-Corebd306090120150SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3127.65127.91128.051. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

CloverLeaf

Input: clover_bm16

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm16AMD EPYC 9334 32-Corebdc60120180240300SE +/- 1.17, N = 3261.72261.84262.77264.761. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBdAMD EPYC 9334 32-Coreb20406080100SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 379.3979.5179.551. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core246810SE +/- 0.07, N = 77.457.497.52MIN: 4.93 / MAX: 16.71MIN: 5.49 / MAX: 16.91MIN: 5.54 / MAX: 16.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core5001000150020002500SE +/- 19.67, N = 72145.052133.032124.151. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUcAMD EPYC 9334 32-Coreb2004006008001000SE +/- 0.19, N = 3808.18808.35808.67MIN: 774.6 / MAX: 821.97MIN: 751.2 / MAX: 822.48MIN: 785.84 / MAX: 820.81. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUbcAMD EPYC 9334 32-Core510152025SE +/- 0.01, N = 319.7519.7519.731. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec90180270360450SE +/- 0.13, N = 3427.23427.31427.46MIN: 406.18 / MAX: 436.27MIN: 406.06 / MAX: 433.09MIN: 403.85 / MAX: 437.321. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec918273645SE +/- 0.01, N = 337.3737.3737.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb20406080100SE +/- 0.34, N = 382.2982.6184.13MIN: 68.08 / MAX: 101.21MIN: 40.53 / MAX: 94.79MIN: 67.92 / MAX: 100.991. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb4080120160200SE +/- 0.80, N = 3194.27193.50190.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAMD EPYC 9334 32-Corecb20406080100SE +/- 0.10, N = 382.1082.3983.13MIN: 68.73 / MAX: 93.7MIN: 65.37 / MAX: 93.63MIN: 41.84 / MAX: 94.381. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUAMD EPYC 9334 32-Corecb4080120160200SE +/- 0.25, N = 3194.66194.06192.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUcAMD EPYC 9334 32-Coreb1632486480SE +/- 0.09, N = 369.7869.7969.88MIN: 55.65 / MAX: 74.84MIN: 58.06 / MAX: 75.49MIN: 40.08 / MAX: 76.271. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUcAMD EPYC 9334 32-Coreb50100150200250SE +/- 0.31, N = 3229.08229.01228.751. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb510152025SE +/- 0.04, N = 322.5822.7122.76MIN: 14.13 / MAX: 31.83MIN: 13.69 / MAX: 31.43MIN: 14.14 / MAX: 29.871. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb150300450600750SE +/- 1.40, N = 3708.05703.94702.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUcbAMD EPYC 9334 32-Core612182430SE +/- 0.04, N = 324.9424.9424.96MIN: 16.06 / MAX: 41.08MIN: 17.3 / MAX: 33.61MIN: 16 / MAX: 33.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUbcAMD EPYC 9334 32-Core140280420560700SE +/- 0.98, N = 3641.26641.25640.741. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb1.08232.16463.24694.32925.4115SE +/- 0.01, N = 34.814.814.81MIN: 3.23 / MAX: 32.04MIN: 3.24 / MAX: 14.25MIN: 3.08 / MAX: 14.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec14002800420056007000SE +/- 7.91, N = 36645.776644.756642.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb0.0810.1620.2430.3240.405SE +/- 0.00, N = 30.360.360.36MIN: 0.22 / MAX: 8.19MIN: 0.22 / MAX: 9.63MIN: 0.22 / MAX: 8.931. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUcbAMD EPYC 9334 32-Core15K30K45K60K75KSE +/- 728.07, N = 371175.9670549.6870376.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corecb918273645SE +/- 0.13, N = 339.3539.4039.40MIN: 33.85 / MAX: 46.73MIN: 24.93 / MAX: 45.2MIN: 25.61 / MAX: 46.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUAMD EPYC 9334 32-Corecb2004006008001000SE +/- 2.73, N = 3812.69811.71811.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec246810SE +/- 0.02, N = 37.037.097.10MIN: 4.2 / MAX: 17.05MIN: 4.12 / MAX: 16.39MIN: 4.4 / MAX: 21.661. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUbAMD EPYC 9334 32-Corec5001000150020002500SE +/- 5.81, N = 32272.242254.332248.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUcAMD EPYC 9334 32-Coreb0.11930.23860.35790.47720.5965SE +/- 0.00, N = 30.530.530.53MIN: 0.28 / MAX: 9.96MIN: 0.32 / MAX: 9.28MIN: 0.3 / MAX: 10.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUcbAMD EPYC 9334 32-Core13K26K39K52K65KSE +/- 18.50, N = 359341.8459230.0359085.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUcAMD EPYC 9334 32-Coreb714212835SE +/- 0.09, N = 330.7731.1331.81MIN: 20.73 / MAX: 39.96MIN: 20.64 / MAX: 42.52MIN: 27.4 / MAX: 39.741. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUcAMD EPYC 9334 32-Coreb2004006008001000SE +/- 3.19, N = 31039.351027.291005.481. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb3691215SE +/- 0.01, N = 310.8710.9011.00MIN: 5.72 / MAX: 20.47MIN: 5.57 / MAX: 21.27MIN: 5.6 / MAX: 20.881. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUAMD EPYC 9334 32-Corecb30060090012001500SE +/- 0.81, N = 31468.661465.391451.821. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUcbAMD EPYC 9334 32-Core0.7471.4942.2412.9883.735SE +/- 0.01, N = 33.283.313.32MIN: 2.03 / MAX: 13.53MIN: 2.14 / MAX: 12.63MIN: 2.1 / MAX: 13.611. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUcbAMD EPYC 9334 32-Core10002000300040005000SE +/- 18.44, N = 34863.334822.014811.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core48121620SE +/- 0.00, N = 316.2816.2816.29MIN: 8.42 / MAX: 26.01MIN: 8.52 / MAX: 25.67MIN: 8.63 / MAX: 25.521. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUcbAMD EPYC 9334 32-Core400800120016002000SE +/- 0.27, N = 31962.801962.271961.231. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUcAMD EPYC 9334 32-Coreb246810SE +/- 0.00, N = 38.468.478.47MIN: 4.5 / MAX: 17.91MIN: 4.66 / MAX: 18.23MIN: 4.56 / MAX: 17.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUcbAMD EPYC 9334 32-Core8001600240032004000SE +/- 1.21, N = 33778.283773.933773.851. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024cbAMD EPYC 9334 32-Cored1224364860SE +/- 0.16, N = 352.5452.4852.4652.041. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024bAMD EPYC 9334 32-Coredc1224364860SE +/- 0.03, N = 352.2952.2852.2652.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024cAMD EPYC 9334 32-Corebd1224364860SE +/- 0.06, N = 352.4352.3952.2252.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Corecdb1224364860SE +/- 0.08, N = 352.6052.5152.4652.431. (CXX) g++ options: -O3

Cpuminer-Opt

Algorithm: scrypt

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: scryptdAMD EPYC 9334 32-Corebc100200300400500SE +/- 5.54, N = 4444.05439.70439.34438.791. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

CloverLeaf

Input: clover_bm64_short

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bm64_shortAMD EPYC 9334 32-Corecdb714212835SE +/- 0.02, N = 330.1530.1630.1930.211. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: LBC, LBRY CreditsbdcAMD EPYC 9334 32-Core6K12K18K24K30KSE +/- 38.44, N = 3259402581325730257301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 ScbdAMD EPYC 9334 32-Core40K80K120K160K200KSE +/- 12.02, N = 32101102101002100832100701. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinbcAMD EPYC 9334 32-Cored30K60K90K120K150KSE +/- 8.82, N = 31418201417901417901417771. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinAMD EPYC 9334 32-Coredbc3K6K9K12K15KSE +/- 3.33, N = 3139201391313910139001. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: GarlicoinbcdAMD EPYC 9334 32-Core400800120016002000SE +/- 9.13, N = 32086.562070.312065.892049.721. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestldbAMD EPYC 9334 32-Corec4K8K12K16K20KSE +/- 58.12, N = 3195331946019440194301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Quad SHA-256, PyriteAMD EPYC 9334 32-Coredcb20K40K60K80K100KSE +/- 15.28, N = 3980509800097940979101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: SkeincoindcbAMD EPYC 9334 32-Core12K24K36K48K60KSE +/- 82.12, N = 3554835547055320553101. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: MagicAMD EPYC 9334 32-Corebd2004006008001000SE +/- 0.51, N = 31073.191066.171028.111025.651. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: RingcoinbdAMD EPYC 9334 32-Corec12002400360048006000SE +/- 15.94, N = 35608.205578.975575.435573.851. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024bcdAMD EPYC 9334 32-Core20406080100SE +/- 0.76, N = 393.5593.1792.8192.801. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corecbd20406080100SE +/- 0.13, N = 393.6793.4693.3492.941. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024AMD EPYC 9334 32-Corebdc20406080100SE +/- 0.18, N = 394.4694.2794.1293.631. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024cdbAMD EPYC 9334 32-Core20406080100SE +/- 0.25, N = 394.5294.4493.9793.551. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corebdc20406080100SE +/- 0.31, N = 394.7494.6093.7392.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024bAMD EPYC 9334 32-Coredc20406080100SE +/- 0.40, N = 393.8593.7493.7192.741. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024cbAMD EPYC 9334 32-Cored20406080100SE +/- 0.20, N = 395.5595.4995.3095.211. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024AMD EPYC 9334 32-Corecbd20406080100SE +/- 0.15, N = 395.8495.3795.2695.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024AMD EPYC 9334 32-Corecdb20406080100SE +/- 0.16, N = 3100.54100.27100.21100.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024AMD EPYC 9334 32-Coredbc20406080100SE +/- 0.17, N = 3100.33100.32100.31100.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024dAMD EPYC 9334 32-Corebc20406080100SE +/- 0.08, N = 399.7799.7399.4699.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024cAMD EPYC 9334 32-Coredb20406080100SE +/- 0.23, N = 3100.20100.14100.14100.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512cAMD EPYC 9334 32-Coredb20406080100SE +/- 0.93, N = 1576.9674.7874.7173.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512cdbAMD EPYC 9334 32-Core20406080100SE +/- 0.50, N = 1376.1772.9572.3472.151. (CXX) g++ options: -O3

CloverLeaf

Input: clover_bm

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeaf 1.3Input: clover_bmdbAMD EPYC 9334 32-Corec3691215SE +/- 0.11, N = 312.1012.1112.2012.291. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512bAMD EPYC 9334 32-Coredc306090120150SE +/- 1.58, N = 15154.56154.22151.13142.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corecbd306090120150SE +/- 1.20, N = 15156.26156.21155.27152.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512cdbAMD EPYC 9334 32-Core714212835SE +/- 0.16, N = 331.9331.8231.7331.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512cdAMD EPYC 9334 32-Coreb714212835SE +/- 0.13, N = 332.1131.9431.8731.821. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512AMD EPYC 9334 32-Corebdc816243240SE +/- 0.10, N = 333.3933.2632.3132.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512dAMD EPYC 9334 32-Corebc816243240SE +/- 0.08, N = 333.5633.2633.1733.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512cdAMD EPYC 9334 32-Coreb1122334455SE +/- 0.57, N = 449.9049.4648.1548.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corebdc1224364860SE +/- 0.60, N = 451.5151.1950.2749.741. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256dcAMD EPYC 9334 32-Coreb918273645SE +/- 0.64, N = 1538.0938.0037.0936.831. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512bdAMD EPYC 9334 32-Corec1122334455SE +/- 0.49, N = 349.4348.7347.8447.681. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512cbAMD EPYC 9334 32-Cored1224364860SE +/- 0.32, N = 351.9051.4051.2851.141. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256cbdAMD EPYC 9334 32-Core918273645SE +/- 0.55, N = 1241.5840.2638.3536.261. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256bcAMD EPYC 9334 32-Cored4080120160200SE +/- 1.78, N = 15166.34165.42162.38161.471. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128cdbAMD EPYC 9334 32-Core1326395265SE +/- 0.72, N = 1557.7554.3652.5251.691. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128cbdAMD EPYC 9334 32-Core1326395265SE +/- 0.91, N = 1556.9953.9553.4751.331. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512bAMD EPYC 9334 32-Corecd20406080100SE +/- 0.37, N = 379.7879.6578.2678.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.62, N = 380.1179.8679.6578.241. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512cdbAMD EPYC 9334 32-Core306090120150SE +/- 1.83, N = 4151.28147.50146.28146.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256dAMD EPYC 9334 32-Corecb306090120150SE +/- 1.31, N = 8151.35150.13145.75137.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512AMD EPYC 9334 32-Corecbd306090120150SE +/- 1.28, N = 3150.34148.73148.09146.581. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256cdAMD EPYC 9334 32-Coreb1632486480SE +/- 0.71, N = 672.2170.4068.6666.921. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256bdcAMD EPYC 9334 32-Core20406080100SE +/- 0.73, N = 675.7473.9172.3371.351. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256cdAMD EPYC 9334 32-Coreb816243240SE +/- 0.21, N = 336.9236.5435.5135.131. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256bAMD EPYC 9334 32-Corecd918273645SE +/- 0.49, N = 340.0137.2636.9436.161. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256dbAMD EPYC 9334 32-Corec1632486480SE +/- 0.38, N = 371.8671.6771.4370.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256AMD EPYC 9334 32-Corecdb1632486480SE +/- 0.66, N = 373.7572.8471.0968.981. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256dbcAMD EPYC 9334 32-Core1632486480SE +/- 0.43, N = 373.0971.6370.7170.051. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256dAMD EPYC 9334 32-Corebc20406080100SE +/- 0.86, N = 375.8873.6073.5170.591. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.83, N = 376.7174.6074.3873.641. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256AMD EPYC 9334 32-Corecbd20406080100SE +/- 1.01, N = 376.5475.9475.9273.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256AMD EPYC 9334 32-Corecdb306090120150SE +/- 0.98, N = 3157.88153.63149.88141.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256bdAMD EPYC 9334 32-Corec4080120160200SE +/- 1.70, N = 3165.25164.52160.22159.021. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128cbdAMD EPYC 9334 32-Core1122334455SE +/- 0.27, N = 346.7045.9845.1244.551. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128bdcAMD EPYC 9334 32-Core1122334455SE +/- 0.64, N = 347.2046.6044.2443.901. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128bcAMD EPYC 9334 32-Cored20406080100SE +/- 0.59, N = 392.3992.1891.9891.571. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128bdAMD EPYC 9334 32-Corec20406080100SE +/- 0.17, N = 390.1088.2387.5385.061. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128cbdAMD EPYC 9334 32-Core20406080100SE +/- 0.35, N = 394.4893.4592.1891.371. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128cdAMD EPYC 9334 32-Coreb20406080100SE +/- 1.15, N = 389.5788.9788.0287.111. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128cAMD EPYC 9334 32-Corebd306090120150SE +/- 0.27, N = 3112.00111.76107.95106.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128dbAMD EPYC 9334 32-Corec20406080100SE +/- 1.13, N = 3107.52106.54103.26103.121. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128cAMD EPYC 9334 32-Corebd4080120160200SE +/- 1.35, N = 3158.62158.54156.99156.171. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128bdAMD EPYC 9334 32-Corec4080120160200SE +/- 1.76, N = 3190.49189.95187.01186.511. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128bAMD EPYC 9334 32-Coredc306090120150SE +/- 0.90, N = 3122.84120.08117.17116.321. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128dAMD EPYC 9334 32-Corecb306090120150SE +/- 0.32, N = 3119.64118.84117.92116.901. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128dcbAMD EPYC 9334 32-Core4080120160200SE +/- 1.39, N = 3162.03161.54161.08154.421. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128bdcAMD EPYC 9334 32-Core4080120160200SE +/- 1.18, N = 3190.58186.86186.80186.531. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.4