GPTshop.ai NVIDIA GH200 Linux Benchmarks

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2402184-NE-GH200THRE73
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 2 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 9 Tests
CPU Massive 17 Tests
Creator Workloads 9 Tests
Cryptocurrency Benchmarks, CPU Mining Tests 2 Tests
Cryptography 3 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 14 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Linear Algebra 2 Tests
Machine Learning 2 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 5 Tests
Multi-Core 25 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 10 Tests
Programmer / Developer System Benchmarks 7 Tests
Python Tests 6 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 6 Tests
Server 2 Tests
Server CPU Tests 12 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GPTshop.ai GH200
February 05
  9 Hours, 5 Minutes
HP Z6 G5 A - Threadripper PRO 7995WX
February 17
  12 Hours, 44 Minutes
Invert Hiding All Results Option
  10 Hours, 54 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GPTshop.ai NVIDIA GH200 Linux Benchmarks - Phoronix Test Suite

GPTshop.ai NVIDIA GH200 Linux Benchmarks

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2402184-NE-GH200THRE73&gru&sro.

GPTshop.ai NVIDIA GH200 Linux BenchmarksProcessorMotherboardMemoryDiskGraphicsNetworkChipsetAudioMonitorOSKernelCompilerFile-SystemScreen ResolutionDesktopDisplay ServerDisplay DriverOpenGLOpenCLGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WXARMv8 Neoverse-V2 @ 3.39GHz (72 Cores)Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS)1 x 480GB DRAM-6400MT/s960GB SAMSUNG MZ1L2960HCJR-00A07 + 1920GB SAMSUNG MZTL21T9ASPEED2 x Mellanox MT2910 + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEUbuntu 23.106.5.0-15-generic (aarch64)GCC 13.2.0ext41920x1200AMD Ryzen Threadripper PRO 7995WX 96-Cores @ 6.44GHz (96 Cores / 192 Threads)HP Z6 G5 A Workstation 8B24 (U65 Ver. 01.01.04 BIOS)AMD Device 14a48 x 16GB DRAM-5200MT/s Hynix HMCG78AGBRA190N2 x 1024GB SAMSUNG MZVL21T0HCLR-00BH1NVIDIA RTX A4000 16GBNVIDIA GA104 HD AudioASUS VP28URealtek RTL8111/8168/84116.5.0-17-generic (x86_64)GNOME Shell 45.2X Server 1.21.1.7NVIDIA 535.154.054.6.0OpenCL 3.0 CUDA 12.2.1483840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- GPTshop.ai GH200: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - HP Z6 G5 A - Threadripper PRO 7995WX: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- GPTshop.ai GH200: Scaling Governor: cppc_cpufreq performance (Boost: Disabled)- HP Z6 G5 A - Threadripper PRO 7995WX: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa108105Python Details- Python 3.11.6Security Details- GPTshop.ai GH200: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - HP Z6 G5 A - Threadripper PRO 7995WX: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected OpenCL Details- HP Z6 G5 A - Threadripper PRO 7995WX: GPU Compute Cores: 6144

GPTshop.ai NVIDIA GH200 Linux Benchmarksgraph500: 26graph500: 26minibude: OpenMP - BM2stress-ng: CPU Stressstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: AVX-512 VNNIstress-ng: Floating Pointstress-ng: Matrix 3D Mathstress-ng: Memory Copyingstress-ng: Wide Vector Mathstress-ng: Fused Multiply-Addstress-ng: Vector Floating Pointamg: openvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUsvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kminibude: OpenMP - BM2hpcg: 144 144 144 - 60mt-dgemm: Sustained Floating-Point Ratelibxsmm: 128libxsmm: 256xmrig: Monero - 1Mxmrig: Wownero - 1Mgraphics-magick: Sharpengraphics-magick: Enhancedcoremark: CoreMark Size 666 - Iterations Per Secondcpuminer-opt: Deepcoincpuminer-opt: Blake-2 Scpuminer-opt: Myriad-Groestlcpuminer-opt: Triple SHA-256, Onecoincompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingrays1bench: Large Sceneastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivestockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthgromacs: MPI CPU - water_GMX50_barejohn-the-ripper: bcryptjohn-the-ripper: WPA PSKjohn-the-ripper: Blowfishjohn-the-ripper: MD5liquid-dsp: 128 - 256 - 512liquid-dsp: 240 - 256 - 512graph500: 26graph500: 26npb: BT.Cnpb: CG.Cnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Cpgbench: 100 - 1000 - Read Writelulesh: onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUpgbench: 100 - 1000 - Read Write - Average Latencyopenvino: Face Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUrodinia: OpenMP LavaMDnwchem: C240 Buckyballincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directionbuild-godot: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-nodejs: Time To Compileprimesieve: 1e13helsing: 14 digittachyon: Total Timeduckdb: IMDBduckdb: TPC-H Parquetrawtherapee: Total Benchmark Timebuild-gem5: Time To CompileGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1315650000124979000047.72530484.77512759.08359058.524173580.1320137.3517483.0227182.722002466.67139525267.41103967.9319979291117.4619.0619.16124.471.01184.3644.589.94285.0026.533.2423.7876.4426.684.33781.774.16595.730.82613.90831.46831.4961193.12441.694117.93677817253.021924.1136317612263279.93980234529538905512407.69652.191538266821509363795.194688177031069811190066722986666723774666746701200029902700049381.6824046.2548109.281748.5539739.6258334.5321268.705497523185.17718.230134.0652.5152.208.02993.525.4122.44100.623.5037.70308.4242.0513.0737.48231.181.27240.511.6730.3081403.5254.4900319.81172053139.09966.894282.286195.982173.87735.49067.61292.081148.75946.718180.622647711000634702000178.887212486.91430429.52619071.099099613.6530460.619100.7427883.512721964.70206576021.93219092.94166993966748.01340.49341.453237.4094.9411647.891712.725727.474793.8016968.102039.24555.219585.345467.672541.5183726.282141.63107003.127.016119.984206.132202.2834472.17543.4568852039.82582.556612.271115.482413803998440.414024336325607634494040139354716565899140906.543543.9582.32443.458962.63936.619628565135924830468610.3141727236128301728191469266713069666671530466667399994000317931000213489.5450606.21100501.034351.68251214.2394796.9588821.771612423830.8670.3589021.2583562.032993.05140.80140.4514.81503.914.1128.008.3720.015.6423.5286.3810.008.7637.750.9044.960.6526.8141601.1383.98571810.760549286.96230.978261.438122.388109.23223.28855.90216.0529104.275120.76457.892156.374OpenBenchmarking.org

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX300M600M900M1200M1500M13156500006477110001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX300M600M900M1200M1500M12497900006347020001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4080120160200SE +/- 0.12, N = 3SE +/- 0.20, N = 347.73178.89-mcpu=native-march=native1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -lm

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU StressGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50K100K150K200K250KSE +/- 24.74, N = 3SE +/- 1093.61, N = 330484.77212486.911. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix MathGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX110K220K330K440K550KSE +/- 3363.22, N = 3SE +/- 1114.33, N = 3512759.08430429.521. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector MathGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX130K260K390K520K650KSE +/- 38.26, N = 3SE +/- 183.75, N = 3359058.52619071.091. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: AVX-512 VNNI

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVX-512 VNNIGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX2M4M6M8M10MSE +/- 21528.43, N = 3SE +/- 4488.69, N = 34173580.139099613.651. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Floating PointGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX7K14K21K28K35KSE +/- 4.82, N = 3SE +/- 30.90, N = 320137.3530460.611. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix 3D MathGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4K8K12K16K20KSE +/- 12.40, N = 3SE +/- 2.39, N = 317483.029100.741. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Memory CopyingGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX6K12K18K24K30KSE +/- 78.88, N = 3SE +/- 21.26, N = 327182.7227883.511. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Wide Vector MathGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX600K1200K1800K2400K3000KSE +/- 16548.38, N = 15SE +/- 3634.23, N = 32002466.672721964.701. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Fused Multiply-AddGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX40M80M120M160M200MSE +/- 1701013.93, N = 3SE +/- 157257.42, N = 3139525267.41206576021.931. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Floating PointGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50K100K150K200K250KSE +/- 87.06, N = 3SE +/- 135.17, N = 3103967.93219092.941. (CXX) g++ options: -O2 -std=gnu99 -lc

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX400M800M1200M1600M2000MSE +/- 16287263.84, N = 9SE +/- 2196017.33, N = 3199792911116699396671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1122334455SE +/- 0.05, N = 3SE +/- 0.07, N = 37.4648.011. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX70140210280350SE +/- 0.19, N = 15SE +/- 0.25, N = 319.06340.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX70140210280350SE +/- 0.22, N = 4SE +/- 0.34, N = 319.16341.451. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX7001400210028003500SE +/- 0.97, N = 3SE +/- 4.78, N = 3124.473237.401. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20406080100SE +/- 0.00, N = 3SE +/- 0.05, N = 31.0194.941. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX2K4K6K8K10KSE +/- 0.31, N = 3SE +/- 20.00, N = 3184.3611647.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX400800120016002000SE +/- 0.39, N = 15SE +/- 1.75, N = 344.581712.721. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX12002400360048006000SE +/- 0.08, N = 3SE +/- 16.25, N = 39.945727.471. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX10002000300040005000SE +/- 1.58, N = 3SE +/- 6.25, N = 3285.004793.801. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4K8K12K16K20KSE +/- 0.29, N = 5SE +/- 38.34, N = 326.5316968.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX400800120016002000SE +/- 0.00, N = 3SE +/- 2.05, N = 33.242039.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX120240360480600SE +/- 0.26, N = 3SE +/- 0.29, N = 323.78555.211. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX2K4K6K8K10KSE +/- 0.58, N = 3SE +/- 38.08, N = 376.449585.341. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX12002400360048006000SE +/- 0.33, N = 3SE +/- 13.90, N = 326.685467.671. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX5001000150020002500SE +/- 0.04, N = 3SE +/- 9.61, N = 34.332541.511. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20K40K60K80K100KSE +/- 8.02, N = 3SE +/- 300.34, N = 3781.7783726.281. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX5001000150020002500SE +/- 0.01, N = 3SE +/- 35.73, N = 124.162141.631. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20K40K60K80K100KSE +/- 3.96, N = 15SE +/- 398.68, N = 3595.73107003.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 4 - Input: Bosphorus 4KGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX246810SE +/- 0.002, N = 3SE +/- 0.053, N = 30.8267.016-mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq1. (CXX) g++ options: -march=native

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 8 - Input: Bosphorus 4KGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX306090120150SE +/- 0.03, N = 3SE +/- 0.98, N = 1213.91119.98-mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq1. (CXX) g++ options: -march=native

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 12 - Input: Bosphorus 4KGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50100150200250SE +/- 0.00, N = 3SE +/- 1.23, N = 331.47206.13-mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq1. (CXX) g++ options: -march=native

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 13 - Input: Bosphorus 4KGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4080120160200SE +/- 0.00, N = 3SE +/- 2.01, N = 331.50202.28-mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq1. (CXX) g++ options: -march=native

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX10002000300040005000SE +/- 2.95, N = 3SE +/- 5.10, N = 31193.124472.18-mcpu=native-march=native1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -lm

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60GPTshop.ai GH2001020304050SE +/- 0.29, N = 341.691. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1020304050SE +/- 0.27, N = 12SE +/- 0.25, N = 317.9443.461. (CC) gcc options: -O3 -march=native -fopenmp

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128HP Z6 G5 A - Threadripper PRO 7995WX400800120016002000SE +/- 4.04, N = 32039.81. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256HP Z6 G5 A - Threadripper PRO 7995WX6001200180024003000SE +/- 1.59, N = 32582.51. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1MGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX12K24K36K48K60KSE +/- 9.66, N = 3SE +/- 161.30, N = 317253.056612.2-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1MGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX15K30K45K60K75KSE +/- 16.77, N = 3SE +/- 250.10, N = 321924.171115.4-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: SharpenGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX30060090012001500SE +/- 10.90, N = 3SE +/- 1.76, N = 31363824-ljbig -lwebp -lwebpmux -ltiff -lfreetype -lSM -lICE -lbz2 -lzstd1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lX11 -llzma -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.38Operation: EnhancedGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX400800120016002000SE +/- 0.33, N = 3SE +/- 6.64, N = 317611380-ljbig -lwebp -lwebpmux -ltiff -lfreetype -lSM -lICE -lbz2 -lzstd1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lX11 -llzma -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX900K1800K2700K3600K4500KSE +/- 9376.30, N = 3SE +/- 5264.15, N = 32263279.943998440.411. (CC) gcc options: -O2 -lrt" -lrt

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: DeepcoinHP Z6 G5 A - Threadripper PRO 7995WX7K14K21K28K35KSE +/- 357.43, N = 5336321. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Blake-2 SHP Z6 G5 A - Threadripper PRO 7995WX120K240K360K480K600KSE +/- 1705.31, N = 35607631. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Myriad-GroestlHP Z6 G5 A - Threadripper PRO 7995WX10K20K30K40K50KSE +/- 64.29, N = 3449401. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 23.5Algorithm: Triple SHA-256, OnecoinHP Z6 G5 A - Threadripper PRO 7995WX90K180K270K360K450KSE +/- 2385.23, N = 34013931. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX120K240K360K480K600KSE +/- 220.18, N = 3SE +/- 2297.47, N = 33452955471651. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX140K280K420K560K700KSE +/- 229.99, N = 3SE +/- 3067.65, N = 33890556589911. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX9K18K27K36K45KSE +/- 21.70, N = 3SE +/- 524.43, N = 312407.640906.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX9K18K27K36K45KSE +/- 13.17, N = 3SE +/- 527.54, N = 39652.1943543.901. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

rays1bench

Large Scene

OpenBenchmarking.orgmrays/s, More Is Betterrays1bench 2020-01-09Large SceneHP Z6 G5 A - Threadripper PRO 7995WX130260390520650SE +/- 2.65, N = 3582.32

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: MediumHP Z6 G5 A - Threadripper PRO 7995WX100200300400500SE +/- 1.12, N = 3443.461. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ThoroughHP Z6 G5 A - Threadripper PRO 7995WX1428425670SE +/- 0.18, N = 362.641. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: ExhaustiveHP Z6 G5 A - Threadripper PRO 7995WX246810SE +/- 0.0214, N = 36.61961. (CXX) g++ options: -O3 -flto -pthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimeGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX60M120M180M240M300MSE +/- 3694553.52, N = 12SE +/- 3424910.11, N = 3153826682285651359-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50M100M150M200M250MSE +/- 653999.42, N = 3SE +/- 3281525.65, N = 3150936379248304686

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX3691215SE +/- 0.002, N = 3SE +/- 0.389, N = 95.19410.3141. (CXX) g++ options: -O3

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX40K80K120K160K200KSE +/- 7.67, N = 3SE +/- 69.15, N = 368817172723-lgmp -lbz2-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX130K260K390K520K650KSE +/- 28.67, N = 3SE +/- 2936.87, N = 370310612830-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX40K80K120K160K200KSE +/- 24.83, N = 3SE +/- 163.99, N = 369811172819-lgmp -lbz2-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX3M6M9M12M15MSE +/- 12251.98, N = 3SE +/- 110105.00, N = 3190066714692667-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX300M600M900M1200M1500MSE +/- 271804.67, N = 3SE +/- 6716728.70, N = 322986666713069666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 240 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 240 - Buffer Length: 256 - Filter Length: 512GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX300M600M900M1200M1500MSE +/- 501741.41, N = 3SE +/- 7410877.89, N = 323774666715304666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX100M200M300M400M500M4670120003999940001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX70M140M210M280M350M2990270003179310001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50K100K150K200K250KSE +/- 150.57, N = 3SE +/- 492.77, N = 349381.68213489.541. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX11K22K33K44K55KSE +/- 54.20, N = 3SE +/- 557.33, N = 324046.2550606.211. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20K40K60K80K100KSE +/- 82.39, N = 3SE +/- 62.59, N = 348109.28100501.031. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX9001800270036004500SE +/- 1.10, N = 3SE +/- 18.52, N = 31748.554351.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50K100K150K200K250KSE +/- 430.26, N = 3SE +/- 1429.28, N = 339739.62251214.231. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20K40K60K80K100KSE +/- 39.48, N = 3SE +/- 555.59, N = 358334.5394796.951. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20K40K60K80K100KSE +/- 249.16, N = 4SE +/- 97.08, N = 321268.7088821.771. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX12K24K36K48K60KSE +/- 783.91, N = 11SE +/- 176.81, N = 354975161241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX5K10K15K20K25KSE +/- 15.38, N = 3SE +/- 128.12, N = 323185.1823830.871. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUHP Z6 G5 A - Threadripper PRO 7995WX0.08080.16160.24240.32320.404SE +/- 0.001912, N = 30.358902MIN: 0.321. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.3Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUHP Z6 G5 A - Threadripper PRO 7995WX0.28310.56620.84931.13241.4155SE +/- 0.01004, N = 31.25835MIN: 1.011. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1428425670SE +/- 0.28, N = 11SE +/- 0.68, N = 318.2362.031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX2004006008001000SE +/- 0.82, N = 3SE +/- 0.98, N = 3134.06993.05MIN: 482.47 / MAX: 1144.591. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX306090120150SE +/- 0.52, N = 15SE +/- 0.12, N = 352.51140.80MIN: 45.64 / MAX: 88.61MIN: 52.25 / MAX: 220.241. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP32 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX306090120150SE +/- 0.59, N = 4SE +/- 0.13, N = 352.20140.45MIN: 46.86 / MAX: 83.61MIN: 52.13 / MAX: 230.351. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX48121620SE +/- 0.06, N = 3SE +/- 0.02, N = 38.0214.81MIN: 6.08 / MAX: 18.7MIN: 6.08 / MAX: 45.261. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX2004006008001000SE +/- 4.42, N = 3SE +/- 0.20, N = 3993.52503.91MIN: 966.61 / MAX: 1035.71MIN: 253.56 / MAX: 608.111. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1.21732.43463.65194.86926.0865SE +/- 0.01, N = 3SE +/- 0.01, N = 35.414.11MIN: 3.66 / MAX: 12.71MIN: 2.29 / MAX: 22.11. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX714212835SE +/- 0.19, N = 15SE +/- 0.03, N = 322.4428.00MIN: 18.38 / MAX: 33.33MIN: 18.2 / MAX: 67.591. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20406080100SE +/- 0.77, N = 3SE +/- 0.02, N = 3100.628.37MIN: 95.54 / MAX: 114.291. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 33.5020.01MIN: 2.11 / MAX: 13.96MIN: 10.21 / MAX: 40.651. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX918273645SE +/- 0.42, N = 5SE +/- 0.01, N = 337.705.64MIN: 34.56 / MAX: 47.33MIN: 3.25 / MAX: 24.091. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX70140210280350SE +/- 0.47, N = 3SE +/- 0.02, N = 3308.4223.52MIN: 299.67 / MAX: 324.141. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20406080100SE +/- 0.47, N = 3SE +/- 0.05, N = 342.0586.38MIN: 37.47 / MAX: 166.36MIN: 51.34 / MAX: 158.071. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX3691215SE +/- 0.10, N = 3SE +/- 0.04, N = 313.0710.00MIN: 11.41 / MAX: 24.95MIN: 5.34 / MAX: 28.561. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX918273645SE +/- 0.46, N = 3SE +/- 0.02, N = 337.488.76MIN: 33.64 / MAX: 48.73MIN: 6.07 / MAX: 28.471. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50100150200250SE +/- 2.13, N = 3SE +/- 0.14, N = 3231.1837.75MIN: 220.77 / MAX: 317.5MIN: 23.66 / MAX: 98.371. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX0.28580.57160.85741.14321.429SE +/- 0.01, N = 3SE +/- 0.00, N = 31.270.90MIN: 0.64 / MAX: 3.24MIN: 0.25 / MAX: 18.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX50100150200250SE +/- 0.76, N = 3SE +/- 0.90, N = 12240.5144.96MIN: 230.79 / MAX: 323.15MIN: 30.33 / MAX: 84.491. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX0.37580.75161.12741.50321.879SE +/- 0.01, N = 15SE +/- 0.00, N = 31.670.65MIN: 1.08 / MAX: 4.63MIN: 0.21 / MAX: 20.431. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX714212835SE +/- 0.08, N = 3SE +/- 0.05, N = 330.3126.811. (CXX) g++ options: -O2 -lOpenCL

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 BuckyballGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX300600900120015001403.51601.1-m641. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX80160240320400SE +/- 0.50, N = 3SE +/- 0.96, N = 3254.49383.991. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX3691215SE +/- 0.02630858, N = 3SE +/- 0.08460429, N = 39.8117205310.760549201. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX306090120150SE +/- 1.36, N = 3SE +/- 0.83, N = 3139.1086.96

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1530456075SE +/- 0.40, N = 3SE +/- 0.30, N = 366.8930.98

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX60120180240300SE +/- 1.16, N = 3SE +/- 0.89, N = 3282.29261.44

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4080120160200SE +/- 0.44, N = 3SE +/- 0.30, N = 3195.98122.39

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To CompileGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4080120160200SE +/- 0.05, N = 3SE +/- 0.62, N = 3173.88109.23

Primesieve

Length: 1e13

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 8.0Length: 1e13GPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX816243240SE +/- 0.33, N = 3SE +/- 0.20, N = 335.4923.291. (CXX) g++ options: -O3

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1530456075SE +/- 0.52, N = 10SE +/- 0.11, N = 367.6155.901. (CC) gcc options: -O2 -pthread

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99.2Total TimeHP Z6 G5 A - Threadripper PRO 7995WX48121620SE +/- 0.03, N = 316.051. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

DuckDB

Benchmark: IMDB

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX20406080100SE +/- 0.70, N = 3SE +/- 0.12, N = 392.08104.281. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

DuckDB

Benchmark: TPC-H Parquet

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX306090120150SE +/- 1.06, N = 3SE +/- 0.09, N = 3148.76120.761. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

RawTherapee

Total Benchmark Time

OpenBenchmarking.orgSeconds, Fewer Is BetterRawTherapeeTotal Benchmark TimeGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX1326395265SE +/- 0.24, N = 3SE +/- 0.31, N = 346.7257.891. RawTherapee, version 5.9, command line.

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileGPTshop.ai GH200HP Z6 G5 A - Threadripper PRO 7995WX4080120160200SE +/- 2.14, N = 3SE +/- 1.07, N = 3180.62156.37


Phoronix Test Suite v10.8.4