GPTshop.ai NVIDIA GH200 Linux Benchmarks
Benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2402184-NE-GH200THRE73&sro&grr.
GROMACS
Implementation: MPI CPU - Input: water_GMX50_bare
NWChem
Input: C240 Buckyball
Xcompact3d Incompact3d
Input: X3D-benchmarking input.i3d
PostgreSQL
Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency
PostgreSQL
Scaling Factor: 100 - Clients: 1000 - Mode: Read Write
Timed Linux Kernel Compilation
Build: allmodconfig
Graph500
Scale: 26
Graph500
Scale: 26
Graph500
Scale: 26
Graph500
Scale: 26
libxsmm
M N K: 128
OpenVINO
Model: Person Detection FP16 - Device: CPU
OpenVINO
Model: Person Detection FP16 - Device: CPU
OpenVINO
Model: Road Segmentation ADAS FP16 - Device: CPU
OpenVINO
Model: Road Segmentation ADAS FP16 - Device: CPU
DuckDB
Benchmark: TPC-H Parquet
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU
High Performance Conjugate Gradient
X Y Z: 144 144 144 - RT: 60
Timed Gem5 Compilation
Time To Compile
Timed LLVM Compilation
Build System: Ninja
DuckDB
Benchmark: IMDB
OpenVINO
Model: Handwritten English Recognition FP16-INT8 - Device: CPU
OpenVINO
Model: Handwritten English Recognition FP16-INT8 - Device: CPU
Timed Node.js Compilation
Time To Compile
Helsing
Digit Range: 14 digit
Stockfish
Total Time
Timed Godot Game Engine Compilation
Time To Compile
SVT-AV1
Encoder Mode: Preset 4 - Input: Bosphorus 4K
asmFish
1024 Hash Memory, 26 Depth
Stress-NG
Test: Wide Vector Math
OpenVINO
Model: Face Detection Retail FP16-INT8 - Device: CPU
OpenVINO
Model: Face Detection Retail FP16-INT8 - Device: CPU
libxsmm
M N K: 256
OpenVINO
Model: Person Detection FP32 - Device: CPU
OpenVINO
Model: Person Detection FP32 - Device: CPU
miniBUDE
Implementation: OpenMP - Input Deck: BM2
miniBUDE
Implementation: OpenMP - Input Deck: BM2
OpenVINO
Model: Face Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Face Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Face Detection FP16 - Device: CPU
OpenVINO
Model: Face Detection FP16 - Device: CPU
OpenVINO
Model: Road Segmentation ADAS FP16-INT8 - Device: CPU
OpenVINO
Model: Road Segmentation ADAS FP16-INT8 - Device: CPU
OpenVINO
Model: Machine Translation EN To DE FP16 - Device: CPU
OpenVINO
Model: Machine Translation EN To DE FP16 - Device: CPU
OpenVINO
Model: Person Vehicle Bike Detection FP16 - Device: CPU
OpenVINO
Model: Person Vehicle Bike Detection FP16 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Handwritten English Recognition FP16 - Device: CPU
OpenVINO
Model: Handwritten English Recognition FP16 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16 - Device: CPU
OpenVINO
Model: Face Detection Retail FP16 - Device: CPU
OpenVINO
Model: Face Detection Retail FP16 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16-INT8 - Device: CPU
John The Ripper
Test: MD5
GraphicsMagick
Operation: Sharpen
GraphicsMagick
Operation: Enhanced
NAS Parallel Benchmarks
Test / Class: SP.C
Algebraic Multi-Grid Benchmark
RawTherapee
Total Benchmark Time
Timed Linux Kernel Compilation
Build: defconfig
ASKAP
Test: tConvolve MPI - Gridding
ASKAP
Test: tConvolve MPI - Degridding
Xmrig
Variant: Monero - Hash Count: 1M
NAS Parallel Benchmarks
Test / Class: BT.C
SVT-AV1
Encoder Mode: Preset 8 - Input: Bosphorus 4K
Liquid-DSP
Threads: 240 - Buffer Length: 256 - Filter Length: 512
Xmrig
Variant: Wownero - Hash Count: 1M
NAS Parallel Benchmarks
Test / Class: LU.C
John The Ripper
Test: WPA PSK
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 512
Stress-NG
Test: Matrix 3D Math
Stress-NG
Test: AVX-512 VNNI
Stress-NG
Test: CPU Stress
Stress-NG
Test: Matrix Math
Stress-NG
Test: Vector Math
Stress-NG
Test: Memory Copying
Stress-NG
Test: Vector Floating Point
Stress-NG
Test: Fused Multiply-Add
Stress-NG
Test: Floating Point
John The Ripper
Test: bcrypt
John The Ripper
Test: Blowfish
Cpuminer-Opt
Algorithm: Triple SHA-256, Onecoin
Cpuminer-Opt
Algorithm: Blake-2 S
Cpuminer-Opt
Algorithm: Myriad-Groestl
Primesieve
Length: 1e13
7-Zip Compression
Test: Decompression Rating
7-Zip Compression
Test: Compression Rating
ACES DGEMM
Sustained Floating-Point Rate
Rodinia
Test: OpenMP LavaMD
Coremark
CoreMark Size 666 - Iterations Per Second
Cpuminer-Opt
Algorithm: Deepcoin
LULESH
NAS Parallel Benchmarks
Test / Class: IS.D
SVT-AV1
Encoder Mode: Preset 13 - Input: Bosphorus 4K
SVT-AV1
Encoder Mode: Preset 12 - Input: Bosphorus 4K
Xcompact3d Incompact3d
Input: input.i3d 193 Cells Per Direction
oneDNN
Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU
ASTC Encoder
Preset: Exhaustive
Tachyon
Total Time
NAS Parallel Benchmarks
Test / Class: FT.C
ASTC Encoder
Preset: Thorough
NAS Parallel Benchmarks
Test / Class: CG.C
NAS Parallel Benchmarks
Test / Class: MG.C
oneDNN
Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU
ASTC Encoder
Preset: Medium
rays1bench
Large Scene
Phoronix Test Suite v10.8.4