GPTshop.ai NVIDIA GH200 Linux Benchmarks

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2402184-NE-GH200THRE73
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 2 Tests
Timed Code Compilation 5 Tests
C/C++ Compiler Tests 9 Tests
CPU Massive 17 Tests
Creator Workloads 9 Tests
Cryptocurrency Benchmarks, CPU Mining Tests 2 Tests
Cryptography 3 Tests
Fortran Tests 4 Tests
Game Development 2 Tests
HPC - High Performance Computing 14 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Linear Algebra 2 Tests
Machine Learning 2 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 5 Tests
Multi-Core 25 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 10 Tests
Programmer / Developer System Benchmarks 7 Tests
Python Tests 6 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 6 Tests
Server 2 Tests
Server CPU Tests 12 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GPTshop.ai GH200
February 05
  9 Hours, 5 Minutes
HP Z6 G5 A - Threadripper PRO 7995WX
February 17
  12 Hours, 44 Minutes
Invert Hiding All Results Option
  10 Hours, 54 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GPTshop.ai NVIDIA GH200 Linux Benchmarks Suite 1.0.0 System Test suite extracted from GPTshop.ai NVIDIA GH200 Linux Benchmarks. pts/hpcg-1.3.0 --nx=144 --ny=144 --nz=144 --rt=60 X Y Z: 144 144 144 - RT: 60 pts/npb-1.4.5 bt.C Test / Class: BT.C pts/npb-1.4.5 cg.C Test / Class: CG.C pts/npb-1.4.5 ft.C Test / Class: FT.C pts/npb-1.4.5 is.D Test / Class: IS.D pts/npb-1.4.5 lu.C Test / Class: LU.C pts/npb-1.4.5 mg.C Test / Class: MG.C pts/npb-1.4.5 sp.C Test / Class: SP.C pts/minibude-1.0.0 --deck ../data/bm2 --iterations 10 Implementation: OpenMP - Input Deck: BM2 pts/rodinia-1.3.2 OMP_LAVAMD Test: OpenMP LavaMD pts/amg-1.1.0 pts/libxsmm-1.0.1 128 128 128 M N K: 128 pts/libxsmm-1.0.1 256 256 256 M N K: 256 pts/nwchem-1.1.1 Input: C240 Buckyball pts/incompact3d-2.0.2 input.i3d Input: X3D-benchmarking input.i3d pts/incompact3d-2.0.2 input_193_nodes.i3d Input: input.i3d 193 Cells Per Direction pts/lulesh-1.1.1 pts/xmrig-1.1.0 --bench=1M Variant: Monero - Hash Count: 1M pts/xmrig-1.1.0 -a rx/wow --bench=1M Variant: Wownero - Hash Count: 1M pts/john-the-ripper-1.8.0 --format=bcrypt Test: bcrypt pts/john-the-ripper-1.8.0 --format=wpapsk Test: WPA PSK pts/john-the-ripper-1.8.0 --format=bcrypt Test: Blowfish pts/john-the-ripper-1.8.0 --format=md5crypt Test: MD5 pts/graphics-magick-2.1.0 -sharpen 0x2.0 Operation: Sharpen pts/graphics-magick-2.1.0 -enhance Operation: Enhanced pts/svt-av1-2.10.0 --preset 4 -n 160 -i Bosphorus_3840x2160.y4m -w 3840 -h 2160 Encoder Mode: Preset 4 - Input: Bosphorus 4K pts/svt-av1-2.10.0 --preset 8 -i Bosphorus_3840x2160.y4m -w 3840 -h 2160 Encoder Mode: Preset 8 - Input: Bosphorus 4K pts/svt-av1-2.10.0 --preset 12 -i Bosphorus_3840x2160.y4m -w 3840 -h 2160 Encoder Mode: Preset 12 - Input: Bosphorus 4K pts/svt-av1-2.10.0 --preset 13 -i Bosphorus_3840x2160.y4m -w 3840 -h 2160 Encoder Mode: Preset 13 - Input: Bosphorus 4K pts/mt-dgemm-1.2.0 Sustained Floating-Point Rate pts/coremark-1.0.1 CoreMark Size 666 - Iterations Per Second pts/compress-7zip-1.10.0 Test: Compression Rating pts/compress-7zip-1.10.0 Test: Decompression Rating pts/stockfish-1.4.0 Total Time pts/asmfish-1.1.2 1024 Hash Memory, 26 Depth pts/build-godot-4.0.0 Time To Compile pts/build-linux-kernel-1.15.0 defconfig Build: defconfig pts/build-linux-kernel-1.15.0 allmodconfig Build: allmodconfig pts/build-llvm-1.5.0 Ninja Build System: Ninja pts/build-nodejs-1.3.0 Time To Compile pts/primesieve-1.9.0 1e13 Length: 1e13 pts/rays1bench-1.0.0 Large Scene pts/onednn-3.3.0 --conv --batch=inputs/conv/shapes_auto --cfg=bf16bf16bf16 --engine=cpu Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU pts/onednn-3.3.0 --deconv --batch=inputs/deconv/shapes_1d --cfg=bf16bf16bf16 --engine=cpu Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU pts/helsing-1.0.2 10000000000000 99999999999999 Digit Range: 14 digit pts/tachyon-1.3.0 Total Time pts/cpuminer-opt-1.7.0 -a deep Algorithm: Deepcoin pts/cpuminer-opt-1.7.0 -a blake2s Algorithm: Blake-2 S pts/cpuminer-opt-1.7.0 -a myr-gr Algorithm: Myriad-Groestl pts/cpuminer-opt-1.7.0 -a sha256t Algorithm: Triple SHA-256, Onecoin pts/liquid-dsp-1.6.0 -n 128 -b 256 -f 512 Threads: 128 - Buffer Length: 256 - Filter Length: 512 pts/liquid-dsp-1.6.0 -n 240 -b 256 -f 512 Threads: 240 - Buffer Length: 256 - Filter Length: 512 pts/askap-2.1.0 tConvolveMPI Test: tConvolve MPI - Degridding pts/askap-2.1.0 tConvolveMPI Test: tConvolve MPI - Gridding pts/astcenc-1.4.0 -medium -repeats 20 Preset: Medium pts/astcenc-1.4.0 -thorough -repeats 10 Preset: Thorough pts/astcenc-1.4.0 -exhaustive -repeats 2 Preset: Exhaustive pts/graph500-1.0.1 26 Scale: 26 pts/gromacs-1.8.0 mpi-build water-cut1.0_GMX50_bare/1536 Implementation: MPI CPU - Input: water_GMX50_bare pts/duckdb-1.0.0 benchmark/imdb Benchmark: IMDB pts/duckdb-1.0.0 benchmark/tpch/parquet Benchmark: TPC-H Parquet pts/pgbench-1.14.0 -s 100 -c 1000 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write pts/pgbench-1.14.0 -s 100 -c 1000 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency system/rawtherapee-1.0.1 Total Benchmark Time pts/stress-ng-1.11.0 --cpu -1 --cpu-method all --no-rand-seed Test: CPU Stress pts/stress-ng-1.11.0 --matrix -1 --no-rand-seed Test: Matrix Math pts/stress-ng-1.11.0 --vecmath -1 --no-rand-seed Test: Vector Math pts/stress-ng-1.11.0 --vnni -1 Test: AVX-512 VNNI pts/stress-ng-1.11.0 --fp -1 --no-rand-seed Test: Floating Point pts/stress-ng-1.11.0 --matrix-3d -1 --no-rand-seed Test: Matrix 3D Math pts/stress-ng-1.11.0 --memcpy -1 --no-rand-seed Test: Memory Copying pts/stress-ng-1.11.0 --vecwide -1 --no-rand-seed Test: Wide Vector Math pts/stress-ng-1.11.0 --fma -1 --no-rand-seed Test: Fused Multiply-Add pts/stress-ng-1.11.0 --vecfp -1 --no-rand-seed Test: Vector Floating Point pts/build-gem5-1.1.0 Time To Compile pts/openvino-1.4.0 -m models/intel/face-detection-0206/FP16/face-detection-0206.xml -d CPU Model: Face Detection FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/person-detection-0303/FP16/person-detection-0303.xml -d CPU Model: Person Detection FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/person-detection-0303/FP32/person-detection-0303.xml -d CPU Model: Person Detection FP32 - Device: CPU pts/openvino-1.4.0 -m models/intel/vehicle-detection-0202/FP16/vehicle-detection-0202.xml -d CPU Model: Vehicle Detection FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/face-detection-0206/FP16-INT8/face-detection-0206.xml -d CPU Model: Face Detection FP16-INT8 - Device: CPU pts/openvino-1.4.0 -m models/intel/face-detection-retail-0005/FP16/face-detection-retail-0005.xml -d CPU Model: Face Detection Retail FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/road-segmentation-adas-0001/FP16/road-segmentation-adas-0001.xml -d CPU Model: Road Segmentation ADAS FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/vehicle-detection-0202/FP16-INT8/vehicle-detection-0202.xml -d CPU Model: Vehicle Detection FP16-INT8 - Device: CPU pts/openvino-1.4.0 -m models/intel/weld-porosity-detection-0001/FP16/weld-porosity-detection-0001.xml -d CPU Model: Weld Porosity Detection FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/face-detection-retail-0005/FP16-INT8/face-detection-retail-0005.xml -d CPU Model: Face Detection Retail FP16-INT8 - Device: CPU pts/openvino-1.4.0 -m models/intel/road-segmentation-adas-0001/FP16-INT8/road-segmentation-adas-0001.xml -d CPU Model: Road Segmentation ADAS FP16-INT8 - Device: CPU pts/openvino-1.4.0 -m models/intel/machine-translation-nar-en-de-0002/FP16/machine-translation-nar-en-de-0002.xml -d CPU Model: Machine Translation EN To DE FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/weld-porosity-detection-0001/FP16-INT8/weld-porosity-detection-0001.xml -d CPU Model: Weld Porosity Detection FP16-INT8 - Device: CPU pts/openvino-1.4.0 -m models/intel/person-vehicle-bike-detection-2004/FP16/person-vehicle-bike-detection-2004.xml -d CPU Model: Person Vehicle Bike Detection FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/handwritten-english-recognition-0001/FP16/handwritten-english-recognition-0001.xml -d CPU Model: Handwritten English Recognition FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/age-gender-recognition-retail-0013/FP16/age-gender-recognition-retail-0013.xml -d CPU Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU pts/openvino-1.4.0 -m models/intel/handwritten-english-recognition-0001/FP16-INT8/handwritten-english-recognition-0001.xml -d CPU Model: Handwritten English Recognition FP16-INT8 - Device: CPU pts/openvino-1.4.0 -m models/intel/age-gender-recognition-retail-0013/FP16-INT8/age-gender-recognition-retail-0013.xml -d CPU Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU