NVIDIA GH200 Compilers

Clang and GCC benchmarks by Michael Larabel for a future article. ARMv8 Neoverse-V2 testing with a Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2402098-NE-NVIDIAGH291
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 3 Tests
C/C++ Compiler Tests 7 Tests
CPU Massive 8 Tests
Creator Workloads 7 Tests
Encoding 4 Tests
HPC - High Performance Computing 3 Tests
Imaging 3 Tests
Molecular Dynamics 2 Tests
Multi-Core 6 Tests
OpenMPI Tests 2 Tests
Scientific Computing 2 Tests
Server CPU Tests 3 Tests
Single-Threaded 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 13
February 09
  2 Hours, 27 Minutes
Clang 17
February 09
  2 Hours, 31 Minutes
Invert Hiding All Results Option
  2 Hours, 29 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA GH200 Compilers Suite 1.0.0 System Test suite extracted from NVIDIA GH200 Compilers. pts/c-ray-1.2.0 Total Time - 4K, 16 Rays Per Pixel pts/encode-flac-1.8.1 WAV To FLAC pts/graphics-magick-2.1.0 -swirl 90 Operation: Swirl pts/graphics-magick-2.1.0 -rotate 90 Operation: Rotate pts/graphics-magick-2.1.0 -sharpen 0x2.0 Operation: Sharpen pts/graphics-magick-2.1.0 -enhance Operation: Enhanced pts/graphics-magick-2.1.0 -resize 50% Operation: Resizing pts/graphics-magick-2.1.0 -operator all Noise-Gaussian 30% Operation: Noise-Gaussian pts/graphics-magick-2.1.0 -colorspace HWB Operation: HWB Color Space pts/helsing-1.0.2 10000000000000 99999999999999 Digit Range: 14 digit pts/encode-mp3-1.7.4 WAV To MP3 pts/lammps-1.4.0 benchmark_20k_atoms.in Model: 20k Atoms pts/lammps-1.4.0 in.rhodo Model: Rhodopsin Protein pts/avifenc-1.4.0 -s 0 Encoder Speed: 0 pts/avifenc-1.4.0 -s 2 Encoder Speed: 2 pts/avifenc-1.4.0 -s 6 -l Encoder Speed: 6, Lossless pts/avifenc-1.4.0 -s 10 -l Encoder Speed: 10, Lossless pts/liquid-dsp-1.6.0 -n 1 -b 256 -f 32 Threads: 1 - Buffer Length: 256 - Filter Length: 32 pts/liquid-dsp-1.6.0 -n 1 -b 256 -f 57 Threads: 1 - Buffer Length: 256 - Filter Length: 57 pts/liquid-dsp-1.6.0 -n 1 -b 256 -f 512 Threads: 1 - Buffer Length: 256 - Filter Length: 512 pts/liquid-dsp-1.6.0 -n 32 -b 256 -f 32 Threads: 32 - Buffer Length: 256 - Filter Length: 32 pts/liquid-dsp-1.6.0 -n 32 -b 256 -f 57 Threads: 32 - Buffer Length: 256 - Filter Length: 57 pts/liquid-dsp-1.6.0 -n 64 -b 256 -f 32 Threads: 64 - Buffer Length: 256 - Filter Length: 32 pts/liquid-dsp-1.6.0 -n 64 -b 256 -f 57 Threads: 64 - Buffer Length: 256 - Filter Length: 57 pts/liquid-dsp-1.6.0 -n 72 -b 256 -f 32 Threads: 72 - Buffer Length: 256 - Filter Length: 32 pts/liquid-dsp-1.6.0 -n 72 -b 256 -f 57 Threads: 72 - Buffer Length: 256 - Filter Length: 57 pts/liquid-dsp-1.6.0 -n 32 -b 256 -f 512 Threads: 32 - Buffer Length: 256 - Filter Length: 512 pts/liquid-dsp-1.6.0 -n 64 -b 256 -f 512 Threads: 64 - Buffer Length: 256 - Filter Length: 512 pts/liquid-dsp-1.6.0 -n 72 -b 256 -f 512 Threads: 72 - Buffer Length: 256 - Filter Length: 512 pts/lulesh-1.1.1 pts/minibude-1.0.0 --deck ../data/bm1 --iterations 500 Implementation: OpenMP - Input Deck: BM1 pts/minibude-1.0.0 --deck ../data/bm2 --iterations 10 Implementation: OpenMP - Input Deck: BM2 pts/encode-opus-1.4.0 WAV To Opus Encode pts/primesieve-1.9.0 1e12 Length: 1e12 pts/primesieve-1.9.0 1e13 Length: 1e13 pts/quantlib-1.2.0 --mp Configuration: Multi-Threaded pts/quantlib-1.2.0 Configuration: Single-Threaded pts/securemark-1.0.0 Benchmark: SecureMark-TLS pts/stress-ng-1.11.0 --cache -1 --no-rand-seed Test: CPU Cache pts/stress-ng-1.11.0 --matrix -1 --no-rand-seed Test: Matrix Math pts/stress-ng-1.11.0 --vecmath -1 --no-rand-seed Test: Vector Math pts/stress-ng-1.11.0 --fp -1 --no-rand-seed Test: Floating Point pts/stress-ng-1.11.0 --vecshuf -1 --no-rand-seed Test: Vector Shuffle pts/stress-ng-1.11.0 --fma -1 --no-rand-seed Test: Fused Multiply-Add pts/stress-ng-1.11.0 --vecfp -1 --no-rand-seed Test: Vector Floating Point pts/tscp-1.2.2 AI Chess Performance pts/webp-1.2.0 Encode Settings: Default pts/webp-1.2.0 -q 100 Encode Settings: Quality 100 pts/webp-1.2.0 -q 100 -lossless Encode Settings: Quality 100, Lossless pts/webp-1.2.0 -q 100 -m 6 Encode Settings: Quality 100, Highest Compression pts/webp-1.2.0 -q 100 -lossless -m 6 Encode Settings: Quality 100, Lossless, Highest Compression pts/compress-zstd-1.6.0 -b19 Compression Level: 19 - Compression Speed pts/compress-zstd-1.6.0 -b19 Compression Level: 19 - Decompression Speed pts/compress-zstd-1.6.0 -b19 --long Compression Level: 19, Long Mode - Compression Speed pts/compress-zstd-1.6.0 -b19 --long Compression Level: 19, Long Mode - Decompression Speed