NVIDIA GH200 GPU

ARMv8 Neoverse-V2 testing with a Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS) and NVIDIA GH100 [GH200 120GB] on Ubuntu 23.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2401285-NE-NVIDIAGH229

Jump To Table - Results

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -

Processor: ARMv8 Neoverse-V2 @ 3.39GHz (72 Cores), Motherboard: Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS), Memory: 1 x 480 GB DRAM-6400MT/s, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07 + 1920GB SAMSUNG MZTL21T9, Graphics: NVIDIA GH100 [GH200 120GB], Network: 2 x Mellanox MT2910 + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE

OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (aarch64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Processor Notes: Scaling Governor: cppc_cpufreq performance (Boost: Disabled)
Python Notes: Python 3.11.6
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Currently this test profile is catered to CPU-based testing. Learn more via the OpenBenchmarking.org test page.

Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-50

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: The test quit with a non-zero exit status. E: AssertionError: Torch not compiled with CUDA enabled

Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-152

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: The test quit with a non-zero exit status. E: AssertionError: Torch not compiled with CUDA enabled

Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: The test quit with a non-zero exit status. E: AssertionError: Torch not compiled with CUDA enabled

Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: The test quit with a non-zero exit status. E: AssertionError: Torch not compiled with CUDA enabled

Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: Efficientnet_v2_l

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: The test quit with a non-zero exit status. E: AssertionError: Torch not compiled with CUDA enabled

Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: Efficientnet_v2_l

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: The test quit with a non-zero exit status. E: AssertionError: Torch not compiled with CUDA enabled

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing is supported. This system/blender test profile makes use of the system-supplied Blender. Use pts/blender if wishing to stick to a fixed version of Blender. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

16 Results Shown

PyTorch:
CPU - 1 - ResNet-50
CPU - 1 - ResNet-152
CPU - 512 - ResNet-50
CPU - 512 - ResNet-152
CPU - 1 - Efficientnet_v2_l
CPU - 512 - Efficientnet_v2_l
Blender:
BMW27 - CUDA
BMW27 - CPU-Only
Classroom - CUDA
Fishy Cat - CUDA
Barbershop - CUDA
Classroom - CPU-Only
Fishy Cat - CPU-Only
Barbershop - CPU-Only
Pabellon Barcelona - CUDA
Pabellon Barcelona - CPU-Only

ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -

Processor: ARMv8 Neoverse-V2 @ 3.39GHz (72 Cores), Motherboard: Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS), Memory: 1 x 480 GB DRAM-6400MT/s, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07 + 1920GB SAMSUNG MZTL21T9, Graphics: NVIDIA GH100 [GH200 120GB], Network: 2 x Mellanox MT2910 + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE

OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (aarch64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Processor Notes: Scaling Governor: cppc_cpufreq performance (Boost: Disabled)
Python Notes: Python 3.11.6
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 27 January 2024 18:20 by user x.