NVIDIA GH200 GPU ARMv8 Neoverse-V2 testing with a Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS) and NVIDIA GH100 [GH200 120GB] on Ubuntu 23.10 via the Phoronix Test Suite. ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] -: Processor: ARMv8 Neoverse-V2 @ 3.39GHz (72 Cores), Motherboard: Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS), Memory: 1 x 480 GB DRAM-6400MT/s, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07 + 1920GB SAMSUNG MZTL21T9, Graphics: NVIDIA GH100 [GH200 120GB], Network: 2 x Mellanox MT2910 + 2 x QLogic FastLinQ QL41000 10/25/40/50GbE OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (aarch64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200 PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 batches/sec > Higher Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 10.66 |===================== PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: ResNet-152 batches/sec > Higher Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 3.93 |====================== PyTorch 2.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 batches/sec > Higher Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 10.28 |===================== PyTorch 2.1 Device: CPU - Batch Size: 512 - Model: ResNet-152 batches/sec > Higher Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 3.77 |====================== PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l batches/sec > Higher Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 0.44 |====================== PyTorch 2.1 Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_l batches/sec > Higher Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 2.54 |====================== PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-50 batches/sec > Higher Is Better PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-152 batches/sec > Higher Is Better PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50 batches/sec > Higher Is Better PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152 batches/sec > Higher Is Better PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: Efficientnet_v2_l batches/sec > Higher Is Better PyTorch 2.1 Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: Efficientnet_v2_l batches/sec > Higher Is Better Blender 3.6.2 Blend File: BMW27 - Compute: CUDA Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 40.43 |===================== ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 40.72 |===================== Blender 3.6.2 Blend File: BMW27 - Compute: CPU-Only Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 40.98 |===================== ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 41.60 |===================== Blender 3.6.2 Blend File: Classroom - Compute: CUDA Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 86.88 |===================== Blender 3.6.2 Blend File: Fishy Cat - Compute: CUDA Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 83.30 |===================== Blender 3.6.2 Blend File: Barbershop - Compute: CUDA Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 455.69 |==================== Blender 3.6.2 Blend File: Classroom - Compute: CPU-Only Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 87.25 |===================== Blender 3.6.2 Blend File: Fishy Cat - Compute: CPU-Only Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 82.72 |===================== Blender 3.6.2 Blend File: Barbershop - Compute: CPU-Only Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 456.61 |==================== Blender 3.6.2 Blend File: Pabellon Barcelona - Compute: CUDA Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 172.36 |==================== Blender 3.6.2 Blend File: Pabellon Barcelona - Compute: CPU-Only Seconds < Lower Is Better ARMv8 Neoverse-V2 - NVIDIA GH100 [GH200 120GB] - . 171.76 |====================