bandwidth

ARMv8 Neoverse-V2 testing with a Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS) and NVIDIA GH200 480GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2402279-NE-BANDWIDTH66&gru&rdt.

bandwidthProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDisplay DriverOpenCLVulkanCompilerFile-SystemScreen ResolutionARMv8 Neoverse-V2bcdefARMv8 Neoverse-V2 @ 3.39GHz (72 Cores)Quanta Cloud QuantaGrid S74G-2U 1S7GZ9Z0000 S7G MB (CG1) (3A06 BIOS)1 x 480GB DRAM-6400MT/s960GB SAMSUNG MZ1L2960HCJR-00A07 + 1920GB SAMSUNG MZTL21T9NVIDIA GH200 480GB2 x Mellanox MT2910 + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEUbuntu 22.046.5.0-1007-NVIDIA-64k (aarch64)NVIDIAOpenCL 3.0 CUDA 12.4.891.3.277GCC 11.4.0 + CUDA 11.5ext41920x1200OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: cppc_cpufreq performance (Boost: Disabled)Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

bandwidthgraph500: 26graph500: 26hpcg: 104 104 104 - 60hpcg: 144 144 144 - 60hpcg: 160 160 160 - 60hpcg: 192 192 192 - 60gromacs: MPI CPU - water_GMX50_baregraph500: 26graph500: 26ARMv8 Neoverse-V2bcdef1573470000150557000044.705941.919139.915838.73125.4295119090003339050001574260000150351000039.571838.964438.77538.48435.525013800003220510001571860000149867000038.948338.233338.512738.10515.5294924130003274510001562170000148927000038.634638.01738.172538.05685.534986660003227040001547980000146972000038.686437.991238.000437.98255.5154915010003228830001559480000148140000038.705838.108238.029737.85385.508483974000318469000OpenBenchmarking.org

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26ARMv8 Neoverse-V2bcdef300M600M900M1200M1500M1573470000157426000015718600001562170000154798000015594800001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26ARMv8 Neoverse-V2bcdef300M600M900M1200M1500M1505570000150351000014986700001489270000146972000014814000001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60ARMv8 Neoverse-V2bcdef1020304050SE +/- 0.02, N = 344.7139.5738.9538.6338.6938.711. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60ARMv8 Neoverse-V2bcdef1020304050SE +/- 0.21, N = 341.9238.9638.2338.0237.9938.111. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60ARMv8 Neoverse-V2bcdef918273645SE +/- 0.15, N = 339.9238.7838.5138.1738.0038.031. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 192 192 192 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 192 192 192 - RT: 60ARMv8 Neoverse-V2bcdef918273645SE +/- 0.18, N = 338.7338.4838.1138.0637.9837.851. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareARMv8 Neoverse-V2bcdef1.24432.48863.73294.97726.2215SE +/- 0.033, N = 35.4295.5205.5295.5305.5155.508

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26ARMv8 Neoverse-V2bcdef110M220M330M440M550M5119090005013800004924130004986660004915010004839740001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26ARMv8 Neoverse-V2bcdef70M140M210M280M350M3339050003220510003274510003227040003228830003184690001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi


Phoronix Test Suite v10.8.5