a40-ml

KVM testing on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2412124-NE-A40ML481010

Jump To Table - Results

NVIDIA A40 - 80 x Intel Xeon

Processor: 80 x Intel Xeon (Icelake) (80 Cores), Motherboard: Nutanix AHV (0.0.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 8 x 16 GB RAM Red Hat, Disk: 8796GB VDISK, Graphics: NVIDIA A40 48GB, Network: Red Hat Virtio device

OS: Ubuntu 22.04, Kernel: 6.5.0-45-generic (x86_64), Display Driver: NVIDIA, Vulkan: 1.3.255, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1280x1024, System Layer: KVM

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: CPU Microcode: 0x1
Vbios Version Notes: ??.??.??.??.??
Python Notes: Python 3.10.12
Security Notes: gather_data_sampling: Unknown: Dependent on hypervisor status + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: SW sequence; BHI: Syscall hardening KVM: SW loop + srbds: Not affected + tsx_async_abort: Not affected

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

Backend: BLAS

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status.

oneDNN

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

LiteRT

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.

Device: CPU - Batch Size: 1 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 1 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 16 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 32 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 64 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 16 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 256 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 32 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 512 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 64 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 256 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 512 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 16 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 32 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: CPU - Batch Size: 512 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 1 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 16 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 32 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 64 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 256 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: Efficientnet_v2_l

NVIDIA A40 - 80 x Intel Xeon: The test quit with a non-zero exit status. E: ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Device: CPU - Batch Size: 1 - Model: VGG-16