Nvidia A2 Benchmarks - OpenBenchmarking.org

2 x AMD EPYC 9334 32-Core testing with a Giga Computing MZ73-LM1-000 v01000100 (F10 BIOS) and Gigabyte NVIDIA A2 15GB on Ubuntu 22.04 via the Phoronix Test Suite.

Genoa 9334

Processor: 2 x AMD EPYC 9334 32-Core @ 2.70GHz (64 Cores / 127 Threads), Motherboard: Giga Computing MZ73-LM1-000 v01000100 (F10 BIOS), Chipset: AMD Device 14a4, Memory: 8 x 32 GB DDR5-4800MT/s Micron MTC20F1045S1RC48BA2, Disk: 1920GB KINGSTON SEDC600, Graphics: Gigabyte NVIDIA A2 15GB, Network: 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA

OS: Ubuntu 22.04, Kernel: 6.8.0-45-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.183.01, OpenCL: OpenCL 3.0 CUDA 12.2.148, Vulkan: 1.3.242, Compiler: GCC 11.4.0 + CUDA 12.2, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa10113e
OpenCL Notes: GPU Compute Cores: 1280
Python Notes: Python 3.10.12
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Genoa Eypc 9334

OS: Ubuntu 22.04, Kernel: 6.8.0-45-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.183.01, OpenGL: 4.5 Mesa 23.2.1-1ubuntu3.1~22.04.2 (LLVM 15.0.7 256 bits), OpenCL: OpenCL 3.0 CUDA 12.2.148, Vulkan: 1.3.242, Compiler: GCC 11.4.0 + CUDA 12.2, File-System: ext4, Screen Resolution: 1920x1200

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa10113e
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL

Genoa 9334: The test quit with a non-zero exit status. E: ImportError: cannot import name 'Iterable' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

18 Results Shown

NCNN:
Vulkan GPU - FastestDet
Vulkan GPU - vision_transformer
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet

Genoa 9334

Testing initiated at 26 September 2024 10:17 by user siddheshapical.

Genoa Eypc 9334

OS: Ubuntu 22.04, Kernel: 6.8.0-45-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.183.01, OpenGL: 4.5 Mesa 23.2.1-1ubuntu3.1~22.04.2 (LLVM 15.0.7 256 bits), OpenCL: OpenCL 3.0 CUDA 12.2.148, Vulkan: 1.3.242, Compiler: GCC 11.4.0 + CUDA 12.2, File-System: ext4, Screen Resolution: 1920x1200

Testing initiated at 26 September 2024 10:28 by user siddheshapical.

nvidia A2

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

Genoa 9334

Genoa Eypc 9334

NCNN

PlaidML

18 Results Shown

Genoa 9334

Genoa Eypc 9334