AMD EPYC 9754 Bergamo AVX-512 Benchmarks

80 Results Shown

miniBUDE:
OpenMP - BM1:
GFInst/s
Billion Interactions/s
OpenMP - BM2:
GFInst/s
Billion Interactions/s
libxsmm:
128
256
Embree:
Pathtracer ISPC - Crown
Pathtracer ISPC - Asian Dragon
Pathtracer ISPC - Asian Dragon Obj
OpenVKL
OSPRay:
gravity_spheres_volume/dim_512/ao/real_time
gravity_spheres_volume/dim_512/scivis/real_time
gravity_spheres_volume/dim_512/pathtracer/real_time
oneDNN
Cpuminer-Opt:
x25x
scrypt
Blake-2 S
Garlicoin
Skeincoin
Myriad-Groestl
LBC, LBRY Credits
Quad SHA-256, Pyrite
TensorFlow:
CPU - 16 - AlexNet
CPU - 32 - AlexNet
CPU - 64 - AlexNet
CPU - 256 - AlexNet
CPU - 512 - AlexNet
CPU - 16 - GoogLeNet
CPU - 16 - ResNet-50
CPU - 32 - GoogLeNet
CPU - 32 - ResNet-50
CPU - 64 - GoogLeNet
CPU - 64 - ResNet-50
CPU - 256 - GoogLeNet
CPU - 256 - ResNet-50
CPU - 512 - GoogLeNet
CPU - 512 - ResNet-50
Neural Magic DeepSparse:
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
items/sec
ms/batch
OpenVINO:
Face Detection FP16 - CPU:
FPS
ms
Person Detection FP16 - CPU:
FPS
ms
Person Detection FP32 - CPU:
FPS
ms
Vehicle Detection FP16 - CPU:
FPS
ms
Face Detection FP16-INT8 - CPU:
FPS
ms
Vehicle Detection FP16-INT8 - CPU:
FPS
ms
Weld Porosity Detection FP16 - CPU:
FPS
ms
Machine Translation EN To DE FP16 - CPU:
FPS
ms
Weld Porosity Detection FP16-INT8 - CPU:
FPS
ms
Person Vehicle Bike Detection FP16 - CPU:
FPS
ms
Age Gender Recognition Retail 0013 FP16 - CPU:
FPS
ms
Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
FPS
ms
CPU Peak Freq (Highest CPU Core Frequency) Monitor:
Phoronix Test Suite System Monitoring:
Megahertz
Watts
Celsius

AVX512 On

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa0010b
Python Notes: Python 3.10.6
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 July 2023 06:38 by user phoronix.

AVX512 Off

Processor: AMD EPYC 9754 128-Core @ 2.25GHz (128 Cores / 256 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 768GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Network: Broadcom NetXtreme BCM5720 PCIe

OS: Ubuntu 22.04, Kernel: 5.19.0-41-generic (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.4, Vulkan: 1.3.224, Compiler: GCC 11.3.0, File-System: ext4, Screen Resolution: 1024x768

Testing initiated at 16 July 2023 14:04 by user phoronix.

AMD EPYC 9754 Bergamo AVX-512

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

AVX512 On

AVX512 Off

miniBUDE

libxsmm

Embree

OpenVKL

OSPRay

oneDNN

Cpuminer-Opt

TensorFlow

Neural Magic DeepSparse

OpenVINO

CPU Peak Freq (Highest CPU Core Frequency) Monitor

CPU Power Consumption Monitor

CPU Temperature Monitor

80 Results Shown

AVX512 On

AVX512 Off