Microsoft Azure EPYC 7003 HBv3 Benchmarks

Azure HBv3 vs. Azure HBv2 benchmarks.

Azure HBv3

Processor: 2 x AMD EPYC 7V13 64-Core (120 Cores), Motherboard: Microsoft Virtual Machine (Hyper-V UEFI v4.1 BIOS), Memory: 442GB, Disk: 2 x 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Disk, Graphics: hyperv_fb

OS: CentOS Linux 8, Kernel: 4.18.0-147.8.1.el8_1.x86_64 (x86_64), Compiler: GCC 8.3.1 20190507, File-System: nfs, Screen Resolution: 1152x864, System Layer: microsoft

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.6.8
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

Azure HBv2

Processor: 2 x AMD EPYC 7V12 64-Core (120 Cores), Motherboard: Microsoft Virtual Machine (Hyper-V UEFI v4.0 BIOS), Memory: 450GB, Disk: 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Disk, Graphics: hyperv_fb

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.6.8
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

oneDNN

Result

Result Confidence

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

Result

Result Confidence

FinanceBench

Result

Result Confidence

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Botan

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

SVT-AV1

Result

Result Confidence

Zstd Compression

Result

Result Confidence

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Rodinia

Result

Result Confidence

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

Result

Result Confidence

Result

Result Confidence

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

SVT-VP9

Result

Result Confidence

Result

Result Confidence

SVT-HEVC

Result

Result Confidence

Result

Result Confidence

SVT-AV1

Result

Result Confidence

Zstd Compression

Result

Result Confidence

Rodinia

Result

Result Confidence

Result

Result Confidence

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

86 Results Shown

SVT-VP9
oneDNN
Mobile Neural Network
PlaidML:
No - Inference - VGG16 - CPU
No - Inference - VGG19 - CPU
oneDNN
Zstd Compression:
19 - Decompression Speed
8, Long Mode - Decompression Speed
19, Long Mode - Decompression Speed
Botan:
AES-256
AES-256 - Decrypt
QuantLib
Zstd Compression
Rodinia
SVT-AV1
oneDNN
FinanceBench
oneDNN
FinanceBench
SVT-HEVC
Botan:
Twofish
Twofish - Decrypt
Blowfish
Blowfish - Decrypt
SVT-AV1
Zstd Compression
LULESH
Botan
TNN
Timed HMMer Search
Botan
oneDNN
GNU GMP GMPbench
Botan:
CAST-256
CAST-256 - Decrypt
Rodinia
Timed MAFFT Alignment
Xcompact3d Incompact3d
NAMD
Zstd Compression
Botan:
ChaCha20Poly1305
ChaCha20Poly1305 - Decrypt
Timed Linux Kernel Compilation
GROMACS
Timed Node.js Compilation
Timed LLVM Compilation
NAS Parallel Benchmarks
High Performance Conjugate Gradient
Pennant
Rodinia
Pennant
oneDNN:
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Kripke
PlaidML
NCNN:
CPU - resnet50
CPU - alexnet
CPU - vgg16
CPU - efficientnet-b0
CPU - shufflenet-v2
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
Mobile Neural Network:
mobilenet-v1-1.0
MobileNetV2_224
resnet-v2-50
SqueezeNetV1.0
TensorFlow Lite
oneDNN:
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
IP Shapes 3D - f32 - CPU
SVT-VP9:
PSNR/SSIM Optimized - Bosphorus 1080p
VMAF Optimized - Bosphorus 1080p
SVT-HEVC:
10 - Bosphorus 1080p
7 - Bosphorus 1080p
SVT-AV1
Zstd Compression
Rodinia:
OpenMP Streamcluster
OpenMP CFD Solver
CloverLeaf
miniFE

Azure HBv3

OS: CentOS Linux 8, Kernel: 4.18.0-147.8.1.el8_1.x86_64 (x86_64), Compiler: GCC 8.3.1 20190507, File-System: nfs, Screen Resolution: 1152x864, System Layer: microsoft

Testing initiated at 9 April 2021 13:28 by user .

Azure HBv2

OS: CentOS Linux 8, Kernel: 4.18.0-147.8.1.el8_1.x86_64 (x86_64), Compiler: GCC 8.3.1 20190507, File-System: nfs, Screen Resolution: 1152x864, System Layer: microsoft

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.6.8
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

Testing initiated at 10 April 2021 10:15 by user .

Microsoft Azure EPYC 7003 HBv3 Benchmarks

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

Azure HBv3

Azure HBv2

SVT-VP9

oneDNN

Mobile Neural Network

PlaidML

oneDNN

Zstd Compression

Botan

QuantLib

Zstd Compression

Rodinia

SVT-AV1

oneDNN

FinanceBench

oneDNN

FinanceBench

SVT-HEVC

Botan

SVT-AV1

Zstd Compression

LULESH

Botan

TNN

Timed HMMer Search

Botan

oneDNN

GNU GMP GMPbench

Botan

Rodinia

Timed MAFFT Alignment

Xcompact3d Incompact3d

NAMD

Zstd Compression

Botan

Timed Linux Kernel Compilation

GROMACS

Timed Node.js Compilation

Timed LLVM Compilation

NAS Parallel Benchmarks

High Performance Conjugate Gradient

Pennant

Rodinia

Pennant

oneDNN

Kripke

PlaidML

NCNN

Mobile Neural Network

TensorFlow Lite

oneDNN

SVT-VP9

SVT-HEVC

SVT-AV1

Zstd Compression

Rodinia

CloverLeaf

miniFE

86 Results Shown

Azure HBv3

Azure HBv2