Microsoft Azure EPYC 7003 HBv3 Benchmarks

Azure HBv3 vs. Azure HBv2 benchmarks.

Azure HBv3

Processor: 2 x AMD EPYC 7V13 64-Core (120 Cores), Motherboard: Microsoft Virtual Machine (Hyper-V UEFI v4.1 BIOS), Memory: 442GB, Disk: 2 x 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Disk, Graphics: hyperv_fb

OS: CentOS Linux 8, Kernel: 4.18.0-147.8.1.el8_1.x86_64 (x86_64), Compiler: GCC 8.3.1 20190507, File-System: nfs, Screen Resolution: 1152x864, System Layer: microsoft

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.6.8
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

Azure HBv2

Processor: 2 x AMD EPYC 7V12 64-Core (120 Cores), Motherboard: Microsoft Virtual Machine (Hyper-V UEFI v4.0 BIOS), Memory: 450GB, Disk: 960GB Microsoft NVMe Direct Disk + 32GB Virtual Disk + 515GB Virtual Disk, Graphics: hyperv_fb

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.6.8
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

GNU GMP GMPbench

GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

86 Results Shown

QuantLib
High Performance Conjugate Gradient
NAS Parallel Benchmarks
miniFE
CloverLeaf
Rodinia:
OpenMP LavaMD
OpenMP HotSpot3D
OpenMP Leukocyte
OpenMP CFD Solver
OpenMP Streamcluster
NAMD
Pennant:
sedovbig
leblancbig
Timed HMMer Search
Xcompact3d Incompact3d
Timed MAFFT Alignment
LULESH
GNU GMP GMPbench
Zstd Compression:
8 - Compression Speed
19 - Compression Speed
19 - Decompression Speed
8, Long Mode - Compression Speed
8, Long Mode - Decompression Speed
19, Long Mode - Compression Speed
19, Long Mode - Decompression Speed
Botan:
KASUMI
KASUMI - Decrypt
AES-256
AES-256 - Decrypt
Twofish
Twofish - Decrypt
Blowfish
Blowfish - Decrypt
CAST-256
CAST-256 - Decrypt
ChaCha20Poly1305
ChaCha20Poly1305 - Decrypt
SVT-AV1:
Enc Mode 0 - 1080p
Enc Mode 4 - 1080p
Enc Mode 8 - 1080p
SVT-HEVC:
1 - Bosphorus 1080p
7 - Bosphorus 1080p
10 - Bosphorus 1080p
SVT-VP9:
VMAF Optimized - Bosphorus 1080p
PSNR/SSIM Optimized - Bosphorus 1080p
Visual Quality Optimized - Bosphorus 1080p
Timed Linux Kernel Compilation
Timed LLVM Compilation
Timed Node.js Compilation
oneDNN:
IP Shapes 1D - f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
FinanceBench:
Repo OpenMP
Bonds OpenMP
GROMACS
TensorFlow Lite
Mobile Neural Network:
SqueezeNetV1.0
resnet-v2-50
MobileNetV2_224
mobilenet-v1-1.0
inception-v3
NCNN:
CPU - mobilenet
CPU-v2-v2 - mobilenet-v2
CPU - shufflenet-v2
CPU - efficientnet-b0
CPU - vgg16
CPU - alexnet
CPU - resnet50
TNN
PlaidML:
No - Inference - VGG16 - CPU
No - Inference - VGG19 - CPU
No - Inference - ResNet 50 - CPU
Kripke

Azure HBv3

OS: CentOS Linux 8, Kernel: 4.18.0-147.8.1.el8_1.x86_64 (x86_64), Compiler: GCC 8.3.1 20190507, File-System: nfs, Screen Resolution: 1152x864, System Layer: microsoft

Testing initiated at 9 April 2021 13:28 by user .

Azure HBv2

OS: CentOS Linux 8, Kernel: 4.18.0-147.8.1.el8_1.x86_64 (x86_64), Compiler: GCC 8.3.1 20190507, File-System: nfs, Screen Resolution: 1152x864, System Layer: microsoft

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-redhat-linux --disable-libmpx --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-gcc-major-version-only --with-isl --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.6.8
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + tsx_async_abort: Not affected

Testing initiated at 10 April 2021 10:15 by user .

Microsoft Azure EPYC 7003 HBv3 Benchmarks

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

Azure HBv3

Azure HBv2

QuantLib

High Performance Conjugate Gradient

NAS Parallel Benchmarks

miniFE

CloverLeaf

Rodinia

NAMD

Pennant

Timed HMMer Search

Xcompact3d Incompact3d

Timed MAFFT Alignment

LULESH

GNU GMP GMPbench

Zstd Compression

Botan

SVT-AV1

SVT-HEVC

SVT-VP9

Timed Linux Kernel Compilation

Timed LLVM Compilation

Timed Node.js Compilation

oneDNN

FinanceBench

GROMACS

TensorFlow Lite

Mobile Neural Network

NCNN

TNN

PlaidML

Kripke

86 Results Shown

Azure HBv3

Azure HBv2