EPYC 7702 April 2021

AMD EPYC 7702 64-Core testing with a ASRockRack EPYCD8 (P2.40 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

1

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

2

3

Processor: AMD EPYC 7702 64-Core @ 2.00GHz (64 Cores / 128 Threads), Motherboard: ASRockRack EPYCD8 (P2.40 BIOS), Chipset: AMD Starship/Matisse, Memory: 126GB, Disk: 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel I350

OS: Ubuntu 20.04, Kernel: 5.9.0-050900rc6daily20200921-generic (x86_64) 20200920, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

GNU GMP GMPbench

GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google. Learn more via the OpenBenchmarking.org test page.

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

Timed Mesa Compilation

This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

Timed Erlang/OTP Compilation

This test times how long it takes to compile Erlang/OTP. Erlang is a programming language and run-time for massively scalable soft real-time systems with high availability requirements. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

117 Results Shown

toyBrot Fractal Generator:
TBB
OpenMP
C++ Tasks
C++ Threads
Xcompact3d Incompact3d:
X3D-benchmarking input.i3d
input.i3d 129 Cells Per Direction
input.i3d 193 Cells Per Direction
simdjson:
Kostya
LargeRand
PartialTweets
DistinctUserID
GNU GMP GMPbench
Zstd Compression:
3 - Compression Speed
8 - Compression Speed
8 - Decompression Speed
19 - Compression Speed
19 - Decompression Speed
3, Long Mode - Compression Speed
3, Long Mode - Decompression Speed
8, Long Mode - Compression Speed
8, Long Mode - Decompression Speed
19, Long Mode - Compression Speed
19, Long Mode - Decompression Speed
Botan:
KASUMI
KASUMI - Decrypt
AES-256
AES-256 - Decrypt
Twofish
Twofish - Decrypt
Blowfish
Blowfish - Decrypt
CAST-256
CAST-256 - Decrypt
ChaCha20Poly1305
ChaCha20Poly1305 - Decrypt
LuaRadio:
Five Back to Back FIR Filters
FM Deemphasis Filter
Hilbert Transform
Complex Phase
GNU Radio:
Five Back to Back FIR Filters
Signal Source (Cosine)
FIR Filter
IIR Filter
FM Deemphasis Filter
Hilbert Transform
AOM AV1:
Speed 4 Two-Pass - Bosphorus 4K
Speed 6 Realtime - Bosphorus 4K
Speed 6 Two-Pass - Bosphorus 4K
Speed 8 Realtime - Bosphorus 4K
Speed 9 Realtime - Bosphorus 4K
Speed 4 Two-Pass - Bosphorus 1080p
Speed 6 Realtime - Bosphorus 1080p
Speed 6 Two-Pass - Bosphorus 1080p
Speed 8 Realtime - Bosphorus 1080p
Speed 9 Realtime - Bosphorus 1080p
SVT-HEVC:
1 - Bosphorus 1080p
7 - Bosphorus 1080p
10 - Bosphorus 1080p
SVT-VP9:
VMAF Optimized - Bosphorus 1080p
PSNR/SSIM Optimized - Bosphorus 1080p
Visual Quality Optimized - Bosphorus 1080p
Stockfish
libavif avifenc:
0
2
6
10
6, Lossless
10, Lossless
Timed Linux Kernel Compilation
Timed Mesa Compilation
Timed Node.js Compilation
oneDNN:
IP Shapes 1D - f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Timed Erlang/OTP Compilation
Liquid-DSP:
1 - 256 - 57
2 - 256 - 57
4 - 256 - 57
8 - 256 - 57
16 - 256 - 57
32 - 256 - 57
64 - 256 - 57
128 - 256 - 57
ViennaCL:
CPU BLAS - sCOPY
CPU BLAS - sAXPY
CPU BLAS - sDOT
CPU BLAS - dCOPY
CPU BLAS - dAXPY
CPU BLAS - dDOT
CPU BLAS - dGEMV-N
CPU BLAS - dGEMV-T
CPU BLAS - dGEMM-NN
CPU BLAS - dGEMM-NT
CPU BLAS - dGEMM-TN
CPU BLAS - dGEMM-TT
Sysbench:
RAM / Memory
CPU
Blender:
BMW27 - CPU-Only
Classroom - CPU-Only
Fishy Cat - CPU-Only
Barbershop - CPU-Only
Pabellon Barcelona - CPU-Only

1

Testing initiated at 3 April 2021 18:15 by user phoronix.

2

Testing initiated at 3 April 2021 20:11 by user phoronix.

3

Testing initiated at 4 April 2021 05:18 by user phoronix.