xeon eo march

Tests for a future article. 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 22.10 via the Phoronix Test Suite.

a

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xd000375
Python Notes: Python 3.10.7
Security Notes: dodt: Mitigation of DOITM + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

b

c

Processor: 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads), Motherboard: Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS), Chipset: Intel Ice Lake IEH, Memory: 512GB, Disk: 7682GB INTEL SSDPF2KX076TZ, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP

OS: Ubuntu 22.10, Kernel: 6.2.0-rc5-phx-dodt (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server 1.21.1.3, Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 1920x1080

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google as the AV1 Codec Library. Learn more via the OpenBenchmarking.org test page.

Embree

SVT-AV1

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

VVenC

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

Stress-NG

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

Connections: 100

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

c: The test quit with a non-zero exit status.

Connections: 1000

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

c: The test quit with a non-zero exit status.

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

Concurrent Requests: 100

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

Concurrent Requests: 1000

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

209 Results Shown

SPECFEM3D:
Mount St. Helens
Layered Halfspace
Tomographic Model
Homogeneous Halfspace
Water-layered Halfspace
Zstd Compression:
3 - Compression Speed
3 - Decompression Speed
8 - Compression Speed
8 - Decompression Speed
12 - Compression Speed
12 - Decompression Speed
19 - Compression Speed
19 - Decompression Speed
3, Long Mode - Compression Speed
3, Long Mode - Decompression Speed
8, Long Mode - Compression Speed
8, Long Mode - Decompression Speed
19, Long Mode - Compression Speed
19, Long Mode - Decompression Speed
John The Ripper:
bcrypt
WPA PSK
Blowfish
HMAC-SHA512
MD5
dav1d:
Chimera 1080p
Summer Nature 4K
Summer Nature 1080p
Chimera 1080p 10-bit
AOM AV1:
Speed 0 Two-Pass - Bosphorus 4K
Speed 4 Two-Pass - Bosphorus 4K
Speed 6 Realtime - Bosphorus 4K
Speed 6 Two-Pass - Bosphorus 4K
Speed 8 Realtime - Bosphorus 4K
Speed 9 Realtime - Bosphorus 4K
Speed 10 Realtime - Bosphorus 4K
Speed 0 Two-Pass - Bosphorus 1080p
Speed 4 Two-Pass - Bosphorus 1080p
Speed 6 Realtime - Bosphorus 1080p
Speed 6 Two-Pass - Bosphorus 1080p
Speed 8 Realtime - Bosphorus 1080p
Speed 9 Realtime - Bosphorus 1080p
Speed 10 Realtime - Bosphorus 1080p
Embree:
Pathtracer - Crown
Pathtracer ISPC - Crown
Pathtracer - Asian Dragon
Pathtracer - Asian Dragon Obj
Pathtracer ISPC - Asian Dragon
Pathtracer ISPC - Asian Dragon Obj
SVT-AV1:
Preset 4 - Bosphorus 4K
Preset 8 - Bosphorus 4K
Preset 12 - Bosphorus 4K
Preset 13 - Bosphorus 4K
VP9 libvpx Encoding:
Speed 0 - Bosphorus 4K
Speed 5 - Bosphorus 4K
Speed 0 - Bosphorus 1080p
Speed 5 - Bosphorus 1080p
VVenC:
Bosphorus 4K - Fast
Bosphorus 4K - Faster
Bosphorus 1080p - Fast
Bosphorus 1080p - Faster
Timed FFmpeg Compilation
Timed Godot Game Engine Compilation
Timed LLVM Compilation:
Ninja
Unix Makefiles
Timed Node.js Compilation
Build2
FFmpeg:
libx264 - Live:
Seconds
FPS
libx265 - Live:
Seconds
FPS
libx264 - Upload:
Seconds
FPS
libx265 - Upload:
Seconds
FPS
libx264 - Platform:
Seconds
FPS
libx265 - Platform:
Seconds
FPS
libx264 - Video On Demand:
Seconds
FPS
libx265 - Video On Demand:
Seconds
FPS
OpenSSL:
SHA256
SHA512
RSA4096
RSA4096
ChaCha20
AES-128-GCM
AES-256-GCM
ChaCha20-Poly1305
Memcached:
1:5
1:10
1:100
GROMACS
Darmstadt Automotive Parallel Heterogeneous Suite:
OpenMP - NDT Mapping
OpenMP - Points2Image
OpenMP - Euclidean Cluster
MariaDB:
512
1024
2048
4096
TensorFlow:
CPU - 16 - AlexNet
CPU - 32 - AlexNet
CPU - 64 - AlexNet
CPU - 256 - AlexNet
CPU - 512 - AlexNet
CPU - 16 - GoogLeNet
CPU - 16 - ResNet-50
CPU - 32 - GoogLeNet
CPU - 32 - ResNet-50
CPU - 64 - GoogLeNet
CPU - 64 - ResNet-50
CPU - 256 - GoogLeNet
CPU - 256 - ResNet-50
CPU - 512 - GoogLeNet
CPU - 512 - ResNet-50
Google Draco:
Lion
Church Facade
Stress-NG:
Hash
MMAP
NUMA
Poll
Zlib
Futex
MEMFD
Mutex
Atomic
Crypto
Malloc
Forking
Pthread
IO_uring
SENDFILE
CPU Cache
CPU Stress
Semaphores
Matrix Math
Vector Math
Function Call
x86_64 RdRand
Memory Copying
Socket Activity
Context Switching
Glibc C String Functions
Glibc Qsort Data Sorting
System V Message Passing
Blender:
BMW27 - CPU-Only
Classroom - CPU-Only
Fishy Cat - CPU-Only
Barbershop - CPU-Only
Pabellon Barcelona - CPU-Only
RocksDB:
Rand Fill
Rand Read
Update Rand
Seq Fill
Rand Fill Sync
Read While Writing
Read Rand Write Rand
nginx:
200
500
ONNX Runtime:
GPT-2 - CPU - Parallel
GPT-2 - CPU - Standard
yolov4 - CPU - Parallel
yolov4 - CPU - Standard
bertsquad-12 - CPU - Parallel
bertsquad-12 - CPU - Standard
CaffeNet 12-int8 - CPU - Parallel
CaffeNet 12-int8 - CPU - Standard
fcn-resnet101-11 - CPU - Parallel
fcn-resnet101-11 - CPU - Standard
ArcFace ResNet-100 - CPU - Parallel
ArcFace ResNet-100 - CPU - Standard
ResNet50 v1-12-int8 - CPU - Parallel
ResNet50 v1-12-int8 - CPU - Standard
super-resolution-10 - CPU - Parallel
super-resolution-10 - CPU - Standard
Faster R-CNN R-50-FPN-int8 - CPU - Parallel
Faster R-CNN R-50-FPN-int8 - CPU - Standard
Apache HTTP Server:
200
500
OpenCV:
Core
Video
Graph API
Stitching
Features 2D
Image Processing
Object Detection
DNN - Deep Neural Network
oneDNN:
IP Shapes 1D - f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
IP Shapes 1D - bf16bf16bf16 - CPU
IP Shapes 3D - bf16bf16bf16 - CPU
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Convolution Batch Shapes Auto - bf16bf16bf16 - CPU
Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU
Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU

a

Testing initiated at 31 March 2023 11:44 by user phoronix.

b

Testing initiated at 31 March 2023 21:58 by user phoronix.

c

Testing initiated at 1 April 2023 06:25 by user phoronix.

xeon eo march

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

a

b

c

SPECFEM3D

Zstd Compression

John The Ripper

dav1d

AOM AV1

Embree

SVT-AV1

VP9 libvpx Encoding

VVenC

Timed FFmpeg Compilation

Timed Godot Game Engine Compilation

Timed LLVM Compilation

Timed Node.js Compilation

Build2

FFmpeg

OpenSSL

Memcached

GROMACS

Darmstadt Automotive Parallel Heterogeneous Suite

MariaDB

TensorFlow

Google Draco

Stress-NG

Blender

RocksDB

nginx

ONNX Runtime

Apache HTTP Server

OpenCV

oneDNN

209 Results Shown

a

b

c