ddf

AMD EPYC 8534PN 64-Core testing with a AMD Cinnabar (RCB1009C BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.

a

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212
Python Notes: Python 3.11.5
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

b

c

Processor: AMD EPYC 8534PN 64-Core @ 2.00GHz (64 Cores / 128 Threads), Motherboard: AMD Cinnabar (RCB1009C BIOS), Chipset: AMD Device 14a4, Memory: 192GB, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe

OS: Ubuntu 23.10, Kernel: 6.5.0-5-generic (x86_64), Desktop: GNOME Shell, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 640x480

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

WebP2 Image Encode

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Currently this test profile is catered to CPU-based testing. Learn more via the OpenBenchmarking.org test page.

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

WebP2 Image Encode

PyTorch

SVT-AV1

PyTorch

SVT-AV1

PyTorch

OpenRadioss

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenRadioss

Embree

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

PyTorch

rav1e

Xiph rav1e is a Rust-written AV1 video encoder that claims to be the fastest and safest AV1 encoder. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

SVT-AV1

PyTorch

OpenRadioss

SVT-AV1

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

Embree

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

PyTorch

Blender

Neural Magic DeepSparse

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

easyWave

The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

PyTorch

Neural Magic DeepSparse

OpenRadioss

Xmrig

TensorFlow

Neural Magic DeepSparse

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

WebP2 Image Encode

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

easyWave

FFmpeg

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

Embree

Neural Magic DeepSparse

rav1e

Xiph rav1e is a Rust-written AV1 video encoder that claims to be the fastest and safest AV1 encoder. Learn more via the OpenBenchmarking.org test page.

PyTorch

Quicksilver

Neural Magic DeepSparse

rav1e

Xiph rav1e is a Rust-written AV1 video encoder that claims to be the fastest and safest AV1 encoder. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

OpenRadioss

Xmrig

Neural Magic DeepSparse

Xmrig

Neural Magic DeepSparse

Xmrig

PyTorch

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version. Learn more via the OpenBenchmarking.org test page.

Embree

TensorFlow

Neural Magic DeepSparse

FFmpeg

Blender

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Quicksilver

Blender

Neural Magic DeepSparse

QuantLib

Neural Magic DeepSparse

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Xmrig

Neural Magic DeepSparse

Y-Cruncher

Y-Cruncher is a multi-threaded Pi benchmark capable of computing Pi to trillions of digits. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Embree

Neural Magic DeepSparse

TensorFlow

easyWave

Neural Magic DeepSparse

Blender

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

TensorFlow

PyTorch

rav1e

Xiph rav1e is a Rust-written AV1 video encoder that claims to be the fastest and safest AV1 encoder. Learn more via the OpenBenchmarking.org test page.

WebP2 Image Encode

137 Results Shown

LeelaChessZero
WebP2 Image Encode
Timed Gem5 Compilation
LeelaChessZero
SVT-AV1
WebP2 Image Encode
QuantLib
PyTorch
CloverLeaf
Xmrig
Speedb
SVT-AV1
OpenRadioss
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream:
items/sec
ms/batch
Speedb
SVT-AV1
WebP2 Image Encode
PyTorch
SVT-AV1:
Preset 12 - Bosphorus 4K
Preset 4 - Bosphorus 4K
PyTorch
SVT-AV1
PyTorch
OpenRadioss
Embree
Speedb
OpenRadioss
Embree
FFmpeg
PyTorch
rav1e
Neural Magic DeepSparse:
CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream:
ms/batch
items/sec
SVT-AV1
PyTorch
OpenRadioss
SVT-AV1
Speedb
Embree
Blender
PyTorch
Blender
Neural Magic DeepSparse
Timed FFmpeg Compilation
easyWave
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering - Synchronous Single-Stream:
ms/batch
items/sec
TensorFlow
PyTorch
Neural Magic DeepSparse:
ResNet-50, Baseline - Synchronous Single-Stream:
items/sec
ms/batch
OpenRadioss
Xmrig
TensorFlow
Neural Magic DeepSparse
Quicksilver
WebP2 Image Encode
Y-Cruncher
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream
NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream
NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream
easyWave
FFmpeg
Speedb
Embree
Neural Magic DeepSparse:
ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
ms/batch
rav1e
PyTorch:
CPU - 16 - Efficientnet_v2_l
CPU - 32 - Efficientnet_v2_l
Quicksilver
Neural Magic DeepSparse
rav1e
Neural Magic DeepSparse
OpenRadioss
Xmrig
Neural Magic DeepSparse:
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
ms/batch
items/sec
Xmrig
Neural Magic DeepSparse
Xmrig
PyTorch
Y-Cruncher
Neural Magic DeepSparse:
CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
ms/batch
BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
items/sec
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
CloverLeaf
Embree
TensorFlow
Neural Magic DeepSparse:
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream
ResNet-50, Baseline - Asynchronous Multi-Stream
ResNet-50, Baseline - Asynchronous Multi-Stream
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream
FFmpeg:
libx265 - Video On Demand
libx265 - Platform
Blender
Y-Cruncher
Neural Magic DeepSparse
Quicksilver
Blender
Neural Magic DeepSparse:
NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream:
items/sec
ms/batch
QuantLib
Neural Magic DeepSparse
Speedb
Neural Magic DeepSparse:
NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream:
ms/batch
items/sec
BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream:
items/sec
Xmrig
Neural Magic DeepSparse
Y-Cruncher
Neural Magic DeepSparse:
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
ResNet-50, Sparse INT8 - Synchronous Single-Stream
Embree
Neural Magic DeepSparse:
CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
ms/batch
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
ms/batch
CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
items/sec
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
items/sec
TensorFlow
easyWave
Neural Magic DeepSparse
Blender
Speedb
TensorFlow:
CPU - 16 - GoogLeNet
CPU - 1 - ResNet-50
CPU - 1 - GoogLeNet
CPU - 1 - VGG-16
PyTorch
rav1e
WebP2 Image Encode

a

Testing initiated at 7 January 2024 21:18 by user phoronix.

b

Testing initiated at 8 January 2024 00:13 by user phoronix.

c

Testing initiated at 8 January 2024 10:00 by user phoronix.

ddf

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

a

b

c

LeelaChessZero

WebP2 Image Encode

Timed Gem5 Compilation

LeelaChessZero

SVT-AV1

WebP2 Image Encode

QuantLib

PyTorch

CloverLeaf

Xmrig

Speedb

SVT-AV1

OpenRadioss

Neural Magic DeepSparse

Speedb

SVT-AV1

WebP2 Image Encode

PyTorch

SVT-AV1

PyTorch

SVT-AV1

PyTorch

OpenRadioss

Embree

Speedb

OpenRadioss

Embree

FFmpeg

PyTorch

rav1e

Neural Magic DeepSparse

SVT-AV1

PyTorch

OpenRadioss

SVT-AV1

Speedb

Embree

Blender

PyTorch

Blender

Neural Magic DeepSparse

Timed FFmpeg Compilation

easyWave

Neural Magic DeepSparse

TensorFlow

PyTorch

Neural Magic DeepSparse

OpenRadioss

Xmrig

TensorFlow

Neural Magic DeepSparse

Quicksilver

WebP2 Image Encode

Y-Cruncher

Neural Magic DeepSparse

easyWave

FFmpeg

Speedb

Embree

Neural Magic DeepSparse

rav1e

PyTorch

Quicksilver

Neural Magic DeepSparse

rav1e

Neural Magic DeepSparse

OpenRadioss

Xmrig

Neural Magic DeepSparse

Xmrig