dddas

AMD Ryzen Threadripper 3970X 32-Core testing with a ASUS ROG ZENITH II EXTREME (1603 BIOS) and AMD Radeon RX 5700 8GB on Ubuntu 22.04 via the Phoronix Test Suite.

a

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Disk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830104d
Graphics Notes: BAR1 / Visible vRAM Size: 256 MB - vBIOS Version: 113-D1820201-101
Python Notes: Python 3.10.6
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

b

Processor: AMD Ryzen Threadripper 3970X 32-Core @ 3.70GHz (32 Cores / 64 Threads), Motherboard: ASUS ROG ZENITH II EXTREME (1603 BIOS), Chipset: AMD Starship/Matisse, Memory: 64GB, Disk: Samsung SSD 980 PRO 500GB, Graphics: AMD Radeon RX 5700 8GB (1750/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS VP28U, Network: Aquantia AQC107 NBase-T/IEEE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 22.04, Kernel: 5.19.0-051900rc7-generic (x86_64), Desktop: GNOME Shell 42.2, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.0.1 (LLVM 13.0.1 DRM 3.47), Vulkan: 1.2.204, Compiler: GCC 11.3.0, File-System: ext4, Screen Resolution: 3840x2160

Whisper.cpp

Whisper.cpp is a port of OpenAI's Whisper model in C/C++. Whisper.cpp is developed by Georgi Gerganov for transcribing WAV audio files to text / speech recognition. Whisper.cpp supports ARM NEON, x86 AVX, and other advanced CPU features. Learn more via the OpenBenchmarking.org test page.

SQLite

This is a simple benchmark of SQLite. At present this test profile just measures the time to perform a pre-defined number of insertions on an indexed database with a variable number of concurrent repetitions -- up to the maximum number of CPU threads available. Learn more via the OpenBenchmarking.org test page.

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

SQLite

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

PETSc

PETSc, the Portable, Extensible Toolkit for Scientific Computation, is for the scalable (parallel) solution of scientific applications modeled by partial differential equations. This test profile runs the PETSc "make streams" benchmark and records the throughput rate when all available cores are utilized for the MPI Streams build. Learn more via the OpenBenchmarking.org test page.

SQLite

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

libxsmm

Monte Carlo Simulations of Ionised Nebulae

Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.

QMCPACK

Palabos

The Palabos library is a framework for general purpose Computational Fluid Dynamics (CFD). Palabos uses a kernel based on the Lattice Boltzmann method. This test profile uses the Palabos MPI-based Cavity3D benchmark. Learn more via the OpenBenchmarking.org test page.

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Whisper.cpp

OSPRay

Palabos

QMCPACK

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

Xonotic

This is a benchmark of Xonotic, which is a fork of the DarkPlaces-based Nexuiz game. Development began in March of 2010 on the Xonotic game for this open-source first person shooter title. Learn more via the OpenBenchmarking.org test page.

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Laghos

Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

SQLite

OSPRay

Xonotic

oneDNN

Xonotic

oneDNN

Xonotic

Z3 Theorem Prover

The Z3 Theorem Prover / SMT solver is developed by Microsoft Research under the MIT license. Learn more via the OpenBenchmarking.org test page.

OSPRay

Xonotic

OSPRay

HeFFTe - Highly Efficient FFT for Exascale

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OSPRay

HeFFTe - Highly Efficient FFT for Exascale

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

Xonotic

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. More details on the CP2K benchmark test cases and details can be found @ https://www.cp2k.org/performance Learn more via the OpenBenchmarking.org test page.

Xonotic

Neural Magic DeepSparse

oneDNN

Neural Magic DeepSparse

VVenC

Neural Magic DeepSparse

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Laghos

Neural Magic DeepSparse

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

VVenC

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus five times. Learn more via the OpenBenchmarking.org test page.

oneDNN

Neural Magic DeepSparse

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

libxsmm

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

Z3 Theorem Prover

The Z3 Theorem Prover / SMT solver is developed by Microsoft Research under the MIT license. Learn more via the OpenBenchmarking.org test page.

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

QMCPACK

Embree

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

VVenC

dav1d

Dav1d is an open-source, speedy AV1 video decoder supporting modern SIMD CPU features. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

dav1d

CP2K Molecular Dynamics

oneDNN

Embree

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Embree

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

dav1d

SVT-AV1

Embree

HeFFTe - Highly Efficient FFT for Exascale

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

a: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

b: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

Monte Carlo Simulations of Ionised Nebulae

SVT-AV1

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

CP2K Molecular Dynamics

Input: H2O-DFT-LS

a: The test quit with a non-zero exit status. E: mpirun noticed that process rank 13 with PID 0 on node phoronix-System-Product-Name exited on signal 9 (Killed).

b: The test quit with a non-zero exit status. E: mpirun noticed that process rank 23 with PID 0 on node phoronix-System-Product-Name exited on signal 9 (Killed).

Palabos

Grid Size: 1000

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

oneDNN

SVT-AV1

HeFFTe - Highly Efficient FFT for Exascale

SVT-AV1

HeFFTe - Highly Efficient FFT for Exascale

SVT-AV1

oneDNN

dav1d

HeFFTe - Highly Efficient FFT for Exascale

oneDNN

SVT-AV1

Palabos

Grid Size: 4000

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

HeFFTe - Highly Efficient FFT for Exascale

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

a: The test run did not produce a result.

b: The test run did not produce a result.

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Test: x86_64 RdRand

a: The test run did not produce a result. E: stress-ng: error: [1222741] No stress workers invoked (one or more were unsupported)

b: The test run did not produce a result. E: stress-ng: error: [3041301] No stress workers invoked (one or more were unsupported)

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

a: The test run did not produce a result.

b: The test run did not produce a result.

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

a: The test run did not produce a result.

b: The test run did not produce a result.

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

a: The test run did not produce a result.

b: The test run did not produce a result.

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

a: The test run did not produce a result.

b: The test run did not produce a result.

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Run: RTLightmap.hdr.4096x4096 - Device: Intel oneAPI SYCL

a: The test quit with a non-zero exit status. E: Error: unsupported device type: SYCL

b: The test quit with a non-zero exit status. E: Error: unsupported device type: SYCL

Run: RT.ldr_alb_nrm.3840x2160 - Device: Intel oneAPI SYCL

a: The test quit with a non-zero exit status. E: Error: unsupported device type: SYCL

b: The test quit with a non-zero exit status. E: Error: unsupported device type: SYCL

Run: RT.hdr_alb_nrm.3840x2160 - Device: Intel oneAPI SYCL

a: The test quit with a non-zero exit status. E: Error: unsupported device type: SYCL

b: The test quit with a non-zero exit status. E: Error: unsupported device type: SYCL

Run: RTLightmap.hdr.4096x4096 - Device: Radeon HIP

a: The test quit with a non-zero exit status. E: Error: unsupported device type: HIP

b: The test quit with a non-zero exit status. E: Error: unsupported device type: HIP

Run: RT.ldr_alb_nrm.3840x2160 - Device: Radeon HIP

a: The test quit with a non-zero exit status. E: Error: unsupported device type: HIP

b: The test quit with a non-zero exit status. E: Error: unsupported device type: HIP

Run: RT.hdr_alb_nrm.3840x2160 - Device: Radeon HIP

a: The test quit with a non-zero exit status. E: Error: unsupported device type: HIP

b: The test quit with a non-zero exit status. E: Error: unsupported device type: HIP

218 Results Shown

Whisper.cpp:
ggml-medium.en - 2016 State of the Union
ggml-small.en - 2016 State of the Union
SQLite:
64
32
libxsmm
SQLite:
16
4
oneDNN
PETSc
SQLite:
8
2
nekRS:
Kershaw
TurboPipe Periodic
High Performance Conjugate Gradient
QMCPACK
libxsmm
Monte Carlo Simulations of Ionised Nebulae
QMCPACK
Palabos
OSPRay
Whisper.cpp
OSPRay
Palabos:
400
500
QMCPACK
LevelDB:
Seq Fill:
Microseconds Per Op
MB/s
Xonotic
LevelDB
HeFFTe - Highly Efficient FFT for Exascale:
c2c - FFTW - double-long - 512
c2c - Stock - double-long - 512
Stress-NG:
Socket Activity
Pipe
Laghos
GPAW
VVenC
SQLite
OSPRay
Xonotic:
2560 x 1440 - Ultimate
1920 x 1200 - Ultimate
1920 x 1080 - Ultimate
3840 x 2160 - Ultra
oneDNN:
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Xonotic
oneDNN:
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Xonotic:
1920 x 1080 - Ultra
1920 x 1200 - Ultra
2560 x 1440 - Ultra
Z3 Theorem Prover
OSPRay
Xonotic:
1920 x 1080 - High
2560 x 1440 - High
1920 x 1200 - High
OSPRay
HeFFTe - Highly Efficient FFT for Exascale
LevelDB
OSPRay
HeFFTe - Highly Efficient FFT for Exascale
Neural Magic DeepSparse:
NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
ms/batch
items/sec
Xonotic
CP2K Molecular Dynamics
Xonotic:
1920 x 1080 - Low
1920 x 1200 - Low
2560 x 1440 - Low
Neural Magic DeepSparse:
NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
ms/batch
items/sec
oneDNN
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream:
ms/batch
items/sec
VVenC
Neural Magic DeepSparse:
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
ms/batch
items/sec
Kripke
Neural Magic DeepSparse:
NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Stream:
ms/batch
items/sec
NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Stream:
ms/batch
items/sec
NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream:
ms/batch
items/sec
Laghos
Neural Magic DeepSparse:
NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Stream:
ms/batch
items/sec
SVT-AV1
VVenC
LevelDB:
Rand Read
Hot Read
Neural Magic DeepSparse:
CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
ms/batch
items/sec
Opus Codec Encoding
oneDNN
Neural Magic DeepSparse:
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream:
ms/batch
items/sec
eSpeak-NG Speech Engine
Neural Magic DeepSparse:
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO - Synchronous Single-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
ms/batch
items/sec
Stress-NG
Neural Magic DeepSparse:
CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream:
ms/batch
items/sec
libxsmm:
64
32
Intel Open Image Denoise
Stress-NG:
IO_uring
MMAP
Malloc
Cloning
MEMFD
Atomic
CPU Cache
Liquid-DSP:
64 - 256 - 512
8 - 256 - 512
Stress-NG
Liquid-DSP
Stress-NG
Liquid-DSP:
8 - 256 - 32
8 - 256 - 57
Stress-NG:
Memory Copying
NUMA
Liquid-DSP
Stress-NG:
Matrix 3D Math
Vector Shuffle
Function Call
Semaphores
Wide Vector Math
Vector Floating Point
Glibc C String Functions
Liquid-DSP
Stress-NG:
System V Message Passing
Floating Point
Liquid-DSP
Stress-NG
Liquid-DSP
Stress-NG:
Mutex
AVL Tree
Crypto
Liquid-DSP
Stress-NG:
Context Switching
Forking
Vector Math
Matrix Math
Hash
Glibc Qsort Data Sorting
CPU Stress
SENDFILE
Fused Multiply-Add
Liquid-DSP:
32 - 256 - 32
2 - 256 - 512
16 - 256 - 57
16 - 256 - 32
1 - 256 - 512
1 - 256 - 32
2 - 256 - 32
4 - 256 - 57
4 - 256 - 32
2 - 256 - 57
1 - 256 - 57
Z3 Theorem Prover
Embree
QMCPACK
Embree
LevelDB:
Rand Fill:
Microseconds Per Op
MB/s
Overwrite:
Microseconds Per Op
MB/s
VVenC
dav1d
Remhos
dav1d
CP2K Molecular Dynamics
oneDNN:
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
Embree:
Pathtracer ISPC - Crown
Pathtracer - Crown
Intel Open Image Denoise
Embree
Intel Open Image Denoise
dav1d
SVT-AV1
Embree
HeFFTe - Highly Efficient FFT for Exascale:
c2c - FFTW - double-long - 256
c2c - Stock - double-long - 256
Monte Carlo Simulations of Ionised Nebulae
SVT-AV1
LevelDB:
Fill Sync:
Microseconds Per Op
MB/s
oneDNN:
IP Shapes 3D - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
SVT-AV1
HeFFTe - Highly Efficient FFT for Exascale
SVT-AV1
HeFFTe - Highly Efficient FFT for Exascale
SVT-AV1
oneDNN:
Convolution Batch Shapes Auto - u8s8f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
dav1d
HeFFTe - Highly Efficient FFT for Exascale
oneDNN:
Deconvolution Batch shapes_3d - f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
SVT-AV1:
Preset 12 - Bosphorus 1080p
Preset 13 - Bosphorus 1080p
HeFFTe - Highly Efficient FFT for Exascale:
c2c - Stock - double-long - 128
c2c - FFTW - double-long - 128
r2c - Stock - double-long - 128

a

Testing initiated at 23 June 2023 21:26 by user phoronix.

b

Testing initiated at 24 June 2023 11:43 by user phoronix.

dddas

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

a

b

Whisper.cpp

SQLite

libxsmm

SQLite

oneDNN

PETSc

SQLite

nekRS

High Performance Conjugate Gradient

QMCPACK

libxsmm

Monte Carlo Simulations of Ionised Nebulae

QMCPACK

Palabos

OSPRay

Whisper.cpp

OSPRay

Palabos

QMCPACK

LevelDB

Xonotic

LevelDB

HeFFTe - Highly Efficient FFT for Exascale

Stress-NG

Laghos

GPAW

VVenC

SQLite

OSPRay

Xonotic

oneDNN

Xonotic

oneDNN

Xonotic

Z3 Theorem Prover

OSPRay

Xonotic

OSPRay

HeFFTe - Highly Efficient FFT for Exascale

LevelDB

OSPRay

HeFFTe - Highly Efficient FFT for Exascale

Neural Magic DeepSparse

Xonotic

CP2K Molecular Dynamics

Xonotic

Neural Magic DeepSparse

oneDNN

Neural Magic DeepSparse

VVenC

Neural Magic DeepSparse

Kripke

Neural Magic DeepSparse

Laghos

Neural Magic DeepSparse

SVT-AV1

VVenC

LevelDB

Neural Magic DeepSparse

Opus Codec Encoding

oneDNN

Neural Magic DeepSparse

eSpeak-NG Speech Engine

Neural Magic DeepSparse

Stress-NG

Neural Magic DeepSparse

libxsmm

Intel Open Image Denoise

Stress-NG

Liquid-DSP

Stress-NG