AMD EPYC 9684X 3D V-Cache Benchmark

AMD EPYC 9684X 96-Core testing by Michael Larabel for a future article. Various benchmarks conducted with the EPYC 9684X 1P and then repeated after disabling 3D V-Cache from the BIOS to see direct comparison of 3DV impact. Plus monitoring CPU thermal / power / frequency for future follow-up article.

Default

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101121
Python Notes: Python 3.10.6
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

3DV Disabled

Processor: AMD EPYC 9684X 96-Core @ 2.55GHz (96 Cores / 192 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 768GB, Disk: 2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007, Graphics: ASPEED, Network: Broadcom NetXtreme BCM5720 PCIe

OS: Ubuntu 22.04, Kernel: 5.19.0-41-generic (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.4, Vulkan: 1.3.224, Compiler: GCC 11.3.0, File-System: ext4, Screen Resolution: 1024x768

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Total Mop/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Result

Nodes Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Nodes Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Result

CG Mflops Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

Result

Figure Of Merit Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

Result

GFLOPS/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOPS/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOPS/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOPS/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Palabos

The Palabos library is a framework for general purpose Computational Fluid Dynamics (CFD). Palabos uses a kernel based on the Lattice Boltzmann method. This test profile uses the Palabos MPI-based Cavity3D benchmark. Learn more via the OpenBenchmarking.org test page.

Result

Mega Site Updates Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Mega Site Updates Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Mega Site Updates Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

Result

z/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

Result

H/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

srsRAN Project

srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.

Result

Mbps Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

Result

Frames Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Frames Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Frames Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

Result

GFLOP/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Result

Items Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Items Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Items Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Items Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Items Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

Result

Nodes Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Timed PHP Compilation

This test times how long it takes to build PHP. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

Result

Score Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

Result

samples/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Million Grid Points Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

Mpix/sec Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

Million Grid Points Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Iterations Per Second Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

Result

MT/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

MT/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

MT/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

Result

Ns Per Day Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Result

images/sec Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Bogo Ops/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

WRF

WRF, the Weather Research and Forecasting Model, is a "next-generation mesoscale numerical weather prediction system designed for both atmospheric research and operational forecasting applications. It features two dynamical cores, a data assimilation system, and a software architecture supporting parallel computation and system extensibility." Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Blender

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

PETSc

PETSc, the Portable, Extensible Toolkit for Scientific Computation, is for the scalable (parallel) solution of scientific applications modeled by partial differential equations. This test profile runs the PETSc "make streams" benchmark and records the throughput rate when all available cores are utilized for the MPI Streams build. Learn more via the OpenBenchmarking.org test page.

Result

MB/s Per Watt

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

Result Confidence

CPU Peak Freq (Highest CPU Core Frequency) Monitor

CPU Power Consumption Monitor

CPU Temperature Monitor

130 Results Shown

High Performance Conjugate Gradient:
104 104 104 - 60
144 144 144 - 60
160 160 160 - 60
192 192 192 - 60
NAS Parallel Benchmarks:
BT.C
CG.C
EP.D
IS.D
LU.C
MG.C
SP.C
LeelaChessZero:
BLAS
Eigen
miniFE
CloverLeaf
NAMD
Algebraic Multi-Grid Benchmark
libxsmm:
128
256
32
64
HeFFTe - Highly Efficient FFT for Exascale:
c2c - FFTW - float - 512
r2c - FFTW - float - 512
c2c - FFTW - double - 256
c2c - FFTW - double - 512
r2c - FFTW - double - 256
r2c - FFTW - double - 512
Palabos:
400
500
1000
Xcompact3d Incompact3d:
input.i3d 129 Cells Per Direction
input.i3d 193 Cells Per Direction
OpenFOAM:
drivaerFastback, Large Mesh Size - Mesh Time
drivaerFastback, Large Mesh Size - Execution Time
drivaerFastback, Medium Mesh Size - Mesh Time
drivaerFastback, Medium Mesh Size - Execution Time
Remhos
LULESH
Xmrig
srsRAN Project
Embree:
Pathtracer ISPC - Crown
Pathtracer ISPC - Asian Dragon
Pathtracer ISPC - Asian Dragon Obj
ACES DGEMM
OSPRay:
particle_volume/ao/real_time
particle_volume/scivis/real_time
gravity_spheres_volume/dim_512/ao/real_time
gravity_spheres_volume/dim_512/scivis/real_time
gravity_spheres_volume/dim_512/pathtracer/real_time
Stockfish
Timed Gem5 Compilation
Timed Godot Game Engine Compilation
Timed Linux Kernel Compilation
Timed LLVM Compilation:
Ninja
Unix Makefiles
Timed Node.js Compilation
Timed PHP Compilation
OSPRay Studio:
1 - 4K - 1 - Path Tracer
2 - 4K - 1 - Path Tracer
3 - 4K - 1 - Path Tracer
1 - 4K - 16 - Path Tracer
1 - 4K - 32 - Path Tracer
2 - 4K - 16 - Path Tracer
2 - 4K - 32 - Path Tracer
3 - 4K - 16 - Path Tracer
3 - 4K - 32 - Path Tracer
Numpy Benchmark
Ngspice:
C2670
C7552
Liquid-DSP
ASKAP:
tConvolve MT - Gridding
tConvolve MT - Degridding
tConvolve MPI - Degridding
tConvolve MPI - Gridding
tConvolve OpenMP - Gridding
tConvolve OpenMP - Degridding
Hogbom Clean OpenMP
ASTC Encoder:
Medium
Thorough
Exhaustive
GROMACS
TensorFlow
Neural Magic DeepSparse:
NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
items/sec
ms/batch
Google Draco:
Lion
Church Facade
Stress-NG:
Pipe
Futex
Mutex
Malloc
AVL Tree
CPU Cache
CPU Stress
Semaphores
Matrix Math
Vector Math
Matrix 3D Math
Memory Copying
Wide Vector Math
Fused Multiply-Add
Vector Floating Point
WRF
GPAW
Blender:
BMW27 - CPU-Only
Classroom - CPU-Only
Fishy Cat - CPU-Only
Barbershop - CPU-Only
Pabellon Barcelona - CPU-Only
OpenVINO:
Person Detection FP16 - CPU:
FPS
ms
Person Detection FP32 - CPU:
FPS
ms
Vehicle Detection FP16 - CPU:
FPS
ms
Vehicle Detection FP16-INT8 - CPU:
FPS
ms
Person Vehicle Bike Detection FP16 - CPU:
FPS
ms
PETSc
PyHPC Benchmarks:
CPU - Numpy - 4194304 - Equation of State
CPU - Numpy - 4194304 - Isoneutral Mixing
CPU Peak Freq (Highest CPU Core Frequency) Monitor:
Phoronix Test Suite System Monitoring:
Megahertz
Watts
Celsius

Default

Testing initiated at 17 July 2023 20:53 by user phoronix.

3DV Disabled

Testing initiated at 19 July 2023 04:28 by user phoronix.

AMD EPYC 9684X 3D V-Cache Benchmark

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

Default

3DV Disabled

High Performance Conjugate Gradient

NAS Parallel Benchmarks

LeelaChessZero

miniFE

CloverLeaf

NAMD

Algebraic Multi-Grid Benchmark

libxsmm

HeFFTe - Highly Efficient FFT for Exascale

Palabos

Xcompact3d Incompact3d

OpenFOAM

Remhos

LULESH

Xmrig

srsRAN Project

Embree

ACES DGEMM

OSPRay

Stockfish

Timed Gem5 Compilation

Timed Godot Game Engine Compilation

Timed Linux Kernel Compilation

Timed LLVM Compilation

Timed Node.js Compilation

Timed PHP Compilation

OSPRay Studio

Numpy Benchmark

Ngspice

Liquid-DSP

ASKAP

ASTC Encoder

GROMACS

TensorFlow

Neural Magic DeepSparse

Google Draco

Stress-NG

WRF

GPAW

Blender

OpenVINO

PETSc

PyHPC Benchmarks

CPU Peak Freq (Highest CPU Core Frequency) Monitor

CPU Power Consumption Monitor

CPU Temperature Monitor

130 Results Shown

Default

3DV Disabled