AMD EPYC 9684X 3D V-Cache
AMD EPYC 9684X 96-Core testing with a AMD Titanite_4G (RTI1007B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2307207-NE-UPLOAD92587&grw.
Stress-NG
Test: Hash
HeFFTe - Highly Efficient FFT for Exascale
Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512
Stress-NG
Test: MMAP
Stress-NG
Test: NUMA
HeFFTe - Highly Efficient FFT for Exascale
Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128
Stress-NG
Test: Pipe
Stress-NG
Test: Poll
Stress-NG
Test: Zlib
Stress-NG
Test: MEMFD
HeFFTe - Highly Efficient FFT for Exascale
Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512
Stress-NG
Test: Mutex
Stress-NG
Test: Crypto
Stress-NG
Test: Malloc
HeFFTe - Highly Efficient FFT for Exascale
Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128
Stress-NG
Test: Cloning
HeFFTe - Highly Efficient FFT for Exascale
Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512
HeFFTe - Highly Efficient FFT for Exascale
Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128
HeFFTe - Highly Efficient FFT for Exascale
Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128
HeFFTe - Highly Efficient FFT for Exascale
Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256
Laghos
Test: Triple Point Problem
Laghos
Test: Sedov Blast Wave, ube_922_hex.mesh
libxsmm
M N K: 32
libxsmm
M N K: 64
libxsmm
M N K: 256
libxsmm
M N K: 128
Stress-NG
Test: Forking
HeFFTe - Highly Efficient FFT for Exascale
Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256
HeFFTe - Highly Efficient FFT for Exascale
Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512
Stress-NG
Test: Pthread
Stress-NG
Test: AVL Tree
Stress-NG
Test: IO_uring
Stress-NG
Test: SENDFILE
Stress-NG
Test: CPU Cache
Palabos
Grid Size: 500
Palabos
Grid Size: 1000
Palabos
Grid Size: 400
Palabos
Grid Size: 100
Stress-NG
Test: CPU Stress
SPECFEM3D
Model: Water-layered Halfspace
Stress-NG
Test: Semaphores
Stress-NG
Test: Matrix Math
Stress-NG
Test: Vector Math
Stress-NG
Test: Function Call
Stress-NG
Test: Floating Point
SPECFEM3D
Model: Layered Halfspace
SPECFEM3D
Model: Tomographic Model
Remhos
Test: Sample Remap Example
SPECFEM3D
Model: Mount St. Helens
HeFFTe - Highly Efficient FFT for Exascale
Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256
Stress-NG
Test: Matrix 3D Math
Stress-NG
Test: Memory Copying
Stress-NG
Test: Vector Shuffle
Stress-NG
Test: Socket Activity
Stress-NG
Test: Wide Vector Math
HeFFTe - Highly Efficient FFT for Exascale
Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256
Stress-NG
Test: Atomic
SPECFEM3D
Model: Homogeneous Halfspace
Stress-NG
Test: Futex
Stress-NG
Test: Context Switching
Stress-NG
Test: Fused Multiply-Add
Stress-NG
Test: Vector Floating Point
Stress-NG
Test: Glibc C String Functions
Stress-NG
Test: Glibc Qsort Data Sorting
Stress-NG
Test: System V Message Passing
Ngspice
Circuit: C2670
Ngspice
Circuit: C7552
ASTC Encoder
Preset: Medium
ASTC Encoder
Preset: Thorough
ASTC Encoder
Preset: Exhaustive
Google Draco
Model: Lion
Google Draco
Model: Church Facade
Xmrig
Variant: Monero - Hash Count: 1M
Xmrig
Variant: Wownero - Hash Count: 1M
WRF
Input: conus 2.5km
TensorFlow
Device: CPU - Batch Size: 16 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 32 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 64 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 256 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 512 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 16 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 16 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 32 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 32 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 64 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 64 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 256 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 256 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 512 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 512 - Model: ResNet-50
LeelaChessZero
Backend: BLAS
LeelaChessZero
Backend: Eigen
Numpy Benchmark
CloverLeaf
Lagrangian-Eulerian Hydrodynamics
Neural Magic DeepSparse
Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream
Neural Magic DeepSparse
Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream
Whisper.cpp
Model: ggml-base.en - Input: 2016 State of the Union
Whisper.cpp
Model: ggml-small.en - Input: 2016 State of the Union
Whisper.cpp
Model: ggml-medium.en - Input: 2016 State of the Union
GROMACS
Implementation: MPI CPU - Input: water_GMX50_bare
LAMMPS Molecular Dynamics Simulator
Model: 20k Atoms
High Performance Conjugate Gradient
X Y Z: 104 104 104 - RT: 60
High Performance Conjugate Gradient
X Y Z: 144 144 144 - RT: 60
High Performance Conjugate Gradient
X Y Z: 160 160 160 - RT: 60
High Performance Conjugate Gradient
X Y Z: 192 192 192 - RT: 60
NAS Parallel Benchmarks
Test / Class: BT.C
NAS Parallel Benchmarks
Test / Class: CG.C
NAS Parallel Benchmarks
Test / Class: EP.D
NAS Parallel Benchmarks
Test / Class: FT.C
NAS Parallel Benchmarks
Test / Class: IS.D
NAS Parallel Benchmarks
Test / Class: LU.C
NAS Parallel Benchmarks
Test / Class: MG.C
NAS Parallel Benchmarks
Test / Class: SP.C
NAMD
ATPase Simulation - 327,506 Atoms
OpenVINO
Model: Face Detection FP16 - Device: CPU
OpenVINO
Model: Face Detection FP16 - Device: CPU
OpenVINO
Model: Person Detection FP16 - Device: CPU
OpenVINO
Model: Person Detection FP16 - Device: CPU
OpenVINO
Model: Person Detection FP32 - Device: CPU
OpenVINO
Model: Person Detection FP32 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16 - Device: CPU
OpenVINO
Model: Face Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Face Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16 - Device: CPU
OpenVINO
Model: Machine Translation EN To DE FP16 - Device: CPU
OpenVINO
Model: Machine Translation EN To DE FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Person Vehicle Bike Detection FP16 - Device: CPU
OpenVINO
Model: Person Vehicle Bike Detection FP16 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU
ASKAP
Test: tConvolve MT - Gridding
ASKAP
Test: tConvolve MT - Degridding
ASKAP
Test: tConvolve MPI - Degridding
ASKAP
Test: tConvolve MPI - Gridding
ASKAP
Test: tConvolve OpenMP - Gridding
ASKAP
Test: tConvolve OpenMP - Degridding
ASKAP
Test: Hogbom Clean OpenMP
ACES DGEMM
Sustained Floating-Point Rate
Algebraic Multi-Grid Benchmark
PyHPC Benchmarks
Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State
PyHPC Benchmarks
Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing
LULESH
OpenFOAM
Input: drivaerFastback, Large Mesh Size - Mesh Time
OpenFOAM
Input: drivaerFastback, Large Mesh Size - Execution Time
OpenFOAM
Input: drivaerFastback, Small Mesh Size - Mesh Time
OpenFOAM
Input: drivaerFastback, Small Mesh Size - Execution Time
OpenFOAM
Input: drivaerFastback, Medium Mesh Size - Mesh Time
OpenFOAM
Input: drivaerFastback, Medium Mesh Size - Execution Time
Monte Carlo Simulations of Ionised Nebulae
Input: Gas HII40
Monte Carlo Simulations of Ionised Nebulae
Input: Dust 2D tau100.0
miniFE
Problem Size: Small
Xcompact3d Incompact3d
Input: input.i3d 129 Cells Per Direction
Xcompact3d Incompact3d
Input: input.i3d 193 Cells Per Direction
GPAW
Input: Carbon Nanotube
Stockfish
Total Time
7-Zip Compression
Test: Compression Rating
7-Zip Compression
Test: Decompression Rating
Timed LLVM Compilation
Build System: Ninja
Timed LLVM Compilation
Build System: Unix Makefiles
Timed PHP Compilation
Time To Compile
Zstd Compression
Compression Level: 3 - Compression Speed
Zstd Compression
Compression Level: 3 - Decompression Speed
Zstd Compression
Compression Level: 8 - Compression Speed
Zstd Compression
Compression Level: 8 - Decompression Speed
Zstd Compression
Compression Level: 12 - Compression Speed
Zstd Compression
Compression Level: 12 - Decompression Speed
Zstd Compression
Compression Level: 19 - Compression Speed
Zstd Compression
Compression Level: 19 - Decompression Speed
Zstd Compression
Compression Level: 3, Long Mode - Compression Speed
Zstd Compression
Compression Level: 3, Long Mode - Decompression Speed
Zstd Compression
Compression Level: 8, Long Mode - Compression Speed
Zstd Compression
Compression Level: 8, Long Mode - Decompression Speed
Zstd Compression
Compression Level: 19, Long Mode - Compression Speed
Zstd Compression
Compression Level: 19, Long Mode - Decompression Speed
asmFish
1024 Hash Memory, 26 Depth
Timed Linux Kernel Compilation
Build: defconfig
Timed Linux Kernel Compilation
Build: allmodconfig
Blender
Blend File: BMW27 - Compute: CPU-Only
Blender
Blend File: Classroom - Compute: CPU-Only
Blender
Blend File: Fishy Cat - Compute: CPU-Only
Blender
Blend File: Barbershop - Compute: CPU-Only
Blender
Blend File: Pabellon Barcelona - Compute: CPU-Only
Timed Godot Game Engine Compilation
Time To Compile
Embree
Binary: Pathtracer ISPC - Model: Crown
Embree
Binary: Pathtracer ISPC - Model: Asian Dragon
Embree
Binary: Pathtracer ISPC - Model: Asian Dragon Obj
OSPRay
Benchmark: particle_volume/ao/real_time
OSPRay
Benchmark: particle_volume/scivis/real_time
OSPRay
Benchmark: particle_volume/pathtracer/real_time
OSPRay
Benchmark: gravity_spheres_volume/dim_512/ao/real_time
OSPRay
Benchmark: gravity_spheres_volume/dim_512/scivis/real_time
OSPRay
Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time
OSPRay Studio
Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer
PETSc
Test: Streams
OSPRay Studio
Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer
OSPRay Studio
Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer
OSPRay Studio
Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer
OSPRay Studio
Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer
OSPRay Studio
Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer
OSPRay Studio
Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer
OSPRay Studio
Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer
OSPRay Studio
Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer
Timed Gem5 Compilation
Time To Compile
Timed Node.js Compilation
Time To Compile
Liquid-DSP
Threads: 64 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 64 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 192 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 192 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 64 - Buffer Length: 256 - Filter Length: 512
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 512
Liquid-DSP
Threads: 192 - Buffer Length: 256 - Filter Length: 512
srsRAN Project
Test: Downlink Processor Benchmark
srsRAN Project
Test: PUSCH Processor Benchmark, Throughput Total
Phoronix Test Suite v10.8.5