AMD EPYC 9755 DDR5 Turin Memory Performance
AMD EPYC 9755 with varying DDR5-6000 default versus DDR5-4800 memory performance. Benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2410130-NE-TURINDDR565&gru&sor.
PyTorch
Device: CPU - Batch Size: 1 - Model: ResNet-50
PyTorch
Device: CPU - Batch Size: 1 - Model: ResNet-152
PyTorch
Device: CPU - Batch Size: 64 - Model: ResNet-50
PyTorch
Device: CPU - Batch Size: 64 - Model: ResNet-152
PyTorch
Device: CPU - Batch Size: 256 - Model: ResNet-50
PyTorch
Device: CPU - Batch Size: 256 - Model: ResNet-152
PyTorch
Device: CPU - Batch Size: 512 - Model: ResNet-50
PyTorch
Device: CPU - Batch Size: 512 - Model: ResNet-152
miniBUDE
Implementation: OpenMP - Input Deck: BM1
miniBUDE
Implementation: OpenMP - Input Deck: BM2
OpenSSL
Algorithm: SHA256
OpenSSL
Algorithm: SHA512
OpenSSL
Algorithm: AES-128-GCM
OpenSSL
Algorithm: AES-256-GCM
OpenSSL
Algorithm: ChaCha20
OpenSSL
Algorithm: ChaCha20-Poly1305
miniFE
Problem Size: Small
Algebraic Multi-Grid Benchmark
FFmpeg
Encoder: libx265 - Scenario: Upload
FFmpeg
Encoder: libx265 - Scenario: Platform
FFmpeg
Encoder: libx265 - Scenario: Video On Demand
OpenVINO
Model: Face Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU
OpenVINO
Model: Person Detection FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Person Vehicle Bike Detection FP16 - Device: CPU
OpenVINO
Model: Machine Translation EN To DE FP16 - Device: CPU
OpenVINO
Model: Face Detection Retail FP16-INT8 - Device: CPU
OpenVINO
Model: Handwritten English Recognition FP16-INT8 - Device: CPU
OpenVINO
Model: Road Segmentation ADAS FP16-INT8 - Device: CPU
OpenVINO
Model: Person Re-Identification Retail FP16 - Device: CPU
OpenVINO
Model: Noise Suppression Poconet-Like FP16 - Device: CPU
Embree
Binary: Pathtracer ISPC - Model: Asian Dragon
Embree
Binary: Pathtracer ISPC - Model: Asian Dragon Obj
Embree
Binary: Pathtracer ISPC - Model: Crown
SVT-AV1
Encoder Mode: Preset 13 - Input: Bosphorus 4K
SVT-AV1
Encoder Mode: Preset 12 - Input: Bosphorus 4K
SVT-AV1
Encoder Mode: Preset 8 - Input: Bosphorus 4K
SVT-AV1
Encoder Mode: Preset 4 - Input: Bosphorus 4K
x265
Video Input: Bosphorus 4K
Kvazaar
Video Input: Bosphorus 4K - Video Preset: Slow
Kvazaar
Video Input: Bosphorus 4K - Video Preset: Medium
Kvazaar
Video Input: Bosphorus 4K - Video Preset: Very Fast
uvg266
Video Input: Bosphorus 4K - Video Preset: Slow
uvg266
Video Input: Bosphorus 4K - Video Preset: Medium
uvg266
Video Input: Bosphorus 4K - Video Preset: Very Fast
VVenC
Video Input: Bosphorus 4K - Video Preset: Fast
VVenC
Video Input: Bosphorus 4K - Video Preset: Faster
miniBUDE
Implementation: OpenMP - Input Deck: BM1
miniBUDE
Implementation: OpenMP - Input Deck: BM2
ACES DGEMM
Sustained Floating-Point Rate
High Performance Conjugate Gradient
X Y Z: 144 144 144 - RT: 60
Xmrig
Variant: GhostRider - Hash Count: 1M
Intel Open Image Denoise
Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only
Intel Open Image Denoise
Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only
Intel Open Image Denoise
Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only
TensorFlow
Device: CPU - Batch Size: 1 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 1 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 1 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 64 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 64 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 64 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 256 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 256 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 256 - Model: GoogLeNet
TensorFlow
Device: CPU - Batch Size: 512 - Model: ResNet-50
TensorFlow
Device: CPU - Batch Size: 512 - Model: AlexNet
TensorFlow
Device: CPU - Batch Size: 512 - Model: GoogLeNet
ONNX Runtime
Model: yolov4 - Device: CPU - Executor: Standard
ONNX Runtime
Model: yolov4 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: super-resolution-10 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: bertsquad-12 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: GPT-2 - Device: CPU - Executor: Standard
ONNX Runtime
Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard
ONNX Runtime
Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: T5 Encoder - Device: CPU - Executor: Standard
OpenVKL
Benchmark: vklBenchmarkCPU ISPC
OSPRay
Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time
OSPRay
Benchmark: particle_volume/ao/real_time
OSPRay
Benchmark: particle_volume/scivis/real_time
GraphicsMagick
Operation: Noise-Gaussian
GraphicsMagick
Operation: Enhanced
GraphicsMagick
Operation: Sharpen
GraphicsMagick
Operation: Swirl
Coremark
CoreMark Size 666 - Iterations Per Second
LuxCoreRender
Scene: DLSC - Acceleration: CPU
LuxCoreRender
Scene: LuxCore Benchmark - Acceleration: CPU
LuxCoreRender
Scene: Orange Juice - Acceleration: CPU
SecureMark
Benchmark: SecureMark-TLS
Zstd Compression
Compression Level: 19 - Compression Speed
Zstd Compression
Compression Level: 19 - Decompression Speed
Zstd Compression
Compression Level: 19, Long Mode - Compression Speed
Zstd Compression
Compression Level: 19, Long Mode - Decompression Speed
srsRAN Project
Test: PUSCH Processor Benchmark, Throughput Total
srsRAN Project
Test: PUSCH Processor Benchmark, Throughput Thread
srsRAN Project
Test: PDSCH Processor Benchmark, Throughput Total
srsRAN Project
Test: PDSCH Processor Benchmark, Throughput Thread
QuantLib
Configuration: Single-Threaded
QuantLib
Configuration: Multi-Threaded
ASKAP
Test: tConvolve MT - Gridding
ASKAP
Test: tConvolve MT - Degridding
7-Zip Compression
Test: Compression Rating
7-Zip Compression
Test: Decompression Rating
WebP Image Encode
Encode Settings: Quality 100, Highest Compression
WebP Image Encode
Encode Settings: Quality 100, Lossless, Highest Compression
ASKAP
Test: tConvolve MPI - Degridding
ASKAP
Test: tConvolve MPI - Gridding
ASTC Encoder
Preset: Medium
ASTC Encoder
Preset: Thorough
ASTC Encoder
Preset: Very Thorough
ASTC Encoder
Preset: Exhaustive
Stockfish
Chess Benchmark
GROMACS
Implementation: MPI CPU - Input: water_GMX50_bare
LAMMPS Molecular Dynamics Simulator
Model: Rhodopsin Protein
LAMMPS Molecular Dynamics Simulator
Model: 20k Atoms
NAMD
Input: ATPase with 327,506 Atoms
NAMD
Input: STMV with 1,066,628 Atoms
RocksDB
Test: Random Read
RocksDB
Test: Read While Writing
Speedb
Test: Random Read
Speedb
Test: Read While Writing
Memcached
Set To Get Ratio: 1:100
Apache IoTDB
Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400
Apache IoTDB
Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100
ClickHouse
100M Rows Hits Dataset, First Run / Cold Cache
ClickHouse
100M Rows Hits Dataset, Second Run
ClickHouse
100M Rows Hits Dataset, Third Run
John The Ripper
Test: Blowfish
John The Ripper
Test: bcrypt
John The Ripper
Test: WPA PSK
Liquid-DSP
Threads: 1 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 1 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 1 - Buffer Length: 256 - Filter Length: 512
Liquid-DSP
Threads: 64 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 64 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 64 - Buffer Length: 256 - Filter Length: 512
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 128 - Buffer Length: 256 - Filter Length: 512
Liquid-DSP
Threads: 256 - Buffer Length: 256 - Filter Length: 32
Liquid-DSP
Threads: 256 - Buffer Length: 256 - Filter Length: 57
Liquid-DSP
Threads: 256 - Buffer Length: 256 - Filter Length: 512
Numpy Benchmark
OpenSSL
Algorithm: RSA4096
Llamafile
Test: llava-v1.5-7b-q4 - Acceleration: CPU
Llamafile
Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU
NAS Parallel Benchmarks
Test / Class: LU.C
NAS Parallel Benchmarks
Test / Class: SP.C
NAS Parallel Benchmarks
Test / Class: IS.D
NAS Parallel Benchmarks
Test / Class: MG.C
NAS Parallel Benchmarks
Test / Class: CG.C
PostgreSQL
Scaling Factor: 100 - Clients: 1000 - Mode: Read Write
PostgreSQL
Scaling Factor: 100 - Clients: 1000 - Mode: Read Only
OpenSSL
Algorithm: RSA4096
BRL-CAD
VGR Performance Metric
LULESH
Apache IoTDB
Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400
Apache IoTDB
Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100
Pennant
Test: leblancbig
ONNX Runtime
Model: yolov4 - Device: CPU - Executor: Standard
ONNX Runtime
Model: yolov4 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: super-resolution-10 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: bertsquad-12 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: GPT-2 - Device: CPU - Executor: Standard
ONNX Runtime
Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard
ONNX Runtime
Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel
ONNX Runtime
Model: T5 Encoder - Device: CPU - Executor: Standard
PyBench
Total For Average Test Times
PostgreSQL
Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency
PostgreSQL
Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency
OSPRay Studio
Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU
OSPRay Studio
Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU
OSPRay Studio
Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU
OSPRay Studio
Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU
OSPRay Studio
Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU
OSPRay Studio
Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU
OpenVINO
Model: Face Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU
OpenVINO
Model: Person Detection FP16 - Device: CPU
OpenVINO
Model: Weld Porosity Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Vehicle Detection FP16-INT8 - Device: CPU
OpenVINO
Model: Person Vehicle Bike Detection FP16 - Device: CPU
OpenVINO
Model: Machine Translation EN To DE FP16 - Device: CPU
OpenVINO
Model: Face Detection Retail FP16-INT8 - Device: CPU
OpenVINO
Model: Handwritten English Recognition FP16-INT8 - Device: CPU
OpenVINO
Model: Road Segmentation ADAS FP16-INT8 - Device: CPU
OpenVINO
Model: Person Re-Identification Retail FP16 - Device: CPU
OpenVINO
Model: Noise Suppression Poconet-Like FP16 - Device: CPU
oneDNN
Harness: Deconvolution Batch shapes_3d - Engine: CPU
Timed Linux Kernel Compilation
Build: defconfig
Timed Linux Kernel Compilation
Build: allmodconfig
Timed FFmpeg Compilation
Time To Compile
Timed Godot Game Engine Compilation
Time To Compile
Timed Node.js Compilation
Time To Compile
Timed Gem5 Compilation
Time To Compile
Timed LLVM Compilation
Build System: Ninja
Timed Mesa Compilation
Time To Compile
Timed ImageMagick Compilation
Time To Compile
WRF
Input: conus 2.5km
m-queens
Time To Solve
CloverLeaf
Input: clover_bm64_short
CloverLeaf
Input: clover_bm16
GPAW
Input: Carbon Nanotube
NWChem
Input: C240 Buckyball
Xcompact3d Incompact3d
Input: input.i3d 193 Cells Per Direction
Xcompact3d Incompact3d
Input: X3D-benchmarking input.i3d
OpenFOAM
Input: drivaerFastback, Small Mesh Size - Mesh Time
OpenFOAM
Input: drivaerFastback, Small Mesh Size - Execution Time
OpenFOAM
Input: drivaerFastback, Medium Mesh Size - Execution Time
OpenRadioss
Model: Cell Phone Drop Test
OpenRadioss
Model: INIVOL and Fluid Structure Interaction Drop Container
OpenRadioss
Model: Chrysler Neon 1M
Blender
Blend File: BMW27 - Compute: CPU-Only
Blender
Blend File: Classroom - Compute: CPU-Only
Blender
Blend File: Fishy Cat - Compute: CPU-Only
Blender
Blend File: Pabellon Barcelona - Compute: CPU-Only
Blender
Blend File: Barbershop - Compute: CPU-Only
Blender
Blend File: Junkshop - Compute: CPU-Only
Appleseed
Scene: Disney Material
Parallel BZIP2 Compression
FreeBSD-13.0-RELEASE-amd64-memstick.img Compression
libavif avifenc
Encoder Speed: 0
libavif avifenc
Encoder Speed: 2
libavif avifenc
Encoder Speed: 6
libavif avifenc
Encoder Speed: 6, Lossless
libavif avifenc
Encoder Speed: 10, Lossless
GNU Octave Benchmark
Helsing
Digit Range: 14 digit
Primesieve
Length: 1e12
Primesieve
Length: 1e13
Y-Cruncher
Pi Digits To Calculate: 500M
Y-Cruncher
Pi Digits To Calculate: 1B
QMCPACK
Input: Li2_STO_ae
Phoronix Test Suite v10.8.5