AMD 3D V-Cache Comparison

Tests for a future article.

OpenFOAM

Open Porous Media Git

LeelaChessZero

Open Porous Media Git

Open Porous Media Git

Open Porous Media Git

WebP2 Image Encode

Open Porous Media Git

Open Porous Media Git

Open Porous Media Git

Open Porous Media Git

Open Porous Media Git

ONNX Runtime

LeelaChessZero

ECP-CANDLE

WebP2 Image Encode

Open Porous Media Git

Open Porous Media Git

ECP-CANDLE

ONNX Runtime

ONNX Runtime

Open Porous Media Git

ONNX Runtime

ONNX Runtime

ONNX Runtime

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

NCNN

Xcompact3d Incompact3d

TNN

Caffe

Numpy Benchmark

WebP2 Image Encode

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ONNX Runtime

ASKAP

ASKAP

OpenFOAM

Caffe

oneDNN

oneDNN

oneDNN

oneDNN

Mlpack Benchmark

oneDNN

oneDNN

Mobile Neural Network

Mobile Neural Network

Mobile Neural Network

Mobile Neural Network

Mobile Neural Network

Mobile Neural Network

Mobile Neural Network

oneDNN

Mlpack Benchmark

Caffe

ASKAP

ASKAP

Mlpack Benchmark

Caffe

oneDNN

Xcompact3d Incompact3d

ASKAP

ASKAP

oneDNN

oneDNN

TNN

Mlpack Benchmark

TNN

ASKAP

oneDNN

oneDNN

oneDNN

oneDNN

WebP2 Image Encode

TNN

ECP-CANDLE

oneDNN

oneDNN

WebP2 Image Encode

Phoronix Test Suite v10.8.5

AMD 3D V-Cache Comparison

OpenFOAM

Input: Motorbike 60M

Open Porous Media Git

OPM Benchmark: Flow MPI Extra - Threads: 1

LeelaChessZero

Backend: Eigen

Open Porous Media Git

OPM Benchmark: Flow MPI Extra - Threads: 8

Open Porous Media Git

OPM Benchmark: Flow MPI Extra - Threads: 2

Open Porous Media Git

OPM Benchmark: Flow MPI Extra - Threads: 4

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

Open Porous Media Git

OPM Benchmark: Flow MPI Norne-4C MSW - Threads: 8

Open Porous Media Git

OPM Benchmark: Flow MPI Norne-4C MSW - Threads: 1

Open Porous Media Git

OPM Benchmark: Flow MPI Norne-4C MSW - Threads: 2

Open Porous Media Git

OPM Benchmark: Flow MPI Norne - Threads: 8

Open Porous Media Git

OPM Benchmark: Flow MPI Norne-4C MSW - Threads: 4

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

LeelaChessZero

Backend: BLAS

ECP-CANDLE

Benchmark: P3B1

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

Open Porous Media Git

OPM Benchmark: Flow MPI Norne - Threads: 1

Open Porous Media Git

OPM Benchmark: Flow MPI Norne - Threads: 4

ECP-CANDLE

Benchmark: P3B2

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

Open Porous Media Git

OPM Benchmark: Flow MPI Norne - Threads: 2

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

NCNN

Target: CPU - Model: regnety_400m

NCNN

Target: CPU - Model: squeezenet_ssd

NCNN

Target: CPU - Model: yolov4-tiny

NCNN

Target: CPU - Model: resnet50

NCNN

Target: CPU - Model: alexnet

NCNN

Target: CPU - Model: resnet18

NCNN

Target: CPU - Model: vgg16

NCNN

Target: CPU - Model: googlenet

NCNN

Target: CPU - Model: blazeface

NCNN

Target: CPU - Model: efficientnet-b0

NCNN

Target: CPU - Model: mnasnet

NCNN

Target: CPU - Model: shufflenet-v2

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

NCNN