onednn onnx threadripper

AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F4p BIOS) and AMD Radeon RX 5700 8GB on Pop 21.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2203314-PTS-ONEDNNON39&grs&sro&rro.

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Parallel

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

onednn onnx threadripper

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Parallel

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU