onednn onnx threadripper

AMD Ryzen Threadripper 3990X 64-Core testing with a Gigabyte TRX40 AORUS PRO WIFI (F4p BIOS) and AMD Radeon RX 5700 8GB on Pop 21.10 via the Phoronix Test Suite.

A

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8301039
Python Notes: Python 3.9.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

B

C

D

Processor: AMD Ryzen Threadripper 3990X 64-Core @ 2.90GHz (64 Cores / 128 Threads), Motherboard: Gigabyte TRX40 AORUS PRO WIFI (F4p BIOS), Chipset: AMD Starship/Matisse, Memory: 128GB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: AMD Radeon RX 5700 8GB (1750/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: DELL P2415Q, Network: Intel I211 + Intel Wi-Fi 6 AX200

OS: Pop 21.10, Kernel: 5.17.0-rc1-sched-core-phx (x86_64), Desktop: GNOME Shell 40.5, Display Server: X Server, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.1), Vulkan: 1.2.182, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 3840x2160

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

ONNX Runtime

Result

Result Confidence

oneDNN

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

ONNX Runtime

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

oneDNN

Result

Result Confidence

ONNX Runtime

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

oneDNN

Result

Result Confidence

Result

Result Confidence

ONNX Runtime

Result

Result Confidence

oneDNN

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

ONNX Runtime

Result

Result Confidence

Result

Result Confidence

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

D: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

D: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

D: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

D: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Result

Result Confidence

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

D: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

A: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

B: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

D: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Result

Result Confidence

Result

Result Confidence

30 Results Shown

ONNX Runtime
oneDNN
ONNX Runtime
oneDNN:
Convolution Batch Shapes Auto - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
IP Shapes 3D - u8s8f32 - CPU
ONNX Runtime:
ArcFace ResNet-100 - CPU - Standard
fcn-resnet101-11 - CPU - Standard
bertsquad-12 - CPU - Parallel
fcn-resnet101-11 - CPU - Parallel
oneDNN
ONNX Runtime:
yolov4 - CPU - Standard
super-resolution-10 - CPU - Parallel
GPT-2 - CPU - Parallel
oneDNN:
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
ONNX Runtime
oneDNN:
Deconvolution Batch shapes_3d - u8s8f32 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
ONNX Runtime:
yolov4 - CPU - Parallel
super-resolution-10 - CPU - Standard
oneDNN:
Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
IP Shapes 1D - f32 - CPU

A

Testing initiated at 31 March 2022 05:03 by user pts.

B

Testing initiated at 31 March 2022 08:24 by user pts.

C

Testing initiated at 31 March 2022 11:13 by user pts.

D

Testing initiated at 31 March 2022 13:27 by user pts.

onednn onnx threadripper

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

A

B

C

D

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

30 Results Shown

A

B

C

D