tgl onnx onednn

Intel Core i7-1185G7 testing with a Dell 0DXP1F (3.4.0 BIOS) and Intel Xe TGL GT2 3GB on Ubuntu 22.04 via the Phoronix Test Suite.

A

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XWYfV6/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XWYfV6/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x9a - Thermald 2.4.7
Python Notes: Python 3.10.2
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS plus Retpolines IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

B

C

Processor: Intel Core i7-1185G7 @ 4.80GHz (4 Cores / 8 Threads), Motherboard: Dell 0DXP1F (3.4.0 BIOS), Chipset: Intel Tiger Lake-LP, Memory: 16GB, Disk: Micron 2300 NVMe 512GB, Graphics: Intel Xe TGL GT2 3GB (1350MHz), Audio: Realtek ALC289, Network: Intel Wi-Fi 6 AX201

OS: Ubuntu 22.04, Kernel: 5.17.0-051700rc7daily20220309-generic (x86_64), Desktop: GNOME Shell 41.3, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.3.5, Vulkan: 1.2.195, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 1920x1200

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

36 Results Shown

ONNX Runtime
oneDNN
ONNX Runtime
oneDNN
ONNX Runtime
oneDNN:
IP Shapes 3D - bf16bf16bf16 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
ONNX Runtime
oneDNN:
Deconvolution Batch shapes_3d - u8s8f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
ONNX Runtime
oneDNN
ONNX Runtime:
yolov4 - CPU - Parallel
yolov4 - CPU - Standard
oneDNN
ONNX Runtime
oneDNN:
Recurrent Neural Network Training - u8s8f32 - CPU
Convolution Batch Shapes Auto - bf16bf16bf16 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
ONNX Runtime:
super-resolution-10 - CPU - Standard
fcn-resnet101-11 - CPU - Standard
bertsquad-12 - CPU - Standard
bertsquad-12 - CPU - Parallel
oneDNN:
Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU
Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU
Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU
Recurrent Neural Network Training - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
IP Shapes 1D - bf16bf16bf16 - CPU
IP Shapes 1D - f32 - CPU

A

Testing initiated at 30 March 2022 19:15 by user pts.

B

Testing initiated at 30 March 2022 20:05 by user pts.

C

Testing initiated at 31 March 2022 04:53 by user pts.

tgl onnx onednn

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

A

B

C

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

ONNX Runtime

oneDNN

36 Results Shown

A

B

C