mltestresults2

wsl testing on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2208042-NE-MLTESTRES66

Jump To Table - Results

rtx3080_1290k_2

Processor: Intel Core i9-12900K (12 Cores / 24 Threads), Memory: 16GB, Disk: 4 x 275GB Virtual Disk, Graphics: NVIDIA GeForce RTX 3080 10GB

OS: Ubuntu 20.04, Kernel: 5.10.16.3-microsoft-standard-WSL2 (x86_64), Display Server: Wayland, OpenGL: 3.3 Mesa 21.2.6, Vulkan: 1.1.182, Compiler: GCC 9.4.0 + CUDA 11.7, File-System: ext4, Screen Resolution: 1920x1080, System Layer: wsl

Kernel Notes: Transparent Huge Pages: always
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.9.12
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial timeseries data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

ONNX Runtime

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

TensorFlow Lite

OpenVINO

Numenta Anomaly Benchmark

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

oneDNN

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

oneDNN

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

oneDNN

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

oneDNN

ECP-CANDLE

Scikit-Learn

Scikit-learn is a Python module for machine learning Learn more via the OpenBenchmarking.org test page.

oneDNN

Numenta Anomaly Benchmark

oneDNN

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Tensorflow

This is a benchmark of the Tensorflow deep learning framework using the CIFAR10 data set. Learn more via the OpenBenchmarking.org test page.

Build: Cifar10

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: AttributeError: module 'tensorflow' has no attribute 'app'

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

Benchmark: scikit_qda

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: TypeError: load_all() missing 1 required positional argument: 'Loader'

Benchmark: scikit_ica

Benchmark: scikit_linearridgeregression

Benchmark: scikit_svm

OpenVINO

Model: Person Detection 0106 FP32 - Device: Intel GPU

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Model: Face Detection 0106 FP16 - Device: Intel GPU

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Model: Age Gender Recognition Retail 0013 FP32 - Device: Intel GPU

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Model: Person Detection 0106 FP16 - Device: Intel GPU

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Model: Age Gender Recognition Retail 0013 FP16 - Device: Intel GPU

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Model: Face Detection 0106 FP32 - Device: Intel GPU

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "yolov4/yolov4.onnx" failed: No such file or directory

Model: yolov4 - Device: CPU - Executor: Standard

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

Model: AlexNet - Acceleration: CPU - Iterations: 1000

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

rtx3080_1290k_2: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

rtx3080_1290k_2: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 200

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

rtx3080_1290k_2: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

rtx3080_1290k_2: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

rtx3080_1290k_2: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: GoogleNet - Acceleration: CPU - Iterations: 200

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: CPU - Iterations: 100

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

rtx3080_1290k_2: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 1000

rtx3080_1290k_2: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

106 Results Shown

NCNN:
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
ONNX Runtime
Mobile Neural Network:
inception-v3
mobilenet-v1-1.0
MobileNetV2_224
SqueezeNetV1.0
resnet-v2-50
squeezenetv1.1
mobilenetV3
LeelaChessZero
AI Benchmark Alpha:
Device AI Score
Device Training Score
Device Inference Score
TensorFlow Lite
PlaidML
NCNN:
CPU - regnety_400m
CPU - squeezenet_ssd
CPU - yolov4-tiny
CPU - resnet50
CPU - alexnet
CPU - resnet18
CPU - vgg16
CPU - googlenet
CPU - blazeface
CPU - efficientnet-b0
CPU - mnasnet
CPU - shufflenet-v2
CPU-v3-v3 - mobilenet-v3
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
Numenta Anomaly Benchmark
ECP-CANDLE
TNN
ECP-CANDLE
ONNX Runtime:
fcn-resnet101-11 - CPU - Parallel
ArcFace ResNet-100 - CPU - Parallel
bertsquad-12 - CPU - Parallel
GPT-2 - CPU - Parallel
ArcFace ResNet-100 - CPU - Standard
GPT-2 - CPU - Standard
bertsquad-12 - CPU - Standard
super-resolution-10 - CPU - Parallel
super-resolution-10 - CPU - Standard
Numpy Benchmark
PlaidML
oneDNN:
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
IP Shapes 1D - f32 - CPU
Numenta Anomaly Benchmark
OpenVINO:
Face Detection 0106 FP16 - CPU:
ms
FPS
Person Detection 0106 FP16 - CPU:
ms
FPS
Face Detection 0106 FP32 - CPU:
ms
FPS
Person Detection 0106 FP32 - CPU:
ms
FPS
TensorFlow Lite:
Inception V4
Inception ResNet V2
NASNet Mobile
Mobilenet Float
Mobilenet Quant
OpenVINO:
Age Gender Recognition Retail 0013 FP16 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP32 - CPU:
ms
FPS
Numenta Anomaly Benchmark
DeepSpeech
oneDNN:
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
R Benchmark
Numenta Anomaly Benchmark
oneDNN:
Matrix Multiply Batch Shapes Transformer - f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
RNNoise
TNN
oneDNN
TNN
oneDNN:
IP Shapes 3D - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
ECP-CANDLE
Scikit-Learn
oneDNN:
Convolution Batch Shapes Auto - f32 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
Numenta Anomaly Benchmark
oneDNN:
Deconvolution Batch shapes_3d - f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
TNN

rtx3080_1290k_2

Processor: Intel Core i9-12900K (12 Cores / 24 Threads), Memory: 16GB, Disk: 4 x 275GB Virtual Disk, Graphics: NVIDIA GeForce RTX 3080 10GB

Testing initiated at 4 August 2022 15:57 by user allanlago.

mltestresults2

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

rtx3080_1290k_2

NCNN

ONNX Runtime

Mobile Neural Network

LeelaChessZero

AI Benchmark Alpha

TensorFlow Lite

PlaidML

NCNN

Numenta Anomaly Benchmark

ECP-CANDLE

TNN

ECP-CANDLE

ONNX Runtime

Numpy Benchmark

PlaidML

oneDNN

Numenta Anomaly Benchmark

OpenVINO

TensorFlow Lite

OpenVINO

Numenta Anomaly Benchmark

DeepSpeech

oneDNN

R Benchmark

Numenta Anomaly Benchmark

oneDNN

RNNoise

TNN

oneDNN

TNN

oneDNN

ECP-CANDLE

Scikit-Learn

oneDNN

Numenta Anomaly Benchmark

oneDNN

TNN

Tensorflow

Mlpack Benchmark

OpenVINO

ONNX Runtime

Caffe

oneDNN

Caffe

oneDNN

Caffe

oneDNN

Caffe

106 Results Shown

rtx3080_1290k_2