24.03.13.Pop.2204.ML.test1

AMD Ryzen 9 7950X 16-Core testing with a ASUS ProArt X670E-CREATOR WIFI (1710 BIOS) and Zotac NVIDIA GeForce RTX 4070 Ti 12GB on Pop 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2403157-NE-240313POP28

Jump To Table - Results

Initial test 1 No water cool

Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ProArt X670E-CREATOR WIFI (1710 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16 GB DDR5-4800MT/s G Skill F5-6000J3636F16G, Disk: 1000GB PNY CS2130 1TB SSD, Graphics: Zotac NVIDIA GeForce RTX 4070 Ti 12GB, Audio: NVIDIA Device 22bc, Monitor: 2 x DELL 2001FP, Network: Intel I225-V + Aquantia AQtion AQC113CS NBase-T/IEEE + MEDIATEK MT7922 802.11ax PCI

OS: Pop 22.04, Kernel: 6.6.10-76060610-generic (x86_64), Desktop: GNOME Shell 42.5, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 550.54.14, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.89, Vulkan: 1.3.277, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 3200x1200

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601206
Graphics Notes: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.31.00.3b
OpenCL Notes: GPU Compute Cores: 7680
Python Notes: Python 3.10.12
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial time-series data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

TensorFlow

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Currently this test profile is catered to CPU-based testing. Learn more via the OpenBenchmarking.org test page.

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

TensorFlow

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. This MNN test profile is building the OpenMP / CPU threaded version for processor benchmarking and not any GPU-accelerated test. MNN does allow making use of AVX-512 extensions. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Device: GPU - Batch Size: 512 - Model: VGG-16

Initial test 1 No water cool: The test quit with a non-zero exit status. E: Fatal Python error: Segmentation fault

PyTorch

TensorFlow

PyTorch

TensorFlow

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenVINO

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

TensorFlow

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenVINO

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow

Device: GPU - Batch Size: 512 - Model: ResNet-50

Initial test 1 No water cool: The test quit with a non-zero exit status. E: Fatal Python error: Segmentation fault

Numenta Anomaly Benchmark

TensorFlow

oneDNN

PyTorch

Neural Magic DeepSparse

spaCy

The spaCy library is an open-source solution for advanced neural language processing (NLP). The spaCy library leverages Python and is a leading neural language processing solution. This test profile times the spaCy CPU performance with various models. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

TensorFlow

Device: CPU - Batch Size: 512 - Model: VGG-16

Initial test 1 No water cool: The test quit with a non-zero exit status. E: Fatal Python error: Segmentation fault

oneDNN

PyTorch

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

Initial test 1 No water cool: The test quit with a non-zero exit status. E: Fatal Python error: Segmentation fault

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

TensorFlow

Numenta Anomaly Benchmark

TensorFlow

Numenta Anomaly Benchmark

TensorFlow

PyTorch

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

TensorFlow

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Numenta Anomaly Benchmark

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Numenta Anomaly Benchmark

PyTorch

TensorFlow

oneDNN

PyTorch

SHOC Scalable HeterOgeneous Computing

oneDNN

SHOC Scalable HeterOgeneous Computing

PyTorch

Whisper.cpp

Whisper.cpp is a port of OpenAI's Whisper model in C/C++. Whisper.cpp is developed by Georgi Gerganov for transcribing WAV audio files to text / speech recognition. Whisper.cpp supports ARM NEON, x86 AVX, and other advanced CPU features. Learn more via the OpenBenchmarking.org test page.

TensorFlow

SHOC Scalable HeterOgeneous Computing

oneDNN

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

SHOC Scalable HeterOgeneous Computing

Whisper.cpp

SHOC Scalable HeterOgeneous Computing

Whisper.cpp

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Tree

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

SHOC Scalable HeterOgeneous Computing

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: cannot import name 'Iterable' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Plot Non-Negative Matrix Factorization

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: cannot import name 'Iterable' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Kernel PCA Solvers / Time vs. N Samples

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Text Vectorizers

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Kernel PCA Solvers / Time vs. N Components

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Sample Without Replacement

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: RCV1 Logreg Convergencet

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Parallel Pairwise

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Hist Gradient Boosting

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: SGD Regression

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Neighbors

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Feature Expansions

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Incremental PCA

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Isolation Forest

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: LocalOutlierFactor

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Hist Gradient Boosting Adult

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Hist Gradient Boosting Higgs Boson

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: GLM

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Sparse Random Projections / 100 Iterations

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Hist Gradient Boosting Categorical Only

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Polynomial Kernel Approximation

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: 20 Newsgroups / Logistic Regression

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Hist Gradient Boosting Threading

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Covertype Dataset Benchmark

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Isotonic / Logistic

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot OMP vs. LARS

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Hierarchical

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Fast KMeans

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Lasso Path

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: MNIST Dataset

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Sparsify

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Glmnet

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Lasso

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: SAGA

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Isotonic / Perturbed Logarithm

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Isotonic / Pathological

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: TSNE MNIST Dataset

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Ward

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Benchmark: Plot Singular Value Decomposition

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/libblas.so.3: undefined symbol: gotoblas

Benchmark: SGDOneClassSVM

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ImportError: /lib/x86_64-linux-gnu/liblapack.so.3: undefined symbol: gotoblas

Llamafile

Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.

Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./run-mistral: line 2: ./mistral-7b-instruct-v0.2.Q8_0.llamafile: No such file or directory

Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./run-wizardcoder: line 2: ./wizardcoder-python-34b-v1.0.Q6_K.llamafile: No such file or directory

Test: llava-v1.5-7b-q4 - Acceleration: CPU

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./run-llava: line 2: ./llava-v1.5-7b-q4.llamafile: No such file or directory

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

Model: llama-2-70b-chat.Q5_0.gguf

Initial test 1 No water cool: The test quit with a non-zero exit status. E: main: error: unable to load model

Model: llama-2-13b.Q4_0.gguf

Initial test 1 No water cool: The test quit with a non-zero exit status. E: main: error: unable to load model

Model: llama-2-7b.Q4_0.gguf

Initial test 1 No water cool: The test quit with a non-zero exit status. E: main: error: unable to load model

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.

Model: GPT-2 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: super-resolution-10 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: super-resolution-10 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: bertsquad-12 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: bertsquad-12 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: T5 Encoder - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: T5 Encoder - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: yolov4 - Device: CPU - Executor: Standard

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: yolov4 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: GPT-2 - Device: CPU - Executor: Parallel

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

Model: GoogleNet - Acceleration: CPU - Iterations: 1000

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: GoogleNet - Acceleration: CPU - Iterations: 200

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: GoogleNet - Acceleration: CPU - Iterations: 100

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: CPU - Iterations: 1000

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: CPU - Iterations: 200

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

Model: AlexNet - Acceleration: CPU - Iterations: 100

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./caffe: 3: ./tools/caffe: not found

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ERROR: Rscript is not found on the system!

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Backend: BLAS

Initial test 1 No water cool: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory

261 Results Shown

TensorFlow:
GPU - 256 - VGG-16
GPU - 256 - ResNet-50
GPU - 64 - VGG-16
GPU - 512 - GoogLeNet
GPU - 32 - VGG-16
GPU - 256 - GoogLeNet
SHOC Scalable HeterOgeneous Computing
TensorFlow:
GPU - 512 - AlexNet
CPU - 256 - VGG-16
GPU - 64 - ResNet-50
GPU - 16 - VGG-16
GPU - 256 - AlexNet
CPU - 256 - ResNet-50
GPU - 32 - ResNet-50
CPU - 512 - GoogLeNet
GPU - 64 - GoogLeNet
CPU - 64 - VGG-16
GPU - 16 - ResNet-50
Numenta Anomaly Benchmark
AI Benchmark Alpha:
Device AI Score
Device Training Score
Device Inference Score
TensorFlow:
CPU - 256 - GoogLeNet
GPU - 32 - GoogLeNet
CPU - 32 - VGG-16
GPU - 64 - AlexNet
CPU - 64 - ResNet-50
PyTorch:
CPU - 512 - Efficientnet_v2_l
CPU - 16 - Efficientnet_v2_l
CPU - 32 - Efficientnet_v2_l
CPU - 256 - Efficientnet_v2_l
CPU - 64 - Efficientnet_v2_l
OpenCV
OpenVINO:
Face Detection FP16 - CPU:
ms
FPS
TensorFlow
TNN
Mobile Neural Network:
inception-v3
mobilenet-v1-1.0
MobileNetV2_224
SqueezeNetV1.0
resnet-v2-50
squeezenetv1.1
mobilenetV3
nasnet
TensorFlow
Numpy Benchmark
TensorFlow
PyTorch:
CPU - 256 - ResNet-152
CPU - 64 - ResNet-152
CPU - 512 - ResNet-152
CPU - 32 - ResNet-152
CPU - 16 - ResNet-152
TensorFlow:
GPU - 32 - AlexNet
CPU - 32 - ResNet-50
PyTorch
TensorFlow
oneDNN:
Recurrent Neural Network Training - CPU
Recurrent Neural Network Inference - CPU
TensorFlow
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
ms/batch
items/sec
BERT-Large, NLP Question Answering - Synchronous Single-Stream:
ms/batch
items/sec
Mlpack Benchmark
OpenVINO:
Face Detection FP16-INT8 - CPU:
ms
FPS
NCNN:
CPU - FastestDet
CPU - vision_transformer
CPU - regnety_400m
CPU - squeezenet_ssd
CPU - yolov4-tiny
CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
CPU - resnet50
CPU - alexnet
CPU - resnet18
CPU - vgg16
CPU - googlenet
CPU - blazeface
CPU - efficientnet-b0
CPU - mnasnet
CPU - shufflenet-v2
CPU-v3-v3 - mobilenet-v3
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
TensorFlow
NCNN:
Vulkan GPU - FastestDet
Vulkan GPU - vision_transformer
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
OpenVINO:
Machine Translation EN To DE FP16 - CPU:
ms
FPS
Person Detection FP16 - CPU:
ms
FPS
Person Detection FP32 - CPU:
ms
FPS
TensorFlow Lite
OpenVINO:
Noise Suppression Poconet-Like FP16 - CPU:
ms
FPS
TensorFlow Lite
OpenVINO:
Road Segmentation ADAS FP16-INT8 - CPU:
ms
FPS
TensorFlow Lite:
NASNet Mobile
Mobilenet Float
SqueezeNet
OpenVINO:
Person Vehicle Bike Detection FP16 - CPU:
ms
FPS
TensorFlow Lite
OpenVINO:
Road Segmentation ADAS FP16 - CPU:
ms
FPS
Person Re-Identification Retail FP16 - CPU:
ms
FPS
Handwritten English Recognition FP16-INT8 - CPU:
ms
FPS
Vehicle Detection FP16-INT8 - CPU:
ms
FPS
Face Detection Retail FP16-INT8 - CPU:
ms
FPS
Handwritten English Recognition FP16 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP16 - CPU:
ms
FPS
Vehicle Detection FP16 - CPU:
ms
FPS
Weld Porosity Detection FP16 - CPU:
ms
FPS
Face Detection Retail FP16 - CPU:
ms
FPS
Weld Porosity Detection FP16-INT8 - CPU:
ms
FPS
TensorFlow
Numenta Anomaly Benchmark
TensorFlow
oneDNN
PyTorch:
CPU - 1 - ResNet-152
CPU - 512 - ResNet-50
CPU - 256 - ResNet-50
CPU - 64 - ResNet-50
CPU - 32 - ResNet-50
CPU - 16 - ResNet-50
Neural Magic DeepSparse:
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
spaCy:
en_core_web_trf
en_core_web_lg
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
ms/batch
items/sec
BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream:
ms/batch
items/sec
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
ResNet-50, Baseline - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO - Synchronous Single-Stream:
ms/batch
items/sec
ResNet-50, Baseline - Synchronous Single-Stream:
ms/batch
items/sec
ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream:
ms/batch
items/sec
DeepSpeech
Neural Magic DeepSparse:
ResNet-50, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
oneDNN
PyTorch:
NVIDIA CUDA GPU - 64 - Efficientnet_v2_l
NVIDIA CUDA GPU - 512 - Efficientnet_v2_l
NVIDIA CUDA GPU - 32 - Efficientnet_v2_l
NVIDIA CUDA GPU - 16 - Efficientnet_v2_l
NVIDIA CUDA GPU - 256 - Efficientnet_v2_l
Mlpack Benchmark
TensorFlow
Mlpack Benchmark
TensorFlow
Numenta Anomaly Benchmark
TensorFlow
Numenta Anomaly Benchmark
TensorFlow
PyTorch:
CPU - 1 - ResNet-50
NVIDIA CUDA GPU - 256 - ResNet-152
NVIDIA CUDA GPU - 16 - ResNet-152
NVIDIA CUDA GPU - 32 - ResNet-152
NVIDIA CUDA GPU - 64 - ResNet-152
NVIDIA CUDA GPU - 512 - ResNet-152
NVIDIA CUDA GPU - 1 - Efficientnet_v2_l
Mlpack Benchmark
TensorFlow:
CPU - 32 - AlexNet
CPU - 16 - GoogLeNet
RNNoise
TensorFlow
Numenta Anomaly Benchmark
TNN:
CPU - MobileNet v2
CPU - SqueezeNet v1.1
TensorFlow:
CPU - 1 - ResNet-50
GPU - 1 - GoogLeNet
Numenta Anomaly Benchmark
PyTorch
TensorFlow:
GPU - 1 - AlexNet
CPU - 1 - AlexNet
oneDNN
PyTorch:
NVIDIA CUDA GPU - 256 - ResNet-50
NVIDIA CUDA GPU - 64 - ResNet-50
NVIDIA CUDA GPU - 16 - ResNet-50
NVIDIA CUDA GPU - 32 - ResNet-50
NVIDIA CUDA GPU - 512 - ResNet-50
SHOC Scalable HeterOgeneous Computing
oneDNN
SHOC Scalable HeterOgeneous Computing
PyTorch
Whisper.cpp
TensorFlow
SHOC Scalable HeterOgeneous Computing
oneDNN
TNN
SHOC Scalable HeterOgeneous Computing
Whisper.cpp
SHOC Scalable HeterOgeneous Computing:
OpenCL - FFT SP
OpenCL - Triad
OpenCL - Reduction
OpenCL - Bus Speed Download
Whisper.cpp
SHOC Scalable HeterOgeneous Computing

Initial test 1 No water cool

Testing initiated at 13 March 2024 12:59 by user root.

24.03.13.Pop.2204.ML.test1

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

Initial test 1 No water cool

TensorFlow

SHOC Scalable HeterOgeneous Computing

TensorFlow

Numenta Anomaly Benchmark

AI Benchmark Alpha

TensorFlow

PyTorch

OpenCV

OpenVINO

TensorFlow

TNN

Mobile Neural Network

TensorFlow

Numpy Benchmark

TensorFlow

PyTorch

TensorFlow

PyTorch

TensorFlow

oneDNN

TensorFlow

Neural Magic DeepSparse

Mlpack Benchmark

OpenVINO

NCNN

TensorFlow

NCNN

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow Lite

OpenVINO

TensorFlow

Numenta Anomaly Benchmark

TensorFlow

oneDNN

PyTorch

Neural Magic DeepSparse

spaCy

Neural Magic DeepSparse

DeepSpeech

Neural Magic DeepSparse

TensorFlow

oneDNN

PyTorch

Mlpack Benchmark

TensorFlow

Mlpack Benchmark

TensorFlow

Numenta Anomaly Benchmark

TensorFlow

Numenta Anomaly Benchmark

TensorFlow

PyTorch

Mlpack Benchmark

TensorFlow

RNNoise

TensorFlow

Numenta Anomaly Benchmark

TNN

TensorFlow

Numenta Anomaly Benchmark

PyTorch

TensorFlow

oneDNN

PyTorch

SHOC Scalable HeterOgeneous Computing

oneDNN

SHOC Scalable HeterOgeneous Computing