ml-benchmark2

AMD Ryzen 9 7950X3D 16-Core testing with a ASUS PRIME X670E-PRO WIFI (1813 BIOS) and MSI NVIDIA GeForce RTX 3060 12GB on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312269-NE-MLBENCHMA88

Jump To Table - Results

ml-benchmark2-12-25-23

Processor: AMD Ryzen 9 7950X3D 16-Core @ 4.20GHz (16 Cores / 32 Threads), Motherboard: ASUS PRIME X670E-PRO WIFI (1813 BIOS), Chipset: AMD Device 14d8, Memory: 128GB, Disk: 4001GB CT4000P3PSSD8 + 1024GB SPCC M.2 PCIe SSD, Graphics: MSI NVIDIA GeForce RTX 3060 12GB, Audio: NVIDIA Device 228e, Monitor: LC27T55, Network: Realtek RTL8125 2.5GbE + MEDIATEK Device 0608

OS: Ubuntu 22.04, Kernel: 6.2.0-39-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Display Driver: NVIDIA 535.129.03, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.2.147, Vulkan: 1.3.242, Compiler: GCC 11.4.0 + CUDA 12.3, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa601206
Graphics Notes: GLAMOR - BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.2f.00.98
OpenCL Notes: GPU Compute Cores: 3584
Python Notes: Python 3.10.12
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Whisper.cpp

Whisper.cpp is a port of OpenAI's Whisper model in C/C++. Whisper.cpp is developed by Georgi Gerganov for transcribing WAV audio files to text / speech recognition. Whisper.cpp supports ARM NEON, x86 AVX, and other advanced CPU features. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial time-series data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. This MNN test profile is building the OpenMP / CPU threaded version for processor benchmarking and not any GPU-accelerated test. MNN does allow making use of AVX-512 extensions. Learn more via the OpenBenchmarking.org test page.

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

spaCy

The spaCy library is an open-source solution for advanced neural language processing (NLP). The spaCy library leverages Python and is a leading neural language processing solution. This test profile times the spaCy CPU performance with various models. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

PyTorch

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Kernel PCA Solvers / Time vs. N Components

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: AttributeError: 'KernelPCA' object has no attribute 'lambdas_'

Benchmark: Kernel PCA Solvers / Time vs. N Samples

Benchmark: Hist Gradient Boosting Categorical Only

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'HistGradientBoostingClassifier' from 'sklearn.ensemble' (/usr/lib/python3/dist-packages/sklearn/ensemble/__init__.py)

Benchmark: Plot Non-Negative Matrix Factorization

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: TypeError: NMF.__init__() got an unexpected keyword argument 'alpha_W'

Benchmark: Plot Polynomial Kernel Approximation

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'PolynomialCountSketch' from 'sklearn.kernel_approximation' (/usr/lib/python3/dist-packages/sklearn/kernel_approximation.py)

Benchmark: Hist Gradient Boosting Higgs Boson

Benchmark: Plot Singular Value Decomposition

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'docstring' from 'matplotlib' (/home/jaelle/.local/lib/python3.10/site-packages/matplotlib/__init__.py)

Benchmark: Hist Gradient Boosting Threading

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'HistGradientBoostingRegressor' from 'sklearn.ensemble' (/usr/lib/python3/dist-packages/sklearn/ensemble/__init__.py)

Benchmark: Hist Gradient Boosting Adult

Benchmark: Sample Without Replacement

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: AttributeError: module 'numpy' has no attribute 'int'.

Benchmark: RCV1 Logreg Convergencet

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ValueError: The loss log_loss is not supported.

Benchmark: Plot Parallel Pairwise

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Benchmark: Hist Gradient Boosting

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'HistGradientBoostingRegressor' from 'sklearn.ensemble' (/usr/lib/python3/dist-packages/sklearn/ensemble/__init__.py)

Benchmark: TSNE MNIST Dataset

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: TypeError: fetch_openml() got an unexpected keyword argument 'parser'

Benchmark: LocalOutlierFactor

Benchmark: Plot OMP vs. LARS

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: TypeError: make_sparse_coded_signal() got an unexpected keyword argument 'data_transposed'

Benchmark: Plot Fast KMeans

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'docstring' from 'matplotlib' (/home/jaelle/.local/lib/python3.10/site-packages/matplotlib/__init__.py)

Benchmark: Isolation Forest

Benchmark: Plot Lasso Path

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'docstring' from 'matplotlib' (/home/jaelle/.local/lib/python3.10/site-packages/matplotlib/__init__.py)

Benchmark: SGDOneClassSVM

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'SGDOneClassSVM' from 'sklearn.linear_model' (/usr/lib/python3/dist-packages/sklearn/linear_model/__init__.py)

Benchmark: SGD Regression

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: TypeError: solve() got an unexpected keyword argument 'sym_pos'

Benchmark: MNIST Dataset

Benchmark: Glmnet

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'glmnet'

Benchmark: SAGA

Benchmark: GLM

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel

Model: super-resolution-10 - Device: CPU - Executor: Standard

Model: super-resolution-10 - Device: CPU - Executor: Parallel

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel

Model: bertsquad-12 - Device: CPU - Executor: Standard

Model: bertsquad-12 - Device: CPU - Executor: Parallel

Model: yolov4 - Device: CPU - Executor: Standard

Model: yolov4 - Device: CPU - Executor: Parallel

Model: GPT-2 - Device: CPU - Executor: Standard

Model: GPT-2 - Device: CPU - Executor: Parallel

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'Iterable' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

FP16: No - Mode: Inference - Network: VGG16 - Device: CPU

ml-benchmark2-12-25-23: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ImportError: cannot import name 'Iterable' from 'collections' (/usr/lib/python3.10/collections/__init__.py)

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

oneDNN

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

Target: OpenCL - Benchmark: Texture Read Bandwidth