HDVR4-A8.9600-1

AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C+6G testing with a ASRock A320M-HDV R4.0 (P2.00 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312201-HERT-HDVR4A802

Jump To Table - Results

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C

Processor: AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C+6G @ 3.10GHz (2 Cores / 4 Threads), Motherboard: ASRock A320M-HDV R4.0 (P2.00 BIOS), Chipset: AMD 15h, Memory: 3584MB, Disk: 1000GB Western Digital WDS100T2B0A, Graphics: llvmpipe, Audio: AMD Kabini HDMI/DP, Network: Realtek RTL8111/8168/8411

OS: Ubuntu 20.04, Kernel: 5.15.0-89-generic (x86_64), Desktop: GNOME Shell 3.36.9, Display Server: X Server 1.20.13, OpenGL: 4.5 Mesa 21.2.6 (LLVM 12.0.0 256 bits), Vulkan: 1.1.182, Compiler: GCC 9.4.0, File-System: ext4, Screen Resolution: 1368x768

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-9QDOt0/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x600611a
Python Notes: Python 3.8.10
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Whisper.cpp

Whisper.cpp is a port of OpenAI's Whisper model in C/C++. Whisper.cpp is developed by Georgi Gerganov for transcribing WAV audio files to text / speech recognition. Whisper.cpp supports ARM NEON, x86 AVX, and other advanced CPU features. Learn more via the OpenBenchmarking.org test page.

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Whisper.cpp

TensorFlow

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: SGDOneClassSVM

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Benchmark: Isolation Forest

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

PyTorch

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Plot Fast KMeans

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

TensorFlow

PyTorch

TensorFlow

PyTorch

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Whisper.cpp

TensorFlow

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

PyTorch

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

PyTorch

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

TensorFlow

PyTorch

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Isotonic / Perturbed Logarithm

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

TensorFlow

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Caffe

PyTorch

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial time-series data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. This MNN test profile is building the OpenMP / CPU threaded version for processor benchmarking and not any GPU-accelerated test. MNN does allow making use of AVX-512 extensions. Learn more via the OpenBenchmarking.org test page.

PyTorch

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

PyTorch

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

TensorFlow

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

PyTorch

TensorFlow

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

Benchmark: scikit_qda

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Benchmark: scikit_linearridgeregression

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Isotonic / Logistic

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Caffe

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Isotonic / Pathological

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

TensorFlow

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

PyTorch

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

TensorFlow

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Plot Non-Negative Matrix Factorization

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: KeyError:

Neural Magic DeepSparse

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Caffe

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Caffe

oneDNN

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Model Zoo. Learn more via the OpenBenchmarking.org test page.

Result

Inference Time Cost (ms)

Numenta Anomaly Benchmark

ONNX Runtime

Result

Inference Time Cost (ms)

Caffe

ONNX Runtime

Result

Inference Time Cost (ms)

TensorFlow

Device: CPU - Batch Size: 512 - Model: VGG-16

ONNX Runtime

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Result

Inference Time Cost (ms)

Neural Magic DeepSparse

OpenVINO

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Device: CPU - Batch Size: 256 - Model: ResNet-50

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Neural Magic DeepSparse

OpenVINO

TensorFlow Lite

OpenVINO

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

OpenVINO

TensorFlow Lite

Neural Magic DeepSparse

OpenVINO

TensorFlow Lite

OpenVINO

Neural Magic DeepSparse

Numenta Anomaly Benchmark

TensorFlow

Device: CPU - Batch Size: 512 - Model: GoogLeNet

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

TensorFlow

Device: CPU - Batch Size: 256 - Model: VGG-16

Neural Magic DeepSparse

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: RCV1 Logreg Convergencet

oneDNN

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

oneDNN

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Plot Parallel Pairwise

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: numpy.core._exceptions.MemoryError: Unable to allocate 74.5 GiB for an array with shape (100000, 100000) and data type float64

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. E: AttributeError: module 'numpy' has no attribute 'typeDict'

Scikit-Learn

Scikit-learn is a Python module for machine learning built on NumPy, SciPy, and is BSD-licensed. Learn more via the OpenBenchmarking.org test page.

Benchmark: Glmnet

spaCy

The spaCy library is an open-source solution for advanced neural language processing (NLP). The spaCy library leverages Python and is a leading neural language processing solution. This test profile times the spaCy CPU performance with various models. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

Benchmark: P1B2

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. E: ImportError: initialization failed

Benchmark: P3B1

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. E: ImportError: initialization failed

Benchmark: P3B2

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. E: ImportError: initialization failed

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "yolov4/yolov4.onnx" failed: No such file or directory

Model: yolov4 - Device: CPU - Executor: Parallel

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "yolov4/yolov4.onnx" failed: No such file or directory

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

272 Results Shown

Whisper.cpp
TensorFlow
Scikit-Learn
TensorFlow
Whisper.cpp
TensorFlow:
CPU - 64 - ResNet-50
CPU - 32 - VGG-16
PyTorch
TensorFlow
PyTorch
TensorFlow
PyTorch:
CPU - 64 - Efficientnet_v2_l
CPU - 256 - Efficientnet_v2_l
CPU - 512 - Efficientnet_v2_l
Scikit-Learn:
SAGA
GLM
Whisper.cpp
TensorFlow:
CPU - 32 - ResNet-50
CPU - 256 - AlexNet
Caffe
PyTorch:
CPU - 32 - ResNet-50
CPU - 16 - ResNet-152
CPU - 32 - ResNet-152
Scikit-Learn
PyTorch:
CPU - 64 - ResNet-152
CPU - 256 - ResNet-152
CPU - 512 - ResNet-152
Scikit-Learn
PlaidML
TensorFlow
PyTorch
PlaidML
Scikit-Learn
TensorFlow
Scikit-Learn:
Plot Lasso Path
Hist Gradient Boosting Threading
Caffe
PyTorch
Scikit-Learn
Numenta Anomaly Benchmark
Scikit-Learn:
Plot Hierarchical
Kernel PCA Solvers / Time vs. N Samples
Numenta Anomaly Benchmark
Scikit-Learn
Mobile Neural Network:
inception-v3
mobilenet-v1-1.0
MobileNetV2_224
SqueezeNetV1.0
resnet-v2-50
squeezenetv1.1
mobilenetV3
nasnet
PyTorch
Scikit-Learn
PyTorch:
CPU - 64 - ResNet-50
CPU - 512 - ResNet-50
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering - Synchronous Single-Stream:
ms/batch
items/sec
TensorFlow
Scikit-Learn
NCNN:
Vulkan GPU - FastestDet
Vulkan GPU - vision_transformer
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
CPU - FastestDet
CPU - vision_transformer
CPU - regnety_400m
CPU - squeezenet_ssd
CPU - yolov4-tiny
CPU - resnet50
CPU - alexnet
CPU - resnet18
CPU - vgg16
CPU - googlenet
CPU - blazeface
CPU - efficientnet-b0
CPU - mnasnet
CPU - shufflenet-v2
CPU-v3-v3 - mobilenet-v3
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
PyTorch
TensorFlow
Scikit-Learn
TNN
Scikit-Learn:
Tree
Hist Gradient Boosting
OpenCV
Scikit-Learn
Numpy Benchmark
Scikit-Learn:
SGD Regression
Plot OMP vs. LARS
Caffe
Scikit-Learn:
Sample Without Replacement
Kernel PCA Solvers / Time vs. N Components
TensorFlow:
CPU - 16 - GoogLeNet
CPU - 32 - AlexNet
LeelaChessZero
DeepSpeech
Numenta Anomaly Benchmark
OpenVINO:
Handwritten English Recognition FP16 - CPU:
ms
FPS
Face Detection Retail FP16-INT8 - CPU:
ms
FPS
TensorFlow Lite
Scikit-Learn
PyTorch
oneDNN:
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Numenta Anomaly Benchmark
Scikit-Learn:
Plot Ward
Hist Gradient Boosting Adult
TensorFlow
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
ms/batch
items/sec
Scikit-Learn
Caffe
Mlpack Benchmark
Scikit-Learn
Caffe
oneDNN:
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - f32 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Mlpack Benchmark
Scikit-Learn:
20 Newsgroups / Logistic Regression
Plot Incremental PCA
ONNX Runtime:
super-resolution-10 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
Numenta Anomaly Benchmark
ONNX Runtime:
fcn-resnet101-11 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
Caffe
ONNX Runtime:
fcn-resnet101-11 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
bertsquad-12 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
bertsquad-12 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
ArcFace ResNet-100 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
Faster R-CNN R-50-FPN-int8 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
GPT-2 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
Faster R-CNN R-50-FPN-int8 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
GPT-2 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
ArcFace ResNet-100 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
CaffeNet 12-int8 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
ResNet50 v1-12-int8 - CPU - Parallel:
Inference Time Cost (ms)
Inferences Per Second
CaffeNet 12-int8 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
ResNet50 v1-12-int8 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
super-resolution-10 - CPU - Standard:
Inference Time Cost (ms)
Inferences Per Second
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
OpenVINO:
Handwritten English Recognition FP16-INT8 - CPU:
ms
FPS
Scikit-Learn
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
OpenVINO:
Face Detection FP16 - CPU:
ms
FPS
Face Detection FP16-INT8 - CPU:
ms
FPS
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
ms/batch
items/sec
OpenVINO:
Person Detection FP32 - CPU:
ms
FPS
Person Detection FP16 - CPU:
ms
FPS
Machine Translation EN To DE FP16 - CPU:
ms
FPS
TensorFlow Lite:
Inception V4
Inception ResNet V2
OpenVINO:
Road Segmentation ADAS FP16-INT8 - CPU:
ms
FPS
Person Vehicle Bike Detection FP16 - CPU:
ms
FPS
Road Segmentation ADAS FP16 - CPU:
ms
FPS
Vehicle Detection FP16-INT8 - CPU:
ms
FPS
R Benchmark
OpenVINO:
Vehicle Detection FP16 - CPU:
ms
FPS
TensorFlow Lite
Neural Magic DeepSparse:
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
ms/batch
items/sec
OpenVINO:
Weld Porosity Detection FP16-INT8 - CPU:
ms
FPS
Weld Porosity Detection FP16 - CPU:
ms
FPS
TensorFlow Lite:
SqueezeNet
Mobilenet Quant
OpenVINO:
Face Detection Retail FP16 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP16 - CPU:
ms
FPS
Neural Magic DeepSparse:
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream:
ms/batch
items/sec
Numenta Anomaly Benchmark
TNN
Neural Magic DeepSparse:
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream:
ms/batch
items/sec
ResNet-50, Baseline - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO - Synchronous Single-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
ResNet-50, Baseline - Synchronous Single-Stream:
ms/batch
items/sec
TNN
Neural Magic DeepSparse:
ResNet-50, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
RNNoise
oneDNN:
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
IP Shapes 1D - f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
TNN
oneDNN:
Convolution Batch Shapes Auto - f32 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C

Testing initiated at 12 December 2023 15:12 by user hertz.

HDVR4-A8.9600-1

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

llvmpipe - AMD A8-9600 RADEON R7 10 COMPUTE CORES 4C

Whisper.cpp

TensorFlow

Scikit-Learn

TensorFlow

Whisper.cpp

TensorFlow

Scikit-Learn

PyTorch

Scikit-Learn

TensorFlow

PyTorch

TensorFlow

PyTorch

Scikit-Learn

Whisper.cpp

TensorFlow

Caffe

PyTorch

Scikit-Learn

PyTorch

Scikit-Learn

PlaidML

TensorFlow

PyTorch

PlaidML

Scikit-Learn

TensorFlow

Scikit-Learn

Caffe

PyTorch

Scikit-Learn

Numenta Anomaly Benchmark

Scikit-Learn

Numenta Anomaly Benchmark

Scikit-Learn

Mobile Neural Network

PyTorch

Scikit-Learn

PyTorch

Neural Magic DeepSparse

TensorFlow

Scikit-Learn

NCNN

PyTorch

TensorFlow

Scikit-Learn

TNN

Mlpack Benchmark

Scikit-Learn

OpenCV

Scikit-Learn

Numpy Benchmark

Scikit-Learn

Caffe

Scikit-Learn

TensorFlow

LeelaChessZero

DeepSpeech

Numenta Anomaly Benchmark

OpenVINO

TensorFlow Lite

Scikit-Learn

PyTorch

oneDNN

Numenta Anomaly Benchmark

Scikit-Learn

TensorFlow

Scikit-Learn

Neural Magic DeepSparse

Scikit-Learn

Caffe

Mlpack Benchmark

Scikit-Learn