installres1

AMD Ryzen 5 5600X 6-Core testing with a ASRock X570 Phantom Gaming-ITX/TB3 (P3.00 BIOS) and NVIDIA GeForce RTX 3090 24GB on Ubuntu 18.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2103310-HA-INSTALLRE08

Jump To Table - Results

NVIDIA GeForce RTX 3090

Processor: AMD Ryzen 5 5600X 6-Core @ 3.70GHz (6 Cores / 12 Threads), Motherboard: ASRock X570 Phantom Gaming-ITX/TB3 (P3.00 BIOS), Chipset: AMD Device 1480, Memory: 64GB, Disk: 2000GB Samsung SSD 970 EVO Plus 2TB + 4001GB Samsung SSD 870 + ProductCode, Graphics: NVIDIA GeForce RTX 3090 24GB, Audio: NVIDIA Device 1aef, Monitor: marantz-AVR, Network: Intel I211 + Intel Device 2723

OS: Ubuntu 18.04, Kernel: 5.4.0-70-generic (x86_64), Desktop: GNOME Shell 3.28.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 460.32.03, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.2.109, Vulkan: 1.2.155, Compiler: GCC 7.5.0 + CUDA 11.2, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 10496
Python Notes: Python 3.8.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Scikit-Learn

Scikit-learn is a Python module for machine learning Learn more via the OpenBenchmarking.org test page.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial timeseries data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

92 Results Shown

Scikit-Learn
Mlpack Benchmark:
scikit_linearridgeregression
scikit_svm
scikit_qda
scikit_ica
AI Benchmark Alpha:
Device AI Score
Device Training Score
Device Inference Score
Numenta Anomaly Benchmark:
Bayesian Changepoint
Earthgecko Skyline
Windowed Gaussian
Relative Entropy
EXPoSE
ECP-CANDLE:
P3B2
P3B1
P1B2
PlaidML:
No - Inference - ResNet 50 - CPU
No - Inference - VGG16 - CPU
TNN:
CPU - SqueezeNet v1.1
CPU - MobileNet v2
NCNN:
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
CPU - regnety_400m
CPU - squeezenet_ssd
CPU - yolov4-tiny
CPU - resnet50
CPU - alexnet
CPU - resnet18
CPU - vgg16
CPU - googlenet
CPU - blazeface
CPU - efficientnet-b0
CPU - mnasnet
CPU - shufflenet-v2
CPU-v3-v3 - mobilenet-v3
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
Mobile Neural Network:
inception-v3
mobilenet-v1-1.0
MobileNetV2_224
resnet-v2-50
SqueezeNetV1.0
TensorFlow Lite:
Inception ResNet V2
Mobilenet Quant
Mobilenet Float
NASNet Mobile
Inception V4
SqueezeNet
RNNoise
DeepSpeech
Numpy Benchmark
oneDNN:
Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
Convolution Batch Shapes Auto - u8s8f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 1D - f32 - CPU
SHOC Scalable HeterOgeneous Computing:
OpenCL - Texture Read Bandwidth
OpenCL - Bus Speed Readback
OpenCL - Bus Speed Download
OpenCL - Max SP Flops
OpenCL - GEMM SGEMM_N
OpenCL - Reduction
OpenCL - MD5 Hash
OpenCL - FFT SP
OpenCL - Triad
OpenCL - S3D
NCNN

NVIDIA GeForce RTX 3090

Testing initiated at 29 March 2021 23:38 by user brw.

installres1

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

NVIDIA GeForce RTX 3090

Scikit-Learn

Mlpack Benchmark

AI Benchmark Alpha

Numenta Anomaly Benchmark

ECP-CANDLE

PlaidML

TNN

NCNN

Mobile Neural Network

TensorFlow Lite

RNNoise

DeepSpeech

Numpy Benchmark

oneDNN

SHOC Scalable HeterOgeneous Computing

NCNN

92 Results Shown

NVIDIA GeForce RTX 3090