MBP M1 Max Machine Learning, sys76-kudu-ML

Apple M1 Max testing with a Apple MacBook Pro and Apple M1 Max on macOS 12.1 via the Phoronix Test Suite.

sys76-kudu-ML: AMD Ryzen 9 5900HX testing with a System76 Kudu (1.07.09RSA1 BIOS) and AMD Cezanne on Pop 21.10 via the Phoronix Test Suite.

MBP M1 Max Machine Learning

Processor: Apple M1 Max (10 Cores), Motherboard: Apple MacBook Pro, Memory: 64GB, Disk: 1859GB, Graphics: Apple M1 Max, Monitor: Color LCD

OS: macOS 12.1, Kernel: 21.2.0 (arm64), OpenCL: OpenCL 1.2 (Nov 13 2021 00:45:09), Compiler: GCC 13.0.0 + Clang 13.0.0, File-System: APFS, Screen Resolution: 3456x2234

Environment Notes: XPC_FLAGS=0x0
Python Notes: Python 2.7.18 + Python 3.8.9

ML Tests

Processor: AMD Ryzen 9 5900HX @ 3.30GHz (8 Cores / 16 Threads), Motherboard: System76 Kudu (1.07.09RSA1 BIOS), Chipset: AMD Renoir/Cezanne, Memory: 16GB, Disk: Samsung SSD 970 EVO Plus 500GB, Graphics: AMD Cezanne (2100/400MHz), Audio: AMD Renoir Radeon HD Audio, Network: Realtek RTL8125 2.5GbE + Intel Wi-Fi 6 AX200

OS: Pop 21.10, Kernel: 5.15.15-76051515-generic (x86_64), Desktop: GNOME Shell 40.5, Display Server: X Server 1.20.13, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.1), Vulkan: 1.2.182, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-ZPT0kp/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa50000c
Graphics Notes: GLAMOR - BAR1 / Visible vRAM Size: 512 MB
Python Notes: Python 3.9.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

Caffe

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

MBP M1 Max Machine Learning: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onednn: line 6: ./benchdnn: No such file or directory

Caffe

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

MBP M1 Max Machine Learning: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Caffe

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

Caffe

TensorFlow Lite

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

Caffe

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

Acceleration: CPU

MBP M1 Max Machine Learning: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

ECP-CANDLE

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

Tensorflow

This is a benchmark of the Tensorflow deep learning framework using the CIFAR10 data set. Learn more via the OpenBenchmarking.org test page.

Build: Cifar10

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: AttributeError: module 'tensorflow' has no attribute 'app'

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial timeseries data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

Detector: EXPoSE

MBP M1 Max Machine Learning: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'pandas'

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'pandas'

Detector: Bayesian Changepoint

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'pandas'

Detector: Relative Entropy

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'pandas'

Detector: Earthgecko Skyline

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'pandas'

Detector: Windowed Gaussian

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'pandas'

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

MBP M1 Max Machine Learning: The test quit with a non-zero exit status. E: SyntaxError: invalid syntax

ML Tests: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'tensorflow'

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

Model: yolov4 - Device: CPU

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "yolov4/yolov4.onnx" failed: No such file or directory

Model: super-resolution-10 - Device: CPU

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "super_resolution/super_resolution.onnx" failed: No such file or directory

Model: shufflenet-v2-10 - Device: CPU

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "model/test_shufflenetv2/model.onnx" failed: No such file or directory

Model: fcn-resnet101-11 - Device: CPU

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: onnxruntime/onnxruntime/test/onnx/onnx_model_info.cc:45 void OnnxModelInfo::InitOnnxModelInfo(const PATH_CHAR_TYPE*) open file "fcn-resnet101-11/model.onnx" failed: No such file or directory

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

ML Tests: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

ML Tests: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

Model: Age Gender Recognition Retail 0013 FP32 - Device: Intel GPU

ML Tests: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./openvino: line 2: ./openvino-github-2021/bin/intel64/Release/benchmark_app: No such file or directory

Model: Age Gender Recognition Retail 0013 FP16 - Device: Intel GPU

Model: Person Detection 0106 FP16 - Device: Intel GPU

Model: Face Detection 0106 FP32 - Device: CPU

Model: Face Detection 0106 FP16 - Device: CPU

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

ML Tests: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

ML Tests: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

ML Tests: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

ML Tests: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

OpenVINO

Model: Age Gender Recognition Retail 0013 FP32 - Device: CPU

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

Model: Person Detection 0106 FP32 - Device: Intel GPU

Model: Person Detection 0106 FP32 - Device: CPU

Model: Person Detection 0106 FP16 - Device: CPU

Model: Face Detection 0106 FP32 - Device: Intel GPU

Model: Face Detection 0106 FP16 - Device: Intel GPU

86 Results Shown

Caffe
LeelaChessZero
ECP-CANDLE
Mobile Neural Network:
inception-v3
mobilenet-v1-1.0
MobileNetV2_224
SqueezeNetV1.0
resnet-v2-50
squeezenetv1.1
mobilenetV3
Caffe
PlaidML
ECP-CANDLE
PlaidML
TNN
oneDNN
Caffe
TensorFlow Lite:
Inception V4
Inception ResNet V2
Mlpack Benchmark
Numpy Benchmark
NCNN:
CPU - regnety_400m
CPU - squeezenet_ssd
CPU - yolov4-tiny
CPU - resnet50
CPU - alexnet
CPU - resnet18
CPU - vgg16
CPU - googlenet
CPU - blazeface
CPU - efficientnet-b0
CPU - mnasnet
CPU - shufflenet-v2
CPU-v3-v3 - mobilenet-v3
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
Caffe
OpenCV
Caffe
TensorFlow Lite:
SqueezeNet
NASNet Mobile
Mobilenet Quant
Mobilenet Float
Mlpack Benchmark
oneDNN:
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Mlpack Benchmark
oneDNN:
Recurrent Neural Network Inference - u8s8f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Caffe
DeepSpeech
Mlpack Benchmark
R Benchmark
oneDNN
TNN
RNNoise
TNN
ECP-CANDLE
oneDNN:
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
IP Shapes 1D - u8s8f32 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
IP Shapes 3D - f32 - CPU
IP Shapes 3D - u8s8f32 - CPU
TNN
oneDNN:
Convolution Batch Shapes Auto - u8s8f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Deconvolution Batch shapes_3d - u8s8f32 - CPU

MBP M1 Max Machine Learning

Processor: Apple M1 Max (10 Cores), Motherboard: Apple MacBook Pro, Memory: 64GB, Disk: 1859GB, Graphics: Apple M1 Max, Monitor: Color LCD

OS: macOS 12.1, Kernel: 21.2.0 (arm64), OpenCL: OpenCL 1.2 (Nov 13 2021 00:45:09), Compiler: GCC 13.0.0 + Clang 13.0.0, File-System: APFS, Screen Resolution: 3456x2234

Environment Notes: XPC_FLAGS=0x0
Python Notes: Python 2.7.18 + Python 3.8.9

Testing initiated at 16 February 2022 14:41 by user chrisf.

ML Tests

Testing initiated at 15 February 2022 18:57 by user chrisf.

MBP M1 Max Machine Learning, sys76-kudu-ML

View

Statistics

Graph Settings

Additional Graphs

Multi-Way Comparison

Table

Run Management