n1n1

ARMv8 Neoverse-N1 testing with a GIGABYTE G242-P36-00 MP32-AR2-00 v01000100 (F31k SCP: 2.10.20220531 BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.

a

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v
Processor Notes: Scaling Governor: cppc_cpufreq performance (Boost: Disabled)
Python Notes: Python 3.11.6
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

aa

b

c

Processor: ARMv8 Neoverse-N1 @ 3.00GHz (128 Cores), Motherboard: GIGABYTE G242-P36-00 MP32-AR2-00 v01000100 (F31k SCP: 2.10.20220531 BIOS), Chipset: Ampere Computing LLC Altra PCI Root Complex A, Memory: 16 x 32 GB DDR4-3200MT/s Samsung M393A4K40DB3-CWE, Disk: 800GB Micron_7450_MTFDKBA800TFS, Graphics: ASPEED, Monitor: VGA HDMI, Network: 2 x Intel I350

OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (aarch64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1024x768

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

JPEG-XL libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.

OpenVINO

JPEG-XL libjxl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

Timed Linux Kernel Compilation

JPEG-XL Decoding libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. The JPEG XL encoding/decoding is done using the libjxl codebase. Learn more via the OpenBenchmarking.org test page.

oneDNN

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

JPEG-XL libjxl

Neural Magic DeepSparse

JPEG-XL libjxl

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

srsRAN Project

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

srsRAN Project

SVT-AV1

oneDNN

srsRAN Project

JPEG-XL Decoding libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. The JPEG XL encoding/decoding is done using the libjxl codebase. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

JPEG-XL libjxl

oneDNN

JPEG-XL libjxl

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

Google Draco

oneDNN

SVT-AV1

oneDNN

Primesieve

SVT-AV1

Parallel BZIP2 Compression

This test measures the time needed to compress a file (FreeBSD-13.0-RELEASE-amd64-memstick.img) using Parallel BZIP2 compression. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

120 Results Shown

Neural Magic DeepSparse:
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
ms/batch
items/sec
ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
Stockfish
Timed Linux Kernel Compilation
Neural Magic DeepSparse:
CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
ms/batch
items/sec
Llama2 Chat 7b Quantized - Asynchronous Multi-Stream:
ms/batch
items/sec
OpenVINO:
Face Detection FP16-INT8 - CPU:
ms
FPS
JPEG-XL libjxl
OpenVINO:
Face Detection FP16 - CPU:
ms
FPS
JPEG-XL libjxl
oneDNN
Timed Linux Kernel Compilation
JPEG-XL Decoding libjxl
oneDNN
Neural Magic DeepSparse:
ResNet-50, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
OpenVINO:
Person Detection FP16 - CPU:
ms
FPS
Person Detection FP32 - CPU:
ms
FPS
Neural Magic DeepSparse:
Llama2 Chat 7b Quantized - Synchronous Single-Stream:
ms/batch
items/sec
OpenVINO:
Road Segmentation ADAS FP16-INT8 - CPU:
ms
FPS
Machine Translation EN To DE FP16 - CPU:
ms
FPS
Road Segmentation ADAS FP16 - CPU:
ms
FPS
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
OpenVINO:
Noise Suppression Poconet-Like FP16 - CPU:
ms
FPS
Vehicle Detection FP16-INT8 - CPU:
ms
FPS
Person Vehicle Bike Detection FP16 - CPU:
ms
FPS
Weld Porosity Detection FP16 - CPU:
ms
FPS
Weld Porosity Detection FP16-INT8 - CPU:
ms
FPS
Person Re-Identification Retail FP16 - CPU:
ms
FPS
Handwritten English Recognition FP16-INT8 - CPU:
ms
FPS
Vehicle Detection FP16 - CPU:
ms
FPS
Handwritten English Recognition FP16 - CPU:
ms
FPS
Face Detection Retail FP16-INT8 - CPU:
ms
FPS
Face Detection Retail FP16 - CPU:
ms
FPS
Neural Magic DeepSparse:
NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream:
ms/batch
items/sec
OpenVINO:
Age Gender Recognition Retail 0013 FP16 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
ms
FPS
Neural Magic DeepSparse:
NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream:
ms/batch
items/sec
SVT-AV1
Neural Magic DeepSparse:
BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
ms/batch
items/sec
JPEG-XL libjxl
Neural Magic DeepSparse:
CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream:
ms/batch
items/sec
NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream:
ms/batch
items/sec
JPEG-XL libjxl
Primesieve
Neural Magic DeepSparse:
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
ms/batch
items/sec
ResNet-50, Baseline - Synchronous Single-Stream:
ms/batch
items/sec
ResNet-50, Baseline - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
ms/batch
items/sec
CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream:
ms/batch
items/sec
srsRAN Project
WavPack Audio Encoding
srsRAN Project:
PUSCH Processor Benchmark, Throughput Thread
PDSCH Processor Benchmark, Throughput Total
SVT-AV1
oneDNN
srsRAN Project
JPEG-XL Decoding libjxl
SVT-AV1
JPEG-XL libjxl
oneDNN
JPEG-XL libjxl
Google Draco
SVT-AV1
Google Draco
oneDNN
SVT-AV1:
Preset 12 - Bosphorus 4K
Preset 13 - Bosphorus 4K
oneDNN:
Deconvolution Batch shapes_3d - CPU
Convolution Batch Shapes Auto - CPU
Primesieve
SVT-AV1
Parallel BZIP2 Compression
SVT-AV1

a

Testing initiated at 17 March 2024 01:23 by user root.

aa

Testing initiated at 17 March 2024 01:47 by user root.

b

Testing initiated at 17 March 2024 09:55 by user root.

c

OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (aarch64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1024x768

Testing initiated at 17 March 2024 13:52 by user root.

n1n1

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

a

aa

b

c

Neural Magic DeepSparse

Stockfish

Timed Linux Kernel Compilation

Neural Magic DeepSparse

OpenVINO

JPEG-XL libjxl

OpenVINO

JPEG-XL libjxl

oneDNN

Timed Linux Kernel Compilation

JPEG-XL Decoding libjxl

oneDNN

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

OpenVINO

Neural Magic DeepSparse

SVT-AV1

Neural Magic DeepSparse

JPEG-XL libjxl

Neural Magic DeepSparse

JPEG-XL libjxl

Primesieve

Neural Magic DeepSparse

srsRAN Project

WavPack Audio Encoding

srsRAN Project

SVT-AV1

oneDNN

srsRAN Project

JPEG-XL Decoding libjxl

SVT-AV1

JPEG-XL libjxl

oneDNN

JPEG-XL libjxl

Google Draco

SVT-AV1

Google Draco

oneDNN

SVT-AV1

oneDNN

Primesieve

SVT-AV1

Parallel BZIP2 Compression

SVT-AV1

120 Results Shown

a

aa

b

c