AMD AOCC 4.0 Benchmarks

AMD Ryzen 9 7950X compiler benchmarks by Michael Larabel for a future article.

AOCC 4.0

Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR X670E HERO (0703 BIOS), Chipset: AMD Device 14d8, Memory: 32GB, Disk: 1000GB Sabrent Rocket 4.0 Plus, Graphics: AMD Radeon RX 6800 16GB (2475/1000MHz), Audio: AMD Navi 21/23, Monitor: ASUS MG28U, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 14.0.6, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Compiler Notes: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver4
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 12.2

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

GCC 13.0 14 Nov

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: GCC 13.0.0 20221114 + clang (GCC) 13.0.0 20221114 (experimental), File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Compiler Notes: --disable-multilib
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 14

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 14.0.6-2, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"
Processor Notes: Scaling Governor: amd-pstate schedutil (Boost: Enabled) - CPU Microcode: 0xa601203
Graphics Notes: BAR1 / Visible vRAM Size: 16368 MB - vBIOS Version: 113-D4120900-101
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 15.0.2

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 15.0.2-1, File-System: ext4, Screen Resolution: 3840x2160

JPEG XL libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

Backend: BLAS

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory

Backend: Eigen

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

Test: Keyed Algorithms

GCC 13.0 14 Nov: The test quit with a non-zero exit status.

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

JPEG XL libjxl

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Target: CPU - Model: DenseNet

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./tnn: 3: ./test/TNNTest: not found

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: yolov4 - Device: CPU - Executor: Standard

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: GPT-2 - Device: CPU - Executor: Parallel

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: bertsquad-12 - Device: CPU - Executor: Parallel

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: yolov4 - Device: CPU - Executor: Parallel

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

Throughput Test: Kostya

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

simdjson

Throughput Test: LargeRandom

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

oneDNN

JPEG XL libjxl

Monte Carlo Simulations of Ionised Nebulae

Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

simdjson

Throughput Test: DistinctUserID

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Throughput Test: PartialTweets

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

Throughput Test: TopTweet

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./simdjson: 3: ./bench_ondemand: not found

OpenVINO

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenVINO

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenSSL

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

Ngspice

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. The sample scene used is the Teapot scene ray-traced to 8K x 8K with 32 samples. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

Test: SET - Parallel Connections: 50

LLVM Clang 14: The test run did not produce a result. E: ./redis: 12: ./src/redis-benchmark: not found

LLVM Clang 15.0.2: The test run did not produce a result. E: ./redis: 12: ./src/redis-benchmark: not found

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

Test: Unkeyed Algorithms

GCC 13.0 14 Nov: The test quit with a non-zero exit status.

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

PJSIP

PJSIP is a free and open source multimedia communication library written in C language implementing standard based protocols such as SIP, SDP, RTP, STUN, TURN, and ICE. It combines signaling protocol (SIP) with rich multimedia framework and NAT traversal functionality into high level API that is portable and suitable for almost any type of systems ranging from desktops, embedded systems, to mobile handsets. This test profile is making use of pjsip-perf with both the client/server on teh system. More details on the PJSIP benchmark at https://www.pjsip.org/high-performance-sip.htm Learn more via the OpenBenchmarking.org test page.

Method: OPTIONS, Stateful

LLVM Clang 14: The test quit with a non-zero exit status. E: ./pjsip: line 5: ./pjsip-apps/bin/samples/*/pjsip-perf: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./pjsip: line 5: ./pjsip-apps/bin/samples/*/pjsip-perf: No such file or directory

Method: INVITE

LLVM Clang 14: The test quit with a non-zero exit status. E: ./pjsip: line 5: ./pjsip-apps/bin/samples/*/pjsip-perf: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./pjsip: line 5: ./pjsip-apps/bin/samples/*/pjsip-perf: No such file or directory

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Dragonflydb

Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

Clients: 50 - Set To Get Ratio: 1:5

LLVM Clang 14: The test run did not produce a result. E: ./dragonflydb: 7: ./memtier_benchmark: not found

LLVM Clang 15.0.2: The test run did not produce a result. E: ./dragonflydb: 7: ./memtier_benchmark: not found

Clients: 50 - Set To Get Ratio: 5:1

LLVM Clang 14: The test run did not produce a result. E: ./dragonflydb: 7: ./memtier_benchmark: not found

LLVM Clang 15.0.2: The test run did not produce a result. E: ./dragonflydb: 7: ./memtier_benchmark: not found

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google as the AV1 Codec Library. Learn more via the OpenBenchmarking.org test page.

JPEG XL Decoding libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. The JPEG XL encoding/decoding is done using the libjxl codebase. Learn more via the OpenBenchmarking.org test page.

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: unknown location(0): [4;31;49mfatal error: in "QuantLib benchmark suite/ShortRateModel__Swaps": QuantLib::Error: cannot roll the asset back to0 (it is already at t = 1.75068)[0;39;49m

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

oneDNN

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

Caffe

Zstd Compression

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

oneDNN

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Target: CPU - Model: MobileNet v2

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./tnn: 3: ./test/TNNTest: not found

oneDNN

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

Text-To-Speech Synthesis

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./espeak: 3: ./espeak-ng: not found

JPEG XL Decoding libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. The JPEG XL encoding/decoding is done using the libjxl codebase. Learn more via the OpenBenchmarking.org test page.

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

oneDNN

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Target: CPU - Model: SqueezeNet v1.1

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./tnn: 3: ./test/TNNTest: not found

QuadRay

VectorChief's QuadRay is a real-time ray-tracing engine written to support SIMD across ARM, MIPS, PPC, and x86/x86_64 processors. QuadRay supports SSE/SSE2/SSE4 and AVX/AVX2/AVX-512 usage on Intel/AMD CPUs. Learn more via the OpenBenchmarking.org test page.

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

Trace Time

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./povray: 3: ./unix/povray: not found

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC audio format ten times using the --best preset settings. Learn more via the OpenBenchmarking.org test page.

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

ASTC Encoder

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Target: CPU - Model: SqueezeNet v2

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./tnn: 3: ./test/TNNTest: not found

Sockperf

This is a network socket API performance benchmark developed by Mellanox. This test profile runs both the client and server on the local host for evaluating individual system performance. Learn more via the OpenBenchmarking.org test page.

PJSIP

Method: OPTIONS, Stateless

LLVM Clang 14: The test quit with a non-zero exit status. E: ./pjsip: line 5: ./pjsip-apps/bin/samples/*/pjsip-perf: No such file or directory

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./pjsip: line 5: ./pjsip-apps/bin/samples/*/pjsip-perf: No such file or directory

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

C-Blosc

C-Blosc (c-blosc2) simple, compressed, fast and persistent data store library for C. Learn more via the OpenBenchmarking.org test page.

oneDNN

ASTC Encoder

KTX-Software toktx

This is a benchmark of The Khronos Group's KTX-Software library and tools. KTX-Software provides "toktx" for converting/creating in the KTX container format for image textures. This benchmark times how long it takes to convert to KTX 2.0 format with various settings using a reference PNG sample input. Learn more via the OpenBenchmarking.org test page.

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenJPEG

OpenJPEG is an open-source JPEG 2000 codec written in the C programming language. The default input for this test profile is the NASA/JPL-Caltech/MSSS Curiosity panorama 717MB TIFF image file converting to JPEG2000 format. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google as the AV1 Codec Library. Learn more via the OpenBenchmarking.org test page.

Xsbench

XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

oneDNN

SVT-AV1

C-Blosc

C-Blosc (c-blosc2) simple, compressed, fast and persistent data store library for C. Learn more via the OpenBenchmarking.org test page.

Kvazaar

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

Model: Church Facade

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./draco: 3: ./draco_encoder: not found

SVT-VP9

SVT-HEVC

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve primarily benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

Length: 1e12

GCC 13.0 14 Nov: The test quit with a non-zero exit status.

oneDNN

SVT-VP9

ASTC Encoder

Kvazaar

SVT-AV1

oneDNN

KTX-Software toktx

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

Google Draco

Model: Lion

GCC 13.0 14 Nov: The test quit with a non-zero exit status. E: ./draco: 3: ./draco_encoder: not found

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

Renderer: Software CPU Color Light - AF: Off - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

SVT-HEVC

Nettle

GNU Nettle is a low-level cryptographic library used by GnuTLS and other software. Learn more via the OpenBenchmarking.org test page.

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

Renderer: Software CPU Color Light - AF: On - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU Color Light - AF: On - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

SVT-AV1

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

Renderer: Software CPU Color Light - AF: Off - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU - AF: On - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU - AF: On - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Renderer: Software CPU - AF: Off - MSAA: Off - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

Kvazaar

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

Renderer: Software CPU - AF: Off - MSAA: On - Resolution: 1920 x 1080

LLVM Clang 14: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

LLVM Clang 15.0.2: The test quit with a non-zero exit status. E: ./yquake2: 5: ./quake2: not found

Nettle

GNU Nettle is a low-level cryptographic library used by GnuTLS and other software. Learn more via the OpenBenchmarking.org test page.

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

Nettle

GNU Nettle is a low-level cryptographic library used by GnuTLS and other software. Learn more via the OpenBenchmarking.org test page.

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

Nettle

GNU Nettle is a low-level cryptographic library used by GnuTLS and other software. Learn more via the OpenBenchmarking.org test page.

190 Results Shown

JPEG XL libjxl
ONNX Runtime
LeelaChessZero:
BLAS
Eigen
Crypto++
SecureMark
JPEG XL libjxl
TNN
ONNX Runtime:
GPT-2 - CPU - Standard
yolov4 - CPU - Standard
OpenSSL
CppPerformanceBenchmarks
ONNX Runtime:
super-resolution-10 - CPU - Parallel
GPT-2 - CPU - Parallel
bertsquad-12 - CPU - Parallel
yolov4 - CPU - Parallel
ArcFace ResNet-100 - CPU - Parallel
simdjson
ONNX Runtime
SVT-HEVC
NCNN:
CPU - regnety_400m
CPU - squeezenet_ssd
CPU - yolov4-tiny
CPU - resnet50
CPU - alexnet
CPU - resnet18
CPU - vgg16
CPU - googlenet
CPU - blazeface
CPU - efficientnet-b0
CPU - mnasnet
CPU - shufflenet-v2
CPU-v3-v3 - mobilenet-v3
CPU-v2-v2 - mobilenet-v2
CPU - mobilenet
Caffe
simdjson
oneDNN:
Recurrent Neural Network Training - u8s8f32 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - u8s8f32 - CPU
JPEG XL libjxl:
JPEG - 90
PNG - 90
Monte Carlo Simulations of Ionised Nebulae
libavif avifenc
OpenVINO:
Person Vehicle Bike Detection FP16 - CPU:
ms
FPS
simdjson:
DistinctUserID
PartialTweets
TopTweet
OpenVINO:
Person Detection FP16 - CPU:
ms
FPS
Person Detection FP32 - CPU:
ms
FPS
Face Detection FP16 - CPU:
ms
FPS
Face Detection FP16-INT8 - CPU:
ms
FPS
Machine Translation EN To DE FP16 - CPU:
ms
FPS
Ngspice
OpenVINO:
Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
ms
FPS
Age Gender Recognition Retail 0013 FP16 - CPU:
ms
FPS
Vehicle Detection FP16-INT8 - CPU:
ms
FPS
Vehicle Detection FP16 - CPU:
ms
FPS
Weld Porosity Detection FP16 - CPU:
ms
FPS
Weld Porosity Detection FP16-INT8 - CPU:
ms
FPS
GraphicsMagick:
Sharpen
Noise-Gaussian
Resizing
Enhanced
Rotate
OpenSSL:
RSA4096:
verify/s
sign/s
GraphicsMagick:
Swirl
HWB Color Space
Ngspice
Tachyon
SVT-AV1
Redis
Crypto++
Kripke
PJSIP:
OPTIONS, Stateful
INVITE
Zstd Compression:
19 - Decompression Speed
19 - Compression Speed
Dragonflydb:
50 - 1:5
50 - 5:1
AOM AV1
JPEG XL Decoding libjxl
QuantLib
CppPerformanceBenchmarks
Zstd Compression:
19, Long Mode - Decompression Speed
19, Long Mode - Compression Speed
oneDNN
SQLite Speedtest
Caffe
Zstd Compression:
3 - Decompression Speed
3 - Compression Speed
libavif avifenc
CppPerformanceBenchmarks
Liquid-DSP
SVT-VP9
WebP Image Encode
CppPerformanceBenchmarks
oneDNN
Coremark
ASTC Encoder
libjpeg-turbo tjbench
TNN
oneDNN
eSpeak-NG Speech Engine
JPEG XL Decoding libjxl
C-Ray
oneDNN:
Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU
Deconvolution Batch shapes_1d - u8s8f32 - CPU
AOBench
Liquid-DSP:
32 - 256 - 57
16 - 256 - 57
TNN
QuadRay:
5 - 4K
1 - 4K
5 - 1080p
1 - 1080p
CLOMP
POV-Ray
FLAC Audio Encoding
CppPerformanceBenchmarks
ASTC Encoder
TNN
Sockperf
PJSIP
Kvazaar
C-Blosc
oneDNN:
Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
ASTC Encoder
KTX-Software toktx
WebP Image Encode
Dolfyn
OpenJPEG
AOM AV1
Xsbench
CppPerformanceBenchmarks
oneDNN
SVT-AV1
C-Blosc
Kvazaar
Google Draco
SVT-VP9
SVT-HEVC
Primesieve
oneDNN
SVT-VP9
ASTC Encoder
Kvazaar
SVT-AV1
oneDNN
KTX-Software toktx
WebP Image Encode
Google Draco
LAME MP3 Encoding
libavif avifenc
yquake2
LAMMPS Molecular Dynamics Simulator
SVT-HEVC
Nettle
yquake2:
Software CPU Color Light - On - Off - 1920 x 1080
Software CPU Color Light - On - On - 1920 x 1080
SVT-AV1
yquake2:
Software CPU Color Light - Off - On - 1920 x 1080
Software CPU - On - On - 1920 x 1080
Software CPU - On - Off - 1920 x 1080
Software CPU - Off - Off - 1920 x 1080
libavif avifenc:
6
10, Lossless
Kvazaar
yquake2
Nettle
WebP Image Encode
Nettle
TSCP
WebP Image Encode
Nettle

AOCC 4.0

Testing initiated at 12 November 2022 04:55 by user pts.

GCC 12.2

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 13 November 2022 16:24 by user pts.

GCC 13.0 14 Nov

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: GCC 13.0.0 20221114 + clang (GCC) 13.0.0 20221114 (experimental), File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 14 November 2022 09:05 by user pts.

LLVM Clang 14

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 14.0.6-2, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 13 November 2022 04:35 by user pts.

LLVM Clang 15.0.2

OS: Ubuntu 22.10, Kernel: 6.1.0-060100rc3daily20221103-generic (x86_64), Desktop: GNOME Shell 43.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 22.2.1 (LLVM 15.0.2 DRM 3.49), Vulkan: 1.3.224, Compiler: Clang 15.0.2-1, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 14 November 2022 16:49 by user pts.

AMD AOCC 4.0 Benchmarks

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

AOCC 4.0

GCC 12.2

GCC 13.0 14 Nov

LLVM Clang 14

LLVM Clang 15.0.2

JPEG XL libjxl

ONNX Runtime

LeelaChessZero

Crypto++

SecureMark

JPEG XL libjxl

TNN

ONNX Runtime

OpenSSL

CppPerformanceBenchmarks

ONNX Runtime

simdjson

ONNX Runtime

SVT-HEVC

NCNN

Caffe

simdjson

oneDNN

JPEG XL libjxl

Monte Carlo Simulations of Ionised Nebulae

libavif avifenc

OpenVINO

simdjson

OpenVINO

Ngspice

OpenVINO

GraphicsMagick

OpenSSL

GraphicsMagick

Ngspice

Tachyon

SVT-AV1

Redis

Crypto++

Kripke

PJSIP

Zstd Compression

Dragonflydb

AOM AV1

JPEG XL Decoding libjxl

QuantLib

CppPerformanceBenchmarks

Zstd Compression

oneDNN

SQLite Speedtest

Caffe

Zstd Compression

libavif avifenc

CppPerformanceBenchmarks

Liquid-DSP

SVT-VP9

WebP Image Encode

CppPerformanceBenchmarks

oneDNN

Coremark

ASTC Encoder

libjpeg-turbo tjbench

TNN

oneDNN

eSpeak-NG Speech Engine

JPEG XL Decoding libjxl

C-Ray

oneDNN

AOBench

Liquid-DSP

TNN

QuadRay

CLOMP