Ryzen 9 5950X AOCC 3.0 Compiler Benchmarking

Benchmarks for a future article.

GCC 10.2

Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3204 BIOS), Chipset: AMD Starship/Matisse, Memory: 32GB, Disk: 2000GB Corsair Force MP600 + 2000GB, Graphics: AMD NAVY_FLOUNDER 12GB (2855/1000MHz), Audio: AMD Device ab28, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: GCC 10.2.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
Python Notes: Python 3.8.6
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

LLVM Clang 12

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: Clang 12.0.0-++rc3-1~exp1~oibaf~g, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
Python Notes: Python 3.8.6
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD AOCC 2.3

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: Clang 11.0.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Compiler Notes: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
Python Notes: Python 3.8.6
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD AOCC 3.0

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: Clang 12.0.0, File-System: ext4, Screen Resolution: 3840x2160

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

Crypto++

Crypto++ is a C++ class library of cryptographic algorithms. Learn more via the OpenBenchmarking.org test page.

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

Basis Universal

Basis Universal is a GPU texture codec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

Ogg Audio Encoding

This test times how long it takes to encode a sample WAV file to Ogg format using the reference Xiph.org tools/libraries. Learn more via the OpenBenchmarking.org test page.

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

147 Results Shown

TSCP
Crypto++
LZ4 Compression:
1 - Compression Speed
1 - Decompression Speed
3 - Compression Speed
3 - Decompression Speed
9 - Compression Speed
9 - Decompression Speed
Crafty
Basis Universal:
ETC1S
UASTC Level 0
UASTC Level 2
UASTC Level 3
Ngspice:
C2670
C7552
Opus Codec Encoding
WavPack Audio Encoding
ASTC Encoder:
Medium
Thorough
Exhaustive
Etcpak:
DXT1
ETC1
ETC2
JPEG XL:
PNG - 5
PNG - 7
PNG - 8
JPEG - 5
JPEG - 7
JPEG - 8
JPEG XL Decoding:
1
All
LibRaw
WebP2 Image Encode:
Default
Quality 75, Compression Effort 7
Quality 95, Compression Effort 7
Quality 100, Compression Effort 5
Quality 100, Lossless Compression
WebP Image Encode:
Default
Quality 100
Quality 100, Lossless
Quality 100, Highest Compression
Quality 100, Lossless, Highest Compression
Ogg Audio Encoding
Google SynthMark
Gcrypt Library
QuantLib
Timed MrBayes Analysis
RNNoise
Mobile Neural Network:
SqueezeNetV1.0
resnet-v2-50
MobileNetV2_224
mobilenet-v1-1.0
inception-v3
ONNX Runtime:
yolov4 - OpenMP CPU
bertsquad-10 - OpenMP CPU
fcn-resnet101-11 - OpenMP CPU
shufflenet-v2-10 - OpenMP CPU
super-resolution-10 - OpenMP CPU
TNN:
CPU - MobileNet v2
CPU - SqueezeNet v1.1
NCNN:
CPU - mobilenet
CPU-v2-v2 - mobilenet-v2
CPU-v3-v3 - mobilenet-v3
CPU - shufflenet-v2
CPU - mnasnet
CPU - efficientnet-b0
CPU - blazeface
CPU - googlenet
CPU - vgg16
CPU - resnet18
CPU - alexnet
CPU - resnet50
CPU - yolov4-tiny
CPU - squeezenet_ssd
CPU - regnety_400m
oneDNN:
IP Shapes 1D - f32 - CPU
IP Shapes 3D - f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
OpenFOAM
Timed LLVM Compilation
Zstd Compression:
8 - Compression Speed
8 - Decompression Speed
19 - Compression Speed
19 - Decompression Speed
3, Long Mode - Compression Speed
3, Long Mode - Decompression Speed
8, Long Mode - Compression Speed
8, Long Mode - Decompression Speed
19, Long Mode - Compression Speed
19, Long Mode - Decompression Speed
Sysbench
AOM AV1:
Speed 0 Two-Pass
Speed 4 Two-Pass
Speed 6 Realtime
Speed 6 Two-Pass
Speed 8 Realtime
GraphicsMagick:
Swirl
Rotate
Sharpen
Enhanced
Resizing
Noise-Gaussian
HWB Color Space
Tachyon
SVT-VP9:
PSNR/SSIM Optimized - Bosphorus 1080p
Visual Quality Optimized - Bosphorus 1080p
x264
dav1d:
Summer Nature 4K
Summer Nature 1080p
SVT-AV1:
Enc Mode 4 - 1080p
Enc Mode 8 - 1080p
x265:
Bosphorus 4K
Bosphorus 1080p
C-Ray
POV-Ray
libavif avifenc:
0
2
6
10
6, Lossless
10, Lossless
Timed Godot Game Engine Compilation
Smallpt
GNU Radio:
Five Back to Back FIR Filters
Signal Source (Cosine)
FIR Filter
IIR Filter
FM Deemphasis Filter
Hilbert Transform
Liquid-DSP:
1 - 256 - 57
16 - 256 - 57
32 - 256 - 57
Redis:
LPOP
SADD
LPUSH
GET
SET
SQLite Speedtest
simdjson:
Kostya
LargeRand
PartialTweets
DistinctUserID

GCC 10.2

Testing initiated at 14 March 2021 08:46 by user pts.

LLVM Clang 12

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: Clang 12.0.0-++rc3-1~exp1~oibaf~g, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 15 March 2021 05:52 by user pts.

AMD AOCC 2.3

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: Clang 11.0.0, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 14 March 2021 17:02 by user pts.

AMD AOCC 3.0

OS: Ubuntu 20.10, Kernel: 5.11.6-051106-generic (x86_64), Desktop: GNOME Shell 3.38.2, Display Server: X Server 1.20.9, OpenGL: 4.6 Mesa 21.1.0-devel (git-684f97d 2021-03-12 groovy-oibaf-ppa) (LLVM 11.0.1), Vulkan: 1.2.168, Compiler: Clang 12.0.0, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 15 March 2021 13:34 by user pts.

Ryzen 9 5950X AOCC 3.0 Compiler Benchmarking

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

GCC 10.2

LLVM Clang 12

AMD AOCC 2.3

AMD AOCC 3.0

TSCP

Crypto++

LZ4 Compression

Crafty

Basis Universal

Ngspice

Opus Codec Encoding

WavPack Audio Encoding

ASTC Encoder

Etcpak

JPEG XL

JPEG XL Decoding

LibRaw

WebP2 Image Encode

WebP Image Encode

Ogg Audio Encoding

Google SynthMark

Gcrypt Library

QuantLib

Timed MrBayes Analysis

RNNoise

Mobile Neural Network

ONNX Runtime

TNN

NCNN

oneDNN

OpenFOAM

Timed LLVM Compilation

Zstd Compression

Sysbench

AOM AV1

GraphicsMagick

Tachyon

SVT-VP9

x264

dav1d

SVT-AV1

x265

C-Ray

POV-Ray

libavif avifenc

Timed Godot Game Engine Compilation

Smallpt

GNU Radio

Liquid-DSP

Redis

SQLite Speedtest

simdjson

147 Results Shown

GCC 10.2

LLVM Clang 12

AMD AOCC 2.3

AMD AOCC 3.0