AMD AOCC 3.1 Compiler Comparison

AMD EPYC 7543 testing of AMD AOCC 3.1 compiler benchmarks by Michael Larabel for a future article.

AOCC 3.1

Processor: AMD EPYC 7543 32-Core @ 2.80GHz (32 Cores / 64 Threads), Motherboard: TYAN S8036GM2NE-LE (V2.00.B21 BIOS), Chipset: AMD Starship/Matisse, Memory: 64GB, Disk: 1000GB Corsair Force MP600, Graphics: ASPEED, Monitor: VE228, Network: 2 x Broadcom NetXtreme BCM5720 2-port PCIe

OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server, Compiler: Clang 12.0.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"
Compiler Notes: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver3
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa001119
Python Notes: Python 3.9.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Clang 12.0

OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server, Compiler: Clang 12.0.1-++20210630032617+fed41342a82f-1~exp1~20210630133328.128, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa001119
Python Notes: Python 3.9.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

GCC 11.1

OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server, Compiler: GCC 11.1.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"
Compiler Notes: --disable-multilib --enable-checking=release
Processor Notes: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa001119
Python Notes: Python 3.9.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

C-Blosc

A simple, compressed, fast and persistent data store library for C. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

WavPack Audio Encoding

This test times how long it takes to encode a sample WAV file to WavPack format with very high quality settings. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

GnuPG

This test times how long it takes to encrypt a sample file using GnuPG. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Geometric Mean Of All Test Results

79 Results Shown

C-Blosc
QuantLib
Etcpak:
DXT1
ETC2
LZ4 Compression
Zstd Compression:
8 - Compression Speed
19, Long Mode - Compression Speed
19, Long Mode - Decompression Speed
JPEG XL:
PNG - 7
JPEG - 7
JPEG - 8
Botan:
AES-256
AES-256 - Decrypt
Twofish
Twofish - Decrypt
Blowfish
Blowfish - Decrypt
CAST-256
CAST-256 - Decrypt
LibRaw
John The Ripper:
Blowfish
MD5
GraphicsMagick:
Rotate
Enhanced
SVT-AV1:
Preset 4 - Bosphorus 4K
Preset 8 - Bosphorus 4K
SVT-HEVC:
7 - Bosphorus 1080p
10 - Bosphorus 1080p
SVT-VP9
VP9 libvpx Encoding
Himeno Benchmark
Stockfish
libavif avifenc:
2
6
10
6, Lossless
10, Lossless
POV-Ray
oneDNN:
IP Shapes 1D - f32 - CPU
IP Shapes 3D - f32 - CPU
Convolution Batch Shapes Auto - f32 - CPU
Deconvolution Batch shapes_1d - f32 - CPU
Deconvolution Batch shapes_3d - f32 - CPU
Recurrent Neural Network Training - f32 - CPU
Recurrent Neural Network Inference - f32 - CPU
Matrix Multiply Batch Shapes Transformer - f32 - CPU
FLAC Audio Encoding
LAME MP3 Encoding
Ngspice:
C2670
C7552
RNNoise
Tachyon
Google SynthMark
SecureMark
Liquid-DSP:
16 - 256 - 57
32 - 256 - 57
64 - 256 - 57
FinanceBench:
Repo OpenMP
Bonds OpenMP
libjpeg-turbo tjbench
ASTC Encoder
SQLite Speedtest
Google Draco:
Lion
Church Facade
NCNN:
CPU-v2-v2 - mobilenet-v2
CPU-v3-v3 - mobilenet-v3
CPU - vgg16
CPU - resnet18
TNN:
CPU - SqueezeNet v2
CPU - SqueezeNet v1.1
Facebook RocksDB:
Update Rand
Read While Writing
Read Rand Write Rand
ONNX Runtime:
bertsquad-10 - OpenMP CPU
fcn-resnet101-11 - OpenMP CPU
super-resolution-10 - OpenMP CPU
WavPack Audio Encoding
GnuPG
Geometric Mean Of All Test Results

AOCC 3.1

Testing initiated at 21 July 2021 16:23 by user phoronix.

Clang 12.0

Testing initiated at 27 July 2021 10:14 by user phoronix.

GCC 11.1

Testing initiated at 27 July 2021 15:16 by user phoronix.

AMD AOCC 3.1 Compiler Comparison

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

AOCC 3.1

Clang 12.0

GCC 11.1

C-Blosc

QuantLib

Etcpak

LZ4 Compression

Zstd Compression

JPEG XL

Botan

LibRaw

John The Ripper

GraphicsMagick

SVT-AV1

SVT-HEVC

SVT-VP9

VP9 libvpx Encoding

Himeno Benchmark

Stockfish

libavif avifenc

POV-Ray

oneDNN

FLAC Audio Encoding

LAME MP3 Encoding

Ngspice

RNNoise

Tachyon

Google SynthMark

SecureMark

Liquid-DSP

FinanceBench

libjpeg-turbo tjbench

ASTC Encoder

SQLite Speedtest

Google Draco

NCNN

TNN

Facebook RocksDB

ONNX Runtime

WavPack Audio Encoding

GnuPG

Geometric Mean Of All Test Results

79 Results Shown

AOCC 3.1

Clang 12.0

GCC 11.1