LLVM Clang 13 Bencmarking Intel Xeon Ice Lake

LLVM Clang benchmarks for a future article.

Clang 13

Processor: 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads), Motherboard: Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS), Chipset: Intel Device 0998, Memory: 504GB, Disk: 7682GB INTEL SSDPF2KX076TZ, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP

OS: Ubuntu 21.04, Kernel: 5.14.0-rc1-folio (x86_64) 20210715, Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, Compiler: Clang 13.0.0-++20210820072921+23ba3732246a-1~exp1~20210820174536.53, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Processor Notes: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0
Python Notes: Python 3.9.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Clang 12

OS: Ubuntu 21.04, Kernel: 5.14.0-rc1-folio (x86_64) 20210715, Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, Compiler: Clang 12.0.0-3ubuntu1~21.04.1, File-System: ext4, Screen Resolution: 1920x1080

Clang 11

OS: Ubuntu 21.04, Kernel: 5.14.0-rc1-folio (x86_64) 20210715, Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, Compiler: Clang 11.0.1-2ubuntu4, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Processor Notes: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Aircrack-ng

Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google. Learn more via the OpenBenchmarking.org test page.

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

C-Blosc

A simple, compressed, fast and persistent data store library for C. Learn more via the OpenBenchmarking.org test page.

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

dav1d

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

PJSIP

PJSIP is a free and open source multimedia communication library written in C language implementing standard based protocols such as SIP, SDP, RTP, STUN, TURN, and ICE. It combines signaling protocol (SIP) with rich multimedia framework and NAT traversal functionality into high level API that is portable and suitable for almost any type of systems ranging from desktops, embedded systems, to mobile handsets. This test profile is making use of pjsip-perf with both the client/server on teh system. More details on the PJSIP benchmark at https://www.pjsip.org/high-performance-sip.htm Learn more via the OpenBenchmarking.org test page.

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. Learn more via the OpenBenchmarking.org test page.

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

97 Results Shown

Aircrack-ng
AOBench
AOM AV1:
Speed 6 Realtime - Bosphorus 4K
Speed 8 Realtime - Bosphorus 4K
Speed 9 Realtime - Bosphorus 4K
Apache HTTP Server:
500
1000
Botan:
KASUMI
KASUMI - Decrypt
AES-256
AES-256 - Decrypt
Twofish
Twofish - Decrypt
Blowfish
Blowfish - Decrypt
CAST-256
CAST-256 - Decrypt
ChaCha20Poly1305
ChaCha20Poly1305 - Decrypt
C-Blosc
C-Ray
Coremark
dav1d:
Summer Nature 4K
Chimera 1080p 10-bit
FFTW
FinanceBench:
Repo OpenMP
Bonds OpenMP
Google Draco:
Lion
Church Facade
Google SynthMark
GraphicsMagick:
Swirl
Rotate
Sharpen
Enhanced
Resizing
Noise-Gaussian
HWB Color Space
John The Ripper:
Blowfish
MD5
LibRaw
MariaDB:
2048
4096
NCNN:
CPU - mobilenet
CPU - shufflenet-v2
CPU - mnasnet
CPU - efficientnet-b0
CPU - blazeface
CPU - googlenet
CPU - alexnet
CPU - resnet50
CPU - yolov4-tiny
CPU - squeezenet_ssd
CPU - regnety_400m
nginx:
1
20
100
200
500
1000
oneDNN:
IP Shapes 1D - bf16bf16bf16 - CPU
IP Shapes 3D - bf16bf16bf16 - CPU
Convolution Batch Shapes Auto - bf16bf16bf16 - CPU
Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU
Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU
Recurrent Neural Network Training - bf16bf16bf16 - CPU
Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Opus Codec Encoding
PJSIP:
INVITE
OPTIONS, Stateful
OPTIONS, Stateless
PostgreSQL pgbench:
100 - 250 - Read Only
100 - 250 - Read Only - Average Latency
100 - 250 - Read Write
100 - 250 - Read Write - Average Latency
QuantLib
SecureMark
SQLite Speedtest
SVT-AV1:
Preset 4 - Bosphorus 4K
Preset 8 - Bosphorus 4K
SVT-HEVC:
7 - Bosphorus 1080p
10 - Bosphorus 1080p
SVT-VP9:
PSNR/SSIM Optimized - Bosphorus 1080p
Visual Quality Optimized - Bosphorus 1080p
Tachyon
TNN:
CPU - DenseNet
CPU - MobileNet v2
CPU - SqueezeNet v2
CPU - SqueezeNet v1.1
TSCP
VP9 libvpx Encoding:
Speed 0 - Bosphorus 4K
Speed 5 - Bosphorus 4K
Zstd Compression:
3 - Compression Speed
8 - Compression Speed
19 - Compression Speed
3, Long Mode - Compression Speed
8, Long Mode - Compression Speed
19, Long Mode - Compression Speed

Clang 13

Testing initiated at 22 August 2021 07:45 by user phoronix.

Clang 12

Testing initiated at 22 August 2021 16:19 by user phoronix.

Clang 11

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Processor Notes: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 23 August 2021 05:54 by user phoronix.

LLVM Clang 13 Bencmarking Intel Xeon Ice Lake

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

Clang 13

Clang 12

Clang 11

Aircrack-ng

AOBench

AOM AV1

Apache HTTP Server

Botan

C-Blosc

C-Ray

Coremark

dav1d

FFTW

FinanceBench

Google Draco

Google SynthMark

GraphicsMagick

John The Ripper

LibRaw

MariaDB

NCNN

nginx

oneDNN

Opus Codec Encoding

PJSIP

PostgreSQL pgbench

QuantLib

SecureMark

SQLite Speedtest

SVT-AV1

SVT-HEVC

SVT-VP9

Tachyon

TNN

TSCP

VP9 libvpx Encoding

Zstd Compression

97 Results Shown

Clang 13

Clang 12

Clang 11