GCC Znver3 First Cut Benchmarks

Tests for a future article.

znver3

Environment Notes: CXXFLAGS="-O3 -march=znver3" CFLAGS="-O3 -march=znver3"
Compiler Notes: --disable-multilib --enable-checking=release
Disk Notes: NONE / errors=remount-ro,relatime,rw / Block Size: 4096
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 4864
Python Notes: Python 2.7.18 + Python 3.8.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

x86-64

Environment Notes: CXXFLAGS="-O3 -march=x86-64" CFLAGS="-O3 -march=x86-64"
Compiler Notes: --disable-multilib --enable-checking=release
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

znver1

Environment Notes: CXXFLAGS="-O3 -march=znver1" CFLAGS="-O3 -march=znver1"
Compiler Notes: --disable-multilib --enable-checking=release
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

znver2

Environment Notes: CXXFLAGS="-O3 -march=znver2" CFLAGS="-O3 -march=znver2"
Compiler Notes: --disable-multilib --enable-checking=release
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

haswell

Environment Notes: CXXFLAGS="-O3 -march=haswell" CFLAGS="-O3 -march=haswell"
Compiler Notes: --disable-multilib --enable-checking=release
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

skylake

Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (2311 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600 + 2000GB, Graphics: NVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz), Audio: NVIDIA Device 228b, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200

OS: Ubuntu 20.04, Kernel: 5.4.0-54-generic (x86_64), Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: NVIDIA 455.45.01, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 11.1.114, Vulkan: 1.2.142, Compiler: GCC 11.0.0 20201203, File-System: ext4, Screen Resolution: 3840x2160

Environment Notes: CXXFLAGS="-O3 -march=skylake" CFLAGS="-O3 -march=skylake"
Compiler Notes: --disable-multilib --enable-checking=release
Processor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009
OpenCL Notes: GPU Compute Cores: 4864
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

Result

Result Confidence

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Result

Result Confidence

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

Result

Result Confidence

25 Results Shown

WebP Image Encode
SciMark:
Composite
Sparse Matrix Multiply
Jacobi Successive Over-Relaxation
LibRaw
Crafty
GraphicsMagick:
Sharpen
Enhanced
Resizing
dav1d
Kvazaar:
Bosphorus 4K - Ultra Fast
Bosphorus 1080p - Ultra Fast
VP9 libvpx Encoding:
Speed 0
Speed 5
ACES DGEMM
Coremark
C-Ray
Smallpt
AOBench
LAME MP3 Encoding
RNNoise
Darmstadt Automotive Parallel Heterogeneous Suite:
OpenMP - NDT Mapping
OpenMP - Points2Image
SQLite Speedtest
Hierarchical INTegration

znver3

Testing initiated at 3 December 2020 18:33 by user phoronix.

x86-64

Testing initiated at 4 December 2020 06:19 by user phoronix.

znver1

Testing initiated at 4 December 2020 10:12 by user phoronix.

znver2

Testing initiated at 4 December 2020 12:10 by user phoronix.

haswell

Testing initiated at 4 December 2020 14:56 by user phoronix.

skylake

Testing initiated at 4 December 2020 17:08 by user phoronix.

GCC Znver3 First Cut Benchmarks

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

znver3

x86-64

znver1

znver2

haswell

skylake

WebP Image Encode

SciMark

LibRaw

Crafty

GraphicsMagick

dav1d

Kvazaar

VP9 libvpx Encoding

ACES DGEMM

Coremark

C-Ray

Smallpt

AOBench

LAME MP3 Encoding

RNNoise

Darmstadt Automotive Parallel Heterogeneous Suite

SQLite Speedtest

Hierarchical INTegration

25 Results Shown

znver3

x86-64

znver1

znver2

haswell

skylake