GCC 9.1 PGO Optimizations AMD Threadripper Benchmarks

AMD Ryzen Threadripper 2990WX GCC 9 PGO benchmarks by Michael Larabel (Profile Guided Optimizations).

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

t-test1

This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

-O3 -march=native

Environment Notes: CXXFLAGS=-O3-march=native CFLAGS=-O3-march=native
Compiler Notes: --disable-multilib --enable-checing=release
Processor Notes: Scaling Governor: acpi-cpufreq ondemand
Python Notes: Python 2.7.15rc1 + Python 3.6.7
Security Notes: __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

Testing initiated at 12 May 2019 09:23 by user phoronix.

-O3 -march=native + PGO

Processor: AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads), Motherboard: ASUS ROG ZENITH EXTREME (1701 BIOS), Chipset: AMD 17h, Memory: 32768MB, Disk: Samsung SSD 970 EVO 500GB, Graphics: AMD Radeon RX 64 8GB (1590/800MHz), Audio: Realtek ALC1220, Monitor: ASUS VP28U, Network: Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11ad

OS: Ubuntu 18.04, Kernel: 4.18.0-18-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: amdgpu 18.1.0, OpenGL: 4.5 Mesa 18.2.8 (LLVM 7.0.0), Compiler: GCC 9.1.0, File-System: ext4, Screen Resolution: 3840x2160

Testing initiated at 12 May 2019 19:00 by user phoronix.

GCC 9.1 PGO Optimizations AMD Threadripper

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

-O3 -march=native

-O3 -march=native + PGO

AOBench

AOM AV1

C-Ray

CppPerformanceBenchmarks

High Performance Conjugate Gradient

Himeno Benchmark

Memcached mcperf

SciMark

Smallpt

Stockfish

t-test1

Timed MAFFT Alignment

TSCP

x265

25 Results Shown

-O3 -march=native

-O3 -march=native + PGO