AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2003230-VE-1907117HV93
Znver2 GCC9 Compiler Tests
AMD Zen 2 GCC compiler benchmarks on Ubuntu Linux. Tests by Michael Larabel for a future article.
-O3 -march=x86-64:
Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.2.0-999-generic (x86_64) 20190703, Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.2 (LLVM 7.0.0), Compiler: GCC 9.1.0, File-System: ext4, Screen Resolution: 3840x2160
-O3 -march=znver1:
Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.2.0-999-generic (x86_64) 20190703, Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.2 (LLVM 7.0.0), Compiler: GCC 9.1.0, File-System: ext4, Screen Resolution: 3840x2160
-O3 -march=znver2:
Processor: AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0066 BIOS), Chipset: AMD Device 1480, Memory: 16384MB, Disk: 2000GB Force MP600, Graphics: Sapphire AMD Baffin [Polaris11] 4GB (1300/1750MHz), Audio: AMD Device aae0, Monitor: ASUS VP28U, Network: Realtek Device 8125 + Intel I211 + Intel Device 2723
OS: Ubuntu 18.04, Kernel: 5.2.0-999-generic (x86_64) 20190703, Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, Display Driver: modesetting 1.20.1, OpenGL: 4.5 Mesa 18.2.2 (LLVM 7.0.0), Compiler: GCC 9.1.0, File-System: ext4, Screen Resolution: 3840x2160
ryzen3800x4200:
Processor: AMD Ryzen 7 3800X 8-Core @ 4.20GHz (8 Cores / 16 Threads), Motherboard: ASUS TUF GAMING X570-PLUS_BR (1405 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 480GB Force MP510 + 320GB SAMSUNG HD322HJ, Graphics: NVIDIA GeForce GTX 960 2GB (1126/3505MHz), Audio: NVIDIA GM206 HD Audio, Network: Realtek RTL8111/8168/8411 + Broadcom BCM4360 802.11ac
OS: Ubuntu 20.04, Kernel: 5.4.0-18-generic (x86_64), Desktop: MATE 1.24.0, Display Server: X Server 1.20.7, Display Driver: NVIDIA 435.21, OpenGL: 4.6.0, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080
MKL-DNN 2019-04-16
Harness: Convolution Batch conv_alexnet - Data Type: f32
ms < Lower Is Better
-O3 -march=x86-64 . 2512.04 |=================================================
-O3 -march=znver1 . 2562.89 |==================================================
-O3 -march=znver2 . 2524.93 |=================================================
ryzen3800x4200 .... 331.47 |======
MKL-DNN 2019-04-16
Harness: Deconvolution Batch deconv_1d - Data Type: f32
ms < Lower Is Better
-O3 -march=x86-64 . 221.00 |===================================================
-O3 -march=znver1 . 219.29 |===================================================
-O3 -march=znver2 . 216.70 |==================================================
ryzen3800x4200 .... 31.01 |=======
SciMark 2.0
Computational Test: Composite
Mflops > Higher Is Better
-O3 -march=x86-64 . 2786.33 |======================================
-O3 -march=znver1 . 3128.65 |==========================================
-O3 -march=znver2 . 3700.64 |==================================================
ryzen3800x4200 .... 618.82 |========
SciMark 2.0
Computational Test: Monte Carlo
Mflops > Higher Is Better
-O3 -march=x86-64 . 766.81 |=================================================
-O3 -march=znver1 . 757.56 |================================================
-O3 -march=znver2 . 799.07 |===================================================
ryzen3800x4200 .... 153.28 |==========
SciMark 2.0
Computational Test: Sparse Matrix Multiply
Mflops > Higher Is Better
-O3 -march=x86-64 . 3762.43 |==================================================
-O3 -march=znver1 . 3702.03 |=================================================
-O3 -march=znver2 . 3575.96 |================================================
ryzen3800x4200 .... 750.46 |==========
LAME MP3 Encoding 3.100
WAV To MP3
Seconds < Lower Is Better
-O3 -march=x86-64 . 7.160 |==============
-O3 -march=znver1 . 6.980 |=============
-O3 -march=znver2 . 7.040 |==============
ryzen3800x4200 .... 26.537 |===================================================
John The Ripper 1.9.0-jumbo-1
Test: Blowfish
Real C/S > Higher Is Better
-O3 -march=x86-64 . 28401 |====================================================
-O3 -march=znver1 . 28221 |====================================================
-O3 -march=znver2 . 20232 |=====================================
ryzen3800x4200 .... 13993 |==========================
SciMark 2.0
Computational Test: Dense LU Matrix Factorization
Mflops > Higher Is Better
-O3 -march=x86-64 . 6959.69 |==============================
-O3 -march=znver1 . 8631.93 |=====================================
-O3 -march=znver2 . 11431.93 |=================================================
ryzen3800x4200 .... 607.72 |===
SciMark 2.0
Computational Test: Jacobi Successive Over-Relaxation
Mflops > Higher Is Better
-O3 -march=x86-64 . 2145.57 |============================================
-O3 -march=znver1 . 2291.62 |===============================================
-O3 -march=znver2 . 2422.10 |==================================================
ryzen3800x4200 .... 1259.58 |==========================
PostgreSQL pgbench 10.3
Scaling: Buffer Test - Test: Normal Load - Mode: Read Write
TPS > Higher Is Better
-O3 -march=x86-64 . 30143.80 |=================================================
-O3 -march=znver1 . 29380.50 |================================================
-O3 -march=znver2 . 30044.32 |=================================================
ryzen3800x4200 .... 16722.39 |===========================
C-Ray 1.1
Total Time - 4K, 16 Rays Per Pixel
Seconds < Lower Is Better
-O3 -march=x86-64 . 43.20 |=====================================
-O3 -march=znver1 . 39.42 |==================================
-O3 -march=znver2 . 39.46 |==================================
ryzen3800x4200 .... 60.32 |====================================================
Timed LLVM Compilation 6.0.1
Time To Compile
Seconds < Lower Is Better
-O3 -march=x86-64 . 281.90 |===================================
-O3 -march=znver1 . 284.24 |===================================
-O3 -march=znver2 . 286.56 |===================================
ryzen3800x4200 .... 415.04 |===================================================
PostgreSQL pgbench 10.3
Scaling: Buffer Test - Test: Normal Load - Mode: Read Only
TPS > Higher Is Better
-O3 -march=x86-64 . 385510.95 |================================================
-O3 -march=znver1 . 383329.30 |================================================
-O3 -march=znver2 . 382751.07 |================================================
ryzen3800x4200 .... 264125.57 |=================================
Stockfish 9
Total Time
Nodes Per Second > Higher Is Better
-O3 -march=x86-64 . 39537930 |=================================================
-O3 -march=znver1 . 39908751 |=================================================
-O3 -march=znver2 . 39471726 |================================================
ryzen3800x4200 .... 27465891 |==================================
7-Zip Compression 16.02
Compress Speed Test
MIPS > Higher Is Better
-O3 -march=x86-64 . 78655 |====================================================
-O3 -march=znver1 . 78412 |====================================================
-O3 -march=znver2 . 78562 |====================================================
ryzen3800x4200 .... 54535 |====================================
x264 2018-09-25
H.264 Video Encoding
Frames Per Second > Higher Is Better
-O3 -march=x86-64 . 143.27 |===================================================
-O3 -march=znver1 . 141.82 |==================================================
-O3 -march=znver2 . 140.96 |==================================================
ryzen3800x4200 .... 101.97 |====================================
FFTW 3.3.6
Build: Stock - Size: 1D FFT Size 4096
Mflops > Higher Is Better
-O3 -march=x86-64 . 9534.70 |=========================================
-O3 -march=znver1 . 11448.00 |=================================================
-O3 -march=znver2 . 11231.00 |================================================
ryzen3800x4200 .... 8510.60 |====================================
FFTW 3.3.6
Build: Stock - Size: 2D FFT Size 4096
Mflops > Higher Is Better
-O3 -march=x86-64 . 7039.43 |============================================
-O3 -march=znver1 . 7660.90 |================================================
-O3 -march=znver2 . 8001.67 |==================================================
ryzen3800x4200 .... 5968.30 |=====================================
XZ Compression 5.2.4
Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9
Seconds < Lower Is Better
-O3 -march=x86-64 . 25.22 |========================================
-O3 -march=znver1 . 25.09 |========================================
-O3 -march=znver2 . 25.21 |========================================
ryzen3800x4200 .... 32.87 |====================================================
GraphicsMagick 1.3.30
Operation: Sharpen
Iterations Per Minute > Higher Is Better
-O3 -march=x86-64 . 179 |==================================================
-O3 -march=znver1 . 193 |======================================================
-O3 -march=znver2 . 194 |======================================================
ryzen3800x4200 .... 156 |===========================================
SciMark 2.0
Computational Test: Fast Fourier Transform
Mflops > Higher Is Better
-O3 -march=x86-64 . 297.13 |===============================================
-O3 -march=znver1 . 260.12 |=========================================
-O3 -march=znver2 . 274.11 |===========================================
ryzen3800x4200 .... 323.04 |===================================================
Memcached mcperf 1.5.10
Method: Get
Operations Per Second > Higher Is Better
-O3 -march=x86-64 . 107814.85 |==============================================
-O3 -march=znver1 . 112447.77 |================================================
-O3 -march=znver2 . 110755.89 |===============================================
ryzen3800x4200 .... 94377.50 |========================================
x265 3.0
H.265 1080p Video Encoding
Frames Per Second > Higher Is Better
-O3 -march=x86-64 . 53.33 |====================================================
-O3 -march=znver1 . 53.15 |====================================================
-O3 -march=znver2 . 52.91 |====================================================
ryzen3800x4200 .... 45.82 |=============================================
VP9 libvpx Encoding 1.8.0
vpxenc VP9 1080p Video Encode
Frames Per Second > Higher Is Better
-O3 -march=x86-64 . 175.99 |============================================
-O3 -march=znver1 . 174.40 |============================================
-O3 -march=znver2 . 175.34 |============================================
ryzen3800x4200 .... 202.41 |===================================================
AOBench
Size: 2048 x 2048 - Total Time
Seconds < Lower Is Better
-O3 -march=x86-64 . 36.49 |====================================================
-O3 -march=znver1 . 35.14 |==================================================
-O3 -march=znver2 . 34.64 |=================================================
ryzen3800x4200 .... 32.23 |==============================================
CppPerformanceBenchmarks 9
Test: Math Library
Seconds < Lower Is Better
-O3 -march=x86-64 . 312.01 |===================================================
-O3 -march=znver1 . 311.77 |===================================================
-O3 -march=znver2 . 309.02 |===================================================
ryzen3800x4200 .... 281.88 |==============================================
Bullet Physics Engine 2.81
Test: 1000 Stack
Seconds < Lower Is Better
-O3 -march=x86-64 . 4.080000 |================================================
-O3 -march=znver1 . 3.980000 |===============================================
-O3 -march=znver2 . 3.790000 |============================================
ryzen3800x4200 .... 4.189241 |=================================================
Redis 4.0.8
Test: SET
Requests Per Second > Higher Is Better
-O3 -march=x86-64 . 2074827.24 |===============================================
-O3 -march=znver1 . 2080013.12 |===============================================
-O3 -march=znver2 . 2089609.47 |===============================================
ryzen3800x4200 .... 1893390.85 |===========================================
FFTW 3.3.6
Build: Float + SSE - Size: 1D FFT Size 4096
Mflops > Higher Is Better
-O3 -march=znver1 . 51757 |================================================
-O3 -march=znver2 . 56652 |====================================================
ryzen3800x4200 .... 54956 |==================================================
FFTW 3.3.6
Build: Float + SSE - Size: 2D FFT Size 4096
Mflops > Higher Is Better
-O3 -march=znver1 . 19405 |===================================================
-O3 -march=znver2 . 19960 |====================================================
ryzen3800x4200 .... 18240 |================================================
Bullet Physics Engine 2.81
Test: 136 Ragdolls
Seconds < Lower Is Better
-O3 -march=x86-64 . 2.170000 |================================================
-O3 -march=znver1 . 2.130000 |===============================================
-O3 -march=znver2 . 2.050000 |=============================================
ryzen3800x4200 .... 2.229157 |=================================================
Bullet Physics Engine 2.81
Test: 3000 Fall
Seconds < Lower Is Better
-O3 -march=x86-64 . 3.370000 |===============================================
-O3 -march=znver1 . 3.360000 |===============================================
-O3 -march=znver2 . 3.230000 |=============================================
ryzen3800x4200 .... 3.509403 |=================================================
TSCP 1.81
AI Chess Performance
Nodes Per Second > Higher Is Better
-O3 -march=x86-64 . 1333926 |=================================================
-O3 -march=znver1 . 1372148 |==================================================
-O3 -march=znver2 . 1321681 |================================================
ryzen3800x4200 .... 1271423 |==============================================
Bullet Physics Engine 2.81
Test: 1000 Convex
Seconds < Lower Is Better
-O3 -march=x86-64 . 3.700000 |===============================================
-O3 -march=znver1 . 3.730000 |===============================================
-O3 -march=znver2 . 3.590000 |==============================================
ryzen3800x4200 .... 3.862570 |=================================================
CppPerformanceBenchmarks 9
Test: Function Objects
Seconds < Lower Is Better
-O3 -march=x86-64 . 14.90 |==================================================
-O3 -march=znver1 . 14.99 |==================================================
-O3 -march=znver2 . 14.56 |================================================
ryzen3800x4200 .... 15.63 |====================================================
GraphicsMagick 1.3.30
Operation: Rotate
Iterations Per Minute > Higher Is Better
-O3 -march=x86-64 . 261 |===================================================
-O3 -march=znver1 . 260 |===================================================
-O3 -march=znver2 . 276 |======================================================
ryzen3800x4200 .... 261 |===================================================
Himeno Benchmark 3.0
Poisson Pressure Solver
MFLOPS > Higher Is Better
-O3 -march=x86-64 . 1336.77 |==================================================
-O3 -march=znver1 . 1345.95 |==================================================
-O3 -march=znver2 . 1347.96 |==================================================
ryzen3800x4200 .... 1274.46 |===============================================
GraphicsMagick 1.3.30
Operation: Resizing
Iterations Per Minute > Higher Is Better
-O3 -march=x86-64 . 270 |===================================================
-O3 -march=znver1 . 279 |=====================================================
-O3 -march=znver2 . 285 |======================================================
ryzen3800x4200 .... 276 |====================================================
FLAC Audio Encoding 1.3.2
WAV To FLAC
Seconds < Lower Is Better
-O3 -march=x86-64 . 7.750 |=================================================
-O3 -march=znver1 . 8.150 |====================================================
-O3 -march=znver2 . 8.120 |====================================================
ryzen3800x4200 .... 7.922 |===================================================
Bullet Physics Engine 2.81
Test: Raytests
Seconds < Lower Is Better
-O3 -march=x86-64 . 2.090000 |================================================
-O3 -march=znver1 . 2.130000 |================================================
-O3 -march=znver2 . 2.060000 |===============================================
ryzen3800x4200 .... 2.154750 |=================================================
Timed PHP Compilation 7.1.9
Time To Compile
Seconds < Lower Is Better
-O3 -march=x86-64 . 52.89 |==================================================
-O3 -march=znver1 . 53.44 |===================================================
-O3 -march=znver2 . 53.41 |===================================================
ryzen3800x4200 .... 54.85 |====================================================
Memcached mcperf 1.5.10
Method: Set
Operations Per Second > Higher Is Better
-O3 -march=x86-64 . 60770.80 |===========================================
-O3 -march=znver1 . 60810.38 |===========================================
-O3 -march=znver2 . 69121.64 |=================================================
ryzen3800x4200 .... 63869.30 |=============================================
Redis 4.0.8
Test: GET
Requests Per Second > Higher Is Better
-O3 -march=x86-64 . 3026826.32 |=============================================
-O3 -march=znver1 . 3126726.23 |===============================================
-O3 -march=znver2 . 3090850.69 |==============================================
ryzen3800x4200 .... 2695667.50 |=========================================
MKL-DNN 2019-04-16
Harness: IP Batch 1D - Data Type: f32
ms < Lower Is Better
-O3 -march=x86-64 . 152.36 |=================================================
-O3 -march=znver1 . 159.95 |===================================================
-O3 -march=znver2 . 158.46 |===================================================
ryzen3800x4200 .... 22.64 |=======