AMD EPYC Compiler Tuning

GCC 9 compiler tuning benchmarks by Michael Larabel for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1902194-SP-AMDEPYCCO19
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 2 Tests
Bioinformatics 2 Tests
Chess Test Suite 2 Tests
Timed Code Compilation 3 Tests
C/C++ Compiler Tests 23 Tests
CPU Massive 21 Tests
Creator Workloads 11 Tests
Encoding 7 Tests
HPC - High Performance Computing 3 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Multi-Core 15 Tests
Programmer / Developer System Benchmarks 4 Tests
Renderers 2 Tests
Scientific Computing 3 Tests
Server CPU Tests 12 Tests
Single-Threaded 5 Tests
Video Encoding 5 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
-O0
February 15 2019
  5 Hours, 52 Minutes
-Og
February 19 2019
  5 Hours, 57 Minutes
-O1
February 16 2019
  6 Hours, 33 Minutes
-O2
February 16 2019
  6 Hours, 13 Minutes
-O2 -ftree-vectorize -ftree-slp-vectorize
February 18 2019
  5 Hours, 47 Minutes
-O2 -march=znver1
February 17 2019
  6 Hours, 29 Minutes
-O2 -flto
February 18 2019
  4 Hours, 46 Minutes
-O3
February 16 2019
  7 Hours, 15 Minutes
-O3 -march=znver1
February 15 2019
  4 Hours, 53 Minutes
-O3 -march=znver1 -flto
February 18 2019
  7 Hours, 2 Minutes
-Ofast -march=znver1
February 17 2019
  6 Hours, 52 Minutes
Invert Hiding All Results Option
  6 Hours, 9 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC Compiler Tuning GCC 9 compiler tuning benchmarks by Michael Larabel for a future article on Phoronix.com. ,,"-O0","-Og","-O1","-O2","-O2 -ftree-vectorize -ftree-slp-vectorize","-O2 -march=znver1","-O2 -flto","-O3","-O3 -march=znver1","-O3 -march=znver1 -flto","-Ofast -march=znver1" Processor,,2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads),2 x AMD EPYC 7601 32-Core (64 Cores / 128 Threads) Motherboard,,Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS),Dell 02MJ3T (1.2.5 BIOS) Chipset,,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h,AMD Family 17h Memory,,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2,16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2 Disk,,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860,120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860 Graphics,,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3,Matrox G200eW3 Monitor,,VE228,VE228,VE228,VE228,VE228,VE228,VE228,VE228,VE228,VE228,VE228 Network,,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe,2 x Broadcom BCM57416 NetXtreme-E 10GBase-T RDMA + 2 x Broadcom NetXtreme BCM5720 PCIe OS,,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04,Ubuntu 18.04 Kernel,,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210,5.0.0-050000rc6-generic (x86_64) 20190210 Desktop,,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3,GNOME Shell 3.28.3 Display Server,,X Server,X Server,X Server,X Server,X Server,X Server,X Server,X Server,X Server,X Server,X Server Compiler,,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210,GCC 9.0.1 20190210 File-System,,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4 Screen Resolution,,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200,1600x1200 ,,"-O0","-Og","-O1","-O2","-O2 -ftree-vectorize -ftree-slp-vectorize","-O2 -march=znver1","-O2 -flto","-O3","-O3 -march=znver1","-O3 -march=znver1 -flto","-Ofast -march=znver1" "SVT-AV1 - 1080p 8-bit YUV To AV1 Video Encode (FPS)",HIB,1.69,1.73,1.70,1.69,1.67,1.70,1.69,1.73,1.68,1.71,1.70 "VP9 libvpx Encoding - vpxenc VP9 1080p Video Encode (FPS)",HIB,12.50,12.52,12.53,12.54,12.56,12.42,,12.31,12.41,12.75,12.37 "x264 - H.264 Video Encoding (FPS)",HIB,102,142,145,144,144,144,,147,144,,144 "x265 - H.265 1080p Video Encoding (FPS)",HIB,35.00,34.76,35.62,34.55,35.41,34.80,35.07,35.21,35.57,,34.91 "SVT-AV1 - 1080p 8-bit YUV To AV1 Video Encode (FPS)",HIB,5.88,5.86,5.87,5.81,5.89,5.84,5.90,5.90,5.89,5.84,5.91 "SVT-VP9 - 1080p 8-bit YUV To VP9 Video Encode (FPS)",HIB,,92.68,,,94.82,95.91,95.79,,,97.26,97.80 "VP9 libvpx Encoding - vpxenc VP9 1080p Video Encode (FPS)",HIB,,20.39,,,20.34,20.05,,,,20.86,20.13 "GraphicsMagick - Operation: Swirl (Iterations/min)",HIB,96,181,194,195,196,196,196,189,195,194,196 "GraphicsMagick - Operation: Rotate (Iterations/min)",HIB,98,181,191,191,190,191,191,183,190,188,189 "GraphicsMagick - Operation: Sharpen (Iterations/min)",HIB,82,156,180,181,180,183,183,174,183,183,182 "GraphicsMagick - Operation: Enhanced (Iterations/min)",HIB,90,173,187,189,188,191,190,181,191,186,193 "GraphicsMagick - Operation: Resizing (Iterations/min)",HIB,74,120,126,131,128,127,128,118,127,125,124 "GraphicsMagick - Operation: Noise-Gaussian (Iterations/min)",HIB,92,168,179,180,178,180,180,172,180,178,187 "GraphicsMagick - Operation: HWB Color Space (Iterations/min)",HIB,102,195,210,211,212,211,214,203,210,209,209 "libjpeg-turbo tjbench - Test: Decompression Throughput (Megapixels/sec)",HIB,111,141,139,140,139,142,140,141,144,144,144 "FFTW - Build: Stock - Size: 2D FFT Size 4096 (Mflops)",HIB,1708,4366,4632,4625,4805,5074,5091,4751,5006,5571,4885 "FFTW - Build: Float + SSE - Size: 2D FFT Size 4096 (Mflops)",HIB,2193,12642,13468,13391,13285,13346,13214,13555,12752,13110,13166 "SciMark - Computational Test: Composite (Mflops)",HIB,434,1205,1519,1369,1724,1501,1307,1800,1961,1747,1825 "SciMark - Computational Test: Monte Carlo (Mflops)",HIB,108,210,576,560,560,557,568,560,557,1480,561 "SciMark - Computational Test: Fast Fourier Transform (Mflops)",HIB,201,257,226,230,231,229,232,232,227,230,221 "SciMark - Computational Test: Sparse Matrix Multiply (Mflops)",HIB,516,2188,2411,2527,2515,2584,2299,2475,2482,2052,2579 "SciMark - Computational Test: Dense LU Matrix Factorization (Mflops)",HIB,512,2539,3466,2609,4396,3231,2515,4307,4851,3300,4089 "SciMark - Computational Test: Jacobi Successive Over-Relaxation (Mflops)",HIB,832,919,919,919,919,1016,918,1427,1689,1675,1676 "Himeno Benchmark - Poisson Pressure Solver (MFLOPS)",HIB,383,772,785,1017,1007,1001,1022,1008,1011,1000,1022 "TSCP - AI Chess Performance (Nodes/s)",HIB,865459,865187,864102,864373,864916,864915,864101,864915,865732,863018,864373 "Stockfish - Total Time (Nodes/s)",HIB,105868175,105709690,105698092,104480422,104197865,106084276,104536605,104121840,106497994,,106507244 "Hierarchical INTegration - Test: FLOAT (QUIPs)",HIB,267404445,267368671,268455578,267311970,267172145,267268023,268173400,267315647,268506472,267239405,267055407 "Hierarchical INTegration - Test: DOUBLE (QUIPs)",HIB,598545342,597234266,585060029,599481605,602535297,617516626,626640400,595428047,589289926,618644101,605331833 "John The Ripper - Test: Blowfish (Real C/S)",HIB,15179,56453,65995,62718,63586,61309,65117,65806,66823,58764,62841 "John The Ripper - Test: Traditional DES (Real C/S)",HIB,218232000,239289333,257067200,257407667,257058000,255957000,260736667,253868583,260019667,254777333,258770667 "PostgreSQL pgbench - Scaling: Buffer Test - Test: Normal Load - Mode: Read Only (TPS)",HIB,419700,507203,515102,515340,529699,510425,520570,490551,505031,454256,508384 "PostgreSQL pgbench - Scaling: Buffer Test - Test: Normal Load - Mode: Read Write (TPS)",HIB,4585,3767,4301,4167,4239,4272,4095,4262,5068,4319,4102 "PostgreSQL pgbench - Scaling: Buffer Test - Test: Single Thread - Mode: Read Only (TPS)",HIB,9063,13333,13303,14931,15353,15111,14851,15099,15188,16012,15352 "PostgreSQL pgbench - Scaling: Buffer Test - Test: Single Thread - Mode: Read Write (TPS)",HIB,886,1080,1065,1037,1125,1060,1127,1079,1145,1074,1125 "ctx_clock - Context Switch Time (Clocks)",LIB,,132,,,132,,132,,,132, "Timed HMMer Search - Pfam Database Search (sec)",LIB,9.02,7.39,6.93,6.62,6.82,6.54,6.56,6.57,6.29,6.16,6.00 "Timed Apache Compilation - Time To Compile (sec)",LIB,11.43,14.59,17.51,23.82,24.03,23.82,26.50,26.08,25.94,28.62,26.11 "Timed ImageMagick Compilation - Time To Compile (sec)",LIB,5.23,7.89,18.42,23.63,23.91,23.78,98.67,25.06,24.88,118.48,25.21 "Timed PHP Compilation - Time To Compile (sec)",LIB,15.19,21.42,29.05,52.17,52.58,51.96,,78.19,78.13,, "C-Ray - Total Time - 4K, 16 Rays Per Pixel (sec)",LIB,44.92,28.64,28.74,25.84,25.77,21.58,25.96,12.60,11.35,11.31,10.40 "AOBench - Size: 2048 x 2048 - Total Time (sec)",LIB,92.50,77.41,56.61,55.54,55.53,54.35,55.52,53.53,51.49,52.08,51.73 "Bullet Physics Engine - Test: Raytests (sec)",LIB,3.11,3.11,3.12,3.12,3.11,3.10,3.05,3.11,3.09,,3.09 "Bullet Physics Engine - Test: 3000 Fall (sec)",LIB,5.14,5.15,5.16,5.14,5.14,5.08,5.21,5.16,5.07,,5.09 "Bullet Physics Engine - Test: 1000 Stack (sec)",LIB,6.00,6.02,6.00,6.01,5.98,5.80,6.32,6.05,5.80,,5.80 "Bullet Physics Engine - Test: 1000 Convex (sec)",LIB,5.36,5.37,5.37,5.37,5.37,5.19,5.40,5.38,5.18,,5.18 "Bullet Physics Engine - Test: 136 Ragdolls (sec)",LIB,3.14,3.14,3.15,3.14,3.15,3.05,3.24,3.15,3.06,,3.06 "Bullet Physics Engine - Test: Prim Trimesh (sec)",LIB,1.11,1.11,1.11,1.11,1.11,1.11,1.09,1.11,1.11,,1.11 "Bullet Physics Engine - Test: Convex Trimesh (sec)",LIB,1.35,1.35,1.35,1.35,1.35,1.32,1.33,1.35,1.32,,1.32 "Zstd Compression - Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 (sec)",LIB,23.12,14.39,14.11,14.48,13.67,14.71,14.08,13.66,14.37,13.16,13.77 "FLAC Audio Encoding - WAV To FLAC (sec)",LIB,96.77,15.58,15.01,13.65,13.70,13.89,13.64,13.61,13.85,14.21,13.95 "LAME MP3 Encoding - WAV To MP3 (sec)",LIB,41.79,16.78,14.32,14.07,10.96,14.00,14.14,10.84,10.57,10.38,9.80