GCC 8.0 vs. Clang 6.0 AMD EPYC Tuning Comparison

Tests for a future article on Phoronix.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1801022-AL-CLANGCC8680
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 2 Tests
Bioinformatics 2 Tests
C/C++ Compiler Tests 11 Tests
CPU Massive 9 Tests
Creator Workloads 4 Tests
Encoding 2 Tests
HPC - High Performance Computing 3 Tests
Multi-Core 3 Tests
Scientific Computing 3 Tests
Server 3 Tests
Server CPU Tests 2 Tests
Single-Threaded 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 8.0: -march=x86-64
January 02 2018
 
GCC 8.0: -march=znver1
January 02 2018
 
Clang 6.0: -march=x86-64
January 02 2018
 
Clang 6.0: -march=znver1
January 01 2018
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCC 8.0 vs. Clang 6.0 AMD EPYC Tuning Comparison Tests for a future article on Phoronix. Clang 6.0: -march=znver1: Processor: AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores), Motherboard: TYAN B8026T70AE24HR, Chipset: AMD Device 1450, Memory: 126976MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe OS: Ubuntu 17.10, Kernel: 4.13.0-21-generic (x86_64), Desktop: GNOME Shell 3.26.1, Display Driver: modesetting 1.19.5, OpenCL: OpenCL 1.2 pocl 1.0 LLVM 5.0.0, Compiler: Clang 6.0.0 (SVN 321623) + LLVM 6.0.0svn, File-System: ext4, Screen Resolution: 1920x1080, System Layer: vm-other Xen 4.9.0 Hypervisor Clang 6.0: -march=x86-64: Processor: AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores), Motherboard: TYAN B8026T70AE24HR, Chipset: AMD Device 1450, Memory: 126976MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe OS: Ubuntu 17.10, Kernel: 4.13.0-21-generic (x86_64), Desktop: GNOME Shell 3.26.1, Display Driver: modesetting 1.19.5, OpenCL: OpenCL 1.2 pocl 1.0 LLVM 5.0.0, Compiler: Clang 6.0.0 (SVN 321623) + LLVM 6.0.0svn, File-System: ext4, Screen Resolution: 1920x1080, System Layer: vm-other Xen 4.9.0 Hypervisor GCC 8.0: -march=znver1: Processor: AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores), Motherboard: TYAN B8026T70AE24HR, Chipset: AMD Device 1450, Memory: 126976MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe OS: Ubuntu 17.10, Kernel: 4.13.0-21-generic (x86_64), Desktop: GNOME Shell 3.26.1, Display Driver: modesetting 1.19.5, OpenCL: OpenCL 1.2 pocl 1.0 LLVM 5.0.0, Compiler: GCC 8.0.0 20171231 + clang (GCC) 8.0.0 20171231 (experimental) + LLVM 5.0.0, File-System: ext4, Screen Resolution: 1920x1080, System Layer: vm-other Xen 4.9.0 Hypervisor GCC 8.0: -march=x86-64: Processor: AMD EPYC 7601 32-Core @ 2.20GHz (64 Cores), Motherboard: TYAN B8026T70AE24HR, Chipset: AMD Device 1450, Memory: 126976MB, Disk: 280GB INTEL SSDPE21D280GA, Graphics: ASPEED ASPEED Family, Monitor: VE228, Network: Broadcom Limited NetXtreme BCM5720 Gigabit PCIe OS: Ubuntu 17.10, Kernel: 4.13.0-21-generic (x86_64), Desktop: GNOME Shell 3.26.1, Display Driver: modesetting 1.19.5, OpenCL: OpenCL 1.2 pocl 1.0 LLVM 5.0.0, Compiler: GCC 8.0.0 20171231 + clang (GCC) 8.0.0 20171231 (experimental) + LLVM 5.0.0, File-System: ext4, Screen Resolution: 1920x1080, System Layer: vm-other Xen 4.9.0 Hypervisor C-Ray 1.1 Total Time Seconds < Lower Is Better Clang 6.0: -march=znver1 . 4.48 |============================================= Clang 6.0: -march=x86-64 . 4.53 |============================================== GCC 8.0: -march=znver1 ... 3.37 |================================== GCC 8.0: -march=x86-64 ... 3.93 |======================================== GraphicsMagick 1.3.19 Operation: Sharpen Iterations Per Minute > Higher Is Better Clang 6.0: -march=znver1 . 136 |======================================= Clang 6.0: -march=x86-64 . 131 |===================================== GCC 8.0: -march=znver1 ... 165 |=============================================== GCC 8.0: -march=x86-64 ... 157 |============================================= GraphicsMagick 1.3.19 Operation: HWB Color Space Iterations Per Minute > Higher Is Better Clang 6.0: -march=znver1 . 155 |======================================= Clang 6.0: -march=x86-64 . 150 |====================================== GCC 8.0: -march=znver1 ... 186 |=============================================== GCC 8.0: -march=x86-64 ... 177 |============================================= GraphicsMagick 1.3.19 Operation: Blur Iterations Per Minute > Higher Is Better Clang 6.0: -march=znver1 . 104 |======================================== Clang 6.0: -march=x86-64 . 101 |======================================= GCC 8.0: -march=znver1 ... 123 |=============================================== GCC 8.0: -march=x86-64 ... 116 |============================================ FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Mflops > Higher Is Better Clang 6.0: -march=znver1 . 5031.60 |====================================== Clang 6.0: -march=x86-64 . 4660.83 |==================================== GCC 8.0: -march=znver1 ... 5627.83 |=========================================== GCC 8.0: -march=x86-64 ... 4959.73 |====================================== FLAC Audio Encoding 1.3.1 WAV To FLAC Seconds < Lower Is Better Clang 6.0: -march=znver1 . 6.63 |====================================== Clang 6.0: -march=x86-64 . 7.94 |============================================== GCC 8.0: -march=znver1 ... 7.45 |=========================================== GCC 8.0: -march=x86-64 ... 7.12 |========================================= LAME MP3 Encoding 3.99.5 WAV To MP3 Seconds < Lower Is Better Clang 6.0: -march=znver1 . 12.81 |============================================= Clang 6.0: -march=x86-64 . 11.33 |======================================== GCC 8.0: -march=znver1 ... 10.81 |====================================== GCC 8.0: -march=x86-64 ... 11.10 |======================================= Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better Clang 6.0: -march=znver1 . 1052.47 |=========================================== Clang 6.0: -march=x86-64 . 1032.71 |========================================== GCC 8.0: -march=znver1 ... 935.64 |====================================== GCC 8.0: -march=x86-64 ... 949.19 |======================================= FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Mflops > Higher Is Better Clang 6.0: -march=znver1 . 12481 |========================================= Clang 6.0: -march=x86-64 . 13649 |============================================= GCC 8.0: -march=znver1 ... 13630 |============================================= PolyBench-C 3.2 Test: 3 Matrix Multiplications Seconds < Lower Is Better Clang 6.0: -march=znver1 . 62.75 |=========================================== Clang 6.0: -march=x86-64 . 62.98 |=========================================== GCC 8.0: -march=znver1 ... 65.45 |============================================= GCC 8.0: -march=x86-64 ... 60.68 |========================================== GraphicsMagick 1.3.19 Operation: Local Adaptive Thresholding Iterations Per Minute > Higher Is Better Clang 6.0: -march=znver1 . 98 |================================================ Clang 6.0: -march=x86-64 . 97 |================================================ GCC 8.0: -march=znver1 ... 95 |=============================================== GCC 8.0: -march=x86-64 ... 92 |============================================= ebizzy 0.3 Records/s > Higher Is Better Clang 6.0: -march=znver1 . 1145405 |=========================================== Clang 6.0: -march=x86-64 . 1076648 |======================================== GCC 8.0: -march=znver1 ... 1101176 |========================================= GCC 8.0: -march=x86-64 ... 1126032 |========================================== SQLite 3.8.10.2 Test Target: Default Test Directory Seconds < Lower Is Better Clang 6.0: -march=znver1 . 7.48 |============================================= Clang 6.0: -march=x86-64 . 7.53 |============================================== GCC 8.0: -march=znver1 ... 7.16 |=========================================== GCC 8.0: -march=x86-64 ... 7.61 |============================================== Bullet Physics Engine 2.81 Test: 1000 Stack Seconds < Lower Is Better Clang 6.0: -march=znver1 . 6.08 |============================================ Clang 6.0: -march=x86-64 . 6.30 |============================================== GCC 8.0: -march=znver1 ... 5.93 |=========================================== GCC 8.0: -march=x86-64 ... 6.18 |============================================= SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better Clang 6.0: -march=znver1 . 552.19 |=========================================== Clang 6.0: -march=x86-64 . 531.38 |========================================== GCC 8.0: -march=znver1 ... 555.76 |============================================ GCC 8.0: -march=x86-64 ... 561.03 |============================================ Bullet Physics Engine 2.81 Test: Raytests Seconds < Lower Is Better Clang 6.0: -march=znver1 . 3.18 |============================================= Clang 6.0: -march=x86-64 . 3.22 |============================================== GCC 8.0: -march=znver1 ... 3.06 |============================================ GCC 8.0: -march=x86-64 ... 3.12 |============================================= TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better Clang 6.0: -march=znver1 . 918269 |============================================ Clang 6.0: -march=x86-64 . 917658 |============================================ GCC 8.0: -march=znver1 ... 875085 |========================================== GCC 8.0: -march=x86-64 ... 874251 |========================================== Bullet Physics Engine 2.81 Test: 3000 Fall Seconds < Lower Is Better Clang 6.0: -march=znver1 . 5.34 |============================================= Clang 6.0: -march=x86-64 . 5.48 |============================================== GCC 8.0: -march=znver1 ... 5.27 |============================================ GCC 8.0: -march=x86-64 ... 5.34 |============================================= SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better Clang 6.0: -march=znver1 . 2258.64 |=========================================== Clang 6.0: -march=x86-64 . 2190.10 |========================================== GCC 8.0: -march=znver1 ... 2259.95 |=========================================== GCC 8.0: -march=x86-64 ... 2263.87 |=========================================== Apache Benchmark 2.4.7 Static Web Page Serving Requests Per Second > Higher Is Better Clang 6.0: -march=znver1 . 9663.93 |========================================== Clang 6.0: -march=x86-64 . 9531.43 |========================================== GCC 8.0: -march=znver1 ... 9791.23 |=========================================== GCC 8.0: -march=x86-64 ... 9841.30 |=========================================== Bullet Physics Engine 2.81 Test: Convex Trimesh Seconds < Lower Is Better Clang 6.0: -march=znver1 . 1.32 |============================================= Clang 6.0: -march=x86-64 . 1.33 |============================================== GCC 8.0: -march=znver1 ... 1.30 |============================================= GCC 8.0: -march=x86-64 ... 1.34 |============================================== Bullet Physics Engine 2.81 Test: 1000 Convex Seconds < Lower Is Better Clang 6.0: -march=znver1 . 5.31 |============================================= Clang 6.0: -march=x86-64 . 5.43 |============================================== GCC 8.0: -march=znver1 ... 5.28 |============================================= GCC 8.0: -march=x86-64 ... 5.44 |============================================== Bullet Physics Engine 2.81 Test: 136 Ragdolls Seconds < Lower Is Better Clang 6.0: -march=znver1 . 3.23 |============================================= Clang 6.0: -march=x86-64 . 3.28 |============================================== GCC 8.0: -march=znver1 ... 3.19 |============================================= GCC 8.0: -march=x86-64 ... 3.26 |============================================== Bullet Physics Engine 2.81 Test: Prim Trimesh Seconds < Lower Is Better Clang 6.0: -march=znver1 . 1.09 |============================================== Clang 6.0: -march=x86-64 . 1.10 |============================================== GCC 8.0: -march=znver1 ... 1.10 |============================================== GCC 8.0: -march=x86-64 ... 1.10 |============================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better Clang 6.0: -march=znver1 . 1424.21 |===================================== Clang 6.0: -march=x86-64 . 1110.65 |============================ GCC 8.0: -march=znver1 ... 1676.62 |=========================================== GCC 8.0: -march=x86-64 ... 1423.14 |==================================== SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better Clang 6.0: -march=znver1 . 4034.89 |=========================================== Clang 6.0: -march=x86-64 . 3190.43 |================================== GCC 8.0: -march=znver1 ... 3678.86 |======================================= GCC 8.0: -march=x86-64 ... 3513.11 |===================================== SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better Clang 6.0: -march=znver1 . 226.68 |=========================================== Clang 6.0: -march=x86-64 . 179.29 |================================== GCC 8.0: -march=znver1 ... 231.09 |=========================================== GCC 8.0: -march=x86-64 ... 233.89 |============================================ SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better Clang 6.0: -march=znver1 . 1699.32 |=========================================== Clang 6.0: -march=x86-64 . 1479.53 |===================================== GCC 8.0: -march=znver1 ... 1680.45 |=========================================== GCC 8.0: -march=x86-64 ... 1579.48 |======================================== Timed HMMer Search 2.3.2 Pfam Database Search Seconds < Lower Is Better Clang 6.0: -march=znver1 . 11.09 |===================================== Clang 6.0: -march=x86-64 . 12.85 |========================================== GCC 8.0: -march=znver1 ... 12.40 |========================================= GCC 8.0: -march=x86-64 ... 13.65 |=============================================