AMD AOCC 1.3 Compiler Benchmarks vs. GCC vs. LLVM Clang 2 x AMD EPYC 7601 compiler benchmarks on a future article for Phoronix.com by Michael Larabel. AMD AOCC 1.3: Processor: 2 x AMD EPYC 7601 32-Core @ 3.10GHz (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 516096MB, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox Matrox G200eW3, Monitor: VE228, Network: Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA OS: Ubuntu 18.10, Kernel: 4.19.0-041900-generic (x86_64), Compiler: Clang 7.0.0, File-System: ext4, Screen Resolution: 1600x1200 LLVM Clang 7.0: Processor: 2 x AMD EPYC 7601 32-Core @ 3.10GHz (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 516096MB, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox Matrox G200eW3, Monitor: VE228, Network: Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA OS: Ubuntu 18.10, Kernel: 4.19.0-041900-generic (x86_64), Compiler: Clang 7.0.0-3 + LLVM 7.0.0, File-System: ext4, Screen Resolution: 1600x1200 GCC 8.2.0: Processor: 2 x AMD EPYC 7601 32-Core @ 3.10GHz (64 Cores / 128 Threads), Motherboard: Dell 02MJ3T (1.2.5 BIOS), Chipset: AMD Family 17h, Memory: 516096MB, Disk: 120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860, Graphics: Matrox Matrox G200eW3, Monitor: VE228, Network: Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA OS: Ubuntu 18.10, Kernel: 4.19.0-041900-generic (x86_64), Compiler: GCC 8.2.0, File-System: ext4, Screen Resolution: 1600x1200 Rodinia 2.4 Test: OpenMP Streamcluster Seconds < Lower Is Better AMD AOCC 1.3 ... 22.41 |===================================================== LLVM Clang 7.0 . 21.81 |=================================================== GCC 8.2.0 ...... 23.33 |======================================================= High Performance Conjugate Gradient 3.0 GFLOP/s > Higher Is Better AMD AOCC 1.3 ... 0.60 |======================================================== LLVM Clang 7.0 . 0.50 |=============================================== GCC 8.2.0 ...... 0.50 |=============================================== PolyBench-C 4.2 Test: Correlation Computation Seconds < Lower Is Better AMD AOCC 1.3 ... 5.33 |====================================================== LLVM Clang 7.0 . 5.53 |======================================================== GCC 8.2.0 ...... 5.36 |====================================================== PolyBench-C 4.2 Test: 3 Matrix Multiplications Seconds < Lower Is Better AMD AOCC 1.3 ... 3.65 |================================================= LLVM Clang 7.0 . 4.13 |======================================================== GCC 8.2.0 ...... 3.70 |================================================== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 2048 Mflops > Higher Is Better AMD AOCC 1.3 ... 26619 |====================================================== LLVM Clang 7.0 . 26574 |===================================================== GCC 8.2.0 ...... 27361 |======================================================= FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 2048 Mflops > Higher Is Better AMD AOCC 1.3 ... 15056 |======================================================= LLVM Clang 7.0 . 14580 |===================================================== GCC 8.2.0 ...... 14829 |====================================================== Timed HMMer Search 2.3.2 Pfam Database Search Seconds < Lower Is Better AMD AOCC 1.3 ... 6.69 |======================================================== LLVM Clang 7.0 . 6.12 |=================================================== GCC 8.2.0 ...... 6.52 |======================================================= Timed MAFFT Alignment 7.392 Multiple Sequence Alignment Seconds < Lower Is Better AMD AOCC 1.3 ... 3.53 |====================================================== LLVM Clang 7.0 . 3.40 |==================================================== GCC 8.2.0 ...... 3.66 |======================================================== Fhourstones 3.1 Complex Connect-4 Solving Kpos / sec > Higher Is Better AMD AOCC 1.3 ... 10799 |====================================================== LLVM Clang 7.0 . 10905 |======================================================= GCC 8.2.0 ...... 10431 |===================================================== SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better AMD AOCC 1.3 ... 1766 |=================================================== LLVM Clang 7.0 . 1925 |======================================================== GCC 8.2.0 ...... 1796 |==================================================== SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better AMD AOCC 1.3 ... 552 |========================================================= LLVM Clang 7.0 . 552 |========================================================= GCC 8.2.0 ...... 556 |========================================================= SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better AMD AOCC 1.3 ... 226 |======================================================== LLVM Clang 7.0 . 218 |====================================================== GCC 8.2.0 ...... 231 |========================================================= SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better AMD AOCC 1.3 ... 2827 |======================================================== LLVM Clang 7.0 . 2513 |================================================== GCC 8.2.0 ...... 2554 |=================================================== SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better AMD AOCC 1.3 ... 3795 |=========================================== LLVM Clang 7.0 . 4911 |======================================================== GCC 8.2.0 ...... 3953 |============================================= SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better AMD AOCC 1.3 ... 1430 |=============================================== LLVM Clang 7.0 . 1429 |=============================================== GCC 8.2.0 ...... 1688 |======================================================== TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better AMD AOCC 1.3 ... 892960 |===================================================== LLVM Clang 7.0 . 911886 |====================================================== GCC 8.2.0 ...... 853137 |=================================================== x264 2018-09-25 H.264 Video Encoding Frames Per Second > Higher Is Better AMD AOCC 1.3 ... 145 |======================================================== LLVM Clang 7.0 . 147 |========================================================= GCC 8.2.0 ...... 143 |======================================================= Stockfish 9 Total Time Nodes Per Second > Higher Is Better AMD AOCC 1.3 ... 107516593 |=================================================== LLVM Clang 7.0 . 107358539 |=================================================== GCC 8.2.0 ...... 102909858 |================================================= Swet 1.5.16 Average Operations Per Second > Higher Is Better AMD AOCC 1.3 ... 230056272 |======================== LLVM Clang 7.0 . 212870591 |====================== GCC 8.2.0 ...... 493058080 |=================================================== Timed Apache Compilation 2.4.7 Time To Compile Seconds < Lower Is Better AMD AOCC 1.3 ... 36.18 |======================================================= LLVM Clang 7.0 . 25.07 |====================================== GCC 8.2.0 ...... 25.56 |======================================= Timed LLVM Compilation 6.0.1 Time To Compile Seconds < Lower Is Better AMD AOCC 1.3 ... 198 |========================================================= LLVM Clang 7.0 . 145 |========================================== GCC 8.2.0 ...... 150 |=========================================== Timed PHP Compilation 7.1.9 Time To Compile Seconds < Lower Is Better AMD AOCC 1.3 ... 179.00 |====================================================== LLVM Clang 7.0 . 109.00 |================================= GCC 8.2.0 ...... 69.33 |===================== Primesieve 7.1 1e12 Prime Number Generation Seconds < Lower Is Better AMD AOCC 1.3 ... 5.81 |==================================================== LLVM Clang 7.0 . 5.78 |==================================================== GCC 8.2.0 ...... 6.25 |======================================================== AOBench Size: 2048 x 2048 - Total Time Seconds < Lower Is Better AMD AOCC 1.3 ... 45.19 |=================================================== LLVM Clang 7.0 . 48.46 |======================================================= GCC 8.2.0 ...... 44.51 |=================================================== Bullet Physics Engine 2.81 Test: Raytests Seconds < Lower Is Better AMD AOCC 1.3 ... 3.14 |======================================================== LLVM Clang 7.0 . 3.15 |======================================================== GCC 8.2.0 ...... 3.14 |======================================================== Bullet Physics Engine 2.81 Test: 3000 Fall Seconds < Lower Is Better AMD AOCC 1.3 ... 5.07 |====================================================== LLVM Clang 7.0 . 5.23 |======================================================== GCC 8.2.0 ...... 5.07 |====================================================== Bullet Physics Engine 2.81 Test: 1000 Stack Seconds < Lower Is Better AMD AOCC 1.3 ... 5.87 |===================================================== LLVM Clang 7.0 . 6.15 |======================================================== GCC 8.2.0 ...... 5.81 |===================================================== Bullet Physics Engine 2.81 Test: 1000 Convex Seconds < Lower Is Better AMD AOCC 1.3 ... 5.23 |======================================================= LLVM Clang 7.0 . 5.26 |======================================================= GCC 8.2.0 ...... 5.37 |======================================================== Bullet Physics Engine 2.81 Test: 136 Ragdolls Seconds < Lower Is Better AMD AOCC 1.3 ... 3.16 |====================================================== LLVM Clang 7.0 . 3.29 |======================================================== GCC 8.2.0 ...... 3.11 |===================================================== Bullet Physics Engine 2.81 Test: Prim Trimesh Seconds < Lower Is Better AMD AOCC 1.3 ... 1.10 |====================================================== LLVM Clang 7.0 . 1.13 |======================================================== GCC 8.2.0 ...... 1.14 |======================================================== Bullet Physics Engine 2.81 Test: Convex Trimesh Seconds < Lower Is Better AMD AOCC 1.3 ... 1.33 |======================================================= LLVM Clang 7.0 . 1.35 |======================================================== GCC 8.2.0 ...... 1.35 |======================================================== Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 Seconds < Lower Is Better AMD AOCC 1.3 ... 13.47 |==================================================== LLVM Clang 7.0 . 14.29 |======================================================= GCC 8.2.0 ...... 13.48 |==================================================== FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better AMD AOCC 1.3 ... 12.84 |===================================================== LLVM Clang 7.0 . 12.23 |================================================== GCC 8.2.0 ...... 13.32 |======================================================= Mencoder 1.3.0 AVI To LAVC Seconds < Lower Is Better AMD AOCC 1.3 ... 22.20 |======================================================= LLVM Clang 7.0 . 22.33 |======================================================= GCC 8.2.0 ...... 22.29 |======================================================= Minion 1.8 Benchmark: Graceful Seconds < Lower Is Better AMD AOCC 1.3 ... 55.19 |====================================================== LLVM Clang 7.0 . 55.67 |====================================================== GCC 8.2.0 ...... 56.41 |======================================================= Minion 1.8 Benchmark: Solitaire Seconds < Lower Is Better AMD AOCC 1.3 ... 86.13 |======================================================= LLVM Clang 7.0 . 83.51 |===================================================== GCC 8.2.0 ...... 86.81 |======================================================= Minion 1.8 Benchmark: Quasigroup Seconds < Lower Is Better AMD AOCC 1.3 ... 142 |======================================================= LLVM Clang 7.0 . 142 |======================================================= GCC 8.2.0 ...... 147 |========================================================= Aircrack-ng 1.3 k/s > Higher Is Better AMD AOCC 1.3 ... 83435 |======================================================= LLVM Clang 7.0 . 83710 |======================================================= GCC 8.2.0 ...... 81752 |====================================================== libjpeg-turbo tjbench 1.5.3 Test: Decompression Throughput Megapixels/sec > Higher Is Better AMD AOCC 1.3 ... 145 |======================================================== LLVM Clang 7.0 . 141 |======================================================= GCC 8.2.0 ...... 147 |========================================================= Redis 4.0.8 Test: GET Requests Per Second > Higher Is Better AMD AOCC 1.3 ... 2150203 |=================================================== LLVM Clang 7.0 . 2150558 |=================================================== GCC 8.2.0 ...... 2235376 |===================================================== Redis 4.0.8 Test: SET Requests Per Second > Higher Is Better AMD AOCC 1.3 ... 1405199 |==================================================== LLVM Clang 7.0 . 1364289 |=================================================== GCC 8.2.0 ...... 1419694 |===================================================== Memcached mcperf 1.5.10 Method: Get Operations Per Second > Higher Is Better AMD AOCC 1.3 ... 67604 |======================================================= LLVM Clang 7.0 . 58078 |=============================================== GCC 8.2.0 ...... 63050 |=================================================== Memcached mcperf 1.5.10 Method: Set Operations Per Second > Higher Is Better AMD AOCC 1.3 ... 43395 |====================================== LLVM Clang 7.0 . 57082 |================================================== GCC 8.2.0 ...... 62454 |======================================================= Hierarchical INTegration 1.0 Test: DOUBLE QUIPs > Higher Is Better AMD AOCC 1.3 ... 624671701 |=================================================== LLVM Clang 7.0 . 618373040 |================================================== GCC 8.2.0 ...... 615241438 |==================================================