LLVM Clang 3.4 Intel Core i7 Haswell Core-AVX2 Intel Core i7-4770K Haswell testing of GCC 4.8.1 and an early GCC 4.9.0 compiler snapshot along with LLVM Clang 3.3 and an LLVM Clang 3.4 development snapshot. Testing with CFLAGS/CXXFLAGS for the Intel Core i7 Haswell CPU of -O3 and -march=core-avx2. Benchmarking by Michael Larabel for a future article on phoronix.com GCC 4.8.1: Processor: Intel Core i7-4770K @ 3.50GHz (8 Cores), Motherboard: Intel DH87RL, Chipset: Intel 4th Gen Core DRAM, Memory: 15360MB, Disk: 240GB OCZ VERTEX3, Graphics: Intel Haswell Desktop, Audio: Intel Haswell HDMI, Monitor: VA2431, Network: Intel Connection I217-V OS: Ubuntu 13.10, Kernel: 3.10.0-999-generic (x86_64), Desktop: KDE 4.10.4, Display Server: X Server 1.13.3, Display Driver: intel 2.21.9, OpenGL: 3.0 Mesa 9.2.0-devel (git-bbd2d57), Compiler: GCC 4.8.1 + Clang 3.3 + LLVM 3.3, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.9.0 20130623: Processor: Intel Core i7-4770K @ 3.50GHz (8 Cores), Motherboard: Intel DH87RL, Chipset: Intel 4th Gen Core DRAM, Memory: 15360MB, Disk: 240GB OCZ VERTEX3, Graphics: Intel Haswell Desktop, Audio: Intel Haswell HDMI, Monitor: VA2431, Network: Intel Connection I217-V OS: Ubuntu 13.10, Kernel: 3.10.0-999-generic (x86_64), Desktop: KDE 4.10.4, Display Server: X Server 1.13.3, Display Driver: intel 2.21.9, OpenGL: 3.0 Mesa 9.2.0-devel (git-bbd2d57), Compiler: GCC 4.9.0 20130623 + Clang 3.3 + LLVM 3.3, File-System: ext4, Screen Resolution: 1920x1080 LLVM Clang 3.3: Processor: Intel Core i7-4770K @ 3.50GHz (8 Cores), Motherboard: Intel DH87RL, Chipset: Intel 4th Gen Core DRAM, Memory: 15360MB, Disk: 240GB OCZ VERTEX3, Graphics: Intel Haswell Desktop, Audio: Intel Haswell HDMI, Monitor: VA2431, Network: Intel Connection I217-V OS: Ubuntu 13.10, Kernel: 3.10.0-999-generic (x86_64), Desktop: KDE 4.10.4, Display Server: X Server 1.13.3, Display Driver: intel 2.21.9, OpenGL: 3.0 Mesa 9.2.0-devel (git-bbd2d57), Compiler: Clang 3.3 + LLVM 3.3, File-System: ext4, Screen Resolution: 1920x1080 LLVM 3.4 SVN 20130626: Processor: Intel Core i7-4770K @ 3.50GHz (8 Cores), Motherboard: Intel DH87RL, Chipset: Intel 4th Gen Core DRAM, Memory: 15360MB, Disk: 240GB OCZ VERTEX3, Graphics: Intel Haswell Desktop, Audio: Intel Haswell HDMI, Monitor: VA2431, Network: Intel Connection I217-V OS: Ubuntu 13.10, Kernel: 3.10.0-999-generic (x86_64), Desktop: KDE 4.10.4, Display Server: X Server 1.13.3, Display Driver: intel 2.21.9, OpenGL: 3.0 Mesa 9.2.0-devel (git-bbd2d57), Compiler: Clang 3.4 (SVN 185044) + LLVM 3.4svn, File-System: ext4, Screen Resolution: 1920x1080 Timed HMMer Search 2.3.2 Pfam Database Search Seconds < Lower Is Better GCC 4.8.1 ............. 10.47 |============================================== GCC 4.9.0 20130623 .... 10.40 |============================================== LLVM Clang 3.3 ........ 10.81 |================================================ LLVM 3.4 SVN 20130626 . 10.70 |================================================ Timed MAFFT Alignment 6.864 Multiple Sequence Alignment Seconds < Lower Is Better GCC 4.8.1 ............. 5.47 |============================================ GCC 4.9.0 20130623 .... 5.52 |============================================ LLVM Clang 3.3 ........ 6.09 |================================================= LLVM 3.4 SVN 20130626 . 5.98 |================================================ BLAKE2 20121223 Phoronix Test Suite v4.8.0m1 Cycles Per Byte < Lower Is Better GCC 4.8.1 ............. 5.71 |==================================== GCC 4.9.0 20130623 .... 5.29 |================================= LLVM Clang 3.3 ........ 7.40 |============================================== LLVM 3.4 SVN 20130626 . 7.81 |================================================= SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better GCC 4.8.1 ............. 1009.99 |======================================= GCC 4.9.0 20130623 .... 1005.56 |====================================== LLVM Clang 3.3 ........ 1102.35 |========================================== LLVM 3.4 SVN 20130626 . 1204.77 |============================================== SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better GCC 4.8.1 ............. 242.66 |============================================== GCC 4.9.0 20130623 .... 248.01 |=============================================== LLVM Clang 3.3 ........ 249.83 |=============================================== LLVM 3.4 SVN 20130626 . 239.90 |============================================= SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better GCC 4.8.1 ............. 1204.00 |============================================= GCC 4.9.0 20130623 .... 1148.95 |=========================================== LLVM Clang 3.3 ........ 1228.19 |============================================== LLVM 3.4 SVN 20130626 . 1182.30 |============================================ SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better GCC 4.8.1 ............. 1825.43 |=================================== GCC 4.9.0 20130623 .... 1851.86 |==================================== LLVM Clang 3.3 ........ 1755.85 |================================== LLVM 3.4 SVN 20130626 . 2397.89 |============================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better GCC 4.8.1 ............. 1169.09 |================================ GCC 4.9.0 20130623 .... 1170.18 |================================ LLVM Clang 3.3 ........ 1673.05 |============================================== LLVM 3.4 SVN 20130626 . 1613.32 |============================================ x264 2013-06-08 H.264 Video Encoding Frames Per Second > Higher Is Better GCC 4.8.1 ............. 156.26 |=============================================== GCC 4.9.0 20130623 .... 155.67 |=============================================== LLVM Clang 3.3 ........ 155.01 |=============================================== LLVM 3.4 SVN 20130626 . 152.82 |============================================== GraphicsMagick 1.3.16 Operation: Resizing Iterations Per Minute > Higher Is Better GCC 4.8.1 ............. 175 |================================================ GCC 4.9.0 20130623 .... 181 |================================================== LLVM Clang 3.3 ........ 91 |========================= LLVM 3.4 SVN 20130626 . 91 |========================= Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better GCC 4.8.1 ............. 1048.77 |============================== GCC 4.9.0 20130623 .... 1588.37 |============================================== LLVM Clang 3.3 ........ 1586.39 |============================================== LLVM 3.4 SVN 20130626 . 1395.01 |======================================== Timed ImageMagick Compilation 6.8.1-10 Time To Compile Seconds < Lower Is Better GCC 4.8.1 ............. 79.18 |================================================ GCC 4.9.0 20130623 .... 74.59 |============================================= LLVM Clang 3.3 ........ 34.39 |===================== LLVM 3.4 SVN 20130626 . 34.37 |===================== C-Ray 1.1 Total Time Seconds < Lower Is Better GCC 4.8.1 ............. 17.06 |============================== GCC 4.9.0 20130623 .... 17.03 |============================== LLVM Clang 3.3 ........ 27.06 |================================================ LLVM 3.4 SVN 20130626 . 26.22 |=============================================== Primesieve 4.2 1e12 Prime Number Generation Seconds < Lower Is Better GCC 4.8.1 ............. 79.17 |============ GCC 4.9.0 20130623 .... 79.20 |============ LLVM Clang 3.3 ........ 323.37 |=============================================== LLVM 3.4 SVN 20130626 . 320.23 |=============================================== Smallpt 1.0 Global Illumination Renderer; 100 Samples Seconds < Lower Is Better GCC 4.8.1 ............. 25 |========= GCC 4.9.0 20130623 .... 25 |========= LLVM Clang 3.3 ........ 140 |================================================== LLVM 3.4 SVN 20130626 . 140 |================================================== FLAC Audio Encoding 1.3.0 WAV To FLAC Seconds < Lower Is Better GCC 4.8.1 ............. 5.37 |=============================================== GCC 4.9.0 20130623 .... 5.26 |============================================== LLVM Clang 3.3 ........ 5.64 |================================================= LLVM 3.4 SVN 20130626 . 4.55 |======================================== N-Queens 1.0 Elapsed Time Seconds < Lower Is Better GCC 4.8.1 ............. 36.69 |========= GCC 4.9.0 20130623 .... 36.72 |========= LLVM Clang 3.3 ........ 184.28 |============================================== LLVM 3.4 SVN 20130626 . 186.32 |===============================================