GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking Benchmarks by Michael Larabel for a future article on Phoronix.com looking at AMD Kaveri A10-7850K compiler performance on GCC 4.8 and GCC 4.9 compilers. GCC 4.8.2: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.8.2, File-System: ext4, Screen Resolution: 1920x1080 GCC 4.9.0 20140126: Processor: AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores), Motherboard: Gigabyte F2A88XM-D3H, Chipset: AMD Device 1422, Memory: 7168MB, Disk: 120GB KINGSTON SV300S3, Graphics: AMD Kaveri 1024MB, Audio: ATI R6xx HDMI, Monitor: TSB-TV, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 14.04, Kernel: 3.13.0-5-generic (x86_64), Desktop: Unity 7.1.2, Display Driver: radeon 7.2.99, Compiler: GCC 4.9.0 20140126, File-System: ext4, Screen Resolution: 1920x1080 Parboil 2.5 Test: OpenMP CUTCP Seconds < Lower Is Better GCC 4.8.2 . 37.17 |============================================================ Parboil 2.5 Test: OpenMP Stencil Seconds < Lower Is Better GCC 4.8.2 . 74.10 |============================================================ PolyBench-C 3.2 Test: 3 Matrix Multiplications Seconds < Lower Is Better GCC 4.8.2 .......... 129.29 |================================================== GCC 4.9.0 20140126 . 126.07 |================================================= LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein Loop Time < Lower Is Better GCC 4.8.2 .......... 62.59 |=================================================== GCC 4.9.0 20140126 . 59.71 |================================================= Timed HMMer Search 2.3.2 Pfam Database Search Seconds < Lower Is Better GCC 4.8.2 .......... 18.57 |================================================= GCC 4.9.0 20140126 . 19.34 |=================================================== BLAKE2 20130131 Phoronix Test Suite v5.0.0m0 Cycles Per Byte < Lower Is Better GCC 4.8.2 .......... 6.79 |==================================================== GCC 4.9.0 20140126 . 6.80 |==================================================== Fhourstones 3.1 Complex Connect-4 Solving Kpos / sec > Higher Is Better GCC 4.8.2 .......... 9402.17 |================================================ GCC 4.9.0 20140126 . 9569.50 |================================================= Botan 1.11.6 Test: Tiger Mbytes/s > Higher Is Better GCC 4.8.2 . 356.59 |=========================================================== Botan 1.11.6 Test: KASUMI Mbytes/s > Higher Is Better GCC 4.8.2 . 64.86 |============================================================ Botan 1.11.6 Test: AES-256 Mbytes/s > Higher Is Better GCC 4.8.2 . 3650.26 |========================================================== Botan 1.11.6 Test: Twofish Mbytes/s > Higher Is Better GCC 4.8.2 . 171.73 |=========================================================== Botan 1.11.6 Test: CAST-256 Mbytes/s > Higher Is Better GCC 4.8.2 . 80.04 |============================================================ Botan 1.11.6 Test: X9.19-MAC Mbytes/s > Higher Is Better GCC 4.8.2 . 64.72 |============================================================ SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better GCC 4.8.2 .......... 641.61 |================================================== GCC 4.9.0 20140126 . 641.01 |================================================== SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better GCC 4.8.2 .......... 428.81 |================================================== GCC 4.9.0 20140126 . 420.02 |================================================= SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better GCC 4.8.2 .......... 65.50 |=============================================== GCC 4.9.0 20140126 . 70.69 |=================================================== SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better GCC 4.8.2 .......... 874.24 |================================================== GCC 4.9.0 20140126 . 865.54 |================================================== SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better GCC 4.8.2 .......... 1154.43 |================================================= GCC 4.9.0 20140126 . 1164.81 |================================================= SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better GCC 4.8.2 .......... 685.06 |================================================== GCC 4.9.0 20140126 . 683.99 |================================================== TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better GCC 4.8.2 .......... 774874 |================================================== GCC 4.9.0 20140126 . 737919 |================================================ John The Ripper 1.8.0 Test: Blowfish Real C/S > Higher Is Better GCC 4.8.2 . 3735 |============================================================= John The Ripper 1.8.0 Test: Traditional DES Real C/S > Higher Is Better GCC 4.8.2 . 8390333 |========================================================== John The Ripper 1.8.0 Test: MD5 Real C/S > Higher Is Better GCC 4.8.2 . 61004 |============================================================ x264 2014-01-09 H.264 Video Encoding Frames Per Second > Higher Is Better GCC 4.8.2 .......... 83.34 |=================================================== GCC 4.9.0 20140126 . 82.37 |================================================== Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better GCC 4.8.2 .......... 870.15 |================================================ GCC 4.9.0 20140126 . 903.78 |================================================== Timed Apache Compilation 2.4.7 Time To Compile Seconds < Lower Is Better GCC 4.8.2 .......... 57.53 |================================================== GCC 4.9.0 20140126 . 58.93 |=================================================== Timed PHP Compilation 5.2.9 Time To Compile Seconds < Lower Is Better GCC 4.8.2 .......... 56.58 |================================================= GCC 4.9.0 20140126 . 58.39 |=================================================== C-Ray 1.1 Total Time Seconds < Lower Is Better GCC 4.8.2 .......... 40.53 |=================================================== GCC 4.9.0 20140126 . 40.50 |=================================================== Open Porous Media 2013-11-26 OPM Benchmark: Upscale-Relperm Seconds < Lower Is Better GCC 4.8.2 . 88.29 |============================================================ Primesieve 4.2 1e12 Prime Number Generation Seconds < Lower Is Better GCC 4.8.2 . 186.23 |=========================================================== Smallpt 1.0 Global Illumination Renderer; 100 Samples Seconds < Lower Is Better GCC 4.8.2 . 71 |=============================================================== Bullet Physics Engine 2.81 Test: Raytests Seconds < Lower Is Better GCC 4.8.2 .......... 4.74 |==================================================== GCC 4.9.0 20140126 . 4.52 |================================================== Bullet Physics Engine 2.81 Test: 3000 Fall Seconds < Lower Is Better GCC 4.8.2 .......... 8.57 |==================================================== GCC 4.9.0 20140126 . 8.56 |==================================================== Bullet Physics Engine 2.81 Test: 1000 Stack Seconds < Lower Is Better GCC 4.8.2 .......... 9.54 |==================================================== GCC 4.9.0 20140126 . 9.43 |=================================================== Bullet Physics Engine 2.81 Test: 1000 Convex Seconds < Lower Is Better GCC 4.8.2 .......... 7.30 |==================================================== GCC 4.9.0 20140126 . 7.20 |=================================================== Bullet Physics Engine 2.81 Test: 136 Ragdolls Seconds < Lower Is Better GCC 4.8.2 .......... 5.41 |==================================================== GCC 4.9.0 20140126 . 5.38 |==================================================== Bullet Physics Engine 2.81 Test: Prim Trimesh Seconds < Lower Is Better GCC 4.8.2 .......... 1.55 |==================================================== GCC 4.9.0 20140126 . 1.54 |==================================================== Bullet Physics Engine 2.81 Test: Convex Trimesh Seconds < Lower Is Better GCC 4.8.2 .......... 1.78 |==================================================== GCC 4.9.0 20140126 . 1.76 |=================================================== Crafty 23.4 Elapsed Time Seconds < Lower Is Better GCC 4.8.2 .......... 105.46 |================================================== GCC 4.9.0 20140126 . 103.35 |================================================= FFmpeg 2.1.1 H.264 HD To NTSC DV Seconds < Lower Is Better GCC 4.8.2 .......... 21.21 |=================================================== GCC 4.9.0 20140126 . 21.37 |=================================================== Hierarchical INTegration 1.0 Test: FLOAT QUIPs > Higher Is Better GCC 4.8.2 .......... 196895149.58 |==================================== GCC 4.9.0 20140126 . 239745129.57 |============================================ Apache Benchmark 2.4.7 Static Web Page Serving Requests Per Second > Higher Is Better GCC 4.8.2 .......... 16778.37 |============================================== GCC 4.9.0 20140126 . 17507.22 |================================================