GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking Benchmarks by Michael Larabel for a future article on Phoronix.com looking at AMD Kaveri A10-7850K compiler performance on GCC 4.8 and GCC 4.9 compilers.
HTML result view exported from: https://openbenchmarking.org/result/1401276-PL-GCC4849CO20 .
GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Driver Compiler File-System Screen Resolution GCC 4.8.2 GCC 4.9.0 20140126 AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores) Gigabyte F2A88XM-D3H AMD Device 1422 7168MB 120GB KINGSTON SV300S3 AMD Kaveri 1024MB ATI R6xx HDMI TSB-TV Realtek RTL8111/8168/8411 Ubuntu 14.04 3.13.0-5-generic (x86_64) Unity 7.1.2 radeon 7.2.99 GCC 4.8.2 ext4 1920x1080 GCC 4.9.0 20140126 OpenBenchmarking.org Kernel Details - radeon.dpm=1 Compiler Details - --disable-multilib --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: acpi-cpufreq ondemand
GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking parboil: OpenMP CUTCP parboil: OpenMP Stencil polybench-c: 3 Matrix Multiplications lammps: Rhodopsin Protein hmmer: Pfam Database Search blake2: Phoronix Test Suite v5.0.0m0 fhourstones: Complex Connect-4 Solving botan: Tiger botan: KASUMI botan: AES-256 botan: Twofish botan: CAST-256 botan: X9.19-MAC scimark2: Composite scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation tscp: AI Chess Performance john-the-ripper: Blowfish john-the-ripper: Traditional DES john-the-ripper: MD5 x264: H.264 Video Encoding himeno: Poisson Pressure Solver build-apache: Time To Compile build-php: Time To Compile c-ray: Total Time open-porous-media: Upscale-Relperm primesieve: 1e12 Prime Number Generation smallpt: Global Illumination Renderer; 100 Samples bullet: Raytests bullet: 3000 Fall bullet: 1000 Stack bullet: 1000 Convex bullet: 136 Ragdolls bullet: Prim Trimesh bullet: Convex Trimesh crafty: Elapsed Time ffmpeg: H.264 HD To NTSC DV hint: FLOAT apache: Static Web Page Serving GCC 4.8.2 GCC 4.9.0 20140126 37.17 74.10 129.29 62.59 18.57 6.79 9402.17 356.59 64.86 3650.26 171.73 80.04 64.72 641.61 428.81 65.50 874.24 1154.43 685.06 774874 3735 8390333 61004 83.34 870.15 57.53 56.58 40.53 88.29 186.23 71 4.74 8.57 9.54 7.30 5.41 1.55 1.78 105.46 21.21 196895149.58 16778.37 126.07 59.71 19.34 6.80 9569.50 641.01 420.02 70.69 865.54 1164.81 683.99 737919 82.37 903.78 58.93 58.39 40.50 4.52 8.56 9.43 7.20 5.38 1.54 1.76 103.35 21.37 239745129.57 17507.22 OpenBenchmarking.org
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP GCC 4.8.2 9 18 27 36 45 SE +/- 0.11, N = 3 37.17 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil GCC 4.8.2 16 32 48 64 80 SE +/- 0.25, N = 3 74.10 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
PolyBench-C Test: 3 Matrix Multiplications OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 3.2 Test: 3 Matrix Multiplications GCC 4.8.2 GCC 4.9.0 20140126 30 60 90 120 150 SE +/- 1.30, N = 3 SE +/- 0.07, N = 3 129.29 126.07 1. (CC) gcc options: -O3 -march=native
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.8.2 GCC 4.9.0 20140126 14 28 42 56 70 SE +/- 0.28, N = 3 SE +/- 0.17, N = 3 62.59 59.71 1. (CXX) g++ options: -lfftw -lmpich
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.8.2 GCC 4.9.0 20140126 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 18.57 19.34 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
BLAKE2 Phoronix Test Suite v5.0.0m0 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20130131 Phoronix Test Suite v5.0.0m0 GCC 4.8.2 GCC 4.9.0 20140126 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.79 6.80 1. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving GCC 4.8.2 GCC 4.9.0 20140126 2K 4K 6K 8K 10K SE +/- 9.66, N = 3 SE +/- 15.15, N = 3 9402.17 9569.50 1. (CC) gcc options: -O3
Botan Test: Tiger OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: Tiger GCC 4.8.2 80 160 240 320 400 356.59 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: KASUMI OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: KASUMI GCC 4.8.2 14 28 42 56 70 64.86 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: AES-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: AES-256 GCC 4.8.2 800 1600 2400 3200 4000 3650.26 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: Twofish OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: Twofish GCC 4.8.2 40 80 120 160 200 171.73 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: CAST-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: CAST-256 GCC 4.8.2 20 40 60 80 100 80.04 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: X9.19-MAC OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: X9.19-MAC GCC 4.8.2 14 28 42 56 70 64.72 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 4.8.2 GCC 4.9.0 20140126 140 280 420 560 700 SE +/- 0.80, N = 4 SE +/- 0.78, N = 4 641.61 641.01 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.8.2 GCC 4.9.0 20140126 90 180 270 360 450 SE +/- 0.15, N = 4 SE +/- 4.58, N = 4 428.81 420.02 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.8.2 GCC 4.9.0 20140126 16 32 48 64 80 SE +/- 1.23, N = 4 SE +/- 0.69, N = 4 65.50 70.69 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.8.2 GCC 4.9.0 20140126 200 400 600 800 1000 SE +/- 4.50, N = 4 SE +/- 6.95, N = 4 874.24 865.54 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.8.2 GCC 4.9.0 20140126 300 600 900 1200 1500 SE +/- 0.32, N = 4 SE +/- 1.14, N = 4 1154.43 1164.81 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.8.2 GCC 4.9.0 20140126 150 300 450 600 750 SE +/- 0.06, N = 4 SE +/- 0.06, N = 4 685.06 683.99 1. (CXX) g++ options: -O3 -march=native
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 4.8.2 GCC 4.9.0 20140126 170K 340K 510K 680K 850K SE +/- 873.88, N = 5 SE +/- 1107.98, N = 5 774874 737919 1. (CC) gcc options: -O3 -march=native
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Blowfish GCC 4.8.2 800 1600 2400 3200 4000 SE +/- 0.67, N = 3 3735 1. (CC) gcc options: -fopenmp -lcrypt
John The Ripper Test: Traditional DES OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Traditional DES GCC 4.8.2 2M 4M 6M 8M 10M SE +/- 6887.99, N = 3 8390333 1. (CC) gcc options: -fopenmp -lcrypt
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: MD5 GCC 4.8.2 13K 26K 39K 52K 65K SE +/- 59.23, N = 3 61004 1. (CC) gcc options: -fopenmp -lcrypt
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2014-01-09 H.264 Video Encoding GCC 4.8.2 GCC 4.9.0 20140126 20 40 60 80 100 SE +/- 0.60, N = 5 SE +/- 0.71, N = 5 83.34 82.37 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.8.2 GCC 4.9.0 20140126 200 400 600 800 1000 SE +/- 25.84, N = 6 SE +/- 2.10, N = 3 870.15 903.78 1. (CC) gcc options: -O3 -march=native
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.7 Time To Compile GCC 4.8.2 GCC 4.9.0 20140126 13 26 39 52 65 SE +/- 0.31, N = 3 SE +/- 0.14, N = 3 57.53 58.93
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile GCC 4.8.2 GCC 4.9.0 20140126 13 26 39 52 65 SE +/- 0.03, N = 3 SE +/- 0.12, N = 3 56.58 58.39 1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.8.2 GCC 4.9.0 20140126 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 40.53 40.50 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Open Porous Media OPM Benchmark: Upscale-Relperm OpenBenchmarking.org Seconds, Fewer Is Better Open Porous Media 2013-11-26 OPM Benchmark: Upscale-Relperm GCC 4.8.2 20 40 60 80 100 SE +/- 0.39, N = 3 88.29 1. (F9X) gfortran options: -rdynamic
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 4.2 1e12 Prime Number Generation GCC 4.8.2 40 80 120 160 200 SE +/- 0.49, N = 3 186.23 1. (CXX) g++ options: -O2 -fopenmp
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.8.2 16 32 48 64 80 SE +/- 0.00, N = 3 71 1. (CXX) g++ options: -fopenmp -O3 -march=native
Bullet Physics Engine Test: Raytests OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests GCC 4.8.2 GCC 4.9.0 20140126 1.0665 2.133 3.1995 4.266 5.3325 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.74 4.52 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 3000 Fall OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall GCC 4.8.2 GCC 4.9.0 20140126 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 8.57 8.56 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Stack OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack GCC 4.8.2 GCC 4.9.0 20140126 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 9.54 9.43 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 1000 Convex OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex GCC 4.8.2 GCC 4.9.0 20140126 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 7.30 7.20 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 136 Ragdolls OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls GCC 4.8.2 GCC 4.9.0 20140126 1.2173 2.4346 3.6519 4.8692 6.0865 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.41 5.38 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Prim Trimesh OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh GCC 4.8.2 GCC 4.9.0 20140126 0.3488 0.6976 1.0464 1.3952 1.744 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.55 1.54 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Convex Trimesh OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh GCC 4.8.2 GCC 4.9.0 20140126 0.4005 0.801 1.2015 1.602 2.0025 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.78 1.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.8.2 GCC 4.9.0 20140126 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 105.46 103.35 1. (CC) gcc options: -lstdc++ -lm
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 2.1.1 H.264 HD To NTSC DV GCC 4.8.2 GCC 4.9.0 20140126 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 21.21 21.37 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT GCC 4.8.2 GCC 4.9.0 20140126 50M 100M 150M 200M 250M SE +/- 2980363.43, N = 3 SE +/- 1044977.11, N = 3 196895149.58 239745129.57 1. (CC) gcc options: -O3 -march=native -lm
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.7 Static Web Page Serving GCC 4.8.2 GCC 4.9.0 20140126 4K 8K 12K 16K 20K SE +/- 316.90, N = 3 SE +/- 172.83, N = 3 16778.37 17507.22 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
Phoronix Test Suite v10.8.4