GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking Benchmarks by Michael Larabel for a future article on Phoronix.com looking at AMD Kaveri A10-7850K compiler performance on GCC 4.8 and GCC 4.9 compilers.
HTML result view exported from: https://openbenchmarking.org/result/1401276-PL-GCC4849CO20&grs .
GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Driver Compiler File-System Screen Resolution GCC 4.8.2 GCC 4.9.0 20140126 AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores) Gigabyte F2A88XM-D3H AMD Device 1422 7168MB 120GB KINGSTON SV300S3 AMD Kaveri 1024MB ATI R6xx HDMI TSB-TV Realtek RTL8111/8168/8411 Ubuntu 14.04 3.13.0-5-generic (x86_64) Unity 7.1.2 radeon 7.2.99 GCC 4.8.2 ext4 1920x1080 GCC 4.9.0 20140126 OpenBenchmarking.org Kernel Details - radeon.dpm=1 Compiler Details - --disable-multilib --enable-checking=release --enable-languages=c,c++,fortran Processor Details - Scaling Governor: acpi-cpufreq ondemand
GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking hint: FLOAT scimark2: Fast Fourier Transform tscp: AI Chess Performance bullet: Raytests lammps: Rhodopsin Protein apache: Static Web Page Serving hmmer: Pfam Database Search build-php: Time To Compile polybench-c: 3 Matrix Multiplications build-apache: Time To Compile scimark2: Monte Carlo crafty: Elapsed Time fhourstones: Complex Connect-4 Solving bullet: 1000 Convex x264: H.264 Video Encoding bullet: 1000 Stack bullet: Convex Trimesh scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization ffmpeg: H.264 HD To NTSC DV bullet: Prim Trimesh bullet: 136 Ragdolls scimark2: Jacobi Successive Over-Relaxation blake2: Phoronix Test Suite v5.0.0m0 bullet: 3000 Fall scimark2: Composite c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples primesieve: 1e12 Prime Number Generation open-porous-media: Upscale-Relperm john-the-ripper: MD5 john-the-ripper: Traditional DES john-the-ripper: Blowfish botan: X9.19-MAC botan: CAST-256 botan: Twofish botan: AES-256 botan: KASUMI botan: Tiger parboil: OpenMP Stencil parboil: OpenMP CUTCP himeno: Poisson Pressure Solver GCC 4.8.2 GCC 4.9.0 20140126 196895149.58 65.50 774874 4.74 62.59 16778.37 18.57 56.58 129.29 57.53 428.81 105.46 9402.17 7.30 83.34 9.54 1.78 874.24 1154.43 21.21 1.55 5.41 685.06 6.79 8.57 641.61 40.53 71 186.23 88.29 61004 8390333 3735 64.72 80.04 171.73 3650.26 64.86 356.59 74.10 37.17 870.15 239745129.57 70.69 737919 4.52 59.71 17507.22 19.34 58.39 126.07 58.93 420.02 103.35 9569.50 7.20 82.37 9.43 1.76 865.54 1164.81 21.37 1.54 5.38 683.99 6.80 8.56 641.01 40.50 903.78 OpenBenchmarking.org
Hierarchical INTegration Test: FLOAT OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT GCC 4.8.2 GCC 4.9.0 20140126 50M 100M 150M 200M 250M SE +/- 2980363.43, N = 3 SE +/- 1044977.11, N = 3 196895149.58 239745129.57 1. (CC) gcc options: -O3 -march=native -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.8.2 GCC 4.9.0 20140126 16 32 48 64 80 SE +/- 1.23, N = 4 SE +/- 0.69, N = 4 65.50 70.69 1. (CXX) g++ options: -O3 -march=native
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 4.8.2 GCC 4.9.0 20140126 170K 340K 510K 680K 850K SE +/- 873.88, N = 5 SE +/- 1107.98, N = 5 774874 737919 1. (CC) gcc options: -O3 -march=native
Bullet Physics Engine Test: Raytests OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Raytests GCC 4.8.2 GCC 4.9.0 20140126 1.0665 2.133 3.1995 4.266 5.3325 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.74 4.52 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein GCC 4.8.2 GCC 4.9.0 20140126 14 28 42 56 70 SE +/- 0.28, N = 3 SE +/- 0.17, N = 3 62.59 59.71 1. (CXX) g++ options: -lfftw -lmpich
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.7 Static Web Page Serving GCC 4.8.2 GCC 4.9.0 20140126 4K 8K 12K 16K 20K SE +/- 316.90, N = 3 SE +/- 172.83, N = 3 16778.37 17507.22 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.8.2 GCC 4.9.0 20140126 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.21, N = 3 18.57 19.34 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile GCC 4.8.2 GCC 4.9.0 20140126 13 26 39 52 65 SE +/- 0.03, N = 3 SE +/- 0.12, N = 3 56.58 58.39 1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
PolyBench-C Test: 3 Matrix Multiplications OpenBenchmarking.org Seconds, Fewer Is Better PolyBench-C 3.2 Test: 3 Matrix Multiplications GCC 4.8.2 GCC 4.9.0 20140126 30 60 90 120 150 SE +/- 1.30, N = 3 SE +/- 0.07, N = 3 129.29 126.07 1. (CC) gcc options: -O3 -march=native
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.7 Time To Compile GCC 4.8.2 GCC 4.9.0 20140126 13 26 39 52 65 SE +/- 0.31, N = 3 SE +/- 0.14, N = 3 57.53 58.93
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.8.2 GCC 4.9.0 20140126 90 180 270 360 450 SE +/- 0.15, N = 4 SE +/- 4.58, N = 4 428.81 420.02 1. (CXX) g++ options: -O3 -march=native
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.8.2 GCC 4.9.0 20140126 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 105.46 103.35 1. (CC) gcc options: -lstdc++ -lm
Fhourstones Complex Connect-4 Solving OpenBenchmarking.org Kpos / sec, More Is Better Fhourstones 3.1 Complex Connect-4 Solving GCC 4.8.2 GCC 4.9.0 20140126 2K 4K 6K 8K 10K SE +/- 9.66, N = 3 SE +/- 15.15, N = 3 9402.17 9569.50 1. (CC) gcc options: -O3
Bullet Physics Engine Test: 1000 Convex OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex GCC 4.8.2 GCC 4.9.0 20140126 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 7.30 7.20 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2014-01-09 H.264 Video Encoding GCC 4.8.2 GCC 4.9.0 20140126 20 40 60 80 100 SE +/- 0.60, N = 5 SE +/- 0.71, N = 5 83.34 82.37 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize
Bullet Physics Engine Test: 1000 Stack OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack GCC 4.8.2 GCC 4.9.0 20140126 3 6 9 12 15 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 9.54 9.43 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: Convex Trimesh OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Convex Trimesh GCC 4.8.2 GCC 4.9.0 20140126 0.4005 0.801 1.2015 1.602 2.0025 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.78 1.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.8.2 GCC 4.9.0 20140126 200 400 600 800 1000 SE +/- 4.50, N = 4 SE +/- 6.95, N = 4 874.24 865.54 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.8.2 GCC 4.9.0 20140126 300 600 900 1200 1500 SE +/- 0.32, N = 4 SE +/- 1.14, N = 4 1154.43 1164.81 1. (CXX) g++ options: -O3 -march=native
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 2.1.1 H.264 HD To NTSC DV GCC 4.8.2 GCC 4.9.0 20140126 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 21.21 21.37 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT
Bullet Physics Engine Test: Prim Trimesh OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh GCC 4.8.2 GCC 4.9.0 20140126 0.3488 0.6976 1.0464 1.3952 1.744 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 1.55 1.54 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
Bullet Physics Engine Test: 136 Ragdolls OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls GCC 4.8.2 GCC 4.9.0 20140126 1.2173 2.4346 3.6519 4.8692 6.0865 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.41 5.38 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.8.2 GCC 4.9.0 20140126 150 300 450 600 750 SE +/- 0.06, N = 4 SE +/- 0.06, N = 4 685.06 683.99 1. (CXX) g++ options: -O3 -march=native
BLAKE2 Phoronix Test Suite v5.0.0m0 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20130131 Phoronix Test Suite v5.0.0m0 GCC 4.8.2 GCC 4.9.0 20140126 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 6.79 6.80 1. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz
Bullet Physics Engine Test: 3000 Fall OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall GCC 4.8.2 GCC 4.9.0 20140126 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 8.57 8.56 1. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 4.8.2 GCC 4.9.0 20140126 140 280 420 560 700 SE +/- 0.80, N = 4 SE +/- 0.78, N = 4 641.61 641.01 1. (CXX) g++ options: -O3 -march=native
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.8.2 GCC 4.9.0 20140126 9 18 27 36 45 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 40.53 40.50 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.8.2 16 32 48 64 80 SE +/- 0.00, N = 3 71 1. (CXX) g++ options: -fopenmp -O3 -march=native
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 4.2 1e12 Prime Number Generation GCC 4.8.2 40 80 120 160 200 SE +/- 0.49, N = 3 186.23 1. (CXX) g++ options: -O2 -fopenmp
Open Porous Media OPM Benchmark: Upscale-Relperm OpenBenchmarking.org Seconds, Fewer Is Better Open Porous Media 2013-11-26 OPM Benchmark: Upscale-Relperm GCC 4.8.2 20 40 60 80 100 SE +/- 0.39, N = 3 88.29 1. (F9X) gfortran options: -rdynamic
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: MD5 GCC 4.8.2 13K 26K 39K 52K 65K SE +/- 59.23, N = 3 61004 1. (CC) gcc options: -fopenmp -lcrypt
John The Ripper Test: Traditional DES OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Traditional DES GCC 4.8.2 2M 4M 6M 8M 10M SE +/- 6887.99, N = 3 8390333 1. (CC) gcc options: -fopenmp -lcrypt
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.8.0 Test: Blowfish GCC 4.8.2 800 1600 2400 3200 4000 SE +/- 0.67, N = 3 3735 1. (CC) gcc options: -fopenmp -lcrypt
Botan Test: X9.19-MAC OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: X9.19-MAC GCC 4.8.2 14 28 42 56 70 64.72 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: CAST-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: CAST-256 GCC 4.8.2 20 40 60 80 100 80.04 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: Twofish OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: Twofish GCC 4.8.2 40 80 120 160 200 171.73 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: AES-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: AES-256 GCC 4.8.2 800 1600 2400 3200 4000 3650.26 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: KASUMI OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: KASUMI GCC 4.8.2 14 28 42 56 70 64.86 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Botan Test: Tiger OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.11.6 Test: Tiger GCC 4.8.2 80 160 240 320 400 356.59 1. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil GCC 4.8.2 16 32 48 64 80 SE +/- 0.25, N = 3 74.10 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP GCC 4.8.2 9 18 27 36 45 SE +/- 0.11, N = 3 37.17 1. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.8.2 GCC 4.9.0 20140126 200 400 600 800 1000 SE +/- 25.84, N = 6 SE +/- 2.10, N = 3 870.15 903.78 1. (CC) gcc options: -O3 -march=native
Phoronix Test Suite v10.8.4