GCC 4.8 4.9 Compiler AMD Kaveri Benchmarking

Benchmarks by Michael Larabel for a future article on Phoronix.com looking at AMD Kaveri A10-7850K compiler performance on GCC 4.8 and GCC 4.9 compilers.

HTML result view exported from: https://openbenchmarking.org/result/1401276-PL-GCC4849CO20&grs.

GCC 4.8 4.9 Compiler AMD Kaveri BenchmarkingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay DriverCompilerFile-SystemScreen ResolutionGCC 4.8.2GCC 4.9.0 20140126AMD A10-7850K APU with Radeon R7 @ 3.70GHz (4 Cores)Gigabyte F2A88XM-D3HAMD Device 14227168MB120GB KINGSTON SV300S3AMD Kaveri 1024MBATI R6xx HDMITSB-TVRealtek RTL8111/8168/8411Ubuntu 14.043.13.0-5-generic (x86_64)Unity 7.1.2radeon 7.2.99GCC 4.8.2ext41920x1080GCC 4.9.0 20140126OpenBenchmarking.orgKernel Details- radeon.dpm=1Compiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++,fortran Processor Details- Scaling Governor: acpi-cpufreq ondemand

GCC 4.8 4.9 Compiler AMD Kaveri Benchmarkinghint: FLOATscimark2: Fast Fourier Transformtscp: AI Chess Performancebullet: Raytestslammps: Rhodopsin Proteinapache: Static Web Page Servinghmmer: Pfam Database Searchbuild-php: Time To Compilepolybench-c: 3 Matrix Multiplicationsbuild-apache: Time To Compilescimark2: Monte Carlocrafty: Elapsed Timefhourstones: Complex Connect-4 Solvingbullet: 1000 Convexx264: H.264 Video Encodingbullet: 1000 Stackbullet: Convex Trimeshscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationffmpeg: H.264 HD To NTSC DVbullet: Prim Trimeshbullet: 136 Ragdollsscimark2: Jacobi Successive Over-Relaxationblake2: Phoronix Test Suite v5.0.0m0bullet: 3000 Fallscimark2: Compositec-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesprimesieve: 1e12 Prime Number Generationopen-porous-media: Upscale-Relpermjohn-the-ripper: MD5john-the-ripper: Traditional DESjohn-the-ripper: Blowfishbotan: X9.19-MACbotan: CAST-256botan: Twofishbotan: AES-256botan: KASUMIbotan: Tigerparboil: OpenMP Stencilparboil: OpenMP CUTCPhimeno: Poisson Pressure SolverGCC 4.8.2GCC 4.9.0 20140126196895149.5865.507748744.7462.5916778.3718.5756.58129.2957.53428.81105.469402.177.3083.349.541.78874.241154.4321.211.555.41685.066.798.57641.6140.5371186.2388.29610048390333373564.7280.04171.733650.2664.86356.5974.1037.17870.15239745129.5770.697379194.5259.7117507.2219.3458.39126.0758.93420.02103.359569.507.2082.379.431.76865.541164.8121.371.545.38683.996.808.56641.0140.50903.78OpenBenchmarking.org

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATGCC 4.8.2GCC 4.9.0 2014012650M100M150M200M250MSE +/- 2980363.43, N = 3SE +/- 1044977.11, N = 3196895149.58239745129.571. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 4.8.2GCC 4.9.0 201401261632486480SE +/- 1.23, N = 4SE +/- 0.69, N = 465.5070.691. (CXX) g++ options: -O3 -march=native

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 4.8.2GCC 4.9.0 20140126170K340K510K680K850KSE +/- 873.88, N = 5SE +/- 1107.98, N = 57748747379191. (CC) gcc options: -O3 -march=native

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 4.8.2GCC 4.9.0 201401261.06652.1333.19954.2665.3325SE +/- 0.00, N = 3SE +/- 0.01, N = 34.744.521. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

LAMMPS Molecular Dynamics Simulator

Test: Rhodopsin Protein

OpenBenchmarking.orgLoop Time, Fewer Is BetterLAMMPS Molecular Dynamics Simulator 1.0Test: Rhodopsin ProteinGCC 4.8.2GCC 4.9.0 201401261428425670SE +/- 0.28, N = 3SE +/- 0.17, N = 362.5959.711. (CXX) g++ options: -lfftw -lmpich

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingGCC 4.8.2GCC 4.9.0 201401264K8K12K16K20KSE +/- 316.90, N = 3SE +/- 172.83, N = 316778.3717507.221. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 4.8.2GCC 4.9.0 20140126510152025SE +/- 0.10, N = 3SE +/- 0.21, N = 318.5719.341. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileGCC 4.8.2GCC 4.9.0 201401261326395265SE +/- 0.03, N = 3SE +/- 0.12, N = 356.5858.391. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

PolyBench-C

Test: 3 Matrix Multiplications

OpenBenchmarking.orgSeconds, Fewer Is BetterPolyBench-C 3.2Test: 3 Matrix MultiplicationsGCC 4.8.2GCC 4.9.0 20140126306090120150SE +/- 1.30, N = 3SE +/- 0.07, N = 3129.29126.071. (CC) gcc options: -O3 -march=native

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileGCC 4.8.2GCC 4.9.0 201401261326395265SE +/- 0.31, N = 3SE +/- 0.14, N = 357.5358.93

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 4.8.2GCC 4.9.0 2014012690180270360450SE +/- 0.15, N = 4SE +/- 4.58, N = 4428.81420.021. (CXX) g++ options: -O3 -march=native

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeGCC 4.8.2GCC 4.9.0 2014012620406080100SE +/- 0.07, N = 3SE +/- 0.26, N = 3105.46103.351. (CC) gcc options: -lstdc++ -lm

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingGCC 4.8.2GCC 4.9.0 201401262K4K6K8K10KSE +/- 9.66, N = 3SE +/- 15.15, N = 39402.179569.501. (CC) gcc options: -O3

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 4.8.2GCC 4.9.0 20140126246810SE +/- 0.02, N = 3SE +/- 0.02, N = 37.307.201. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2014-01-09H.264 Video EncodingGCC 4.8.2GCC 4.9.0 2014012620406080100SE +/- 0.60, N = 5SE +/- 0.71, N = 583.3482.371. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 4.8.2GCC 4.9.0 201401263691215SE +/- 0.07, N = 3SE +/- 0.04, N = 39.549.431. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 4.8.2GCC 4.9.0 201401260.40050.8011.20151.6022.0025SE +/- 0.01, N = 3SE +/- 0.01, N = 31.781.761. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 4.8.2GCC 4.9.0 201401262004006008001000SE +/- 4.50, N = 4SE +/- 6.95, N = 4874.24865.541. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 4.8.2GCC 4.9.0 2014012630060090012001500SE +/- 0.32, N = 4SE +/- 1.14, N = 41154.431164.811. (CXX) g++ options: -O3 -march=native

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.1.1H.264 HD To NTSC DVGCC 4.8.2GCC 4.9.0 20140126510152025SE +/- 0.12, N = 3SE +/- 0.14, N = 321.2121.371. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 4.8.2GCC 4.9.0 201401260.34880.69761.04641.39521.744SE +/- 0.00, N = 3SE +/- 0.01, N = 31.551.541. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 4.8.2GCC 4.9.0 201401261.21732.43463.65194.86926.0865SE +/- 0.01, N = 3SE +/- 0.00, N = 35.415.381. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 4.8.2GCC 4.9.0 20140126150300450600750SE +/- 0.06, N = 4SE +/- 0.06, N = 4685.06683.991. (CXX) g++ options: -O3 -march=native

BLAKE2

Phoronix Test Suite v5.0.0m0

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20130131Phoronix Test Suite v5.0.0m0GCC 4.8.2GCC 4.9.0 20140126246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.796.801. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 4.8.2GCC 4.9.0 20140126246810SE +/- 0.02, N = 3SE +/- 0.01, N = 38.578.561. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 4.8.2GCC 4.9.0 20140126140280420560700SE +/- 0.80, N = 4SE +/- 0.78, N = 4641.61641.011. (CXX) g++ options: -O3 -march=native

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 4.8.2GCC 4.9.0 20140126918273645SE +/- 0.02, N = 3SE +/- 0.02, N = 340.5340.501. (CC) gcc options: -lm -lpthread -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 4.8.21632486480SE +/- 0.00, N = 3711. (CXX) g++ options: -fopenmp -O3 -march=native

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number GenerationGCC 4.8.24080120160200SE +/- 0.49, N = 3186.231. (CXX) g++ options: -O2 -fopenmp

Open Porous Media

OPM Benchmark: Upscale-Relperm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media 2013-11-26OPM Benchmark: Upscale-RelpermGCC 4.8.220406080100SE +/- 0.39, N = 388.291. (F9X) gfortran options: -rdynamic

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: MD5GCC 4.8.213K26K39K52K65KSE +/- 59.23, N = 3610041. (CC) gcc options: -fopenmp -lcrypt

John The Ripper

Test: Traditional DES

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: Traditional DESGCC 4.8.22M4M6M8M10MSE +/- 6887.99, N = 383903331. (CC) gcc options: -fopenmp -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishGCC 4.8.28001600240032004000SE +/- 0.67, N = 337351. (CC) gcc options: -fopenmp -lcrypt

Botan

Test: X9.19-MAC

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.11.6Test: X9.19-MACGCC 4.8.2142842567064.721. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2

Botan

Test: CAST-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.11.6Test: CAST-256GCC 4.8.22040608010080.041. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2

Botan

Test: Twofish

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.11.6Test: TwofishGCC 4.8.24080120160200171.731. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2

Botan

Test: AES-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.11.6Test: AES-256GCC 4.8.280016002400320040003650.261. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2

Botan

Test: KASUMI

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.11.6Test: KASUMIGCC 4.8.2142842567064.861. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2

Botan

Test: Tiger

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.11.6Test: TigerGCC 4.8.280160240320400356.591. (CXX) g++ options: -m64 -pthread -lboost_filesystem -lboost_system -ldl -lrt -std=c++11 -fstack-protector -O2

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilGCC 4.8.21632486480SE +/- 0.25, N = 374.101. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPGCC 4.8.2918273645SE +/- 0.11, N = 337.171. (CXX) g++ options: -lm -lpthread -lgomp -ffast-math -fopenmp

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 4.8.2GCC 4.9.0 201401262004006008001000SE +/- 25.84, N = 6SE +/- 2.10, N = 3870.15903.781. (CC) gcc options: -O3 -march=native


Phoronix Test Suite v10.8.4