ARM Cortex A15 GCC Compiler Benchmarks Benchmarks of GCC 4.6.3 vs. GCC 4.7.2 from a Samsung Exynos 5 Dual SoC with ARM Cortex A15 dual-core ARMv7 1.7GHz processor. Cortex A15 ARM compiler benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1309242-UT-1211249RA95&grs .
ARM Cortex A15 GCC Compiler Benchmarks Processor Motherboard Memory Disk OS Kernel Desktop Display Server Compiler File-System Screen Resolution GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad Exynos ARMv7 rev 4 @ 1.70GHz (2 Cores) SAMSUNG EXYNOS5 (Flattened Device Tree) 2048MB 16GB SEM16G Ubuntu 12.04 3.4.0 (armv7l) Unity 2D 5.16.0 X Server 1.11.3 GCC 4.6 ext4 1366x768 GCC 4.7 ARMv7 rev 3 @ 1.60GHz (4 Cores) ODROIDXU 8GB SU08G Ubuntu 13.10 3.4.5 (armv7l) GCC 4.8 msdos 1920x1080 OpenBenchmarking.org Compiler Details - GCC 4.6.3: --build=arm-linux-gnueabihf --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - GCC 4.7.2: --build=arm-linux-gnueabihf --disable-libitm --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch=armv7-a --with-float=hard --with-fpu=vfpv3-d16 --with-mode=thumb -v - GCC 4.8 Quad: --build=arm-linux-gnueabihf --disable-browser-plugin --disable-libitm --disable-libquadmath --disable-sjlj-exceptions --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf --with-arch-directory=arm --with-arch=armv7-a --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-float=hard --with-fpu=vfpv3-d16 --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-armhf/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-armhf --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-armhf --with-mode=thumb -v Processor Details - GCC 4.6.3: Scaling Governor: ondemand - GCC 4.7.2: Scaling Governor: ondemand - GCC 4.8 Quad: Scaling Governor: exynos freq performance System Details - GCC 4.6.3: Python 2.7.3.
ARM Cortex A15 GCC Compiler Benchmarks primesieve: 1e12 Prime Number Generation compress-pbzip2: 256MB File Compression compress-7zip: Compress Speed Test smallpt: Global Illumination Renderer; 100 Samples vpxenc: vpxenc scimark2: Fast Fourier Transform build-php: Time To Compile scimark2: Jacobi Successive Over-Relaxation ffmpeg: H.264 HD To NTSC DV ffte: N=64, 1D Complex FFT Routine tscp: AI Chess Performance dolfyn: Computational Fluid Dynamics gmpbench: Total Time encode-mp3: WAV To MP3 cachebench: Write scimark2: Monte Carlo scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply x264: H.264 Video Encoding cachebench: Read / Modify / Write nginx: Static Web Page Serving openssl: RSA 4096-bit Performance encode-ogg: WAV To Ogg cachebench: Read encode-flac: WAV To FLAC crafty: Elapsed Time c-ray: Total Time clomp: Static OMP Speedup GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 957.91 88.41 1951 1208 1.21 37.65 255.79 361.97 162.54 1257.00 146968 158.91 451.29 52.30 3521.37 167.90 435.03 333.34 10.62 6979.64 3535.77 10 31.50 2570.83 19.81 0.06 447.38 1.70 810.27 84.30 1703 1181 1.29 44.62 284.15 344.10 157.40 1476.59 146096 171.65 452.59 54.48 3722.67 160.62 450.12 338.23 10.83 7092.44 3578.55 9.90 31.75 2582.33 19.18 0.06 348.61 1.81 497.76 47.26 3184 661 1.84 50.69 217.53 296.51 134.11 1434.27 134928 169.19 419.62 51.45 159.31 427.37 344.50 21.43 0.04 143.22 3.11 OpenBenchmarking.org
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 4.0 1e12 Prime Number Generation GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 200 400 600 800 1000 SE +/- 6.82, N = 3 SE +/- 9.65, N = 3 SE +/- 8.42, N = 3 957.91 810.27 497.76 1. (CXX) g++ options: -O2 -fopenmp
Parallel BZIP2 Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.1.6 256MB File Compression GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 20 40 60 80 100 SE +/- 0.38, N = 3 SE +/- 0.42, N = 3 SE +/- 0.73, N = 5 88.41 84.30 47.26 1. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 700 1400 2100 2800 3500 SE +/- 6.69, N = 3 SE +/- 13.53, N = 3 SE +/- 11.78, N = 3 1951 1703 3184 1. (CXX) g++ options: -pipe -lpthread
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 300 600 900 1200 1500 SE +/- 1.45, N = 3 SE +/- 2.08, N = 3 SE +/- 2.19, N = 3 1208 1181 661 1. (CXX) g++ options: -fopenmp
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 1.1.0 vpxenc GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 0.414 0.828 1.242 1.656 2.07 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1.21 1.29 1.84 -O3 1. (CC) gcc options: -lvpx -lm -lpthread
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 11 22 33 44 55 SE +/- 0.82, N = 4 SE +/- 0.29, N = 4 SE +/- 0.10, N = 4 37.65 44.62 50.69
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 60 120 180 240 300 SE +/- 1.85, N = 3 SE +/- 0.26, N = 3 SE +/- 0.79, N = 3 255.79 284.15 217.53 1. (CC) gcc options: -O2 -pedantic -ldl -lz -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 80 160 240 320 400 SE +/- 0.82, N = 4 SE +/- 1.15, N = 4 SE +/- 0.74, N = 4 361.97 344.10 296.51
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.0 H.264 HD To NTSC DV GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 40 80 120 160 200 SE +/- 0.55, N = 3 SE +/- 0.77, N = 3 SE +/- 1.76, N = 3 162.54 157.40 134.11 -lbz2 -lrt -lbz2 -lrt -std=c99 -fomit-frame-pointer -marm -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lm -pthread
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 300 600 900 1200 1500 SE +/- 4.27, N = 3 SE +/- 2.17, N = 3 SE +/- 0.28, N = 3 1257.00 1476.59 1434.27 -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 30K 60K 90K 120K 150K SE +/- 150.74, N = 5 SE +/- 233.82, N = 5 SE +/- 61.74, N = 5 146968 146096 134928
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 40 80 120 160 200 SE +/- 3.19, N = 6 SE +/- 1.08, N = 3 SE +/- 1.52, N = 3 158.91 171.65 169.19
GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GMPbench 0.2 Total Time GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 100 200 300 400 500 451.29 452.59 419.62 -lm 1. (CC) gcc options: -O3 -fomit-frame-pointer
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 12 24 36 48 60 SE +/- 0.05, N = 5 SE +/- 0.05, N = 5 SE +/- 0.08, N = 5 52.30 54.48 51.45 1. (CC) gcc options: -O3 -fomit-frame-pointer -ffast-math -pipe -lm
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write GCC 4.6.3 GCC 4.7.2 800 1600 2400 3200 4000 SE +/- 6.16, N = 3 SE +/- 17.55, N = 3 3521.37 3722.67 1. (CC) gcc options: -lrt
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 40 80 120 160 200 SE +/- 0.13, N = 4 SE +/- 0.12, N = 4 SE +/- 0.00, N = 4 167.90 160.62 159.31
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 100 200 300 400 500 SE +/- 0.92, N = 4 SE +/- 2.60, N = 4 SE +/- 1.67, N = 4 435.03 450.12 427.37
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 70 140 210 280 350 SE +/- 2.68, N = 4 SE +/- 0.53, N = 4 SE +/- 1.09, N = 4 333.34 338.23 344.50
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2012-10-03 H.264 Video Encoding GCC 4.6.3 GCC 4.7.2 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 10.62 10.83
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write GCC 4.6.3 GCC 4.7.2 1500 3000 4500 6000 7500 SE +/- 42.99, N = 3 SE +/- 39.96, N = 3 6979.64 7092.44 1. (CC) gcc options: -lrt
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.0.11 Static Web Page Serving GCC 4.6.3 GCC 4.7.2 800 1600 2400 3200 4000 SE +/- 9.45, N = 3 SE +/- 70.90, N = 3 3535.77 3578.55 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1c RSA 4096-bit Performance GCC 4.6.3 GCC 4.7.2 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 10.00 9.90 1. (CC) gcc options: -march=armv7-a -O3 -lssl -lcrypto -ldl
Ogg Encoding WAV To Ogg OpenBenchmarking.org Seconds, Fewer Is Better Ogg Encoding 1.3.0 WAV To Ogg GCC 4.6.3 GCC 4.7.2 7 14 21 28 35 SE +/- 0.13, N = 5 SE +/- 0.16, N = 5 31.50 31.75 1. (CC) gcc options: -O2 -ffast-math -fsigned-char -lm -logg
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read GCC 4.6.3 GCC 4.7.2 600 1200 1800 2400 3000 SE +/- 6.33, N = 3 SE +/- 0.34, N = 3 2570.83 2582.33 1. (CC) gcc options: -lrt
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 5 10 15 20 25 SE +/- 0.04, N = 5 SE +/- 0.06, N = 5 SE +/- 1.35, N = 10 19.81 19.18 21.43 -logg -logg 1. (CXX) g++ options: -O2 -lm
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 0.0135 0.027 0.0405 0.054 0.0675 SE +/- 0.01, N = 6 SE +/- 0.00, N = 6 SE +/- 0.00, N = 6 0.06 0.06 0.04 1. (CC) gcc options: -lstdc++ -lm
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 100 200 300 400 500 SE +/- 7.03, N = 6 SE +/- 16.52, N = 6 SE +/- 10.30, N = 6 447.38 348.61 143.22 1. (CC) gcc options: -lm -lpthread -O3
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 3.3 Static OMP Speedup GCC 4.6.3 GCC 4.7.2 GCC 4.8 Quad 0.6998 1.3996 2.0994 2.7992 3.499 SE +/- 0.06, N = 10 SE +/- 0.01, N = 5 SE +/- 0.30, N = 10 1.70 1.81 3.11 1. (CC) gcc options: --openmp -O3 -lm
Phoronix Test Suite v10.8.4