Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler PandaBoard ES Texas Instruments OMAP4460 ARMv7 dual-core ARM Cortex A9 1.2GHz benchmarking of Ubuntu 12.04 LTS hard-float with Linux 3.2 kernel. This ARMv7 compiler testing of the Precise packages for LLVM Clang 3.0 versus GCC 4.6.3 compilers. Same compiler flags used each time with -O3 CFLAGS/CXXFLAGS. Initial look at LLVM Clang C/C++ compiler performance under dual-core ARMv7 platform dual-core. ARM Linux benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1204218-SU-ARMCLANG163&grs&sro .
Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler Processor Motherboard Memory Disk Graphics OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution ARM GCC 4.6.3 ARM LLVM-Clang 3.0 ARMv7 rev 10 @ 1.20GHz (2 Cores) OMAP4 Panda board 904MB 16GB SD16G Texas Instruments OMAP Ubuntu 12.04 3.2.0-1412-omap4 (armv7l) Unity 2D 5.10.0 X Server 1.11.3 fbdev 0.4.2 GCC 4.6 ext4 1920x1080 Clang 4.2.1 OpenBenchmarking.org
Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler scimark2: Jacobi Successive Over-Relaxation scimark2: Monte Carlo scimark2: Composite scimark2: Dense LU Matrix Factorization cachebench: Read cachebench: Read / Modify / Write scimark2: Sparse Matrix Multiply c-ray: Total Time scimark2: Fast Fourier Transform cachebench: Write tscp: AI Chess Performance n-queens: Elapsed Time compress-7zip: Compress Speed Test ffte: N=64, 1D Complex FFT Routine smallpt: Global Illumination Renderer; 100 Samples x264: H.264 Video Encoding vpxenc: vpxenc openssl: RSA 4096-bit Performance ARM GCC 4.6.3 ARM LLVM-Clang 3.0 204.81 76.81 101.97 107.83 1676.59 2376.22 104.74 1170.77 15.67 2397.41 75346 821.79 859 467.39 591 4.03 1.01 4.60 24.83 11.31 15.36 16.45 273.84 416.29 19.09 5452.07 5.12 2987.02 63969 705.09 844 467.07 4.60 OpenBenchmarking.org
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation ARM GCC 4.6.3 ARM LLVM-Clang 3.0 40 80 120 160 200 SE +/- 0.34, N = 4 SE +/- 0.09, N = 4 204.81 24.83
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.14, N = 4 SE +/- 0.02, N = 4 76.81 11.31
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.25, N = 4 SE +/- 0.05, N = 4 101.97 15.36
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.15, N = 4 SE +/- 0.04, N = 4 107.83 16.45
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read ARM GCC 4.6.3 ARM LLVM-Clang 3.0 400 800 1200 1600 2000 SE +/- 15.87, N = 3 SE +/- 2.26, N = 3 1676.59 273.84 1. (CC) gcc options: -lrt
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write ARM GCC 4.6.3 ARM LLVM-Clang 3.0 500 1000 1500 2000 2500 SE +/- 57.26, N = 6 SE +/- 2.61, N = 3 2376.22 416.29 1. (CC) gcc options: -lrt
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 1.11, N = 4 SE +/- 0.09, N = 4 104.74 19.09
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time ARM GCC 4.6.3 ARM LLVM-Clang 3.0 1200 2400 3600 4800 6000 SE +/- 6.39, N = 3 SE +/- 63.86, N = 3 1170.77 5452.07 1. (CC) gcc options: -lm -lpthread -O3 -march=native
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform ARM GCC 4.6.3 ARM LLVM-Clang 3.0 4 8 12 16 20 SE +/- 0.26, N = 4 SE +/- 0.05, N = 4 15.67 5.12
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write ARM GCC 4.6.3 ARM LLVM-Clang 3.0 600 1200 1800 2400 3000 SE +/- 47.53, N = 3 SE +/- 40.45, N = 3 2397.41 2987.02 1. (CC) gcc options: -lrt
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance ARM GCC 4.6.3 ARM LLVM-Clang 3.0 16K 32K 48K 64K 80K SE +/- 72.20, N = 5 SE +/- 56.82, N = 5 75346 63969
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time ARM GCC 4.6.3 ARM LLVM-Clang 3.0 200 400 600 800 1000 SE +/- 0.99, N = 2 SE +/- 0.35, N = 2 821.79 705.09 1. (CC) gcc options: -static -O2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test ARM GCC 4.6.3 ARM LLVM-Clang 3.0 200 400 600 800 1000 SE +/- 10.40, N = 3 SE +/- 5.84, N = 3 859 844 1. (CXX) g++ options: -pipe -lpthread
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine ARM GCC 4.6.3 ARM LLVM-Clang 3.0 100 200 300 400 500 SE +/- 0.31, N = 3 SE +/- 1.11, N = 3 467.39 467.07 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples ARM GCC 4.6.3 130 260 390 520 650 SE +/- 0.33, N = 3 591 1. (CXX) g++ options: -fopenmp -O3 -march=native
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding ARM GCC 4.6.3 0.9068 1.8136 2.7204 3.6272 4.534 SE +/- 0.02, N = 3 4.03
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc ARM GCC 4.6.3 0.2273 0.4546 0.6819 0.9092 1.1365 SE +/- 0.00, N = 3 1.01 1. (CC) gcc options: -lvpx -lm -lpthread
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance ARM GCC 4.6.3 ARM LLVM-Clang 3.0 1.035 2.07 3.105 4.14 5.175 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 4.60 4.60 1. (CC) gcc options: -O3 -lssl -lcrypto -ldl
Phoronix Test Suite v10.8.5