Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler PandaBoard ES Texas Instruments OMAP4460 ARMv7 dual-core ARM Cortex A9 1.2GHz benchmarking of Ubuntu 12.04 LTS hard-float with Linux 3.2 kernel. This ARMv7 compiler testing of the Precise packages for LLVM Clang 3.0 versus GCC 4.6.3 compilers. Same compiler flags used each time with -O3 CFLAGS/CXXFLAGS. Initial look at LLVM Clang C/C++ compiler performance under dual-core ARMv7 platform dual-core. ARM Linux benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1204218-SU-ARMCLANG163&grr&sor .
Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler Processor Motherboard Memory Disk Graphics OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution ARM GCC 4.6.3 ARM LLVM-Clang 3.0 ARMv7 rev 10 @ 1.20GHz (2 Cores) OMAP4 Panda board 904MB 16GB SD16G Texas Instruments OMAP Ubuntu 12.04 3.2.0-1412-omap4 (armv7l) Unity 2D 5.10.0 X Server 1.11.3 fbdev 0.4.2 GCC 4.6 ext4 1920x1080 Clang 4.2.1 OpenBenchmarking.org
Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time x264: H.264 Video Encoding vpxenc: vpxenc openssl: RSA 4096-bit Performance n-queens: Elapsed Time compress-7zip: Compress Speed Test tscp: AI Chess Performance scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo scimark2: Composite cachebench: Read / Modify / Write cachebench: Write cachebench: Read ffte: N=64, 1D Complex FFT Routine ARM GCC 4.6.3 ARM LLVM-Clang 3.0 591 1170.77 4.03 1.01 4.60 821.79 859 75346 204.81 107.83 104.74 15.67 76.81 101.97 2376.22 2397.41 1676.59 467.39 5452.07 4.60 705.09 844 63969 24.83 16.45 19.09 5.12 11.31 15.36 416.29 2987.02 273.84 467.07 OpenBenchmarking.org
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples ARM GCC 4.6.3 130 260 390 520 650 SE +/- 0.33, N = 3 591 1. (CXX) g++ options: -fopenmp -O3 -march=native
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time ARM GCC 4.6.3 ARM LLVM-Clang 3.0 1200 2400 3600 4800 6000 SE +/- 6.39, N = 3 SE +/- 63.86, N = 3 1170.77 5452.07 1. (CC) gcc options: -lm -lpthread -O3 -march=native
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2011-12-06 H.264 Video Encoding ARM GCC 4.6.3 0.9068 1.8136 2.7204 3.6272 4.534 SE +/- 0.02, N = 3 4.03
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc ARM GCC 4.6.3 0.2273 0.4546 0.6819 0.9092 1.1365 SE +/- 0.00, N = 3 1.01 1. (CC) gcc options: -lvpx -lm -lpthread
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance ARM LLVM-Clang 3.0 ARM GCC 4.6.3 1.035 2.07 3.105 4.14 5.175 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 4.60 4.60 1. (CC) gcc options: -O3 -lssl -lcrypto -ldl
N-Queens Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better N-Queens 1.0 Elapsed Time ARM LLVM-Clang 3.0 ARM GCC 4.6.3 200 400 600 800 1000 SE +/- 0.35, N = 2 SE +/- 0.99, N = 2 705.09 821.79 1. (CC) gcc options: -static -O2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test ARM GCC 4.6.3 ARM LLVM-Clang 3.0 200 400 600 800 1000 SE +/- 10.40, N = 3 SE +/- 5.84, N = 3 859 844 1. (CXX) g++ options: -pipe -lpthread
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance ARM GCC 4.6.3 ARM LLVM-Clang 3.0 16K 32K 48K 64K 80K SE +/- 72.20, N = 5 SE +/- 56.82, N = 5 75346 63969
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation ARM GCC 4.6.3 ARM LLVM-Clang 3.0 40 80 120 160 200 SE +/- 0.34, N = 4 SE +/- 0.09, N = 4 204.81 24.83
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.15, N = 4 SE +/- 0.04, N = 4 107.83 16.45
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 1.11, N = 4 SE +/- 0.09, N = 4 104.74 19.09
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform ARM GCC 4.6.3 ARM LLVM-Clang 3.0 4 8 12 16 20 SE +/- 0.26, N = 4 SE +/- 0.05, N = 4 15.67 5.12
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.14, N = 4 SE +/- 0.02, N = 4 76.81 11.31
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite ARM GCC 4.6.3 ARM LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.25, N = 4 SE +/- 0.05, N = 4 101.97 15.36
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write ARM GCC 4.6.3 ARM LLVM-Clang 3.0 500 1000 1500 2000 2500 SE +/- 57.26, N = 6 SE +/- 2.61, N = 3 2376.22 416.29 1. (CC) gcc options: -lrt
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write ARM LLVM-Clang 3.0 ARM GCC 4.6.3 600 1200 1800 2400 3000 SE +/- 40.45, N = 3 SE +/- 47.53, N = 3 2987.02 2397.41 1. (CC) gcc options: -lrt
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read ARM GCC 4.6.3 ARM LLVM-Clang 3.0 400 800 1200 1600 2000 SE +/- 15.87, N = 3 SE +/- 2.26, N = 3 1676.59 273.84 1. (CC) gcc options: -lrt
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine ARM GCC 4.6.3 ARM LLVM-Clang 3.0 100 200 300 400 500 SE +/- 0.31, N = 3 SE +/- 1.11, N = 3 467.39 467.07 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Phoronix Test Suite v10.8.5