Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compiler

PandaBoard ES Texas Instruments OMAP4460 ARMv7 dual-core ARM Cortex A9 1.2GHz benchmarking of Ubuntu 12.04 LTS hard-float with Linux 3.2 kernel. This ARMv7 compiler testing of the Precise packages for LLVM Clang 3.0 versus GCC 4.6.3 compilers. Same compiler flags used each time with -O3 CFLAGS/CXXFLAGS. Initial look at LLVM Clang C/C++ compiler performance under dual-core ARMv7 platform dual-core. ARM Linux benchmarking for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1204218-SU-ARMCLANG163.

Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 CompilerProcessorMotherboardMemoryDiskGraphicsOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionARM GCC 4.6.3ARM LLVM-Clang 3.0ARMv7 rev 10 @ 1.20GHz (2 Cores)OMAP4 Panda board904MB16GB SD16GTexas Instruments OMAPUbuntu 12.043.2.0-1412-omap4 (armv7l)Unity 2D 5.10.0X Server 1.11.3fbdev 0.4.2GCC 4.6ext41920x1080Clang 4.2.1OpenBenchmarking.org

Ubuntu 12.04 LTS ARMv7 LLVM Clang 3.0 vs. GCC 4.6 Compilerffte: N=64, 1D Complex FFT Routinecachebench: Readcachebench: Writecachebench: Read / Modify / Writescimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancecompress-7zip: Compress Speed Testn-queens: Elapsed Timeopenssl: RSA 4096-bit Performancevpxenc: vpxencx264: H.264 Video Encodingc-ray: Total Timesmallpt: Global Illumination Renderer; 100 SamplesARM GCC 4.6.3ARM LLVM-Clang 3.0467.391676.592397.412376.22101.9776.8115.67104.74107.83204.8175346859821.794.601.014.031170.77591467.07273.842987.02416.2915.3611.315.1219.0916.4524.8363969844705.094.605452.07OpenBenchmarking.org

FFTE

Test: N=64, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 5.0Test: N=64, 1D Complex FFT RoutineARM GCC 4.6.3ARM LLVM-Clang 3.0100200300400500SE +/- 0.31, N = 3SE +/- 1.11, N = 3467.39467.071. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadARM GCC 4.6.3ARM LLVM-Clang 3.0400800120016002000SE +/- 15.87, N = 3SE +/- 2.26, N = 31676.59273.841. (CC) gcc options: -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteARM GCC 4.6.3ARM LLVM-Clang 3.06001200180024003000SE +/- 47.53, N = 3SE +/- 40.45, N = 32397.412987.021. (CC) gcc options: -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteARM GCC 4.6.3ARM LLVM-Clang 3.05001000150020002500SE +/- 57.26, N = 6SE +/- 2.61, N = 32376.22416.291. (CC) gcc options: -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeARM GCC 4.6.3ARM LLVM-Clang 3.020406080100SE +/- 0.25, N = 4SE +/- 0.05, N = 4101.9715.36

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloARM GCC 4.6.3ARM LLVM-Clang 3.020406080100SE +/- 0.14, N = 4SE +/- 0.02, N = 476.8111.31

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformARM GCC 4.6.3ARM LLVM-Clang 3.048121620SE +/- 0.26, N = 4SE +/- 0.05, N = 415.675.12

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyARM GCC 4.6.3ARM LLVM-Clang 3.020406080100SE +/- 1.11, N = 4SE +/- 0.09, N = 4104.7419.09

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationARM GCC 4.6.3ARM LLVM-Clang 3.020406080100SE +/- 0.15, N = 4SE +/- 0.04, N = 4107.8316.45

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationARM GCC 4.6.3ARM LLVM-Clang 3.04080120160200SE +/- 0.34, N = 4SE +/- 0.09, N = 4204.8124.83

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceARM GCC 4.6.3ARM LLVM-Clang 3.016K32K48K64K80KSE +/- 72.20, N = 5SE +/- 56.82, N = 57534663969

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestARM GCC 4.6.3ARM LLVM-Clang 3.02004006008001000SE +/- 10.40, N = 3SE +/- 5.84, N = 38598441. (CXX) g++ options: -pipe -lpthread

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeARM GCC 4.6.3ARM LLVM-Clang 3.02004006008001000SE +/- 0.99, N = 2SE +/- 0.35, N = 2821.79705.091. (CC) gcc options: -static -O2

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.0eRSA 4096-bit PerformanceARM GCC 4.6.3ARM LLVM-Clang 3.01.0352.073.1054.145.175SE +/- 0.00, N = 4SE +/- 0.00, N = 44.604.601. (CC) gcc options: -O3 -lssl -lcrypto -ldl

VP8 libvpx Encoding

vpxenc

OpenBenchmarking.orgFrames Per Second, More Is BetterVP8 libvpx Encoding 0.9.7-p1vpxencARM GCC 4.6.30.22730.45460.68190.90921.1365SE +/- 0.00, N = 31.011. (CC) gcc options: -lvpx -lm -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2011-12-06H.264 Video EncodingARM GCC 4.6.30.90681.81362.72043.62724.534SE +/- 0.02, N = 34.03

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeARM GCC 4.6.3ARM LLVM-Clang 3.012002400360048006000SE +/- 6.39, N = 3SE +/- 63.86, N = 31170.775452.071. (CC) gcc options: -lm -lpthread -O3 -march=native

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesARM GCC 4.6.3130260390520650SE +/- 0.33, N = 35911. (CXX) g++ options: -fopenmp -O3 -march=native


Phoronix Test Suite v10.8.4