Intel Atom D525 Compiler Testing Intel Atom D525 testing with GCC 4.7 and LLVM/Clang 3.0 from the Ubuntu Quantal repository. Benchmarking by Michael Larabel for a future article on Phoronix.com just looking at GCC vs. Clang on low-end low-power x86 hardware.
HTML result view exported from: https://openbenchmarking.org/result/1209140-RA-INTELATOM04&gru .
Intel Atom D525 Compiler Testing Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM-Clang 3.0 GCC 4.7.1 Intel Atom D525 @ 1.80GHz (4 Cores) FOXCONN NETBOX nT-435/535 Intel Atom D4xx/D5xx/N4xx/N5xx 2048MB 500GB Western Digital WD5000BEVT-2 Intel Atom D4xx/D5xx/N4xx/N5xx IGP Realtek ALC888 Realtek RTL8111/8168B + Realtek RTL8191SEvA Wireless LAN Ubuntu 12.10 3.5.0-14-generic (x86_64) Unity 6.4.0 X Server 1.13.0 intel 2.20.7 1.4 Mesa 8.1-devel Clang 3.0-6ubuntu3 ext4 1366x768 GCC 4.7 OpenBenchmarking.org System Details - Compiz was running on this system. Compiler Details - GCC 4.7.1: --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v
Intel Atom D525 Compiler Testing vpxenc: vpxenc ffte: N=64, 1D Complex FFT Routine scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test tscp: AI Chess Performance apache: Static Web Page Serving openssl: RSA 4096-bit Performance lammps: Rhodopsin Protein hmmer: Pfam Database Search mafft: Multiple Sequence Alignment c-ray: Total Time compress-pbzip2: 256MB File Compression smallpt: Global Illumination Renderer; 100 Samples crafty: Elapsed Time encode-flac: WAV To FLAC encode-mp3: WAV To MP3 LLVM-Clang 3.0 GCC 4.7.1 2.85 943.53 69.54 26.46 189.46 202.33 341.03 85.38 2030 117819 3135.73 7.60 426.79 110.46 67.35 490.12 106.75 1337 517.85 50.29 110.59 2.98 943.95 64.45 27.76 170.67 181.53 321.04 138.38 2094 118279 3039.21 7.60 456.43 101.73 60.53 363.44 89.67 326 463.93 92.63 OpenBenchmarking.org
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc LLVM-Clang 3.0 GCC 4.7.1 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 2.85 2.98 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine LLVM-Clang 3.0 GCC 4.7.1 200 400 600 800 1000 SE +/- 0.43, N = 3 SE +/- 0.32, N = 3 943.53 943.95 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM-Clang 3.0 GCC 4.7.1 15 30 45 60 75 SE +/- 0.00, N = 4 SE +/- 0.08, N = 4 69.54 64.45
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform LLVM-Clang 3.0 GCC 4.7.1 7 14 21 28 35 SE +/- 0.05, N = 4 SE +/- 0.03, N = 4 26.46 27.76
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM-Clang 3.0 GCC 4.7.1 40 80 120 160 200 SE +/- 0.17, N = 4 SE +/- 0.23, N = 4 189.46 170.67
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM-Clang 3.0 GCC 4.7.1 40 80 120 160 200 SE +/- 0.18, N = 4 SE +/- 0.24, N = 4 202.33 181.53
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM-Clang 3.0 GCC 4.7.1 70 140 210 280 350 SE +/- 0.38, N = 4 SE +/- 0.64, N = 4 341.03 321.04
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LLVM-Clang 3.0 GCC 4.7.1 30 60 90 120 150 SE +/- 1.69, N = 6 SE +/- 0.06, N = 3 85.38 138.38 1. (CC) gcc options: -O3 -march=native
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test LLVM-Clang 3.0 GCC 4.7.1 400 800 1200 1600 2000 SE +/- 4.93, N = 3 SE +/- 2.40, N = 3 2030 2094 1. (CXX) g++ options: -pipe -lpthread
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance LLVM-Clang 3.0 GCC 4.7.1 30K 60K 90K 120K 150K SE +/- 121.45, N = 5 SE +/- 27.43, N = 5 117819 118279
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.2.21 Static Web Page Serving LLVM-Clang 3.0 GCC 4.7.1 700 1400 2100 2800 3500 SE +/- 2.91, N = 3 SE +/- 6.86, N = 3 3135.73 3039.21 1. (CC) gcc options: -pthread -O3 -march=native -lm -lrt -lcrypt -lpthread -ldl
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance LLVM-Clang 3.0 GCC 4.7.1 2 4 6 8 10 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 7.60 7.60 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein LLVM-Clang 3.0 GCC 4.7.1 100 200 300 400 500 SE +/- 0.64, N = 3 SE +/- 0.67, N = 3 426.79 456.43 1. (CXX) g++ options: -lfftw -lmpich
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search LLVM-Clang 3.0 GCC 4.7.1 20 40 60 80 100 SE +/- 0.40, N = 3 SE +/- 0.88, N = 3 110.46 101.73 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment LLVM-Clang 3.0 GCC 4.7.1 15 30 45 60 75 SE +/- 1.10, N = 6 SE +/- 0.12, N = 3 67.35 60.53 1. (CC) gcc options: -O3 -lm -lpthread
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time LLVM-Clang 3.0 GCC 4.7.1 110 220 330 440 550 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 490.12 363.44 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Parallel BZIP2 Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.0.5 256MB File Compression LLVM-Clang 3.0 GCC 4.7.1 20 40 60 80 100 SE +/- 0.83, N = 3 SE +/- 1.38, N = 3 106.75 89.67 1. (CXX) g++ options: -O2 -pthread -lpthread -lbz2
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples LLVM-Clang 3.0 GCC 4.7.1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 1337 326 1. (CXX) g++ options: -fopenmp -O3 -march=native
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time LLVM-Clang 3.0 GCC 4.7.1 110 220 330 440 550 SE +/- 0.63, N = 3 SE +/- 0.72, N = 3 517.85 463.93 1. (CC) gcc options: -lstdc++ -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC LLVM-Clang 3.0 11 22 33 44 55 SE +/- 0.05, N = 5 50.29 1. (CXX) g++ options: -O3 -march=native -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 LLVM-Clang 3.0 GCC 4.7.1 20 40 60 80 100 SE +/- 0.07, N = 5 SE +/- 0.04, N = 5 110.59 92.63
Phoronix Test Suite v10.8.4