Intel Atom D525 Compiler Testing Intel Atom D525 testing with GCC 4.7 and LLVM/Clang 3.0 from the Ubuntu Quantal repository. Benchmarking by Michael Larabel for a future article on Phoronix.com just looking at GCC vs. Clang on low-end low-power x86 hardware.
HTML result view exported from: https://openbenchmarking.org/result/1209140-RA-INTELATOM04&grs&rdt .
Intel Atom D525 Compiler Testing Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM-Clang 3.0 GCC 4.7.1 Intel Atom D525 @ 1.80GHz (4 Cores) FOXCONN NETBOX nT-435/535 Intel Atom D4xx/D5xx/N4xx/N5xx 2048MB 500GB Western Digital WD5000BEVT-2 Intel Atom D4xx/D5xx/N4xx/N5xx IGP Realtek ALC888 Realtek RTL8111/8168B + Realtek RTL8191SEvA Wireless LAN Ubuntu 12.10 3.5.0-14-generic (x86_64) Unity 6.4.0 X Server 1.13.0 intel 2.20.7 1.4 Mesa 8.1-devel Clang 3.0-6ubuntu3 ext4 1366x768 GCC 4.7 OpenBenchmarking.org System Details - Compiz was running on this system. Compiler Details - GCC 4.7.1: --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v
Intel Atom D525 Compiler Testing smallpt: Global Illumination Renderer; 100 Samples himeno: Poisson Pressure Solver c-ray: Total Time encode-mp3: WAV To MP3 compress-pbzip2: 256MB File Compression crafty: Elapsed Time scimark2: Dense LU Matrix Factorization mafft: Multiple Sequence Alignment scimark2: Sparse Matrix Multiply hmmer: Pfam Database Search scimark2: Monte Carlo lammps: Rhodopsin Protein scimark2: Jacobi Successive Over-Relaxation scimark2: Fast Fourier Transform vpxenc: vpxenc apache: Static Web Page Serving compress-7zip: Compress Speed Test tscp: AI Chess Performance ffte: N=64, 1D Complex FFT Routine openssl: RSA 4096-bit Performance encode-flac: WAV To FLAC LLVM-Clang 3.0 GCC 4.7.1 1337 85.38 490.12 110.59 106.75 517.85 202.33 67.35 189.46 110.46 69.54 426.79 341.03 26.46 2.85 3135.73 2030 117819 943.53 7.60 50.29 326 138.38 363.44 92.63 89.67 463.93 181.53 60.53 170.67 101.73 64.45 456.43 321.04 27.76 2.98 3039.21 2094 118279 943.95 7.60 OpenBenchmarking.org
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples LLVM-Clang 3.0 GCC 4.7.1 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 1337 326 1. (CXX) g++ options: -fopenmp -O3 -march=native
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LLVM-Clang 3.0 GCC 4.7.1 30 60 90 120 150 SE +/- 1.69, N = 6 SE +/- 0.06, N = 3 85.38 138.38 1. (CC) gcc options: -O3 -march=native
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time LLVM-Clang 3.0 GCC 4.7.1 110 220 330 440 550 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 490.12 363.44 1. (CC) gcc options: -lm -lpthread -O3 -march=native
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 LLVM-Clang 3.0 GCC 4.7.1 20 40 60 80 100 SE +/- 0.07, N = 5 SE +/- 0.04, N = 5 110.59 92.63
Parallel BZIP2 Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.0.5 256MB File Compression LLVM-Clang 3.0 GCC 4.7.1 20 40 60 80 100 SE +/- 0.83, N = 3 SE +/- 1.38, N = 3 106.75 89.67 1. (CXX) g++ options: -O2 -pthread -lpthread -lbz2
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time LLVM-Clang 3.0 GCC 4.7.1 110 220 330 440 550 SE +/- 0.63, N = 3 SE +/- 0.72, N = 3 517.85 463.93 1. (CC) gcc options: -lstdc++ -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM-Clang 3.0 GCC 4.7.1 40 80 120 160 200 SE +/- 0.18, N = 4 SE +/- 0.24, N = 4 202.33 181.53
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment LLVM-Clang 3.0 GCC 4.7.1 15 30 45 60 75 SE +/- 1.10, N = 6 SE +/- 0.12, N = 3 67.35 60.53 1. (CC) gcc options: -O3 -lm -lpthread
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM-Clang 3.0 GCC 4.7.1 40 80 120 160 200 SE +/- 0.17, N = 4 SE +/- 0.23, N = 4 189.46 170.67
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search LLVM-Clang 3.0 GCC 4.7.1 20 40 60 80 100 SE +/- 0.40, N = 3 SE +/- 0.88, N = 3 110.46 101.73 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM-Clang 3.0 GCC 4.7.1 15 30 45 60 75 SE +/- 0.00, N = 4 SE +/- 0.08, N = 4 69.54 64.45
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein LLVM-Clang 3.0 GCC 4.7.1 100 200 300 400 500 SE +/- 0.64, N = 3 SE +/- 0.67, N = 3 426.79 456.43 1. (CXX) g++ options: -lfftw -lmpich
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM-Clang 3.0 GCC 4.7.1 70 140 210 280 350 SE +/- 0.38, N = 4 SE +/- 0.64, N = 4 341.03 321.04
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform LLVM-Clang 3.0 GCC 4.7.1 7 14 21 28 35 SE +/- 0.05, N = 4 SE +/- 0.03, N = 4 26.46 27.76
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc LLVM-Clang 3.0 GCC 4.7.1 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 2.85 2.98 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.2.21 Static Web Page Serving LLVM-Clang 3.0 GCC 4.7.1 700 1400 2100 2800 3500 SE +/- 2.91, N = 3 SE +/- 6.86, N = 3 3135.73 3039.21 1. (CC) gcc options: -pthread -O3 -march=native -lm -lrt -lcrypt -lpthread -ldl
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test LLVM-Clang 3.0 GCC 4.7.1 400 800 1200 1600 2000 SE +/- 4.93, N = 3 SE +/- 2.40, N = 3 2030 2094 1. (CXX) g++ options: -pipe -lpthread
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance LLVM-Clang 3.0 GCC 4.7.1 30K 60K 90K 120K 150K SE +/- 121.45, N = 5 SE +/- 27.43, N = 5 117819 118279
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine LLVM-Clang 3.0 GCC 4.7.1 200 400 600 800 1000 SE +/- 0.43, N = 3 SE +/- 0.32, N = 3 943.53 943.95 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance LLVM-Clang 3.0 GCC 4.7.1 2 4 6 8 10 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 7.60 7.60 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC LLVM-Clang 3.0 11 22 33 44 55 SE +/- 0.05, N = 5 50.29 1. (CXX) g++ options: -O3 -march=native -logg -lm
Phoronix Test Suite v10.8.4