Intel Atom D525 Compiler Testing Intel Atom D525 testing with GCC 4.7 and LLVM/Clang 3.0 from the Ubuntu Quantal repository. Benchmarking by Michael Larabel for a future article on Phoronix.com just looking at GCC vs. Clang on low-end low-power x86 hardware.
HTML result view exported from: https://openbenchmarking.org/result/1209140-RA-INTELATOM04&sor&grw .
Intel Atom D525 Compiler Testing Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM-Clang 3.0 GCC 4.7.1 Intel Atom D525 @ 1.80GHz (4 Cores) FOXCONN NETBOX nT-435/535 Intel Atom D4xx/D5xx/N4xx/N5xx 2048MB 500GB Western Digital WD5000BEVT-2 Intel Atom D4xx/D5xx/N4xx/N5xx IGP Realtek ALC888 Realtek RTL8111/8168B + Realtek RTL8191SEvA Wireless LAN Ubuntu 12.10 3.5.0-14-generic (x86_64) Unity 6.4.0 X Server 1.13.0 intel 2.20.7 1.4 Mesa 8.1-devel Clang 3.0-6ubuntu3 ext4 1366x768 GCC 4.7 OpenBenchmarking.org System Details - Compiz was running on this system. Compiler Details - GCC 4.7.1: --build=x86_64-linux-gnu --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-languages=c,c++,go,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-arch-32=i686 --with-tune=generic -v
Intel Atom D525 Compiler Testing tscp: AI Chess Performance scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation crafty: Elapsed Time encode-flac: WAV To FLAC encode-mp3: WAV To MP3 hmmer: Pfam Database Search mafft: Multiple Sequence Alignment himeno: Poisson Pressure Solver lammps: Rhodopsin Protein ffte: N=64, 1D Complex FFT Routine compress-7zip: Compress Speed Test compress-pbzip2: 256MB File Compression vpxenc: vpxenc c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples apache: Static Web Page Serving openssl: RSA 4096-bit Performance LLVM-Clang 3.0 GCC 4.7.1 117819 69.54 26.46 189.46 202.33 341.03 517.85 50.29 110.59 110.46 67.35 85.38 426.79 943.53 2030 106.75 2.85 490.12 1337 3135.73 7.60 118279 64.45 27.76 170.67 181.53 321.04 463.93 92.63 101.73 60.53 138.38 456.43 943.95 2094 89.67 2.98 363.44 326 3039.21 7.60 OpenBenchmarking.org
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance GCC 4.7.1 LLVM-Clang 3.0 30K 60K 90K 120K 150K SE +/- 27.43, N = 5 SE +/- 121.45, N = 5 118279 117819
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM-Clang 3.0 GCC 4.7.1 15 30 45 60 75 SE +/- 0.00, N = 4 SE +/- 0.08, N = 4 69.54 64.45
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.7.1 LLVM-Clang 3.0 7 14 21 28 35 SE +/- 0.03, N = 4 SE +/- 0.05, N = 4 27.76 26.46
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM-Clang 3.0 GCC 4.7.1 40 80 120 160 200 SE +/- 0.17, N = 4 SE +/- 0.23, N = 4 189.46 170.67
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM-Clang 3.0 GCC 4.7.1 40 80 120 160 200 SE +/- 0.18, N = 4 SE +/- 0.24, N = 4 202.33 181.53
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM-Clang 3.0 GCC 4.7.1 70 140 210 280 350 SE +/- 0.38, N = 4 SE +/- 0.64, N = 4 341.03 321.04
Crafty Elapsed Time OpenBenchmarking.org Seconds, Fewer Is Better Crafty 23.4 Elapsed Time GCC 4.7.1 LLVM-Clang 3.0 110 220 330 440 550 SE +/- 0.72, N = 3 SE +/- 0.63, N = 3 463.93 517.85 1. (CC) gcc options: -lstdc++ -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.2.1 WAV To FLAC LLVM-Clang 3.0 11 22 33 44 55 SE +/- 0.05, N = 5 50.29 1. (CXX) g++ options: -O3 -march=native -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.7.1 LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.04, N = 5 SE +/- 0.07, N = 5 92.63 110.59
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.7.1 LLVM-Clang 3.0 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 0.40, N = 3 101.73 110.46 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment GCC 4.7.1 LLVM-Clang 3.0 15 30 45 60 75 SE +/- 0.12, N = 3 SE +/- 1.10, N = 6 60.53 67.35 1. (CC) gcc options: -O3 -lm -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.7.1 LLVM-Clang 3.0 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 1.69, N = 6 138.38 85.38 1. (CC) gcc options: -O3 -march=native
LAMMPS Molecular Dynamics Simulator Test: Rhodopsin Protein OpenBenchmarking.org Loop Time, Fewer Is Better LAMMPS Molecular Dynamics Simulator 1.0 Test: Rhodopsin Protein LLVM-Clang 3.0 GCC 4.7.1 100 200 300 400 500 SE +/- 0.64, N = 3 SE +/- 0.67, N = 3 426.79 456.43 1. (CXX) g++ options: -lfftw -lmpich
FFTE Test: N=64, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 5.0 Test: N=64, 1D Complex FFT Routine GCC 4.7.1 LLVM-Clang 3.0 200 400 600 800 1000 SE +/- 0.32, N = 3 SE +/- 0.43, N = 3 943.95 943.53 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp -lmpichf90 -lmpich -lopa -lmpl -lrt -lcr -lpthread
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.7.1 LLVM-Clang 3.0 400 800 1200 1600 2000 SE +/- 2.40, N = 3 SE +/- 4.93, N = 3 2094 2030 1. (CXX) g++ options: -pipe -lpthread
Parallel BZIP2 Compression 256MB File Compression OpenBenchmarking.org Seconds, Fewer Is Better Parallel BZIP2 Compression 1.0.5 256MB File Compression GCC 4.7.1 LLVM-Clang 3.0 20 40 60 80 100 SE +/- 1.38, N = 3 SE +/- 0.83, N = 3 89.67 106.75 1. (CXX) g++ options: -O2 -pthread -lpthread -lbz2
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 0.9.7-p1 vpxenc GCC 4.7.1 LLVM-Clang 3.0 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 2.98 2.85 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.7.1 LLVM-Clang 3.0 110 220 330 440 550 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 363.44 490.12 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.7.1 LLVM-Clang 3.0 300 600 900 1200 1500 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 326 1337 1. (CXX) g++ options: -fopenmp -O3 -march=native
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.2.21 Static Web Page Serving LLVM-Clang 3.0 GCC 4.7.1 700 1400 2100 2800 3500 SE +/- 2.91, N = 3 SE +/- 6.86, N = 3 3135.73 3039.21 1. (CC) gcc options: -pthread -O3 -march=native -lm -lrt -lcrypt -lpthread -ldl
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.0e RSA 4096-bit Performance GCC 4.7.1 LLVM-Clang 3.0 2 4 6 8 10 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 7.60 7.60 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Phoronix Test Suite v10.8.4