LLVM Clang 3.2 Release Final release benchmarking of LLVM Clang 3.2 final release using optimized/non-debug/non-assert builds of LLVM Clang 3.1, LLVM Clang 3.2, GCC 4.7.2, and GCC 4.8 snapshot from late December. Benchmarking on an Intel Core i7 3770K Ivy Bridge processor with flags of -O3 and -march=native. Compiler benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1212278-RA-LLVMCLANG86&grs .
LLVM Clang 3.2 Release Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 Intel Core i7-3770K @ 3.50GHz (8 Cores) ECS Z77H2-A2X v1.0 Intel Xeon E3-1200 v2/3rd 8192MB 60GB OCZ VERTEX2 NVIDIA GeForce GT 220 (405/324MHz) Realtek ALC892 DELL P2210H Realtek RTL8111/8168B + Intel Centrino Advanced-N 6205 Ubuntu 13.04 3.7.0-7-generic (x86_64) Unity 6.6.0 X Server 1.13.0.902 (1.13.1 RC 2) nouveau 1.0.4 3.0 Mesa 9.0.1 Gallium 0.4 Clang 3.1 + LLVM 3.1 ext4 1920x1080 Clang 3.2 + LLVM 3.2svn GCC 4.7.2 GCC 4.8.0 20121223 OpenBenchmarking.org Compiler Details - LLVM Clang 3.1: Optimized build; Built Dec 26 2012 (09:07:02); Default target: x86_64-unknown-linux-gnu; Host CPU: i686 - LLVM Clang 3.2: Optimized build; Built Dec 26 2012 (08:48:06); Default target: x86_64-unknown-linux-gnu; Host CPU: core-avx-i - GCC 4.7.2: --enable-checking=release --enable-languages=c,c++ --enable-lto - GCC 4.8.0 20121223: --enable-checking=release --enable-languages=c,c++ --enable-lto Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
LLVM Clang 3.2 Release john-the-ripper: Blowfish smallpt: Global Illumination Renderer; 100 Samples graphics-magick: Sharpen graphics-magick: Local Adaptive Thresholding graphics-magick: Resizing graphics-magick: Blur scimark2: Monte Carlo scimark2: Jacobi Successive Over-Relaxation graphics-magick: HWB Color Space c-ray: Total Time scimark2: Dense LU Matrix Factorization hmmer: Pfam Database Search tachyon: Total Time scimark2: Sparse Matrix Multiply mafft: Multiple Sequence Alignment blake2: Phoronix Test Suite v4.2.0m3 vpxenc: vpxenc x264: H.264 Video Encoding encode-mp3: WAV To MP3 scimark2: Fast Fourier Transform ffmpeg: H.264 HD To NTSC DV himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test apache: Static Web Page Serving nginx: Static Web Page Serving openssl: RSA 4096-bit Performance LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 971 217 54 46 106 91 614.02 1681.08 141 39.28 3049.85 12.16 11.03 2352.37 6.45 27.00 149.76 13.89 334.23 17.08 1626.13 21415 31339.19 36541.41 129.57 981 227 31 46 91 86 618.43 1682.23 149 32.31 2402.43 12.20 11.15 2566.30 6.27 5.84 27.73 150.40 13.34 334.01 16.84 1641.07 21169 31118.52 36743.93 130.37 6035 38 95 118 168 141 423.57 1180.32 198 32.13 2407.84 10.09 13.06 2250.55 5.69 5.36 28.27 156.08 13.76 347.48 16.55 1651.82 21824 31218.72 36874.67 130.73 38 95 118 167 140 555.63 1179.18 197 28.15 2391.65 10.15 13.03 2260.04 5.86 5.32 27.67 13.46 338.87 16.60 1676.60 21393 31010.59 36709.66 130.57 OpenBenchmarking.org
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9-jumbo-7 Test: Blowfish LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 1300 2600 3900 5200 6500 SE +/- 2.67, N = 3 SE +/- 4.37, N = 3 SE +/- 0.00, N = 3 971 981 6035 1. (CC) gcc options: -lssl -lcrypto -lm -lz -fopenmp -lcrypt -ldl
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 50 100 150 200 250 SE +/- 0.00, N = 3 SE +/- 2.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 217 227 38 38 -march=native -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Sharpen LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 54 31 95 95 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Local Adaptive Thresholding LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 46 46 118 118 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Resizing LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 106 91 168 167 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Blur LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 30 60 90 120 150 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 91 86 141 140 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 130 260 390 520 650 SE +/- 1.50, N = 4 SE +/- 0.85, N = 4 SE +/- 0.48, N = 4 SE +/- 0.41, N = 4 614.02 618.43 423.57 555.63
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 400 800 1200 1600 2000 SE +/- 1.15, N = 4 SE +/- 1.33, N = 4 SE +/- 1.14, N = 4 SE +/- 0.00, N = 4 1681.08 1682.23 1180.32 1179.18
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: HWB Color Space LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 141 149 198 197 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 39.28 32.31 32.13 28.15 -march=native -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 700 1400 2100 2800 3500 SE +/- 7.43, N = 4 SE +/- 4.42, N = 4 SE +/- 3.14, N = 4 SE +/- 4.38, N = 4 3049.85 2402.43 2407.84 2391.65
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.16 12.20 10.09 10.15 -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 11.03 11.15 13.06 13.03 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 600 1200 1800 2400 3000 SE +/- 5.06, N = 4 SE +/- 15.62, N = 4 SE +/- 0.00, N = 4 SE +/- 11.55, N = 4 2352.37 2566.30 2250.55 2260.04
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 2 4 6 8 10 SE +/- 0.13, N = 6 SE +/- 0.02, N = 3 SE +/- 0.12, N = 6 SE +/- 0.12, N = 6 6.45 6.27 5.69 5.86 1. (CC) gcc options: -O3 -lm -lpthread
BLAKE2 Phoronix Test Suite v4.2.0m3 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20121223 Phoronix Test Suite v4.2.0m3 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 1.314 2.628 3.942 5.256 6.57 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.84 5.36 5.32 1. (CC) gcc options: -std=gnu99 -O3 -march=native
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 1.1.0 vpxenc LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 7 14 21 28 35 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 27.00 27.73 28.27 27.67 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2012-10-03 H.264 Video Encoding LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 30 60 90 120 150 SE +/- 1.17, N = 3 SE +/- 1.79, N = 3 SE +/- 0.60, N = 3 149.76 150.40 156.08
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 4 8 12 16 20 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 13.89 13.34 13.76 13.46
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 80 160 240 320 400 SE +/- 0.53, N = 4 SE +/- 3.32, N = 4 SE +/- 1.06, N = 4 SE +/- 0.67, N = 4 334.23 334.01 347.48 338.87
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.0 H.264 HD To NTSC DV LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 4 8 12 16 20 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 17.08 16.84 16.55 16.60 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -lrt
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 400 800 1200 1600 2000 SE +/- 2.66, N = 3 SE +/- 3.70, N = 3 SE +/- 30.11, N = 3 SE +/- 2.28, N = 3 1626.13 1641.07 1651.82 1676.60 -march=native -march=native -march=native 1. (CC) gcc options: -O3
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 5K 10K 15K 20K 25K SE +/- 370.18, N = 4 SE +/- 392.50, N = 3 SE +/- 103.34, N = 3 SE +/- 81.74, N = 3 21415 21169 21824 21393 1. (CXX) g++ options: -pipe -lpthread
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.3 Static Web Page Serving LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 7K 14K 21K 28K 35K SE +/- 289.39, N = 3 SE +/- 215.45, N = 3 SE +/- 315.04, N = 3 SE +/- 211.88, N = 3 31339.19 31118.52 31218.72 31010.59 -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.0.11 Static Web Page Serving LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 8K 16K 24K 32K 40K SE +/- 54.40, N = 3 SE +/- 266.37, N = 3 SE +/- 69.09, N = 3 SE +/- 182.99, N = 3 36541.41 36743.93 36874.67 36709.66 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1c RSA 4096-bit Performance LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 30 60 90 120 150 SE +/- 0.34, N = 3 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 129.57 130.37 130.73 130.57 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Phoronix Test Suite v10.8.5