LLVM Clang 3.2 Release Final release benchmarking of LLVM Clang 3.2 final release using optimized/non-debug/non-assert builds of LLVM Clang 3.1, LLVM Clang 3.2, GCC 4.7.2, and GCC 4.8 snapshot from late December. Benchmarking on an Intel Core i7 3770K Ivy Bridge processor with flags of -O3 and -march=native. Compiler benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1212278-RA-LLVMCLANG86&rdt&grr .
LLVM Clang 3.2 Release Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 Intel Core i7-3770K @ 3.50GHz (8 Cores) ECS Z77H2-A2X v1.0 Intel Xeon E3-1200 v2/3rd 8192MB 60GB OCZ VERTEX2 NVIDIA GeForce GT 220 (405/324MHz) Realtek ALC892 DELL P2210H Realtek RTL8111/8168B + Intel Centrino Advanced-N 6205 Ubuntu 13.04 3.7.0-7-generic (x86_64) Unity 6.6.0 X Server 1.13.0.902 (1.13.1 RC 2) nouveau 1.0.4 3.0 Mesa 9.0.1 Gallium 0.4 Clang 3.2 + LLVM 3.2svn ext4 1920x1080 Clang 3.1 + LLVM 3.1 GCC 4.7.2 GCC 4.8.0 20121223 OpenBenchmarking.org Compiler Details - LLVM Clang 3.2: Optimized build; Built Dec 26 2012 (08:48:06); Default target: x86_64-unknown-linux-gnu; Host CPU: core-avx-i - LLVM Clang 3.1: Optimized build; Built Dec 26 2012 (09:07:02); Default target: x86_64-unknown-linux-gnu; Host CPU: i686 - GCC 4.7.2: --enable-checking=release --enable-languages=c,c++ --enable-lto - GCC 4.8.0 20121223: --enable-checking=release --enable-languages=c,c++ --enable-lto Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
LLVM Clang 3.2 Release apache: Static Web Page Serving nginx: Static Web Page Serving openssl: RSA 4096-bit Performance tachyon: Total Time ffmpeg: H.264 HD To NTSC DV encode-mp3: WAV To MP3 smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time compress-7zip: Compress Speed Test himeno: Poisson Pressure Solver graphics-magick: Local Adaptive Thresholding graphics-magick: HWB Color Space graphics-magick: Resizing graphics-magick: Sharpen graphics-magick: Blur x264: H.264 Video Encoding vpxenc: vpxenc john-the-ripper: Blowfish scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo blake2: Phoronix Test Suite v4.2.0m3 mafft: Multiple Sequence Alignment hmmer: Pfam Database Search LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 31118.52 36743.93 130.37 11.15 16.84 13.34 227 32.31 21169 1641.07 46 149 91 31 86 150.40 27.73 981 1682.23 2402.43 2566.30 334.01 618.43 5.84 6.27 12.20 31339.19 36541.41 129.57 11.03 17.08 13.89 217 39.28 21415 1626.13 46 141 106 54 91 149.76 27.00 971 1681.08 3049.85 2352.37 334.23 614.02 6.45 12.16 31218.72 36874.67 130.73 13.06 16.55 13.76 38 32.13 21824 1651.82 118 198 168 95 141 156.08 28.27 6035 1180.32 2407.84 2250.55 347.48 423.57 5.36 5.69 10.09 31010.59 36709.66 130.57 13.03 16.60 13.46 38 28.15 21393 1676.60 118 197 167 95 140 27.67 1179.18 2391.65 2260.04 338.87 555.63 5.32 5.86 10.15 OpenBenchmarking.org
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.3 Static Web Page Serving LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 7K 14K 21K 28K 35K SE +/- 215.45, N = 3 SE +/- 289.39, N = 3 SE +/- 315.04, N = 3 SE +/- 211.88, N = 3 31118.52 31339.19 31218.72 31010.59 -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.0.11 Static Web Page Serving LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 8K 16K 24K 32K 40K SE +/- 266.37, N = 3 SE +/- 54.40, N = 3 SE +/- 69.09, N = 3 SE +/- 182.99, N = 3 36743.93 36541.41 36874.67 36709.66 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1c RSA 4096-bit Performance LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 30 60 90 120 150 SE +/- 0.15, N = 3 SE +/- 0.34, N = 3 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 130.37 129.57 130.73 130.57 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 11.15 11.03 13.06 13.03 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.0 H.264 HD To NTSC DV LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 16.84 17.08 16.55 16.60 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -lrt
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 4 8 12 16 20 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 13.34 13.89 13.76 13.46
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 50 100 150 200 250 SE +/- 2.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 227 217 38 38 -march=native -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 32.31 39.28 32.13 28.15 -march=native -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 5K 10K 15K 20K 25K SE +/- 392.50, N = 3 SE +/- 370.18, N = 4 SE +/- 103.34, N = 3 SE +/- 81.74, N = 3 21169 21415 21824 21393 1. (CXX) g++ options: -pipe -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 400 800 1200 1600 2000 SE +/- 3.70, N = 3 SE +/- 2.66, N = 3 SE +/- 30.11, N = 3 SE +/- 2.28, N = 3 1641.07 1626.13 1651.82 1676.60 -march=native -march=native -march=native 1. (CC) gcc options: -O3
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Local Adaptive Thresholding LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 46 46 118 118 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: HWB Color Space LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 149 141 198 197 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Resizing LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 91 106 168 167 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Sharpen LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 31 54 95 95 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Blur LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 86 91 141 140 -march=native -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2012-10-03 H.264 Video Encoding LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 30 60 90 120 150 SE +/- 1.79, N = 3 SE +/- 1.17, N = 3 SE +/- 0.60, N = 3 150.40 149.76 156.08
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 1.1.0 vpxenc LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 7 14 21 28 35 SE +/- 0.08, N = 3 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 27.73 27.00 28.27 27.67 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9-jumbo-7 Test: Blowfish LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 1300 2600 3900 5200 6500 SE +/- 4.37, N = 3 SE +/- 2.67, N = 3 SE +/- 0.00, N = 3 981 971 6035 1. (CC) gcc options: -lssl -lcrypto -lm -lz -fopenmp -lcrypt -ldl
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 400 800 1200 1600 2000 SE +/- 1.33, N = 4 SE +/- 1.15, N = 4 SE +/- 1.14, N = 4 SE +/- 0.00, N = 4 1682.23 1681.08 1180.32 1179.18
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 700 1400 2100 2800 3500 SE +/- 4.42, N = 4 SE +/- 7.43, N = 4 SE +/- 3.14, N = 4 SE +/- 4.38, N = 4 2402.43 3049.85 2407.84 2391.65
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 600 1200 1800 2400 3000 SE +/- 15.62, N = 4 SE +/- 5.06, N = 4 SE +/- 0.00, N = 4 SE +/- 11.55, N = 4 2566.30 2352.37 2250.55 2260.04
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 80 160 240 320 400 SE +/- 3.32, N = 4 SE +/- 0.53, N = 4 SE +/- 1.06, N = 4 SE +/- 0.67, N = 4 334.01 334.23 347.48 338.87
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 130 260 390 520 650 SE +/- 0.85, N = 4 SE +/- 1.50, N = 4 SE +/- 0.48, N = 4 SE +/- 0.41, N = 4 618.43 614.02 423.57 555.63
BLAKE2 Phoronix Test Suite v4.2.0m3 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20121223 Phoronix Test Suite v4.2.0m3 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 1.314 2.628 3.942 5.256 6.57 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.84 5.36 5.32 1. (CC) gcc options: -std=gnu99 -O3 -march=native
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.13, N = 6 SE +/- 0.12, N = 6 SE +/- 0.12, N = 6 6.27 6.45 5.69 5.86 1. (CC) gcc options: -O3 -lm -lpthread
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search LLVM Clang 3.2 LLVM Clang 3.1 GCC 4.7.2 GCC 4.8.0 20121223 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.20 12.16 10.09 10.15 -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Phoronix Test Suite v10.8.5