LLVM Clang 3.2 Release Final release benchmarking of LLVM Clang 3.2 final release using optimized/non-debug/non-assert builds of LLVM Clang 3.1, LLVM Clang 3.2, GCC 4.7.2, and GCC 4.8 snapshot from late December. Benchmarking on an Intel Core i7 3770K Ivy Bridge processor with flags of -O3 and -march=native. Compiler benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1212278-RA-LLVMCLANG86&sro&grr .
LLVM Clang 3.2 Release Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 Intel Core i7-3770K @ 3.50GHz (8 Cores) ECS Z77H2-A2X v1.0 Intel Xeon E3-1200 v2/3rd 8192MB 60GB OCZ VERTEX2 NVIDIA GeForce GT 220 (405/324MHz) Realtek ALC892 DELL P2210H Realtek RTL8111/8168B + Intel Centrino Advanced-N 6205 Ubuntu 13.04 3.7.0-7-generic (x86_64) Unity 6.6.0 X Server 1.13.0.902 (1.13.1 RC 2) nouveau 1.0.4 3.0 Mesa 9.0.1 Gallium 0.4 Clang 3.1 + LLVM 3.1 ext4 1920x1080 Clang 3.2 + LLVM 3.2svn GCC 4.7.2 GCC 4.8.0 20121223 OpenBenchmarking.org Compiler Details - LLVM Clang 3.1: Optimized build; Built Dec 26 2012 (09:07:02); Default target: x86_64-unknown-linux-gnu; Host CPU: i686 - LLVM Clang 3.2: Optimized build; Built Dec 26 2012 (08:48:06); Default target: x86_64-unknown-linux-gnu; Host CPU: core-avx-i - GCC 4.7.2: --enable-checking=release --enable-languages=c,c++ --enable-lto - GCC 4.8.0 20121223: --enable-checking=release --enable-languages=c,c++ --enable-lto Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
LLVM Clang 3.2 Release apache: Static Web Page Serving nginx: Static Web Page Serving openssl: RSA 4096-bit Performance tachyon: Total Time ffmpeg: H.264 HD To NTSC DV encode-mp3: WAV To MP3 smallpt: Global Illumination Renderer; 100 Samples c-ray: Total Time compress-7zip: Compress Speed Test himeno: Poisson Pressure Solver graphics-magick: Local Adaptive Thresholding graphics-magick: HWB Color Space graphics-magick: Resizing graphics-magick: Sharpen graphics-magick: Blur x264: H.264 Video Encoding vpxenc: vpxenc john-the-ripper: Blowfish scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo blake2: Phoronix Test Suite v4.2.0m3 mafft: Multiple Sequence Alignment hmmer: Pfam Database Search LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 31339.19 36541.41 129.57 11.03 17.08 13.89 217 39.28 21415 1626.13 46 141 106 54 91 149.76 27.00 971 1681.08 3049.85 2352.37 334.23 614.02 6.45 12.16 31118.52 36743.93 130.37 11.15 16.84 13.34 227 32.31 21169 1641.07 46 149 91 31 86 150.40 27.73 981 1682.23 2402.43 2566.30 334.01 618.43 5.84 6.27 12.20 31218.72 36874.67 130.73 13.06 16.55 13.76 38 32.13 21824 1651.82 118 198 168 95 141 156.08 28.27 6035 1180.32 2407.84 2250.55 347.48 423.57 5.36 5.69 10.09 31010.59 36709.66 130.57 13.03 16.60 13.46 38 28.15 21393 1676.60 118 197 167 95 140 27.67 1179.18 2391.65 2260.04 338.87 555.63 5.32 5.86 10.15 OpenBenchmarking.org
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.3 Static Web Page Serving GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 7K 14K 21K 28K 35K SE +/- 315.04, N = 3 SE +/- 211.88, N = 3 SE +/- 289.39, N = 3 SE +/- 215.45, N = 3 31218.72 31010.59 31339.19 31118.52 -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.0.11 Static Web Page Serving GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 8K 16K 24K 32K 40K SE +/- 69.09, N = 3 SE +/- 182.99, N = 3 SE +/- 54.40, N = 3 SE +/- 266.37, N = 3 36874.67 36709.66 36541.41 36743.93 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1c RSA 4096-bit Performance GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.34, N = 3 SE +/- 0.15, N = 3 130.73 130.57 129.57 130.37 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 13.06 13.03 11.03 11.15 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.0 H.264 HD To NTSC DV GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 16.55 16.60 17.08 16.84 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -lrt
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 4 8 12 16 20 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 13.76 13.46 13.89 13.34
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 2.33, N = 3 38 38 217 227 -march=native -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 32.13 28.15 39.28 32.31 -march=native -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 5K 10K 15K 20K 25K SE +/- 103.34, N = 3 SE +/- 81.74, N = 3 SE +/- 370.18, N = 4 SE +/- 392.50, N = 3 21824 21393 21415 21169 1. (CXX) g++ options: -pipe -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 400 800 1200 1600 2000 SE +/- 30.11, N = 3 SE +/- 2.28, N = 3 SE +/- 2.66, N = 3 SE +/- 3.70, N = 3 1651.82 1676.60 1626.13 1641.07 -march=native -march=native -march=native 1. (CC) gcc options: -O3
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Local Adaptive Thresholding GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 118 118 46 46 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: HWB Color Space GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 198 197 141 149 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Resizing GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 168 167 106 91 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Sharpen GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 95 95 54 31 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Blur GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 141 140 91 86 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2012-10-03 H.264 Video Encoding GCC 4.7.2 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.60, N = 3 SE +/- 1.17, N = 3 SE +/- 1.79, N = 3 156.08 149.76 150.40
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 1.1.0 vpxenc GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 28.27 27.67 27.00 27.73 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9-jumbo-7 Test: Blowfish GCC 4.7.2 LLVM Clang 3.1 LLVM Clang 3.2 1300 2600 3900 5200 6500 SE +/- 0.00, N = 3 SE +/- 2.67, N = 3 SE +/- 4.37, N = 3 6035 971 981 1. (CC) gcc options: -lssl -lcrypto -lm -lz -fopenmp -lcrypt -ldl
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 400 800 1200 1600 2000 SE +/- 1.14, N = 4 SE +/- 0.00, N = 4 SE +/- 1.15, N = 4 SE +/- 1.33, N = 4 1180.32 1179.18 1681.08 1682.23
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 700 1400 2100 2800 3500 SE +/- 3.14, N = 4 SE +/- 4.38, N = 4 SE +/- 7.43, N = 4 SE +/- 4.42, N = 4 2407.84 2391.65 3049.85 2402.43
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 600 1200 1800 2400 3000 SE +/- 0.00, N = 4 SE +/- 11.55, N = 4 SE +/- 5.06, N = 4 SE +/- 15.62, N = 4 2250.55 2260.04 2352.37 2566.30
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 80 160 240 320 400 SE +/- 1.06, N = 4 SE +/- 0.67, N = 4 SE +/- 0.53, N = 4 SE +/- 3.32, N = 4 347.48 338.87 334.23 334.01
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 130 260 390 520 650 SE +/- 0.48, N = 4 SE +/- 0.41, N = 4 SE +/- 1.50, N = 4 SE +/- 0.85, N = 4 423.57 555.63 614.02 618.43
BLAKE2 Phoronix Test Suite v4.2.0m3 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20121223 Phoronix Test Suite v4.2.0m3 GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.2 1.314 2.628 3.942 5.256 6.57 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 5.36 5.32 5.84 1. (CC) gcc options: -std=gnu99 -O3 -march=native
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 2 4 6 8 10 SE +/- 0.12, N = 6 SE +/- 0.12, N = 6 SE +/- 0.13, N = 6 SE +/- 0.02, N = 3 5.69 5.86 6.45 6.27 1. (CC) gcc options: -O3 -lm -lpthread
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 10.09 10.15 12.16 12.20 -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Phoronix Test Suite v10.8.5