LLVM Clang 3.2 Release Final release benchmarking of LLVM Clang 3.2 final release using optimized/non-debug/non-assert builds of LLVM Clang 3.1, LLVM Clang 3.2, GCC 4.7.2, and GCC 4.8 snapshot from late December. Benchmarking on an Intel Core i7 3770K Ivy Bridge processor with flags of -O3 and -march=native. Compiler benchmarking for a future article on Phoronix.com.
HTML result view exported from: https://openbenchmarking.org/result/1212278-RA-LLVMCLANG86&sro&grw .
LLVM Clang 3.2 Release Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 Intel Core i7-3770K @ 3.50GHz (8 Cores) ECS Z77H2-A2X v1.0 Intel Xeon E3-1200 v2/3rd 8192MB 60GB OCZ VERTEX2 NVIDIA GeForce GT 220 (405/324MHz) Realtek ALC892 DELL P2210H Realtek RTL8111/8168B + Intel Centrino Advanced-N 6205 Ubuntu 13.04 3.7.0-7-generic (x86_64) Unity 6.6.0 X Server 1.13.0.902 (1.13.1 RC 2) nouveau 1.0.4 3.0 Mesa 9.0.1 Gallium 0.4 Clang 3.1 + LLVM 3.1 ext4 1920x1080 Clang 3.2 + LLVM 3.2svn GCC 4.7.2 GCC 4.8.0 20121223 OpenBenchmarking.org Compiler Details - LLVM Clang 3.1: Optimized build; Built Dec 26 2012 (09:07:02); Default target: x86_64-unknown-linux-gnu; Host CPU: i686 - LLVM Clang 3.2: Optimized build; Built Dec 26 2012 (08:48:06); Default target: x86_64-unknown-linux-gnu; Host CPU: core-avx-i - GCC 4.7.2: --enable-checking=release --enable-languages=c,c++ --enable-lto - GCC 4.8.0 20121223: --enable-checking=release --enable-languages=c,c++ --enable-lto Processor Details - Scaling Governor: ondemand System Details - Compiz was running on this system.
LLVM Clang 3.2 Release scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation blake2: Phoronix Test Suite v4.2.0m3 encode-mp3: WAV To MP3 hmmer: Pfam Database Search mafft: Multiple Sequence Alignment himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test john-the-ripper: Blowfish vpxenc: vpxenc graphics-magick: Blur graphics-magick: Sharpen graphics-magick: Resizing graphics-magick: HWB Color Space graphics-magick: Local Adaptive Thresholding tachyon: Total Time x264: H.264 Video Encoding c-ray: Total Time ffmpeg: H.264 HD To NTSC DV smallpt: Global Illumination Renderer; 100 Samples nginx: Static Web Page Serving apache: Static Web Page Serving openssl: RSA 4096-bit Performance LLVM Clang 3.1 LLVM Clang 3.2 GCC 4.7.2 GCC 4.8.0 20121223 614.02 334.23 2352.37 3049.85 1681.08 13.89 12.16 6.45 1626.13 21415 971 27.00 91 54 106 141 46 11.03 149.76 39.28 17.08 217 36541.41 31339.19 129.57 618.43 334.01 2566.30 2402.43 1682.23 5.84 13.34 12.20 6.27 1641.07 21169 981 27.73 86 31 91 149 46 11.15 150.40 32.31 16.84 227 36743.93 31118.52 130.37 423.57 347.48 2250.55 2407.84 1180.32 5.36 13.76 10.09 5.69 1651.82 21824 6035 28.27 141 95 168 198 118 13.06 156.08 32.13 16.55 38 36874.67 31218.72 130.73 555.63 338.87 2260.04 2391.65 1179.18 5.32 13.46 10.15 5.86 1676.60 21393 27.67 140 95 167 197 118 13.03 28.15 16.60 38 36709.66 31010.59 130.57 OpenBenchmarking.org
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 130 260 390 520 650 SE +/- 0.48, N = 4 SE +/- 0.41, N = 4 SE +/- 1.50, N = 4 SE +/- 0.85, N = 4 423.57 555.63 614.02 618.43
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 80 160 240 320 400 SE +/- 1.06, N = 4 SE +/- 0.67, N = 4 SE +/- 0.53, N = 4 SE +/- 3.32, N = 4 347.48 338.87 334.23 334.01
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 600 1200 1800 2400 3000 SE +/- 0.00, N = 4 SE +/- 11.55, N = 4 SE +/- 5.06, N = 4 SE +/- 15.62, N = 4 2250.55 2260.04 2352.37 2566.30
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 700 1400 2100 2800 3500 SE +/- 3.14, N = 4 SE +/- 4.38, N = 4 SE +/- 7.43, N = 4 SE +/- 4.42, N = 4 2407.84 2391.65 3049.85 2402.43
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 400 800 1200 1600 2000 SE +/- 1.14, N = 4 SE +/- 0.00, N = 4 SE +/- 1.15, N = 4 SE +/- 1.33, N = 4 1180.32 1179.18 1681.08 1682.23
BLAKE2 Phoronix Test Suite v4.2.0m3 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20121223 Phoronix Test Suite v4.2.0m3 GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.2 1.314 2.628 3.942 5.256 6.57 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 5.36 5.32 5.84 1. (CC) gcc options: -std=gnu99 -O3 -march=native
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 4 8 12 16 20 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 13.76 13.46 13.89 13.34
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 10.09 10.15 12.16 12.20 -march=native -march=native -march=native 1. (CC) gcc options: -O3 -pthread -lhmmer -lsquid -lm
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 2 4 6 8 10 SE +/- 0.12, N = 6 SE +/- 0.12, N = 6 SE +/- 0.13, N = 6 SE +/- 0.02, N = 3 5.69 5.86 6.45 6.27 1. (CC) gcc options: -O3 -lm -lpthread
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 400 800 1200 1600 2000 SE +/- 30.11, N = 3 SE +/- 2.28, N = 3 SE +/- 2.66, N = 3 SE +/- 3.70, N = 3 1651.82 1676.60 1626.13 1641.07 -march=native -march=native -march=native 1. (CC) gcc options: -O3
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 5K 10K 15K 20K 25K SE +/- 103.34, N = 3 SE +/- 81.74, N = 3 SE +/- 370.18, N = 4 SE +/- 392.50, N = 3 21824 21393 21415 21169 1. (CXX) g++ options: -pipe -lpthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.7.9-jumbo-7 Test: Blowfish GCC 4.7.2 LLVM Clang 3.1 LLVM Clang 3.2 1300 2600 3900 5200 6500 SE +/- 0.00, N = 3 SE +/- 2.67, N = 3 SE +/- 4.37, N = 3 6035 971 981 1. (CC) gcc options: -lssl -lcrypto -lm -lz -fopenmp -lcrypt -ldl
VP8 libvpx Encoding vpxenc OpenBenchmarking.org Frames Per Second, More Is Better VP8 libvpx Encoding 1.1.0 vpxenc GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 7 14 21 28 35 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 28.27 27.67 27.00 27.73 1. (CC) gcc options: -m64 -lvpx -lm -lpthread
GraphicsMagick Operation: Blur OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Blur GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 141 140 91 86 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Sharpen GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 20 40 60 80 100 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 95 95 54 31 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Resizing GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 168 167 106 91 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: HWB Color Space GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 40 80 120 160 200 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 198 197 141 149 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
GraphicsMagick Operation: Local Adaptive Thresholding OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.16 Operation: Local Adaptive Thresholding GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 118 118 46 46 -std=gnu99 -fopenmp -march=native -lrt -std=gnu99 -fopenmp -march=native -lrt -march=native 1. (CC) gcc options: -O3 -pthread -lXext -lX11 -lbz2 -lz -lm -lpthread
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 13.06 13.03 11.03 11.15 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2012-10-03 H.264 Video Encoding GCC 4.7.2 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.60, N = 3 SE +/- 1.17, N = 3 SE +/- 1.79, N = 3 156.08 149.76 150.40
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 32.13 28.15 39.28 32.31 -march=native -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.0 H.264 HD To NTSC DV GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 SE +/- 0.21, N = 3 16.55 16.60 17.08 16.84 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -lbz2 -lrt
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 2.33, N = 3 38 38 217 227 -march=native -march=native -march=native 1. (CXX) g++ options: -fopenmp -O3
NGINX Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better NGINX Benchmark 1.0.11 Static Web Page Serving GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 8K 16K 24K 32K 40K SE +/- 69.09, N = 3 SE +/- 182.99, N = 3 SE +/- 54.40, N = 3 SE +/- 266.37, N = 3 36874.67 36709.66 36541.41 36743.93 1. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.3 Static Web Page Serving GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 7K 14K 21K 28K 35K SE +/- 315.04, N = 3 SE +/- 211.88, N = 3 SE +/- 289.39, N = 3 SE +/- 215.45, N = 3 31218.72 31010.59 31339.19 31118.52 -march=native -march=native -march=native 1. (CC) gcc options: -shared -fPIC -pthread -O3
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.0.1c RSA 4096-bit Performance GCC 4.7.2 GCC 4.8.0 20121223 LLVM Clang 3.1 LLVM Clang 3.2 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.09, N = 3 SE +/- 0.34, N = 3 SE +/- 0.15, N = 3 130.73 130.57 129.57 130.37 1. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl
Phoronix Test Suite v10.8.5