LLVM Clang 3.3/3.4 AMD FX-8150 Bulldozer Benchmark Compiler AMD FX-8150 Eight-Core testing of LLVM Clang compiler. LLVM Clang 3.1, LLVM Clang 3.3, and LLVM Clang 3.4 (from mid June) compiler benchmarks under Ubuntu Linux from AMD FX Bulldozer Eight-Core CPU. Benchmarks for a future article on Phoronix by Michael Larabel. The LLVM Clang 3.2 compiler testing had to be skipped due to instability problems on this particular system.
HTML result view exported from: https://openbenchmarking.org/result/1306228-PTS-LLVMAMDF95&sro&grw .
LLVM Clang 3.3/3.4 AMD FX-8150 Bulldozer Benchmark Compiler Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 AMD FX-8150 Eight-Core @ 3.60GHz (8 Cores) MSI 760GM-P34(FX) (MS-7641) v4.0 AMD RS780 + SB7x0/SB8x0/SB9x0 4096MB 500GB Western Digital WD5000AZRX-0 NVIDIA GeForce 8600 GTS 256MB (675/1008MHz) Realtek ALC887-VD Realtek RTL8111/8168 Ubuntu 13.04 3.8.0-19-generic (x86_64) Unity 7.0.0 X Server 1.13.3 NVIDIA 304.88 3.3.0 NVIDIA 304.88 Clang 3.1 + LLVM 3.1 ext4 1920x1080 500GB Western Digital WD5000AZRX-0 + 128GB RVS02 SuperSpeed Clang 3.3 + LLVM 3.3 500GB Western Digital WD5000AZRX-0 Clang 3.4 (SVN 184563) + LLVM 3.4svn OpenBenchmarking.org Compiler Details - LLVM Clang 3.1: Optimized build; Built Jun 21 2013 (13:15:04); Default target: x86_64-unknown-linux-gnu; Host CPU: bdver1 - LLVM Clang 3.3: Optimized build; Built Jun 21 2013 (12:32:02); Default target: x86_64-unknown-linux-gnu; Host CPU: bdver1 - LLVM Clang 3.4 20130621: Optimized build; Built Jun 21 2013 (12:56:45); Default target: x86_64-unknown-linux-gnu; Host CPU: bdver1 Processor Details - Scaling Governor: acpi-cpufreq ondemand
LLVM Clang 3.3/3.4 AMD FX-8150 Bulldozer Benchmark Compiler scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation blake2: Phoronix Test Suite v4.8.0m1 encode-flac: WAV To FLAC encode-mp3: WAV To MP3 hmmer: Pfam Database Search mafft: Multiple Sequence Alignment mrbayes: Primate Phylogeny Analysis himeno: Poisson Pressure Solver build-imagemagick: Time To Compile primesieve: 1e12 Prime Number Generation compress-7zip: Compress Speed Test build-php: Time To Compile tachyon: Total Time x264: H.264 Video Encoding c-ray: Total Time ffmpeg: H.264 HD To NTSC DV smallpt: Global Illumination Renderer; 100 Samples apache: Static Web Page Serving LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 352.05 74.86 979.92 1674.52 1163.52 11.78 21.49 15.02 7.90 641.68 32.56 16376 33.19 17.21 131.10 44.90 22.13 267 16873.75 352.34 71.07 996.60 1644.85 1156.94 9.19 9.65 20.55 13.16 7.98 25.98 579.69 33.26 885.89 18691 25.50 17.31 132.02 38.43 22.15 267 16869.43 350.90 72.33 935.19 1916.76 1156.93 9.07 8.30 20.65 13.10 8.26 24.63 638.49 35.02 883.72 16529 25.45 17.73 130.64 37.54 22.00 266 OpenBenchmarking.org
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 80 160 240 320 400 SE +/- 0.47, N = 4 SE +/- 0.55, N = 4 SE +/- 0.47, N = 4 352.05 352.34 350.90 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 20 40 60 80 100 SE +/- 0.07, N = 4 SE +/- 0.10, N = 4 SE +/- 0.12, N = 4 74.86 71.07 72.33 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 200 400 600 800 1000 SE +/- 1.91, N = 4 SE +/- 1.40, N = 4 SE +/- 3.01, N = 4 979.92 996.60 935.19 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 400 800 1200 1600 2000 SE +/- 1.52, N = 4 SE +/- 3.82, N = 4 SE +/- 3.30, N = 4 1674.52 1644.85 1916.76 1. (CXX) g++ options: -O3 -march=native
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 300 600 900 1200 1500 SE +/- 1.28, N = 4 SE +/- 1.78, N = 4 SE +/- 0.00, N = 4 1163.52 1156.94 1156.93 1. (CXX) g++ options: -O3 -march=native
BLAKE2 Phoronix Test Suite v4.8.0m1 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20121223 Phoronix Test Suite v4.8.0m1 LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 SE +/- 0.16, N = 4 11.78 9.19 9.07 1. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.0 WAV To FLAC LLVM Clang 3.3 LLVM Clang 3.4 20130621 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 9.65 8.30 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 5 10 15 20 25 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 SE +/- 0.02, N = 5 21.49 20.55 20.65 1. (CC) gcc options: -pipe -O3 -march=native -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 2.3.2 Pfam Database Search LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.18, N = 3 SE +/- 0.21, N = 3 15.02 13.16 13.10 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm
Timed MAFFT Alignment Multiple Sequence Alignment OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 6.864 Multiple Sequence Alignment LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.16, N = 6 7.90 7.98 8.26 1. (CC) gcc options: -O3 -lm -lpthread
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.1.2 Primate Phylogeny Analysis LLVM Clang 3.3 LLVM Clang 3.4 20130621 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.36, N = 3 25.98 24.63
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 140 280 420 560 700 SE +/- 0.81, N = 3 SE +/- 0.62, N = 3 SE +/- 0.72, N = 3 641.68 579.69 638.49 1. (CC) gcc options: -O3 -march=native
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.8.1-10 Time To Compile LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 SE +/- 0.14, N = 3 32.56 33.26 35.02
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 4.2 1e12 Prime Number Generation LLVM Clang 3.3 LLVM Clang 3.4 20130621 200 400 600 800 1000 SE +/- 0.65, N = 3 SE +/- 0.28, N = 3 885.89 883.72 1. (CXX) g++ options: -O2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 9.20.1 Compress Speed Test LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 4K 8K 12K 16K 20K SE +/- 99.71, N = 3 SE +/- 53.80, N = 3 SE +/- 51.53, N = 3 16376 18691 16529 1. (CXX) g++ options: -pipe -lpthread
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 5.2.9 Time To Compile LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 8 16 24 32 40 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 SE +/- 0.16, N = 3 33.19 25.50 25.45 -lpthread 1. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.98.9 Total Time LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 4 8 12 16 20 SE +/- 0.49, N = 6 SE +/- 0.37, N = 6 SE +/- 0.04, N = 3 17.21 17.31 17.73 1. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
x264 H.264 Video Encoding OpenBenchmarking.org Frames Per Second, More Is Better x264 2013-06-08 H.264 Video Encoding LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 30 60 90 120 150 SE +/- 1.53, N = 5 SE +/- 0.28, N = 5 SE +/- 1.32, N = 5 131.10 132.02 130.64 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 44.90 38.43 37.54 1. (CC) gcc options: -lm -lpthread -O3 -march=native
FFmpeg H.264 HD To NTSC DV OpenBenchmarking.org Seconds, Fewer Is Better FFmpeg 1.1 H.264 HD To NTSC DV LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 5 10 15 20 25 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 SE +/- 0.21, N = 3 22.13 22.15 22.00 1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lasound -lSDL -lm -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -Qunused-arguments -MMD
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples LLVM Clang 3.1 LLVM Clang 3.3 LLVM Clang 3.4 20130621 60 120 180 240 300 SE +/- 1.15, N = 3 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 267 267 266 1. (CXX) g++ options: -fopenmp -O3 -march=native
Apache Benchmark Static Web Page Serving OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.3 Static Web Page Serving LLVM Clang 3.1 LLVM Clang 3.3 4K 8K 12K 16K 20K SE +/- 157.08, N = 3 SE +/- 84.36, N = 3 16873.75 16869.43 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
Phoronix Test Suite v10.8.5