LLVM Clang 3.4 AMD APU Benchmarks Benchmarks by Michael Larabel for a future article on Phoronix.com. Quick look at LLVM Clang 3.3 vs. Clang 3.4 compiler performance. More tests forthcoming.
HTML result view exported from: https://openbenchmarking.org/result/1411077-SO-1312033SO50&rdt&gru .
LLVM Clang 3.4 AMD APU Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Clang 3.3 Clang 3.5 SVN i7 4700HQ AMD A10-6800K APU @ 4.70GHz (4 Cores) MSI FM2-A85XA-G65 (MS-7793) v1.0 AMD Family 15h 8192MB 64GB OCZ AGILITY Sapphire AMD Radeon HD 6950 2048MB Realtek ALC892 SyncMaster Realtek RTL8111/8168/8411 Ubuntu 13.10 3.13.0-999-generic (x86_64) Unity 7.1.2 X Server 1.14.3 radeon 7.2.99 3.1 Mesa 10.1.0-devel (git-5b331f6 saucy-oibaf-ppa) Gallium 0.4 Clang 3.3-5ubuntu4 ext4 2560x1600 Clang 3.5-1~exp1 Intel Core i7-4700HQ @ 2.40GHz (8 Cores) ASUS G750JM v1.0 Intel Xeon E3-1200 v3/4th 31744MB 1000GB Seagate ST1000LM014-1EJ1 + 1000GB TOSHIBA MQ01ABD1 + 1500GB HGST HTS541515A9 ASUS NVIDIA GeForce GTX 860M 2048MB (540/2505MHz) Intel Haswell HDMI Qualcomm Atheros QCA8171 Gigabit + Broadcom BCM4352 802.11ac Wireless Ubuntu 14.04 3.13.0-39-generic (x86_64) Unity 7.2.3 X Server 1.15.1 4.3.0 GCC 4.8 + Clang 3.4-1ubuntu3 + LLVM 3.4 + CUDA 6.5 1920x1080 OpenBenchmarking.org Compiler Details - i7 4700HQ: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - i7 4700HQ: Scaling Governor: acpi-cpufreq ondemand
LLVM Clang 3.4 AMD APU Benchmarks botan: Tiger botan: KASUMI botan: AES-256 botan: Twofish botan: CAST-256 botan: X9.19-MAC scimark2: Composite scimark2: Monte Carlo scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Dense LU Matrix Factorization scimark2: Jacobi Successive Over-Relaxation himeno: Poisson Pressure Solver blake2: Phoronix Test Suite v4.8.5 c-ray: Total Time smallpt: Global Illumination Renderer; 100 Samples encode-flac: WAV To FLAC encode-mp3: WAV To MP3 Clang 3.3 Clang 3.5 SVN i7 4700HQ 361.16 79.39 180.17 217.19 115.62 75.17 830.17 493.11 74.16 1016.39 1198.51 1368.69 770.32 9.66 63.66 211 7.09 16.36 360.78 63.17 178.18 218.46 112.45 75.86 850.62 474.59 69.01 1015.77 1325.06 1368.69 811.80 9.11 62.34 220 6.31 18.76 361.13 65.29 133.77 178.01 81.16 74.95 1204.07 525.11 255.92 1902.11 2347.95 996.27 1480.80 4.16 27.36 126 5.42 14.60 OpenBenchmarking.org
Botan Test: Tiger OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: Tiger Clang 3.3 Clang 3.5 SVN i7 4700HQ 80 160 240 320 400 361.16 360.78 361.13 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: KASUMI OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: KASUMI Clang 3.3 Clang 3.5 SVN i7 4700HQ 20 40 60 80 100 79.39 63.17 65.29 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: AES-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: AES-256 Clang 3.3 Clang 3.5 SVN i7 4700HQ 40 80 120 160 200 180.17 178.18 133.77 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: Twofish OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: Twofish Clang 3.3 Clang 3.5 SVN i7 4700HQ 50 100 150 200 250 217.19 218.46 178.01 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: CAST-256 OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: CAST-256 Clang 3.3 Clang 3.5 SVN i7 4700HQ 30 60 90 120 150 115.62 112.45 81.16 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
Botan Test: X9.19-MAC OpenBenchmarking.org Mbytes/s, More Is Better Botan 1.10.3 Test: X9.19-MAC Clang 3.3 Clang 3.5 SVN i7 4700HQ 20 40 60 80 100 75.17 75.86 74.95 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite Clang 3.3 Clang 3.5 SVN i7 4700HQ 300 600 900 1200 1500 SE +/- 0.55, N = 4 SE +/- 0.52, N = 4 SE +/- 7.60, N = 4 830.17 850.62 1204.07 -O3 -march=native -O3 -march=native 1. (CXX) g++ options:
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo Clang 3.3 Clang 3.5 SVN i7 4700HQ 110 220 330 440 550 SE +/- 0.57, N = 4 SE +/- 0.53, N = 4 SE +/- 0.59, N = 4 493.11 474.59 525.11 -O3 -march=native -O3 -march=native 1. (CXX) g++ options:
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform Clang 3.3 Clang 3.5 SVN i7 4700HQ 60 120 180 240 300 SE +/- 0.64, N = 4 SE +/- 0.23, N = 2 SE +/- 4.38, N = 4 74.16 69.01 255.92 -O3 -march=native -O3 -march=native 1. (CXX) g++ options:
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply Clang 3.3 Clang 3.5 SVN i7 4700HQ 400 800 1200 1600 2000 SE +/- 1.46, N = 4 SE +/- 2.79, N = 4 SE +/- 35.22, N = 3 1016.39 1015.77 1902.11 -O3 -march=native -O3 -march=native 1. (CXX) g++ options:
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization Clang 3.3 Clang 3.5 SVN i7 4700HQ 500 1000 1500 2000 2500 SE +/- 1.56, N = 4 SE +/- 1.64, N = 4 SE +/- 10.10, N = 4 1198.51 1325.06 2347.95 -O3 -march=native -O3 -march=native 1. (CXX) g++ options:
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Clang 3.3 Clang 3.5 SVN i7 4700HQ 300 600 900 1200 1500 SE +/- 0.00, N = 4 SE +/- 0.00, N = 4 SE +/- 2.16, N = 4 1368.69 1368.69 996.27 -O3 -march=native -O3 -march=native 1. (CXX) g++ options:
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver Clang 3.3 Clang 3.5 SVN i7 4700HQ 300 600 900 1200 1500 SE +/- 8.56, N = 3 SE +/- 2.25, N = 3 SE +/- 15.25, N = 3 770.32 811.80 1480.80 -march=native -march=native 1. (CC) gcc options: -O3
BLAKE2 Phoronix Test Suite v4.8.5 OpenBenchmarking.org Cycles Per Byte, Fewer Is Better BLAKE2 20121223 Phoronix Test Suite v4.8.5 Clang 3.3 Clang 3.5 SVN i7 4700HQ 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 9.66 9.11 4.16 1. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz
C-Ray Total Time OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time Clang 3.3 Clang 3.5 SVN i7 4700HQ 14 28 42 56 70 SE +/- 1.09, N = 6 SE +/- 1.12, N = 6 SE +/- 0.16, N = 3 63.66 62.34 27.36 -march=native -march=native 1. (CC) gcc options: -lm -lpthread -O3
Smallpt Global Illumination Renderer; 100 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 100 Samples Clang 3.3 Clang 3.5 SVN i7 4700HQ 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.93, N = 6 211 220 126 -O3 -march=native -O3 -march=native 1. (CXX) g++ options: -fopenmp
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.0 WAV To FLAC Clang 3.3 Clang 3.5 SVN i7 4700HQ 2 4 6 8 10 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.06, N = 9 7.09 6.31 5.42 -O3 -march=native -O3 -march=native -O2 1. (CXX) g++ options: -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.99.3 WAV To MP3 Clang 3.3 Clang 3.5 SVN i7 4700HQ 5 10 15 20 25 SE +/- 0.00, N = 5 SE +/- 0.01, N = 5 SE +/- 0.09, N = 5 16.36 18.76 14.60 -march=native -march=native -fomit-frame-pointer -ffast-math 1. (CC) gcc options: -pipe -O3 -lm
Phoronix Test Suite v10.8.5