LLVM Clang 3.4 AMD APU Benchmarks

Benchmarks by Michael Larabel for a future article on Phoronix.com. Quick look at LLVM Clang 3.3 vs. Clang 3.4 compiler performance. More tests forthcoming.

HTML result view exported from: https://openbenchmarking.org/result/1411077-SO-1312033SO50.

LLVM Clang 3.4 AMD APU BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionClang 3.3Clang 3.5 SVNi7 4700HQAMD A10-6800K APU @ 4.70GHz (4 Cores)MSI FM2-A85XA-G65 (MS-7793) v1.0AMD Family 15h8192MB64GB OCZ AGILITYSapphire AMD Radeon HD 6950 2048MBRealtek ALC892SyncMasterRealtek RTL8111/8168/8411Ubuntu 13.103.13.0-999-generic (x86_64)Unity 7.1.2X Server 1.14.3radeon 7.2.993.1 Mesa 10.1.0-devel (git-5b331f6 saucy-oibaf-ppa) Gallium 0.4Clang 3.3-5ubuntu4ext42560x1600Clang 3.5-1~exp1Intel Core i7-4700HQ @ 2.40GHz (8 Cores)ASUS G750JM v1.0Intel Xeon E3-1200 v3/4th31744MB1000GB Seagate ST1000LM014-1EJ1 + 1000GB TOSHIBA MQ01ABD1 + 1500GB HGST HTS541515A9ASUS NVIDIA GeForce GTX 860M 2048MB (540/2505MHz)Intel Haswell HDMIQualcomm Atheros QCA8171 Gigabit + Broadcom BCM4352 802.11ac WirelessUbuntu 14.043.13.0-39-generic (x86_64)Unity 7.2.3X Server 1.15.14.3.0GCC 4.8 + Clang 3.4-1ubuntu3 + LLVM 3.4 + CUDA 6.51920x1080OpenBenchmarking.orgCompiler Details- i7 4700HQ: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- i7 4700HQ: Scaling Governor: acpi-cpufreq ondemand

LLVM Clang 3.4 AMD APU Benchmarksblake2: Phoronix Test Suite v4.8.5botan: Tigerbotan: KASUMIbotan: AES-256botan: Twofishbotan: CAST-256botan: X9.19-MACscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationhimeno: Poisson Pressure Solverc-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesencode-flac: WAV To FLACencode-mp3: WAV To MP3Clang 3.3Clang 3.5 SVNi7 4700HQ9.66361.1679.39180.17217.19115.6275.17830.17493.1174.161016.391198.511368.69770.3263.662117.0916.369.11360.7863.17178.18218.46112.4575.86850.62474.5969.011015.771325.061368.69811.8062.342206.3118.764.16361.1365.29133.77178.0181.1674.951204.07525.11255.921902.112347.95996.271480.8027.361265.4214.60OpenBenchmarking.org

BLAKE2

Phoronix Test Suite v4.8.5

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20121223Phoronix Test Suite v4.8.5Clang 3.3Clang 3.5 SVNi7 4700HQ3691215SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.669.114.161. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz

Botan

Test: Tiger

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: TigerClang 3.3Clang 3.5 SVNi7 4700HQ80160240320400361.16360.78361.131. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: KASUMI

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: KASUMIClang 3.3Clang 3.5 SVNi7 4700HQ2040608010079.3963.1765.291. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: AES-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: AES-256Clang 3.3Clang 3.5 SVNi7 4700HQ4080120160200180.17178.18133.771. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: Twofish

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: TwofishClang 3.3Clang 3.5 SVNi7 4700HQ50100150200250217.19218.46178.011. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: CAST-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: CAST-256Clang 3.3Clang 3.5 SVNi7 4700HQ306090120150115.62112.4581.161. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: X9.19-MAC

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: X9.19-MACClang 3.3Clang 3.5 SVNi7 4700HQ2040608010075.1775.8674.951. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 3.3Clang 3.5 SVNi7 4700HQ30060090012001500SE +/- 0.55, N = 4SE +/- 0.52, N = 4SE +/- 7.60, N = 4830.17850.621204.07-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClang 3.3Clang 3.5 SVNi7 4700HQ110220330440550SE +/- 0.57, N = 4SE +/- 0.53, N = 4SE +/- 0.59, N = 4493.11474.59525.11-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 3.3Clang 3.5 SVNi7 4700HQ60120180240300SE +/- 0.64, N = 4SE +/- 0.23, N = 2SE +/- 4.38, N = 474.1669.01255.92-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 3.3Clang 3.5 SVNi7 4700HQ400800120016002000SE +/- 1.46, N = 4SE +/- 2.79, N = 4SE +/- 35.22, N = 31016.391015.771902.11-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClang 3.3Clang 3.5 SVNi7 4700HQ5001000150020002500SE +/- 1.56, N = 4SE +/- 1.64, N = 4SE +/- 10.10, N = 41198.511325.062347.95-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 3.3Clang 3.5 SVNi7 4700HQ30060090012001500SE +/- 0.00, N = 4SE +/- 0.00, N = 4SE +/- 2.16, N = 41368.691368.69996.27-O3 -march=native-O3 -march=native1. (CXX) g++ options:

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverClang 3.3Clang 3.5 SVNi7 4700HQ30060090012001500SE +/- 8.56, N = 3SE +/- 2.25, N = 3SE +/- 15.25, N = 3770.32811.801480.80-march=native-march=native1. (CC) gcc options: -O3

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeClang 3.3Clang 3.5 SVNi7 4700HQ1428425670SE +/- 1.09, N = 6SE +/- 1.12, N = 6SE +/- 0.16, N = 363.6662.3427.36-march=native-march=native1. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesClang 3.3Clang 3.5 SVNi7 4700HQ50100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.93, N = 6211220126-O3 -march=native-O3 -march=native1. (CXX) g++ options: -fopenmp

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACClang 3.3Clang 3.5 SVNi7 4700HQ246810SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.06, N = 97.096.315.42-O3 -march=native-O3 -march=native-O21. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3Clang 3.3Clang 3.5 SVNi7 4700HQ510152025SE +/- 0.00, N = 5SE +/- 0.01, N = 5SE +/- 0.09, N = 516.3618.7614.60-march=native-march=native-fomit-frame-pointer -ffast-math1. (CC) gcc options: -pipe -O3 -lm


Phoronix Test Suite v10.8.4