LLVM Clang 3.4 AMD APU Benchmarks

Benchmarks by Michael Larabel for a future article on Phoronix.com. Quick look at LLVM Clang 3.3 vs. Clang 3.4 compiler performance. More tests forthcoming.

HTML result view exported from: https://openbenchmarking.org/result/1411077-SO-1312033SO50&sor&gru.

LLVM Clang 3.4 AMD APU BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionClang 3.3Clang 3.5 SVNi7 4700HQAMD A10-6800K APU @ 4.70GHz (4 Cores)MSI FM2-A85XA-G65 (MS-7793) v1.0AMD Family 15h8192MB64GB OCZ AGILITYSapphire AMD Radeon HD 6950 2048MBRealtek ALC892SyncMasterRealtek RTL8111/8168/8411Ubuntu 13.103.13.0-999-generic (x86_64)Unity 7.1.2X Server 1.14.3radeon 7.2.993.1 Mesa 10.1.0-devel (git-5b331f6 saucy-oibaf-ppa) Gallium 0.4Clang 3.3-5ubuntu4ext42560x1600Clang 3.5-1~exp1Intel Core i7-4700HQ @ 2.40GHz (8 Cores)ASUS G750JM v1.0Intel Xeon E3-1200 v3/4th31744MB1000GB Seagate ST1000LM014-1EJ1 + 1000GB TOSHIBA MQ01ABD1 + 1500GB HGST HTS541515A9ASUS NVIDIA GeForce GTX 860M 2048MB (540/2505MHz)Intel Haswell HDMIQualcomm Atheros QCA8171 Gigabit + Broadcom BCM4352 802.11ac WirelessUbuntu 14.043.13.0-39-generic (x86_64)Unity 7.2.3X Server 1.15.14.3.0GCC 4.8 + Clang 3.4-1ubuntu3 + LLVM 3.4 + CUDA 6.51920x1080OpenBenchmarking.orgCompiler Details- i7 4700HQ: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- i7 4700HQ: Scaling Governor: acpi-cpufreq ondemand

LLVM Clang 3.4 AMD APU Benchmarksbotan: Tigerbotan: KASUMIbotan: AES-256botan: Twofishbotan: CAST-256botan: X9.19-MACscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationhimeno: Poisson Pressure Solverblake2: Phoronix Test Suite v4.8.5c-ray: Total Timesmallpt: Global Illumination Renderer; 100 Samplesencode-flac: WAV To FLACencode-mp3: WAV To MP3Clang 3.3Clang 3.5 SVNi7 4700HQ361.1679.39180.17217.19115.6275.17830.17493.1174.161016.391198.511368.69770.329.6663.662117.0916.36360.7863.17178.18218.46112.4575.86850.62474.5969.011015.771325.061368.69811.809.1162.342206.3118.76361.1365.29133.77178.0181.1674.951204.07525.11255.921902.112347.95996.271480.804.1627.361265.4214.60OpenBenchmarking.org

Botan

Test: Tiger

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: TigerClang 3.3i7 4700HQClang 3.5 SVN80160240320400361.16361.13360.781. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: KASUMI

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: KASUMIClang 3.3i7 4700HQClang 3.5 SVN2040608010079.3965.2963.171. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: AES-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: AES-256Clang 3.3Clang 3.5 SVNi7 4700HQ4080120160200180.17178.18133.771. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: Twofish

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: TwofishClang 3.5 SVNClang 3.3i7 4700HQ50100150200250218.46217.19178.011. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: CAST-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: CAST-256Clang 3.3Clang 3.5 SVNi7 4700HQ306090120150115.62112.4581.161. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: X9.19-MAC

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: X9.19-MACClang 3.5 SVNClang 3.3i7 4700HQ2040608010075.8675.1774.951. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositei7 4700HQClang 3.5 SVNClang 3.330060090012001500SE +/- 7.60, N = 4SE +/- 0.52, N = 4SE +/- 0.55, N = 41204.07850.62830.17-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carloi7 4700HQClang 3.3Clang 3.5 SVN110220330440550SE +/- 0.59, N = 4SE +/- 0.57, N = 4SE +/- 0.53, N = 4525.11493.11474.59-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transformi7 4700HQClang 3.3Clang 3.5 SVN60120180240300SE +/- 4.38, N = 4SE +/- 0.64, N = 4SE +/- 0.23, N = 2255.9274.1669.01-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyi7 4700HQClang 3.3Clang 3.5 SVN400800120016002000SE +/- 35.22, N = 3SE +/- 1.46, N = 4SE +/- 2.79, N = 41902.111016.391015.77-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorizationi7 4700HQClang 3.5 SVNClang 3.35001000150020002500SE +/- 10.10, N = 4SE +/- 1.64, N = 4SE +/- 1.56, N = 42347.951325.061198.51-O3 -march=native-O3 -march=native1. (CXX) g++ options:

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 3.5 SVNClang 3.3i7 4700HQ30060090012001500SE +/- 0.00, N = 4SE +/- 0.00, N = 4SE +/- 2.16, N = 41368.691368.69996.27-O3 -march=native-O3 -march=native1. (CXX) g++ options:

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solveri7 4700HQClang 3.5 SVNClang 3.330060090012001500SE +/- 15.25, N = 3SE +/- 2.25, N = 3SE +/- 8.56, N = 31480.80811.80770.32-march=native-march=native1. (CC) gcc options: -O3

BLAKE2

Phoronix Test Suite v4.8.5

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20121223Phoronix Test Suite v4.8.5i7 4700HQClang 3.5 SVNClang 3.33691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 34.169.119.661. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Timei7 4700HQClang 3.5 SVNClang 3.31428425670SE +/- 0.16, N = 3SE +/- 1.12, N = 6SE +/- 1.09, N = 627.3662.3463.66-march=native-march=native1. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 Samplesi7 4700HQClang 3.3Clang 3.5 SVN50100150200250SE +/- 1.93, N = 6SE +/- 0.33, N = 3SE +/- 0.33, N = 3126211220-O3 -march=native-O3 -march=native1. (CXX) g++ options: -fopenmp

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACi7 4700HQClang 3.5 SVNClang 3.3246810SE +/- 0.06, N = 9SE +/- 0.00, N = 5SE +/- 0.00, N = 55.426.317.09-O2-O3 -march=native-O3 -march=native1. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3i7 4700HQClang 3.3Clang 3.5 SVN510152025SE +/- 0.09, N = 5SE +/- 0.00, N = 5SE +/- 0.01, N = 514.6016.3618.76-fomit-frame-pointer -ffast-math-march=native-march=native1. (CC) gcc options: -O3 -pipe -lm


Phoronix Test Suite v10.8.4