AMD A10-6800K Compiler GCC vs. LLVM/Clang Performance

AMD A10-6800K Richland APU testing of GCC 4.8.1 and LLVM Clang 3.3 Benchmarking by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1307069-UT-1306306SO97.

AMD A10-6800K Compiler GCC vs. LLVM/Clang PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120AMD A10-6800K APU @ 4.70GHz (4 Cores)MSI FM2-A85XA-G65 (MS-7793) v1.0AMD Family 15h7168MB64GB OCZ AGILITYAMD Radeon HD 8670D 768MB (844/1066MHz)ATI R6xx HDMISyncMasterRealtek RTL8111/8168Ubuntu 13.103.9.0-6-generic (x86_64)Xfce 4.10X Server 1.13.3fglrx 13.10.104.2.12337GCC 4.8ext42560x1600Clang 3.3-2AMD FX-8120 Eight-Core @ 3.10GHz (8 Cores)ASRock 990FX Extreme3AMD ATI RD890 bridge8192MB20GB INTEL SSDSA2VP02 + 2 x 1000GB Western Digital WD10EADS-00L + 300GB Maxtor 6V300F0NVIDIA GeForce GTX 560 Ti 1024MB (835/2050MHz)NVIDIA GPU 16 HDMI/DPBroadcom NetLink BCM57781 Gigabit PCIeUbuntu 12.043.2.0-49-generic (x86_64)Openbox 3.5.0X Server 1.11.3NVIDIA 310.144.3.0 NVIDIA 310.14GCC 4.61920x1080OpenBenchmarking.orgCompiler Details- GCC 4.8.1: --build=x86_64-linux-gnu --disable-browser-plugin --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- 1306306-SO-RICHLANDG80-vs-8120: Scaling Governor: powernow-k8 ondemand

AMD A10-6800K Compiler GCC vs. LLVM/Clang Performancehmmer: Pfam Database Searchmafft: Multiple Sequence Alignmentblake2: Phoronix Test Suite v4.8.0m1botan: Tigerbotan: KASUMIbotan: AES-256botan: Twofishbotan: CAST-256botan: X9.19-MACscimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationx264: H.264 Video Encodinggraphics-magick: Blurgraphics-magick: Sharpengraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverc-ray: Total Timeprimesieve: 1e12 Prime Number Generationsmallpt: Global Illumination Renderer; 100 Samplesencode-flac: WAV To FLACencode-mp3: WAV To MP3ffmpeg: H.264 HD To NTSC DVapache: Static Web Page ServingGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812016.109.238.99342.4477.91134.10219.4688.8882.94505.8967.221008.881221.85859.3997.881169215215983835.4633.10298.53576.8116.8115.5721944.4016.3510.919.68359.4278.76180.25216.68115.2076.16490.8977.301049.751236.141358.1294.4873268612142779.1158.32696.122127.1116.4615.4722273.0016.779.117.43231.9550.2492.13148.2869.0853.71316.9865.70820.031450.28618.7977479812370519.3463.68245.091379.5624.5629.3811268.30OpenBenchmarking.org

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812048121620SE +/- 0.12, N = 3SE +/- 0.18, N = 3SE +/- 0.04, N = 316.1016.3516.77-O3 -march=bdver2-O3 -march=bdver2-O21. (CC) gcc options: -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81203691215SE +/- 0.20, N = 6SE +/- 0.08, N = 3SE +/- 0.15, N = 69.2310.919.111. (CC) gcc options: -O3 -lm -lpthread

BLAKE2

Phoronix Test Suite v4.8.0m1

OpenBenchmarking.orgCycles Per Byte, Fewer Is BetterBLAKE2 20121223Phoronix Test Suite v4.8.0m1GCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81203691215SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.62, N = 68.999.687.431. (CC) gcc options: -std=gnu99 -O3 -march=native -lcrypto -lz

Botan

Test: Tiger

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: TigerGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812080160240320400342.44359.42231.951. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: KASUMI

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: KASUMIGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81202040608010077.9178.7650.241. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: AES-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: AES-256GCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81204080120160200134.10180.2592.131. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: Twofish

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: TwofishGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812050100150200250219.46216.68148.281. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: CAST-256

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: CAST-256GCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812030609012015088.88115.2069.081. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

Botan

Test: X9.19-MAC

OpenBenchmarking.orgMbytes/s, More Is BetterBotan 1.10.3Test: X9.19-MACGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81202040608010082.9476.1653.711. (CXX) g++ options: -m64 -ldl -lpthread -lrt -O2

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120110220330440550SE +/- 0.59, N = 4SE +/- 2.47, N = 4SE +/- 0.59, N = 4505.89490.89316.98-O3 -march=bdver2-O3 -march=bdver21. (CXX) g++ options:

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812020406080100SE +/- 0.10, N = 4SE +/- 0.37, N = 4SE +/- 0.73, N = 467.2277.3065.70-O3 -march=bdver2-O3 -march=bdver21. (CXX) g++ options:

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81202004006008001000SE +/- 2.03, N = 4SE +/- 7.73, N = 4SE +/- 1.57, N = 41008.881049.75820.03-O3 -march=bdver2-O3 -march=bdver21. (CXX) g++ options:

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812030060090012001500SE +/- 1.39, N = 4SE +/- 7.60, N = 4SE +/- 2.48, N = 41221.851236.141450.28-O3 -march=bdver2-O3 -march=bdver21. (CXX) g++ options:

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812030060090012001500SE +/- 0.61, N = 4SE +/- 4.52, N = 4SE +/- 3.29, N = 4859.391358.12618.79-O3 -march=bdver2-O3 -march=bdver21. (CXX) g++ options:

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2013-06-08H.264 Video EncodingGCC 4.8.1LLVM Clang 3.320406080100SE +/- 0.12, N = 5SE +/- 0.39, N = 597.8894.481. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -ffast-math -march=bdver2 -std=gnu99 -fomit-frame-pointer -fno-tree-vectorize

GraphicsMagick

Operation: Blur

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: BlurGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 31167377-std=gnu99 -fopenmp -O3 -march=bdver2 -lgomp-O3 -march=bdver2-std=gnu99 -fopenmp -O2 -lgomp1. (CC) gcc options: -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: SharpenGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812020406080100SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3922647-std=gnu99 -fopenmp -O3 -march=bdver2 -lgomp-O3 -march=bdver2-std=gnu99 -fopenmp -O2 -lgomp1. (CC) gcc options: -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: ResizingGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 31528698-std=gnu99 -fopenmp -O3 -march=bdver2 -lgomp-O3 -march=bdver2-std=gnu99 -fopenmp -O2 -lgomp1. (CC) gcc options: -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: HWB Color SpaceGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81204080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 3159121123-std=gnu99 -fopenmp -O3 -march=bdver2 -lgomp-O3 -march=bdver2-std=gnu99 -fopenmp -O2 -lgomp1. (CC) gcc options: -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.16Operation: Local Adaptive ThresholdingGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812020406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3834270-std=gnu99 -fopenmp -O3 -march=bdver2 -lgomp-O3 -march=bdver2-std=gnu99 -fopenmp -O2 -lgomp1. (CC) gcc options: -pthread -lXext -lSM -lICE -lX11 -lz -lm -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81202004006008001000SE +/- 0.75, N = 3SE +/- 3.27, N = 3SE +/- 0.43, N = 3835.46779.11519.34-march=bdver2-march=bdver21. (CC) gcc options: -O3

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81201428425670SE +/- 0.05, N = 3SE +/- 0.91, N = 3SE +/- 0.01, N = 333.1058.3263.68-march=bdver2-march=bdver21. (CC) gcc options: -lm -lpthread -O3

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 4.21e12 Prime Number GenerationGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120150300450600750SE +/- 4.88, N = 4SE +/- 0.66, N = 3SE +/- 1.70, N = 3298.53696.12245.09-fopenmp-fopenmp1. (CXX) g++ options: -O2

Smallpt

Global Illumination Renderer; 100 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 100 SamplesGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-812050100150200250SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 2.67, N = 357212137-O3 -march=bdver2-O3 -march=bdver21. (CXX) g++ options: -fopenmp

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.0WAV To FLACGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81203691215SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 56.817.119.56-O3 -march=bdver2 -logg-O3 -march=bdver2 -logg-O21. (CXX) g++ options: -fvisibility=hidden -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3GCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120612182430SE +/- 0.03, N = 5SE +/- 0.03, N = 5SE +/- 0.04, N = 516.8116.4624.56-march=bdver2-march=bdver2-fomit-frame-pointer -ffast-math1. (CC) gcc options: -pipe -O3 -lm

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 1.1H.264 HD To NTSC DVGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-8120714212835SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 315.5715.4729.38-lasound -lSDL -march=bdver2 -fno-tree-vectorize -MF -MT-lasound -lSDL -march=bdver2 -Qunused-arguments-lrt -fno-tree-vectorize -MF -MT1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -ldl -lm -pthread -O3 -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -MMD

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.3Static Web Page ServingGCC 4.8.1LLVM Clang 3.31306306-SO-RICHLANDG80-vs-81205K10K15K20K25KSE +/- 142.89, N = 3SE +/- 91.99, N = 3SE +/- 75.07, N = 321944.4022273.0011268.30-O3 -march=bdver2-O3 -march=bdver2-O21. (CC) gcc options: -shared -fPIC -pthread


Phoronix Test Suite v10.8.4