Intel Xeon Compiler Tests

LLVM Clang 3.7 Benchmarks against GCC open-source compiler on Ubuntu Linux with Intel Xeon CPU. Benchmarks by Michael Larabel for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1508311-HA-INTELXEON12&sro.

Intel Xeon Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 4.9.2GCC 5.2.0Clang 3.6.0Clang 3.7.0 SVNIntel Xeon E5-2687W v3 @ 3.50GHz (20 Cores)MSI X99S SLI PLUS (MS-7885) v1.0Intel Xeon E5 v3/Core16384MB80GB INTEL SSDSCKGW08NVIDIA GeForce GTX Titan 6144MBIntel C610/X99ASUS PB278Intel ConnectionUbuntu 15.043.19.0-26-generic (x86_64)Unity 7.3.2X Server 1.17.1nouveau 1.0.113.3 Mesa 10.5.2 Gallium 0.4GCC 4.9.2ext41920x1080GCC 5.2.0Clang 3.6.0-2ubuntu1Clang 3.7.0 (SVN 246033) + LLVM 3.7.0OpenBenchmarking.orgCompiler Details- GCC 4.9.2: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - GCC 5.2.0: --disable-multilib --enable-checking=release --enable-languages=c,c++- Clang 3.7.0 SVN: Optimized build; Built Aug 26 2015 (10:38:52); Default target: x86_64-unknown-linux-gnu; Host CPU: haswellDisk Details- GCC 4.9.2, Clang 3.6.0, Clang 3.7.0 SVN: DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: intel_pstate powersaveSystem Details- GCC 4.9.2, Clang 3.6.0, Clang 3.7.0 SVN: Disk Scheduler: DEADLINE.

Intel Xeon Compiler Testsscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationhimeno: Poisson Pressure Solverbuild-apache: Time To Compilec-ray: Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshencode-flac: WAV To FLACencode-mp3: WAV To MP3pgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Writeapache: Static Web Page ServingGCC 4.9.2GCC 5.2.0Clang 3.6.0Clang 3.7.0 SVN1409.73555.46366.232542.022541.561043.381876.6533.7210.753.445.996.376.063.951.281.567.1212.911148.9572.0516639.161412.04550.59372.002546.832542.111048.671875.7233.9610.753.415.966.335.863.851.261.526.9912.701205.1174.2818776.911832.41548.82364.582472.404271.121505.131582.8827.1313.283.405.866.295.923.821.241.518.7312.541163.7872.1716605.391908.50551.34365.532794.854328.641502.141601.7427.8113.253.436.056.476.003.861.271.528.5614.201169.7973.2918791.30OpenBenchmarking.org

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0400800120016002000SE +/- 2.51, N = 4SE +/- 4.60, N = 4SE +/- 11.02, N = 4SE +/- 1.72, N = 41832.411908.501409.731412.04

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0120240360480600SE +/- 6.01, N = 4SE +/- 1.31, N = 4SE +/- 0.07, N = 4SE +/- 2.60, N = 4548.82551.34555.46550.59

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.080160240320400SE +/- 0.71, N = 4SE +/- 0.54, N = 4SE +/- 4.15, N = 4SE +/- 0.76, N = 4364.58365.53366.23372.00

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.06001200180024003000SE +/- 11.31, N = 4SE +/- 14.19, N = 4SE +/- 10.46, N = 4SE +/- 7.17, N = 42472.402794.852542.022546.83

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.09001800270036004500SE +/- 1.02, N = 4SE +/- 11.71, N = 4SE +/- 47.20, N = 4SE +/- 10.91, N = 44271.124328.642541.562542.11

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.030060090012001500SE +/- 0.20, N = 4SE +/- 3.00, N = 4SE +/- 11.58, N = 4SE +/- 3.56, N = 41505.131502.141043.381048.67

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0400800120016002000SE +/- 1.38, N = 3SE +/- 1.95, N = 3SE +/- 1.81, N = 3SE +/- 3.01, N = 31582.881601.741876.651875.721. (CC) gcc options: -O3

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.7Time To CompileClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0816243240SE +/- 0.23, N = 3SE +/- 0.27, N = 3SE +/- 0.34, N = 3SE +/- 0.27, N = 327.1327.8133.7233.96

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.03691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313.2813.2510.7510.751. (CC) gcc options: -lm -lpthread -O3

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.00.7741.5482.3223.0963.87SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 33.403.433.443.411. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0246810SE +/- 0.21, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 35.866.055.995.961. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0246810SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.296.476.376.331. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0246810SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.926.006.065.861. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.00.88881.77762.66643.55524.444SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.823.863.953.851. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.00.2880.5760.8641.1521.44SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.241.271.281.261. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.00.3510.7021.0531.4041.755SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.511.521.561.521. (CXX) g++ options: -O3 -rdynamic -lglut -lGL -lGLU

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.0246810SE +/- 0.02, N = 5SE +/- 0.05, N = 5SE +/- 0.04, N = 5SE +/- 0.04, N = 58.738.567.126.99-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O2 -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3Clang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.048121620SE +/- 0.03, N = 5SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 512.5414.2012.9112.70-funroll-loops-funroll-loops-fomit-frame-pointer-funroll-loops1. (CC) gcc options: -O3 -ffast-math -pipe -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.030060090012001500SE +/- 13.91, N = 3SE +/- 17.02, N = 4SE +/- 15.02, N = 3SE +/- 15.80, N = 31163.781169.791148.951205.11-pthreads-pthreads -mthreads1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Single Thread - Mode: Read WriteClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.01632486480SE +/- 0.85, N = 3SE +/- 0.80, N = 3SE +/- 0.26, N = 3SE +/- 0.14, N = 372.1773.2972.0574.28-pthreads-pthreads -mthreads1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingClang 3.6.0Clang 3.7.0 SVNGCC 4.9.2GCC 5.2.04K8K12K16K20KSE +/- 132.05, N = 3SE +/- 117.31, N = 3SE +/- 48.20, N = 3SE +/- 150.86, N = 316605.3918791.3016639.1618776.911. (CC) gcc options: -shared -fPIC -O2 -pthread


Phoronix Test Suite v10.8.4