LLVM Clang 4.0 vs. GCC 7 January 2017 Compiler Benchmarks

Intel Core i7-6800K GCC7 vs. LLVM Clang compiler benchmarks. Tests by Michael Larabel for a future article on phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1701270-PTS-CLANG4GC49&grt.

LLVM Clang 4.0 vs. GCC 7 January 2017 Compiler BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVNIntel Core i7-6800K @ 3.80GHz (12 Cores)MSI X99A WORKSTATION (MS-7A54) v1.0Intel Xeon E7 v4/Xeon16384MB120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150NVIDIA GeForce GTX TITAN X 12288MBRealtek ALC1150Intel ConnectionUbuntu 16.044.4.0-59-generic (x86_64)Unity 7.4.0X Server 1.18.4NVIDIA 375.27.034.5.01.0.8GCC 4.9.4ext42560x1440GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1-svn288847-1~exp1 + CUDA 8.0Clang 4.0.0-svn293074-1~exp1 + CUDA 8.0OpenBenchmarking.orgCompiler Details- GCC 4.9.4, GCC 5.4.0, GCC 6.3.0, GCC 7.0.0 20170108: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Details- DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: intel_pstate powersaveOpenCL Details- GPU Compute Cores: 3072System Details- Python 2.7.12. GPU Compute Cores: 3072.

LLVM Clang 4.0 vs. GCC 7 January 2017 Compiler Benchmarkscompress-7zip: Compress Speed Testapache: Static Web Page Servingbullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshc-ray: Total Timecaffe: CPU AlexNetcaffe: CPU Googlenetcrafty: Elapsed Timeebizzy: Phoronix Test Suite v7.0.0m1ffmpeg: H.264 HD To NTSC DVfftw: Float + SSE - 2D FFT Size 2048encode-flac: WAV To FLAChimeno: Poisson Pressure Solverhpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadencode-mp3: WAV To MP3tjbench: Decompression Throughputminion: Gracefulminion: Solitaireminion: Quasigroupmultichase: 4MB Array, 64 Byte Stridemultichase: 256MB Array, 256 Byte Stridemultichase: 1GB Array, 256 Byte Stride, 4 Threadsopenssl: RSA 4096-bit Performancepgbench: Buffer Test - Normal Load - Read Writeredis: GETredis: SETscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationstockfish: Total Timet-test1: 1hmmer: Pfam Database Searchbuild-imagemagick: Time To Compilemafft: Multiple Sequence Alignmentbuild-php: Time To Compiletscp: AI Chess PerformanceGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN3246945057.242.864.825.294.993.251.151.3112.3770.1219480217.62133686.271803.2373.202036.385036.739934.151593.3143110.29156.8764.6685.22136.806.9856.8265.35981.577986.342054039.041650185.921504.18613.36339.432565.532791.881210.68338624.357.243.5219.4612393753218444610.322.854.685.214.963.201.101.3112.2839720376492970.2018610417.58138356.751816.4873.698606.447516.781374.087272.9163710.68156.1464.3084.99139.776.9758.4266.14984.537963.882237143.921661193.921422.54641.57340.072172.902747.541210.65347925.387.2443.183.5918.9612316153266544730.442.864.675.235.003.201.091.3012.2336081678015768.9919564117.66141116.662179.2473.883176.568456.801474.168732.7578710.45157.8765.2483.48138.366.9857.3864.79983.137986.012198076.581528154.691497.82643.99340.012567.752726.621210.76346223.897.2463.933.6519.5312690733387545489.992.964.625.215.623.171.071.3713.9436342878426368.1418463516.98140216.642187.0173.719006.392076.797014.153862.6965110.60158.0865.5185.76140.726.9758.4965.63982.707969.852158090.651684492.461627.73647.15341.032587.403349.491213.54341924.667.1751.163.7119.5712266773417345813.182.974.935.645.073.381.101.2918.1239754377506768.0121502119.15133376.451707.22136.476006.4964017.720634.203062.9264213.58165.9062.3985.55138.856.9759.1364.36976.437975.192138355.671522549.122189.66268.97348.532815.315699.091816.41343825.007.2341.913.7820.3911262153405445498.682.964.915.645.163.391.101.3117.9039508379001468.1421821417.37133536.451705.68136.663676.4836018.073774.188343.1743111.42162.1662.2685.44138.776.9757.4865.14980.407954.242039510.771491974.112219.84685.52340.462624.135632.911816.21336423.787.2143.073.9015.801180385OpenBenchmarking.org

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN7K14K21K28K35KSE +/- 179.79, N = 3SE +/- 117.90, N = 3SE +/- 154.09, N = 3SE +/- 303.27, N = 3SE +/- 158.21, N = 3SE +/- 274.11, N = 33246932184326653387534173340541. (CXX) g++ options: -pipe -lpthread

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN10K20K30K40K50KSE +/- 153.19, N = 3SE +/- 161.65, N = 3SE +/- 77.65, N = 3SE +/- 31.18, N = 3SE +/- 165.88, N = 3SE +/- 586.43, N = 345057.2444610.3244730.4445489.9945813.1845498.681. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.66831.33662.00492.67323.3415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 32.862.852.862.962.972.961. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1.10932.21863.32794.43725.5465SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.824.684.674.624.934.911. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1.2692.5383.8075.0766.345SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.295.215.235.215.645.641. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1.26452.5293.79355.0586.3225SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 34.994.965.005.625.075.161. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.76281.52562.28843.05123.814SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.253.203.203.173.383.391. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.25880.51760.77641.03521.294SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.151.101.091.071.101.101. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.30830.61660.92491.23321.5415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.311.311.301.371.291.311. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN48121620SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.3712.2812.2313.9418.1217.901. (CC) gcc options: -lm -lpthread -O3 -march=native

Caffe

Build: CPU AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2016-12-29Build: CPU AlexNetGCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN90K180K270K360K450KSE +/- 268.06, N = 3SE +/- 302.65, N = 3SE +/- 66.71, N = 3SE +/- 263.90, N = 3SE +/- 101.92, N = 33972033608163634283975433950831. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

Caffe

Build: CPU Googlenet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2016-12-29Build: CPU GooglenetGCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN200K400K600K800K1000KSE +/- 606.01, N = 3SE +/- 248.34, N = 3SE +/- 634.40, N = 3SE +/- 2777.87, N = 3SE +/- 13446.41, N = 67649297801577842637750677900141. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1632486480SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 370.1270.2068.9968.1468.0168.141. (CC) gcc options: -lstdc++ -lm

ebizzy

Phoronix Test Suite v7.0.0m1

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3Phoronix Test Suite v7.0.0m1GCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN50K100K150K200K250KSE +/- 5743.30, N = 6SE +/- 3471.19, N = 3SE +/- 3146.69, N = 3SE +/- 4871.23, N = 6SE +/- 1941.51, N = 3SE +/- 511.25, N = 31948021861041956411846352150212182141. (CC) gcc options: -pthread -lpthread -O3 -march=native

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.8.1H.264 HD To NTSC DVGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN510152025SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 317.6217.5817.6616.9819.1517.37-fno-tree-vectorize-fno-tree-vectorize-fno-tree-vectorize-fno-tree-vectorize-Qunused-arguments -MMD -MF -MT-Qunused-arguments -MMD -MF -MT1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lxcb -lxcb-shm -lxcb-xfixes -lxcb-render -lxcb-shape -lasound -lSDL -lm -llzma -lbz2 -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Float + SSE - Size: 2D FFT Size 2048GCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN3K6K9K12K15KSE +/- 43.00, N = 5SE +/- 47.65, N = 5SE +/- 56.20, N = 5SE +/- 102.73, N = 5SE +/- 32.87, N = 5SE +/- 46.10, N = 5133681383514111140211333713353-std=gnu991. (CC) gcc options: -O3 -march=native -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN246810SE +/- 0.08, N = 7SE +/- 0.08, N = 5SE +/- 0.08, N = 5SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.09, N = 56.276.756.666.646.456.45-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN5001000150020002500SE +/- 0.82, N = 3SE +/- 0.56, N = 3SE +/- 3.74, N = 3SE +/- 1.23, N = 3SE +/- 0.30, N = 3SE +/- 1.40, N = 31803.231816.482179.242187.011707.221705.681. (CC) gcc options: -O3 -march=native -mavx2

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-HPLGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN306090120150SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 3SE +/- 0.14, N = 3SE +/- 0.32, N = 3SE +/- 0.43, N = 373.2073.7073.8873.72136.48136.661. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-FfteGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN246810SE +/- 0.03120, N = 3SE +/- 0.05490, N = 3SE +/- 0.01518, N = 3SE +/- 0.07723, N = 3SE +/- 0.02520, N = 3SE +/- 0.04097, N = 36.385036.447516.568456.392076.496406.483601. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: EP-DGEMMGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN48121620SE +/- 0.05234, N = 3SE +/- 0.01720, N = 3SE +/- 0.00120, N = 3SE +/- 0.00215, N = 3SE +/- 0.01624, N = 3SE +/- 0.38223, N = 36.739936.781376.801476.7970117.7206318.073771. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: G-PtransGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.94571.89142.83713.78284.7285SE +/- 0.03617, N = 3SE +/- 0.02941, N = 3SE +/- 0.03550, N = 3SE +/- 0.02840, N = 3SE +/- 0.00163, N = 3SE +/- 0.02010, N = 34.151594.087274.168734.153864.203064.188341. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: EP-STREAM TriadGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.74571.49142.23712.98283.7285SE +/- 0.13310, N = 3SE +/- 0.16749, N = 3SE +/- 0.10335, N = 3SE +/- 0.03993, N = 3SE +/- 0.05203, N = 3SE +/- 0.09067, N = 33.314312.916372.757872.696512.926423.174311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3GCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN3691215SE +/- 0.00, N = 5SE +/- 0.12, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.03, N = 510.2910.6810.4510.6013.5811.42-fomit-frame-pointer-funroll-loops-funroll-loops-funroll-loops-funroll-loops -lncurses-funroll-loops -lncurses1. (CC) gcc options: -O3 -ffast-math -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.1Test: Decompression ThroughputGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN4080120160200SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.39, N = 3SE +/- 1.01, N = 3SE +/- 3.31, N = 3SE +/- 0.63, N = 3156.87156.14157.87158.08165.90162.161. (CC) gcc options: -O3 -march=native -lm

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: GracefulGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1530456075SE +/- 0.24, N = 3SE +/- 0.56, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 364.6664.3065.2465.5162.3962.261. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic

Minion

Benchmark: Solitaire

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: SolitaireGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN20406080100SE +/- 0.30, N = 3SE +/- 0.28, N = 3SE +/- 0.22, N = 3SE +/- 0.20, N = 3SE +/- 0.07, N = 3SE +/- 0.20, N = 385.2284.9983.4885.7685.5585.441. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic

Minion

Benchmark: Quasigroup

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: QuasigroupGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN306090120150SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 1.92, N = 3SE +/- 1.05, N = 3SE +/- 0.92, N = 3136.80139.77138.36140.72138.85138.771. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic

Multichase Pointer Chaser

Test: 4MB Array, 64 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 4MB Array, 64 Byte StrideGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.986.976.986.976.976.971. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 256MB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 256MB Array, 256 Byte StrideGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1326395265SE +/- 0.83, N = 3SE +/- 0.93, N = 3SE +/- 1.35, N = 6SE +/- 0.98, N = 3SE +/- 0.57, N = 3SE +/- 1.27, N = 656.8258.4257.3858.4959.1357.481. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 4 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 4 ThreadsGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1530456075SE +/- 0.95, N = 3SE +/- 0.17, N = 3SE +/- 1.11, N = 4SE +/- 0.45, N = 3SE +/- 0.95, N = 3SE +/- 0.88, N = 365.3566.1464.7965.6364.3665.141. (CC) gcc options: -O2 -static -pthread -lrt

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN2004006008001000SE +/- 0.75, N = 3SE +/- 1.26, N = 3SE +/- 0.55, N = 3SE +/- 1.10, N = 3SE +/- 0.26, N = 3SE +/- 1.04, N = 3981.57984.53983.13982.70976.43980.401. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN2K4K6K8K10KSE +/- 28.13, N = 3SE +/- 15.65, N = 3SE +/- 27.48, N = 3SE +/- 23.18, N = 3SE +/- 32.40, N = 3SE +/- 16.04, N = 37986.347963.887986.017969.857975.197954.24-pthreads -mthreads-pthreads -mthreads1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN500K1000K1500K2000K2500KSE +/- 93997.54, N = 6SE +/- 2889.57, N = 3SE +/- 17316.52, N = 3SE +/- 62256.75, N = 6SE +/- 9281.03, N = 3SE +/- 64445.11, N = 62054039.042237143.922198076.582158090.652138355.672039510.77-std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SETGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN400K800K1200K1600K2000KSE +/- 4150.84, N = 3SE +/- 7318.36, N = 3SE +/- 62672.40, N = 6SE +/- 6181.11, N = 3SE +/- 42383.29, N = 6SE +/- 70447.16, N = 61650185.921661193.921528154.691684492.461522549.121491974.11-std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN5001000150020002500SE +/- 5.44, N = 4SE +/- 1.39, N = 4SE +/- 0.76, N = 4SE +/- 5.61, N = 4SE +/- 1.08, N = 4SE +/- 2.82, N = 41504.181422.541497.821627.732189.662219.841. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN150300450600750SE +/- 33.75, N = 4SE +/- 2.50, N = 4SE +/- 0.03, N = 4SE +/- 3.03, N = 4SE +/- 0.02, N = 4SE +/- 0.04, N = 4613.36641.57643.99647.15268.97685.521. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN80160240320400SE +/- 0.48, N = 4SE +/- 0.51, N = 4SE +/- 0.15, N = 4SE +/- 0.83, N = 4SE +/- 0.61, N = 4SE +/- 0.12, N = 4339.43340.07340.01341.03348.53340.461. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN6001200180024003000SE +/- 2.79, N = 4SE +/- 6.54, N = 4SE +/- 4.07, N = 4SE +/- 8.80, N = 4SE +/- 7.94, N = 4SE +/- 2.00, N = 42565.532172.902567.752587.402815.312624.131. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN12002400360048006000SE +/- 6.68, N = 4SE +/- 6.90, N = 4SE +/- 0.34, N = 4SE +/- 12.21, N = 4SE +/- 2.77, N = 4SE +/- 12.29, N = 42791.882747.542726.623349.495699.095632.911. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN400800120016002000SE +/- 0.30, N = 4SE +/- 0.24, N = 4SE +/- 0.21, N = 4SE +/- 3.46, N = 4SE +/- 0.06, N = 4SE +/- 0.05, N = 41210.681210.651210.761213.541816.411816.211. (CXX) g++ options: -O3 -march=native

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN7001400210028003500SE +/- 9.24, N = 3SE +/- 24.50, N = 3SE +/- 0.88, N = 3SE +/- 19.50, N = 3SE +/- 9.54, N = 3SE +/- 42.15, N = 3338634793462341934383364-flto-flto-flto-flto1. (CXX) g++ options: -lpthread -O3 -march=native -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1GCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN612182430SE +/- 0.49, N = 3SE +/- 0.39, N = 5SE +/- 0.26, N = 3SE +/- 0.10, N = 3SE +/- 0.36, N = 6SE +/- 0.42, N = 624.3525.3823.8924.6625.0023.781. (CC) gcc options: -pthread -O3 -march=native

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN246810SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 37.247.247.247.177.237.211. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileGCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN1428425670SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 343.1863.9351.1641.9143.07

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN0.87751.7552.63253.514.3875SE +/- 0.02, N = 3SE +/- 0.06, N = 6SE +/- 0.09, N = 6SE +/- 0.06, N = 5SE +/- 0.02, N = 3SE +/- 0.07, N = 33.523.593.653.713.783.901. (CC) gcc options: -O3 -lm -lpthread

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN510152025SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.01, N = 319.4618.9619.5319.5720.3915.801. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN300K600K900K1200K1500KSE +/- 682.67, N = 5SE +/- 550.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 4SE +/- 9791.09, N = 5SE +/- 17284.55, N = 51239375123161512690731226677112621511803851. (CC) gcc options: -O3 -march=native


Phoronix Test Suite v10.8.4