LLVM Clang 4.0 vs. GCC 7 January 2017 Compiler Benchmarks

Intel Core i7-6800K GCC7 vs. LLVM Clang compiler benchmarks. Tests by Michael Larabel for a future article on phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1701270-PTS-CLANG4GC49&grr&sro.

LLVM Clang 4.0 vs. GCC 7 January 2017 Compiler BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVNIntel Core i7-6800K @ 3.80GHz (12 Cores)MSI X99A WORKSTATION (MS-7A54) v1.0Intel Xeon E7 v4/Xeon16384MB120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150NVIDIA GeForce GTX TITAN X 12288MBRealtek ALC1150Intel ConnectionUbuntu 16.044.4.0-59-generic (x86_64)Unity 7.4.0X Server 1.18.4NVIDIA 375.27.034.5.01.0.8GCC 4.9.4ext42560x1440GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1-svn288847-1~exp1 + CUDA 8.0Clang 4.0.0-svn293074-1~exp1 + CUDA 8.0OpenBenchmarking.orgCompiler Details- GCC 4.9.4, GCC 5.4.0, GCC 6.3.0, GCC 7.0.0 20170108: --disable-multilib --enable-checking=release --enable-languages=c,c++,fortranDisk Details- DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: intel_pstate powersaveOpenCL Details- GPU Compute Cores: 3072System Details- Python 2.7.12. GPU Compute Cores: 3072.

LLVM Clang 4.0 vs. GCC 7 January 2017 Compiler Benchmarksapache: Static Web Page Servingredis: SETredis: GETpgbench: Buffer Test - Normal Load - Read Writetjbench: Decompression Throughputcaffe: CPU Googlenetcaffe: CPU AlexNetmultichase: 1GB Array, 256 Byte Stride, 4 Threadsmultichase: 256MB Array, 256 Byte Stridemultichase: 4MB Array, 64 Byte Strideopenssl: RSA 4096-bit Performanceminion: Quasigroupminion: Solitaireminion: Gracefulffmpeg: H.264 HD To NTSC DVencode-mp3: WAV To MP3encode-flac: WAV To FLACcrafty: Elapsed Timebullet: Convex Trimeshbullet: Prim Trimeshbullet: 136 Ragdollsbullet: 1000 Convexbullet: 1000 Stackbullet: 3000 Fallbullet: Raytestsstockfish: Total Timec-ray: Total Timebuild-php: Time To Compilebuild-imagemagick: Time To Compileebizzy: Phoronix Test Suite v7.0.0m1compress-7zip: Compress Speed Testhimeno: Poisson Pressure Solvertscp: AI Chess Performancescimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte Carloscimark2: Compositemafft: Multiple Sequence Alignmenthmmer: Pfam Database Searchfftw: Float + SSE - 2D FFT Size 2048hpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Fftehpcc: G-HPLt-test1: 1GCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108Clang 3.9.1Clang 4.0 SVN45057.241650185.922054039.047986.34156.8765.3556.826.98981.57136.8085.2264.6617.6210.296.2770.121.311.153.254.995.294.822.86338612.3719.46194802324691803.2312393751210.682791.882565.53339.43613.361504.183.527.24133683.314314.151596.739936.3850373.2020324.3544610.321661193.922237143.927963.88156.1476492939720366.1458.426.97984.53139.7784.9964.3017.5810.686.7570.201.311.103.204.965.214.682.85347912.2818.9643.18186104321841816.4812316151210.652747.542172.90340.07641.571422.543.597.24138352.916374.087276.781376.4475173.6986025.3844730.441528154.692198076.587986.01157.8778015736081664.7957.386.98983.13138.3683.4865.2417.6610.456.6668.991.301.093.205.005.234.672.86346212.2319.5363.93195641326652179.2412690731210.762726.622567.75340.01643.991497.823.657.24141112.757874.168736.801476.5684573.8831723.8945489.991684492.462158090.657969.85158.0878426336342865.6358.496.97982.70140.7285.7665.5116.9810.606.6468.141.371.073.175.625.214.622.96341913.9419.5751.16184635338752187.0112266771213.543349.492587.40341.03647.151627.733.717.17140212.696514.153866.797016.3920773.7190024.6645813.181522549.122138355.677975.19165.9077506739754364.3659.136.97976.43138.8585.5562.3919.1513.586.4568.011.291.103.385.075.644.932.97343818.1220.3941.91215021341731707.2211262151816.415699.092815.31348.53268.972189.663.787.23133372.926424.2030617.720636.49640136.4760025.0045498.681491974.112039510.777954.24162.1679001439508365.1457.486.97980.40138.7785.4462.2617.3711.426.4568.141.311.103.395.165.644.912.96336417.9015.8043.07218214340541705.6811803851816.215632.912624.13340.46685.522219.843.907.21133533.174314.1883418.073776.48360136.6636723.78OpenBenchmarking.org

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010810K20K30K40K50KSE +/- 165.88, N = 3SE +/- 586.43, N = 3SE +/- 153.19, N = 3SE +/- 161.65, N = 3SE +/- 77.65, N = 3SE +/- 31.18, N = 345813.1845498.6845057.2444610.3244730.4445489.991. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SETClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108400K800K1200K1600K2000KSE +/- 42383.29, N = 6SE +/- 70447.16, N = 6SE +/- 4150.84, N = 3SE +/- 7318.36, N = 3SE +/- 62672.40, N = 6SE +/- 6181.11, N = 31522549.121491974.111650185.921661193.921528154.691684492.46-std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108500K1000K1500K2000K2500KSE +/- 9281.03, N = 3SE +/- 64445.11, N = 6SE +/- 93997.54, N = 6SE +/- 2889.57, N = 3SE +/- 17316.52, N = 3SE +/- 62256.75, N = 62138355.672039510.772054039.042237143.922198076.582158090.65-std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701082K4K6K8K10KSE +/- 32.40, N = 3SE +/- 16.04, N = 3SE +/- 28.13, N = 3SE +/- 15.65, N = 3SE +/- 27.48, N = 3SE +/- 23.18, N = 37975.197954.247986.347963.887986.017969.85-pthreads -mthreads-pthreads -mthreads1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.1Test: Decompression ThroughputClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701084080120160200SE +/- 3.31, N = 3SE +/- 0.63, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.39, N = 3SE +/- 1.01, N = 3165.90162.16156.87156.14157.87158.081. (CC) gcc options: -O3 -march=native -lm

Caffe

Build: CPU Googlenet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2016-12-29Build: CPU GooglenetClang 3.9.1Clang 4.0 SVNGCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108200K400K600K800K1000KSE +/- 2777.87, N = 3SE +/- 13446.41, N = 6SE +/- 606.01, N = 3SE +/- 248.34, N = 3SE +/- 634.40, N = 37750677900147649297801577842631. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

Caffe

Build: CPU AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2016-12-29Build: CPU AlexNetClang 3.9.1Clang 4.0 SVNGCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010890K180K270K360K450KSE +/- 263.90, N = 3SE +/- 101.92, N = 3SE +/- 268.06, N = 3SE +/- 302.65, N = 3SE +/- 66.71, N = 33975433950833972033608163634281. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

Multichase Pointer Chaser

Test: 1GB Array, 256 Byte Stride, 4 Threads

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 1GB Array, 256 Byte Stride, 4 ThreadsClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081530456075SE +/- 0.95, N = 3SE +/- 0.88, N = 3SE +/- 0.95, N = 3SE +/- 0.17, N = 3SE +/- 1.11, N = 4SE +/- 0.45, N = 364.3665.1465.3566.1464.7965.631. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 256MB Array, 256 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 256MB Array, 256 Byte StrideClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081326395265SE +/- 0.57, N = 3SE +/- 1.27, N = 6SE +/- 0.83, N = 3SE +/- 0.93, N = 3SE +/- 1.35, N = 6SE +/- 0.98, N = 359.1357.4856.8258.4257.3858.491. (CC) gcc options: -O2 -static -pthread -lrt

Multichase Pointer Chaser

Test: 4MB Array, 64 Byte Stride

OpenBenchmarking.orgns, Fewer Is BetterMultichase Pointer ChaserTest: 4MB Array, 64 Byte StrideClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.976.976.986.976.986.971. (CC) gcc options: -O2 -static -pthread -lrt

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701082004006008001000SE +/- 0.26, N = 3SE +/- 1.04, N = 3SE +/- 0.75, N = 3SE +/- 1.26, N = 3SE +/- 0.55, N = 3SE +/- 1.10, N = 3976.43980.40981.57984.53983.13982.701. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

Minion

Benchmark: Quasigroup

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: QuasigroupClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108306090120150SE +/- 1.05, N = 3SE +/- 0.92, N = 3SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 1.92, N = 3138.85138.77136.80139.77138.36140.721. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic

Minion

Benchmark: Solitaire

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: SolitaireClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010820406080100SE +/- 0.07, N = 3SE +/- 0.20, N = 3SE +/- 0.30, N = 3SE +/- 0.28, N = 3SE +/- 0.22, N = 3SE +/- 0.20, N = 385.5585.4485.2284.9983.4885.761. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: GracefulClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081530456075SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.24, N = 3SE +/- 0.56, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 362.3962.2664.6664.3065.2465.511. (CXX) g++ options: -std=gnu++11 -O3 -fomit-frame-pointer -rdynamic

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 2.8.1H.264 HD To NTSC DVClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108510152025SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 319.1517.3717.6217.5817.6616.98-Qunused-arguments -MMD -MF -MT-Qunused-arguments -MMD -MF -MT-fno-tree-vectorize-fno-tree-vectorize-fno-tree-vectorize-fno-tree-vectorize1. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lxcb -lxcb-shm -lxcb-xfixes -lxcb-render -lxcb-shape -lasound -lSDL -lm -llzma -lbz2 -pthread -O3 -march=native -std=c99 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3Clang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701083691215SE +/- 0.01, N = 5SE +/- 0.03, N = 5SE +/- 0.00, N = 5SE +/- 0.12, N = 5SE +/- 0.02, N = 5SE +/- 0.01, N = 513.5811.4210.2910.6810.4510.60-funroll-loops -lncurses-funroll-loops -lncurses-fomit-frame-pointer-funroll-loops-funroll-loops-funroll-loops1. (CC) gcc options: -O3 -ffast-math -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.1WAV To FLACClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108246810SE +/- 0.04, N = 5SE +/- 0.09, N = 5SE +/- 0.08, N = 7SE +/- 0.08, N = 5SE +/- 0.08, N = 5SE +/- 0.04, N = 56.456.456.276.756.666.64-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081632486480SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 368.0168.1470.1270.2068.9968.141. (CC) gcc options: -lstdc++ -lm

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.30830.61660.92491.23321.5415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.291.311.311.311.301.371. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.25880.51760.77641.03521.294SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.101.101.151.101.091.071. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.76281.52562.28843.05123.814SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.383.393.253.203.203.171. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081.26452.5293.79355.0586.3225SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 35.075.164.994.965.005.621. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081.2692.5383.8075.0766.345SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 35.645.645.295.215.235.211. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081.10932.21863.32794.43725.5465SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.934.914.824.684.674.621. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.66831.33662.00492.67323.3415SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.972.962.862.852.862.961. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Stockfish

Total Time

OpenBenchmarking.orgms, Fewer Is BetterStockfish 2014-11-26Total TimeClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701087001400210028003500SE +/- 9.54, N = 3SE +/- 42.15, N = 3SE +/- 9.24, N = 3SE +/- 24.50, N = 3SE +/- 0.88, N = 3SE +/- 19.50, N = 3343833643386347934623419-flto-flto-flto-flto1. (CXX) g++ options: -lpthread -O3 -march=native -fno-exceptions -fno-rtti -ansi -pedantic -msse -msse3 -mpopcnt

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010848121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 318.1217.9012.3712.2812.2313.941. (CC) gcc options: -lm -lpthread -O3 -march=native

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108510152025SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 320.3915.8019.4618.9619.5319.571. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileClang 3.9.1Clang 4.0 SVNGCC 5.4.0GCC 6.3.0GCC 7.0.0 201701081428425670SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 341.9143.0743.1863.9351.16

ebizzy

Phoronix Test Suite v7.0.0m1

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3Phoronix Test Suite v7.0.0m1Clang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010850K100K150K200K250KSE +/- 1941.51, N = 3SE +/- 511.25, N = 3SE +/- 5743.30, N = 6SE +/- 3471.19, N = 3SE +/- 3146.69, N = 3SE +/- 4871.23, N = 62150212182141948021861041956411846351. (CC) gcc options: -pthread -lpthread -O3 -march=native

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 9.20.1Compress Speed TestClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701087K14K21K28K35KSE +/- 158.21, N = 3SE +/- 274.11, N = 3SE +/- 179.79, N = 3SE +/- 117.90, N = 3SE +/- 154.09, N = 3SE +/- 303.27, N = 33417334054324693218432665338751. (CXX) g++ options: -pipe -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701085001000150020002500SE +/- 0.30, N = 3SE +/- 1.40, N = 3SE +/- 0.82, N = 3SE +/- 0.56, N = 3SE +/- 3.74, N = 3SE +/- 1.23, N = 31707.221705.681803.231816.482179.242187.011. (CC) gcc options: -O3 -march=native -mavx2

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108300K600K900K1200K1500KSE +/- 9791.09, N = 5SE +/- 17284.55, N = 5SE +/- 682.67, N = 5SE +/- 550.00, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 41126215118038512393751231615126907312266771. (CC) gcc options: -O3 -march=native

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108400800120016002000SE +/- 0.06, N = 4SE +/- 0.05, N = 4SE +/- 0.30, N = 4SE +/- 0.24, N = 4SE +/- 0.21, N = 4SE +/- 3.46, N = 41816.411816.211210.681210.651210.761213.541. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010812002400360048006000SE +/- 2.77, N = 4SE +/- 12.29, N = 4SE +/- 6.68, N = 4SE +/- 6.90, N = 4SE +/- 0.34, N = 4SE +/- 12.21, N = 45699.095632.912791.882747.542726.623349.491. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701086001200180024003000SE +/- 7.94, N = 4SE +/- 2.00, N = 4SE +/- 2.79, N = 4SE +/- 6.54, N = 4SE +/- 4.07, N = 4SE +/- 8.80, N = 42815.312624.132565.532172.902567.752587.401. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010880160240320400SE +/- 0.61, N = 4SE +/- 0.12, N = 4SE +/- 0.48, N = 4SE +/- 0.51, N = 4SE +/- 0.15, N = 4SE +/- 0.83, N = 4348.53340.46339.43340.07340.01341.031. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108150300450600750SE +/- 0.02, N = 4SE +/- 0.04, N = 4SE +/- 33.75, N = 4SE +/- 2.50, N = 4SE +/- 0.03, N = 4SE +/- 3.03, N = 4268.97685.52613.36641.57643.99647.151. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701085001000150020002500SE +/- 1.08, N = 4SE +/- 2.82, N = 4SE +/- 5.44, N = 4SE +/- 1.39, N = 4SE +/- 0.76, N = 4SE +/- 5.61, N = 42189.662219.841504.181422.541497.821627.731. (CXX) g++ options: -O3 -march=native

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.87751.7552.63253.514.3875SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 6SE +/- 0.09, N = 6SE +/- 0.06, N = 53.783.903.523.593.653.711. (CC) gcc options: -O3 -lm -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108246810SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 37.237.217.247.247.247.171. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Float + SSE - Size: 2D FFT Size 2048Clang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701083K6K9K12K15KSE +/- 32.87, N = 5SE +/- 46.10, N = 5SE +/- 43.00, N = 5SE +/- 47.65, N = 5SE +/- 56.20, N = 5SE +/- 102.73, N = 5133371335313368138351411114021-std=gnu991. (CC) gcc options: -O3 -march=native -lm

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: EP-STREAM TriadClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.74571.49142.23712.98283.7285SE +/- 0.05203, N = 3SE +/- 0.09067, N = 3SE +/- 0.13310, N = 3SE +/- 0.16749, N = 3SE +/- 0.10335, N = 3SE +/- 0.03993, N = 32.926423.174313.314312.916372.757872.696511. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.4.3Test / Class: G-PtransClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 201701080.94571.89142.83713.78284.7285SE +/- 0.00163, N = 3SE +/- 0.02010, N = 3SE +/- 0.03617, N = 3SE +/- 0.02941, N = 3SE +/- 0.03550, N = 3SE +/- 0.02840, N = 34.203064.188344.151594.087274.168734.153861. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: EP-DGEMMClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 2017010848121620SE +/- 0.01624, N = 3SE +/- 0.38223, N = 3SE +/- 0.05234, N = 3SE +/- 0.01720, N = 3SE +/- 0.00120, N = 3SE +/- 0.00215, N = 317.7206318.073776.739936.781376.801476.797011. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-FfteClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108246810SE +/- 0.02520, N = 3SE +/- 0.04097, N = 3SE +/- 0.03120, N = 3SE +/- 0.05490, N = 3SE +/- 0.01518, N = 3SE +/- 0.07723, N = 36.496406.483606.385036.447516.568456.392071. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.4.3Test / Class: G-HPLClang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108306090120150SE +/- 0.32, N = 3SE +/- 0.43, N = 3SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 3SE +/- 0.14, N = 3136.48136.6673.2073.7073.8873.721. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -march=native -funroll-loops2. BLAS + Open MPI 1.10.2

t-test1

Threads: 1

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1Clang 3.9.1Clang 4.0 SVNGCC 4.9.4GCC 5.4.0GCC 6.3.0GCC 7.0.0 20170108612182430SE +/- 0.36, N = 6SE +/- 0.42, N = 6SE +/- 0.49, N = 3SE +/- 0.39, N = 5SE +/- 0.26, N = 3SE +/- 0.10, N = 325.0023.7824.3525.3823.8924.661. (CC) gcc options: -pthread -O3 -march=native


Phoronix Test Suite v10.8.4