Early GCC 7.0 Benchmarking On Intel x86_64 Linux

Intel Core i7-6800K testing of GCC 7.0 snapshot versus GCC 6.2 and GCC 5.4. Tests by Michael Larabel for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1612058-TA-GCCBENCHM34.

Early GCC 7.0 Benchmarking On Intel x86_64 LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127Intel Core i7-6800K @ 3.80GHz (12 Cores)MSI X99A WORKSTATION (MS-7A54) v1.0Intel Xeon E7 v4/Xeon16384MB120GB Samsung SSD 850 + 4 x 120GB TOSHIBA-TR150NVIDIA GeForce GTX TITAN X 12288MBRealtek ALC1150ASUS PB278Intel ConnectionUbuntu 16.104.8.0-28-generic (x86_64)Unity 7.5.0X Server 1.18.4nouveau 1.0.123.3 Mesa 12.0.3 Gallium 0.4GCC 5.4.0ext42560x1440GCC 6.2.0GCC 7.0.0 20161127OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=release --enable-languages=c,c++Disk Details- DEADLINE / data=ordered,errors=remount-ro,relatime,rwProcessor Details- Scaling Governor: intel_pstate powersave

Early GCC 7.0 Benchmarking On Intel x86_64 Linuxsqlite: Default Test Directoryfftw: Float + SSE - 2D FFT Size 4096hmmer: Pfam Database Searchmafft: Multiple Sequence Alignmentfhourstones: Complex Connect-4 Solvingscimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancejohn-the-ripper: Blowfishttsiod-renderer: Phong Rendering With Soft-Shadow Mappinggraphics-magick: Resizinggraphics-magick: HWB Color Spacegraphics-magick: Local Adaptive Thresholdinghimeno: Poisson Pressure Solverebizzy: Phoronix Test Suite v6.8.0m3build-imagemagick: Time To Compilebuild-linux-kernel: Time To Compilebuild-php: Time To Compilec-ray: Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshcrafty: Elapsed Timeencode-mp3: WAV To MP3n-queens: Elapsed Timeopenssl: RSA 4096-bit Performancecaffe: CPU AlexNetcaffe: CPU Googlenetpgbench: Buffer Test - Normal Load - Read Writepgbench: Buffer Test - Single Thread - Read Writepgbench: Buffer Test - Heavy Contention - Read Writeredis: GETredis: SETapache: Static Web Page ServingGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112725.82138026.793.5613169.231500.47642.13338.822590.282720.201210.94126785710214293.17189196931816.2918969443.8478.5118.9112.212.864.625.194.963.201.101.3169.2610.5325.66981.537958.56627.378761.902063098.711576321.2137233.7125.86144486.833.8213055.501482.31578.41339.062562.902719.951211.24132373210238294.36187199932213.1318894464.5283.7919.7012.152.864.655.194.983.201.091.3068.7210.5325.13981.104373619815497946.48622.568683.692152314.251412803.0037371.5025.73141306.833.8013265.501596.12601.84341.962475.933232.731208.50121462510066305.94190198922190.5118637751.8985.1419.7213.862.994.645.185.353.161.071.3666.3010.48214.13983.037988.08636.238677.261906819.191338386.5037345.18OpenBenchmarking.org

SQLite

Test Target: Default Test Directory

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.8.10.2Test Target: Default Test DirectoryGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127612182430SE +/- 0.15, N = 3SE +/- 0.17, N = 3SE +/- 0.26, N = 325.8225.8625.731. (CC) gcc options: -O3 -march=native -ldl -lpthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.4Build: Float + SSE - Size: 2D FFT Size 4096GCC 5.4.0GCC 6.2.0GCC 7.0.0 201611273K6K9K12K15KSE +/- 54.71, N = 5SE +/- 147.31, N = 5SE +/- 41.05, N = 51380214448141301. (CC) gcc options: -O3 -march=native -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.796.836.831. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Timed MAFFT Alignment

Multiple Sequence Alignment

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 6.864Multiple Sequence AlignmentGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611270.85951.7192.57853.4384.2975SE +/- 0.04, N = 3SE +/- 0.08, N = 6SE +/- 0.07, N = 63.563.823.801. (CC) gcc options: -O3 -lm -lpthread

Fhourstones

Complex Connect-4 Solving

OpenBenchmarking.orgKpos / sec, More Is BetterFhourstones 3.1Complex Connect-4 SolvingGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611273K6K9K12K15KSE +/- 71.14, N = 3SE +/- 11.67, N = 3SE +/- 11.21, N = 313169.2313055.5013265.501. (CC) gcc options: -O3

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112730060090012001500SE +/- 1.02, N = 4SE +/- 6.64, N = 4SE +/- 25.27, N = 81500.471482.311596.121. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127140280420560700SE +/- 0.01, N = 4SE +/- 36.88, N = 4SE +/- 32.30, N = 4642.13578.41601.841. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112770140210280350SE +/- 0.66, N = 4SE +/- 0.70, N = 4SE +/- 0.91, N = 4338.82339.06341.961. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611276001200180024003000SE +/- 2.85, N = 4SE +/- 3.29, N = 4SE +/- 116.86, N = 42590.282562.902475.931. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611277001400210028003500SE +/- 2.83, N = 4SE +/- 2.50, N = 4SE +/- 105.82, N = 42720.202719.953232.731. (CXX) g++ options: -O3 -march=native

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112730060090012001500SE +/- 0.27, N = 4SE +/- 0.08, N = 4SE +/- 1.69, N = 41210.941211.241208.501. (CXX) g++ options: -O3 -march=native

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127300K600K900K1200K1500KSE +/- 13238.20, N = 5SE +/- 22912.67, N = 3SE +/- 23083.56, N = 101267857132373212146251. (CC) gcc options: -O3 -march=native

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.8.0Test: BlowfishGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611272K4K6K8K10KSE +/- 5.00, N = 3SE +/- 0.00, N = 3SE +/- 7.00, N = 31021410238100661. (CC) gcc options: -fopenmp -lcrypt

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3aPhong Rendering With Soft-Shadow MappingGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112770140210280350SE +/- 1.10, N = 3SE +/- 1.23, N = 3SE +/- 0.75, N = 3293.17294.36305.941. (CXX) g++ options: -O3 -march=native -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -lstdc++

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: ResizingGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611274080120160200SE +/- 2.03, N = 3SE +/- 0.88, N = 3SE +/- 1.15, N = 31891871901. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: HWB Color SpaceGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611274080120160200SE +/- 0.88, N = 3SE +/- 1.33, N = 3SE +/- 1.00, N = 31961991981. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -ldl -lpthread

GraphicsMagick

Operation: Local Adaptive Thresholding

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.19Operation: Local Adaptive ThresholdingGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112720406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39393921. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lxml2 -lz -lm -ldl -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611275001000150020002500SE +/- 1.37, N = 3SE +/- 28.84, N = 3SE +/- 0.19, N = 31816.292213.132190.511. (CC) gcc options: -O3 -march=native -mavx2

ebizzy

Phoronix Test Suite v6.8.0m3

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3Phoronix Test Suite v6.8.0m3GCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112740K80K120K160K200KSE +/- 5429.90, N = 6SE +/- 472.32, N = 3SE +/- 4882.08, N = 61896941889441863771. (CC) gcc options: -pthread -lpthread -O3 -march=native

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To CompileGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611271428425670SE +/- 0.21, N = 3SE +/- 0.14, N = 3SE +/- 0.17, N = 343.8464.5251.89

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.3Time To CompileGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112720406080100SE +/- 1.07, N = 3SE +/- 0.94, N = 3SE +/- 0.90, N = 378.5183.7985.14

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 5.2.9Time To CompileGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127510152025SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 318.9119.7019.721. (CC) gcc options: -O3 -march=native -pedantic -ldl -lz -lm

C-Ray

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total TimeGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112748121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 312.2112.1513.861. (CC) gcc options: -lm -lpthread -O3 -march=native

Bullet Physics Engine

Test: Raytests

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611270.67281.34562.01842.69123.364SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.862.862.991. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611271.04632.09263.13894.18525.2315SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 34.624.654.641. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Stack

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611271.16782.33563.50344.67125.839SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 35.195.195.181. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611271.20382.40763.61144.81526.019SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 34.964.985.351. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: 136 Ragdolls

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611270.721.442.162.883.6SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.203.203.161. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611270.24750.4950.74250.991.2375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.101.091.071. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611270.3060.6120.9181.2241.53SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.311.301.361. (CXX) g++ options: -O3 -march=native -rdynamic -lglut -lGL -lGLU

Crafty

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterCrafty 23.4Elapsed TimeGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611271530456075SE +/- 0.83, N = 3SE +/- 0.10, N = 3SE +/- 0.80, N = 369.2668.7266.301. (CC) gcc options: -lstdc++ -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.99.3WAV To MP3GCC 5.4.0GCC 6.2.0GCC 7.0.0 201611273691215SE +/- 0.14, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 510.5310.5310.481. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed TimeGCC 5.4.0GCC 6.2.0GCC 7.0.0 2016112750100150200250SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 3.76, N = 325.6625.13214.131. (CC) gcc options: -static -fopenmp -O3 -march=native

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.0.1gRSA 4096-bit PerformanceGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611272004006008001000SE +/- 3.40, N = 3SE +/- 0.35, N = 3SE +/- 0.29, N = 3981.53981.10983.031. (CC) gcc options: -m64 -O3 -lssl -lcrypto -ldl

Caffe AlexNet

Build: CPU AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU AlexNetGCC 6.2.090K180K270K360K450KSE +/- 8860.52, N = 64373611. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

Caffe AlexNet

Build: CPU Googlenet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CPU GooglenetGCC 6.2.0200K400K600K800K1000KSE +/- 9671.46, N = 39815491. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611272K4K6K8K10KSE +/- 29.99, N = 3SE +/- 41.91, N = 3SE +/- 31.36, N = 37958.567946.487988.081. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Single Thread - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Single Thread - Mode: Read WriteGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127140280420560700SE +/- 5.49, N = 3SE +/- 3.56, N = 3SE +/- 2.77, N = 3627.37622.56636.231. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Heavy Contention - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 9.4.3Scaling: Buffer Test - Test: Heavy Contention - Mode: Read WriteGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611272K4K6K8K10KSE +/- 77.01, N = 3SE +/- 48.41, N = 3SE +/- 31.43, N = 38761.908683.698677.261. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -pthread -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: GETGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127500K1000K1500K2000K2500KSE +/- 101610.92, N = 6SE +/- 43300.91, N = 3SE +/- 70859.43, N = 62063098.712152314.251906819.19-std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 3.0.1Test: SETGCC 5.4.0GCC 6.2.0GCC 7.0.0 20161127300K600K900K1200K1500KSE +/- 57176.48, N = 6SE +/- 74386.96, N = 6SE +/- 14086.60, N = 31576321.211412803.001338386.50-std=gnu99 -pipe -g3 -O3 -funroll-loops -march=native1. (CC) gcc options: -ggdb -rdynamic -lm -pthread -ldl

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.7Static Web Page ServingGCC 5.4.0GCC 6.2.0GCC 7.0.0 201611278K16K24K32K40KSE +/- 118.53, N = 3SE +/- 222.33, N = 3SE +/- 137.08, N = 337233.7137371.5037345.181. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native


Phoronix Test Suite v10.8.4