GCC 10 Compiler Optimization Benchmarks

Intel Cascade Lake compiler optimization benchmarks on GCC 10.

HTML result view exported from: https://openbenchmarking.org/result/2005237-PTS-COMPS13296&grr&rdt.

GCC 10 Compiler Optimization BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionBaselinePGOIntel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads)ASRock X299 Steel Legend (P1.30 BIOS)Intel Sky Lake-E DMI3 Registers32GBSamsung SSD 970 PRO 512GBNVIDIA NV132 11GBRealtek ALC1220ASUS MG28UIntel I219-V + Intel I211Ubuntu 20.045.4.0-29-generic (x86_64)GNOME Shell 3.36.1X Server 1.20.8modesetting 1.20.84.3 Mesa 20.0.4GCC 10.1.0ext43840x2160OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500012cPython Details- Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Mitigation of TSX disabled

GCC 10 Compiler Optimization Benchmarksmysqlslap: 32cryptopp: Keyed Algorithmslczero: Eigenlczero: Randfftw: Float + SSE - 2D FFT Size 4096cryptopp: Integer + Elliptic Curve Public Key Algorithmsmysqlslap: 1leveldb: Seq Fillleveldb: Seq Fillyafaray: Total Time For Sample Scenedav1d: Chimera 1080p 10-bitgromacs: Water Benchmarkrocksdb: Rand Fillpgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writecryptopp: Unkeyed Algorithmsstockfish: Total Timerocksdb: Rand Readgraphics-magick: Rotatecompress-zstd: 19himeno: Poisson Pressure Solverleveldb: Rand Readbasis: ETC1Snginx: Static Web Page Servingbasis: UASTC Level 3leveldb: Seek Randaom-av1: Speed 6 Realtimepovray: Trace Timecompress-zstd: 3aom-av1: Speed 6 Two-Passleveldb: Hot Readtungsten: Water Causticdav1d: Chimera 1080paom-av1: Speed 8 Realtimedav1d: Summer Nature 4Ktungsten: Hairtungsten: Volumetric Causticdav1d: Summer Nature 1080ptungsten: Non-Exponentialtjbench: Decompression Throughputlammps: Rhodopsin ProteinBaselinePGO483709.732181923140468189685646.5908741054337.73111.8111.72298.261.5721361002483158.1048989701.427214357.5536305108679710212425879460.34006.14361727.42045.09751257.9037.30833.52719.0827.5404661.73.6927.70721.1061611.8034.38227.3614.48687.33237559.106.15057216.00428311.647494711.799869932142358192855691.2395471056337.83911.8103.52497.431.5721405311484081.1217019684.569063359.4170005158869310199991982260.34021.25060027.24144.87551875.8237.21632.98719.1827.6024671.83.7027.60921.0440601.3634.23228.2714.45267.31794554.236.05293217.44123611.859OpenBenchmarking.org

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32BaselinePGO110220330440550SE +/- 0.27, N = 3SE +/- 0.89, N = 34834941. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -laio -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsBaselinePGO150300450600750SE +/- 0.02, N = 3SE +/- 0.18, N = 3709.73711.801. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.25Backend: EigenBaselinePGO2004006008001000SE +/- 8.39, N = 3SE +/- 10.26, N = 39239321. (CXX) g++ options: -pthread

LeelaChessZero

Backend: Random

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.25Backend: RandomBaselinePGO30K60K90K120K150KSE +/- 137.29, N = 3SE +/- 310.95, N = 31404681423581. (CXX) g++ options: -pthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096BaselinePGO4K8K12K16K20KSE +/- 251.56, N = 3SE +/- 87.35, N = 318968192851. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsBaselinePGO12002400360048006000SE +/- 22.82, N = 3SE +/- 5.24, N = 35646.595691.241. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1BaselinePGO2004006008001000SE +/- 9.41, N = 3SE +/- 6.85, N = 3105410561. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -laio -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential FillBaselinePGO70140210280350SE +/- 0.47, N = 3SE +/- 0.15, N = 3337.73337.841. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential FillBaselinePGO3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311.811.81. (CXX) g++ options: -O3 -lsnappy -lpthread

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneBaselinePGO306090120150SE +/- 1.41, N = 5SE +/- 1.40, N = 4111.72103.521. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitBaselinePGO20406080100SE +/- 0.27, N = 3SE +/- 0.48, N = 398.2697.43MIN: 67.46 / MAX: 205.81MIN: 67.27 / MAX: 198.491. (CC) gcc options: -pthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water BenchmarkBaselinePGO0.35370.70741.06111.41481.7685SE +/- 0.001, N = 3SE +/- 0.001, N = 31.5721.5721. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Facebook RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillBaselinePGO300K600K900K1200K1500KSE +/- 16050.51, N = 5SE +/- 10617.89, N = 3136100214053111. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyBaselinePGO100K200K300K400K500KSE +/- 610.63, N = 3SE +/- 510.02, N = 3483158.10484081.121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteBaselinePGO2K4K6K8K10KSE +/- 53.12, N = 3SE +/- 101.31, N = 39701.439684.571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsBaselinePGO80160240320400SE +/- 0.32, N = 3SE +/- 0.17, N = 3357.55359.421. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeBaselinePGO11M22M33M44M55MSE +/- 274511.80, N = 3SE +/- 511603.78, N = 351086797515886931. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random ReadBaselinePGO20M40M60M80M100MSE +/- 62784.49, N = 3SE +/- 129511.98, N = 31021242581019999191. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateBaselinePGO2004006008001000SE +/- 12.50, N = 3SE +/- 9.26, N = 37948221. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19BaselinePGO1326395265SE +/- 0.07, N = 3SE +/- 0.21, N = 360.360.31. (CC) gcc options: -O3 -pthread -lz -llzma

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverBaselinePGO9001800270036004500SE +/- 10.20, N = 3SE +/- 5.54, N = 34006.144021.251. (CC) gcc options: -O3 -mavx2

LevelDB

Benchmark: Random Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random ReadBaselinePGO612182430SE +/- 0.31, N = 6SE +/- 0.36, N = 427.4227.241. (CXX) g++ options: -O3 -lsnappy -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1SBaselinePGO1020304050SE +/- 0.02, N = 3SE +/- 0.07, N = 345.1044.881. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingBaselinePGO11K22K33K44K55KSE +/- 110.28, N = 3SE +/- 210.12, N = 351257.9051875.821. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3BaselinePGO918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 337.3137.221. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LevelDB

Benchmark: Seek Random

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek RandomBaselinePGO816243240SE +/- 0.18, N = 3SE +/- 0.14, N = 333.5332.991. (CXX) g++ options: -O3 -lsnappy -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 RealtimeBaselinePGO510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 319.0819.181. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeBaselinePGO612182430SE +/- 0.05, N = 3SE +/- 0.11, N = 327.5427.601. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3BaselinePGO10002000300040005000SE +/- 24.65, N = 3SE +/- 34.24, N = 34661.74671.81. (CC) gcc options: -O3 -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-PassBaselinePGO0.83251.6652.49753.334.1625SE +/- 0.00, N = 3SE +/- 0.01, N = 33.693.701. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LevelDB

Benchmark: Hot Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadBaselinePGO714212835SE +/- 0.16, N = 3SE +/- 0.08, N = 327.7127.611. (CXX) g++ options: -O3 -lsnappy -lpthread

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticBaselinePGO510152025SE +/- 0.06, N = 3SE +/- 0.07, N = 321.1121.041. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080pBaselinePGO130260390520650SE +/- 0.27, N = 3SE +/- 6.52, N = 3611.80601.36MIN: 471.5 / MAX: 752.98MIN: 404.59 / MAX: 750.791. (CC) gcc options: -pthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 RealtimeBaselinePGO816243240SE +/- 0.08, N = 3SE +/- 0.05, N = 334.3834.231. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4KBaselinePGO50100150200250SE +/- 0.28, N = 3SE +/- 0.49, N = 3227.36228.27MIN: 170.37 / MAX: 247.69MIN: 177.26 / MAX: 251.181. (CC) gcc options: -pthread

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairBaselinePGO48121620SE +/- 0.00, N = 3SE +/- 0.02, N = 314.4914.451. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticBaselinePGO246810SE +/- 0.06704, N = 3SE +/- 0.02011, N = 37.332377.317941. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080pBaselinePGO120240360480600SE +/- 0.59, N = 3SE +/- 3.23, N = 3559.10554.23MIN: 375.87 / MAX: 606.19MIN: 356.54 / MAX: 606.991. (CC) gcc options: -pthread

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialBaselinePGO246810SE +/- 0.07592, N = 3SE +/- 0.05126, N = 36.150576.052931. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputBaselinePGO50100150200250SE +/- 0.50, N = 3SE +/- 0.21, N = 3216.00217.441. (CC) gcc options: -O3 -rdynamic

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 9Jan2020Model: Rhodopsin ProteinBaselinePGO3691215SE +/- 0.14, N = 3SE +/- 0.07, N = 311.6511.861. (CXX) g++ options: -O3 -rdynamic -ljpeg -lpng -lz -lfftw3 -lm


Phoronix Test Suite v10.8.5