GCC 10 Compiler Optimization Benchmarks

Intel Cascade Lake compiler optimization benchmarks on GCC 10.

HTML result view exported from: https://openbenchmarking.org/result/2005237-PTS-COMPS13296&sor&grr.

GCC 10 Compiler Optimization BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionBaselinePGOIntel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads)ASRock X299 Steel Legend (P1.30 BIOS)Intel Sky Lake-E DMI3 Registers32GBSamsung SSD 970 PRO 512GBNVIDIA NV132 11GBRealtek ALC1220ASUS MG28UIntel I219-V + Intel I211Ubuntu 20.045.4.0-29-generic (x86_64)GNOME Shell 3.36.1X Server 1.20.8modesetting 1.20.84.3 Mesa 20.0.4GCC 10.1.0ext43840x2160OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500012cPython Details- Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Mitigation of TSX disabled

GCC 10 Compiler Optimization Benchmarksmysqlslap: 32cryptopp: Keyed Algorithmslczero: Eigenlczero: Randfftw: Float + SSE - 2D FFT Size 4096cryptopp: Integer + Elliptic Curve Public Key Algorithmsmysqlslap: 1leveldb: Seq Fillleveldb: Seq Fillyafaray: Total Time For Sample Scenedav1d: Chimera 1080p 10-bitgromacs: Water Benchmarkrocksdb: Rand Fillpgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writecryptopp: Unkeyed Algorithmsstockfish: Total Timerocksdb: Rand Readgraphics-magick: Rotatecompress-zstd: 19himeno: Poisson Pressure Solverleveldb: Rand Readbasis: ETC1Snginx: Static Web Page Servingbasis: UASTC Level 3leveldb: Seek Randaom-av1: Speed 6 Realtimepovray: Trace Timecompress-zstd: 3aom-av1: Speed 6 Two-Passleveldb: Hot Readtungsten: Water Causticdav1d: Chimera 1080paom-av1: Speed 8 Realtimedav1d: Summer Nature 4Ktungsten: Hairtungsten: Volumetric Causticdav1d: Summer Nature 1080ptungsten: Non-Exponentialtjbench: Decompression Throughputlammps: Rhodopsin ProteinBaselinePGO483709.732181923140468189685646.5908741054337.73111.8111.72298.261.5721361002483158.1048989701.427214357.5536305108679710212425879460.34006.14361727.42045.09751257.9037.30833.52719.0827.5404661.73.6927.70721.1061611.8034.38227.3614.48687.33237559.106.15057216.00428311.647494711.799869932142358192855691.2395471056337.83911.8103.52497.431.5721405311484081.1217019684.569063359.4170005158869310199991982260.34021.25060027.24144.87551875.8237.21632.98719.1827.6024671.83.7027.60921.0440601.3634.23228.2714.45267.31794554.236.05293217.44123611.859OpenBenchmarking.org

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32PGOBaseline110220330440550SE +/- 0.89, N = 3SE +/- 0.27, N = 34944831. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -laio -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsPGOBaseline150300450600750SE +/- 0.18, N = 3SE +/- 0.02, N = 3711.80709.731. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.25Backend: EigenPGOBaseline2004006008001000SE +/- 10.26, N = 3SE +/- 8.39, N = 39329231. (CXX) g++ options: -pthread

LeelaChessZero

Backend: Random

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.25Backend: RandomPGOBaseline30K60K90K120K150KSE +/- 310.95, N = 3SE +/- 137.29, N = 31423581404681. (CXX) g++ options: -pthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096PGOBaseline4K8K12K16K20KSE +/- 87.35, N = 3SE +/- 251.56, N = 319285189681. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsPGOBaseline12002400360048006000SE +/- 5.24, N = 3SE +/- 22.82, N = 35691.245646.591. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1PGOBaseline2004006008001000SE +/- 6.85, N = 3SE +/- 9.41, N = 3105610541. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -laio -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential FillBaselinePGO70140210280350SE +/- 0.47, N = 3SE +/- 0.15, N = 3337.73337.841. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential FillPGOBaseline3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311.811.81. (CXX) g++ options: -O3 -lsnappy -lpthread

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample ScenePGOBaseline306090120150SE +/- 1.40, N = 4SE +/- 1.41, N = 5103.52111.721. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitBaselinePGO20406080100SE +/- 0.27, N = 3SE +/- 0.48, N = 398.2697.43MIN: 67.46 / MAX: 205.81MIN: 67.27 / MAX: 198.491. (CC) gcc options: -pthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water BenchmarkPGOBaseline0.35370.70741.06111.41481.7685SE +/- 0.001, N = 3SE +/- 0.001, N = 31.5721.5721. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Facebook RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillPGOBaseline300K600K900K1200K1500KSE +/- 10617.89, N = 3SE +/- 16050.51, N = 5140531113610021. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyPGOBaseline100K200K300K400K500KSE +/- 510.02, N = 3SE +/- 610.63, N = 3484081.12483158.101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteBaselinePGO2K4K6K8K10KSE +/- 53.12, N = 3SE +/- 101.31, N = 39701.439684.571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsPGOBaseline80160240320400SE +/- 0.17, N = 3SE +/- 0.32, N = 3359.42357.551. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimePGOBaseline11M22M33M44M55MSE +/- 511603.78, N = 3SE +/- 274511.80, N = 351588693510867971. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random ReadBaselinePGO20M40M60M80M100MSE +/- 62784.49, N = 3SE +/- 129511.98, N = 31021242581019999191. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotatePGOBaseline2004006008001000SE +/- 9.26, N = 3SE +/- 12.50, N = 38227941. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19PGOBaseline1326395265SE +/- 0.21, N = 3SE +/- 0.07, N = 360.360.31. (CC) gcc options: -O3 -pthread -lz -llzma

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverPGOBaseline9001800270036004500SE +/- 5.54, N = 3SE +/- 10.20, N = 34021.254006.141. (CC) gcc options: -O3 -mavx2

LevelDB

Benchmark: Random Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random ReadPGOBaseline612182430SE +/- 0.36, N = 4SE +/- 0.31, N = 627.2427.421. (CXX) g++ options: -O3 -lsnappy -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1SPGOBaseline1020304050SE +/- 0.07, N = 3SE +/- 0.02, N = 344.8845.101. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingPGOBaseline11K22K33K44K55KSE +/- 210.12, N = 3SE +/- 110.28, N = 351875.8251257.901. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3PGOBaseline918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 337.2237.311. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LevelDB

Benchmark: Seek Random

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek RandomPGOBaseline816243240SE +/- 0.14, N = 3SE +/- 0.18, N = 332.9933.531. (CXX) g++ options: -O3 -lsnappy -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 RealtimePGOBaseline510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 319.1819.081. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeBaselinePGO612182430SE +/- 0.05, N = 3SE +/- 0.11, N = 327.5427.601. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3PGOBaseline10002000300040005000SE +/- 34.24, N = 3SE +/- 24.65, N = 34671.84661.71. (CC) gcc options: -O3 -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-PassPGOBaseline0.83251.6652.49753.334.1625SE +/- 0.01, N = 3SE +/- 0.00, N = 33.703.691. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LevelDB

Benchmark: Hot Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadPGOBaseline714212835SE +/- 0.08, N = 3SE +/- 0.16, N = 327.6127.711. (CXX) g++ options: -O3 -lsnappy -lpthread

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticPGOBaseline510152025SE +/- 0.07, N = 3SE +/- 0.06, N = 321.0421.111. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080pBaselinePGO130260390520650SE +/- 0.27, N = 3SE +/- 6.52, N = 3611.80601.36MIN: 471.5 / MAX: 752.98MIN: 404.59 / MAX: 750.791. (CC) gcc options: -pthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 RealtimeBaselinePGO816243240SE +/- 0.08, N = 3SE +/- 0.05, N = 334.3834.231. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4KPGOBaseline50100150200250SE +/- 0.49, N = 3SE +/- 0.28, N = 3228.27227.36MIN: 177.26 / MAX: 251.18MIN: 170.37 / MAX: 247.691. (CC) gcc options: -pthread

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairPGOBaseline48121620SE +/- 0.02, N = 3SE +/- 0.00, N = 314.4514.491. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticPGOBaseline246810SE +/- 0.02011, N = 3SE +/- 0.06704, N = 37.317947.332371. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080pBaselinePGO120240360480600SE +/- 0.59, N = 3SE +/- 3.23, N = 3559.10554.23MIN: 375.87 / MAX: 606.19MIN: 356.54 / MAX: 606.991. (CC) gcc options: -pthread

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialPGOBaseline246810SE +/- 0.05126, N = 3SE +/- 0.07592, N = 36.052936.150571. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputPGOBaseline50100150200250SE +/- 0.21, N = 3SE +/- 0.50, N = 3217.44216.001. (CC) gcc options: -O3 -rdynamic

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 9Jan2020Model: Rhodopsin ProteinPGOBaseline3691215SE +/- 0.07, N = 3SE +/- 0.14, N = 311.8611.651. (CXX) g++ options: -O3 -rdynamic -ljpeg -lpng -lz -lfftw3 -lm


Phoronix Test Suite v10.8.5