GCC 10 Compiler Optimization Benchmarks

Intel Cascade Lake compiler optimization benchmarks on GCC 10.

HTML result view exported from: https://openbenchmarking.org/result/2005237-PTS-COMPS13296&sor&gru.

GCC 10 Compiler Optimization BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionBaselinePGOIntel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads)ASRock X299 Steel Legend (P1.30 BIOS)Intel Sky Lake-E DMI3 Registers32GBSamsung SSD 970 PRO 512GBNVIDIA NV132 11GBRealtek ALC1220ASUS MG28UIntel I219-V + Intel I211Ubuntu 20.045.4.0-29-generic (x86_64)GNOME Shell 3.36.1X Server 1.20.8modesetting 1.20.84.3 Mesa 20.0.4GCC 10.1.0ext43840x2160OpenBenchmarking.orgCompiler Details- --disable-multilib --enable-checking=releaseProcessor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500012cPython Details- Python 3.8.2Security Details- itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + tsx_async_abort: Mitigation of TSX disabled

GCC 10 Compiler Optimization Benchmarksdav1d: Summer Nature 1080pdav1d: Summer Nature 4Kdav1d: Chimera 1080pdav1d: Chimera 1080p 10-bitaom-av1: Speed 8 Realtimeaom-av1: Speed 6 Realtimeaom-av1: Speed 6 Two-Passgraphics-magick: Rotatecompress-zstd: 3compress-zstd: 19leveldb: Seq Filltjbench: Decompression Throughputhimeno: Poisson Pressure Solverfftw: Float + SSE - 2D FFT Size 4096cryptopp: Keyed Algorithmscryptopp: Unkeyed Algorithmscryptopp: Integer + Elliptic Curve Public Key Algorithmslczero: Eigenlczero: Randstockfish: Total Timegromacs: Water Benchmarklammps: Rhodopsin Proteinrocksdb: Rand Fillrocksdb: Rand Readmysqlslap: 1mysqlslap: 32nginx: Static Web Page Servingpgbench: Buffer Test - Normal Load - Read Onlypgbench: Buffer Test - Normal Load - Read Writeleveldb: Seq Fillleveldb: Rand Readleveldb: Hot Readleveldb: Seek Randpovray: Trace Timeyafaray: Total Time For Sample Scenetungsten: Hairtungsten: Water Caustictungsten: Non-Exponentialtungsten: Volumetric Causticbasis: ETC1Sbasis: UASTC Level 3BaselinePGO559.10227.36611.8098.2634.3819.083.697944661.760.311.8216.0042834006.14361718968709.732181357.5536305646.590874923140468510867971.57211.6471361002102124258105448351257.90483158.1048989701.427214337.73127.42027.70733.52727.540111.72214.486821.10616.150577.3323745.09737.308554.23228.27601.3697.4334.2319.183.708224671.860.311.8217.4412364021.25060019285711.799869359.4170005691.239547932142358515886931.57211.8591405311101999919105649451875.82484081.1217019684.569063337.83927.24127.60932.98727.602103.52414.452621.04406.052937.3179444.87537.216OpenBenchmarking.org

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 1080pBaselinePGO120240360480600SE +/- 0.59, N = 3SE +/- 3.23, N = 3559.10554.23MIN: 375.87 / MAX: 606.19MIN: 356.54 / MAX: 606.991. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Summer Nature 4KPGOBaseline50100150200250SE +/- 0.49, N = 3SE +/- 0.28, N = 3228.27227.36MIN: 177.26 / MAX: 251.18MIN: 170.37 / MAX: 247.691. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080pBaselinePGO130260390520650SE +/- 0.27, N = 3SE +/- 6.52, N = 3611.80601.36MIN: 471.5 / MAX: 752.98MIN: 404.59 / MAX: 750.791. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.7.0Video Input: Chimera 1080p 10-bitBaselinePGO20406080100SE +/- 0.27, N = 3SE +/- 0.48, N = 398.2697.43MIN: 67.46 / MAX: 205.81MIN: 67.27 / MAX: 198.491. (CC) gcc options: -pthread

AOM AV1

Encoder Mode: Speed 8 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 8 RealtimeBaselinePGO816243240SE +/- 0.08, N = 3SE +/- 0.05, N = 334.3834.231. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 RealtimePGOBaseline510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 319.1819.081. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 2.0Encoder Mode: Speed 6 Two-PassPGOBaseline0.83251.6652.49753.334.1625SE +/- 0.01, N = 3SE +/- 0.00, N = 33.703.691. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotatePGOBaseline2004006008001000SE +/- 9.26, N = 3SE +/- 12.50, N = 38227941. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3PGOBaseline10002000300040005000SE +/- 34.24, N = 3SE +/- 24.65, N = 34671.84661.71. (CC) gcc options: -O3 -pthread -lz -llzma

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19PGOBaseline1326395265SE +/- 0.21, N = 3SE +/- 0.07, N = 360.360.31. (CC) gcc options: -O3 -pthread -lz -llzma

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential FillPGOBaseline3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 311.811.81. (CXX) g++ options: -O3 -lsnappy -lpthread

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputPGOBaseline50100150200250SE +/- 0.21, N = 3SE +/- 0.50, N = 3217.44216.001. (CC) gcc options: -O3 -rdynamic

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverPGOBaseline9001800270036004500SE +/- 5.54, N = 3SE +/- 10.20, N = 34021.254006.141. (CC) gcc options: -O3 -mavx2

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096PGOBaseline4K8K12K16K20KSE +/- 87.35, N = 3SE +/- 251.56, N = 319285189681. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed AlgorithmsPGOBaseline150300450600750SE +/- 0.18, N = 3SE +/- 0.02, N = 3711.80709.731. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsPGOBaseline80160240320400SE +/- 0.17, N = 3SE +/- 0.32, N = 3359.42357.551. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key AlgorithmsPGOBaseline12002400360048006000SE +/- 5.24, N = 3SE +/- 22.82, N = 35691.245646.591. (CXX) g++ options: -g2 -O3 -fPIC -pthread -pipe

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.25Backend: EigenPGOBaseline2004006008001000SE +/- 10.26, N = 3SE +/- 8.39, N = 39329231. (CXX) g++ options: -pthread

LeelaChessZero

Backend: Random

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.25Backend: RandomPGOBaseline30K60K90K120K150KSE +/- 310.95, N = 3SE +/- 137.29, N = 31423581404681. (CXX) g++ options: -pthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimePGOBaseline11M22M33M44M55MSE +/- 511603.78, N = 3SE +/- 274511.80, N = 351588693510867971. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.1Water BenchmarkPGOBaseline0.35370.70741.06111.41481.7685SE +/- 0.001, N = 3SE +/- 0.001, N = 31.5721.5721. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 9Jan2020Model: Rhodopsin ProteinPGOBaseline3691215SE +/- 0.07, N = 3SE +/- 0.14, N = 311.8611.651. (CXX) g++ options: -O3 -rdynamic -ljpeg -lpng -lz -lfftw3 -lm

Facebook RocksDB

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillPGOBaseline300K600K900K1200K1500KSE +/- 10617.89, N = 3SE +/- 16050.51, N = 5140531113610021. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random ReadBaselinePGO20M40M60M80M100MSE +/- 62784.49, N = 3SE +/- 129511.98, N = 31021242581019999191. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1PGOBaseline2004006008001000SE +/- 6.85, N = 3SE +/- 9.41, N = 3105610541. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -laio -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32PGOBaseline110220330440550SE +/- 0.89, N = 3SE +/- 0.27, N = 34944831. (CXX) g++ options: -pie -fPIC -fstack-protector -O2 -lpthread -llzma -lbz2 -lsnappy -laio -lnuma -lcrypt -lz -lm -lssl -lcrypto -ldl

NGINX Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterNGINX Benchmark 1.9.9Static Web Page ServingPGOBaseline11K22K33K44K55KSE +/- 210.12, N = 3SE +/- 110.28, N = 351875.8251257.901. (CC) gcc options: -lpthread -lcrypt -lcrypto -lz -O3 -march=native

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read OnlyPGOBaseline100K200K300K400K500KSE +/- 510.02, N = 3SE +/- 610.63, N = 3484081.12483158.101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 12.0Scaling: Buffer Test - Test: Normal Load - Mode: Read WriteBaselinePGO2K4K6K8K10KSE +/- 53.12, N = 3SE +/- 101.31, N = 39701.439684.571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential FillBaselinePGO70140210280350SE +/- 0.47, N = 3SE +/- 0.15, N = 3337.73337.841. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Random Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random ReadPGOBaseline612182430SE +/- 0.36, N = 4SE +/- 0.31, N = 627.2427.421. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Hot Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadPGOBaseline714212835SE +/- 0.08, N = 3SE +/- 0.16, N = 327.6127.711. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Seek Random

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek RandomPGOBaseline816243240SE +/- 0.14, N = 3SE +/- 0.18, N = 332.9933.531. (CXX) g++ options: -O3 -lsnappy -lpthread

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeBaselinePGO612182430SE +/- 0.05, N = 3SE +/- 0.11, N = 327.5427.601. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample ScenePGOBaseline306090120150SE +/- 1.40, N = 4SE +/- 1.41, N = 5103.52111.721. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: HairPGOBaseline48121620SE +/- 0.02, N = 3SE +/- 0.00, N = 314.4514.491. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Tungsten Renderer

Scene: Water Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Water CausticPGOBaseline510152025SE +/- 0.07, N = 3SE +/- 0.06, N = 321.0421.111. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Tungsten Renderer

Scene: Non-Exponential

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Non-ExponentialPGOBaseline246810SE +/- 0.05126, N = 3SE +/- 0.07592, N = 36.052936.150571. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Tungsten Renderer

Scene: Volumetric Caustic

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Volumetric CausticPGOBaseline246810SE +/- 0.02011, N = 3SE +/- 0.06704, N = 37.317947.332371. (CXX) g++ options: -std=c++0x -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mfma -mbmi2 -mavx512f -mavx512vl -mavx512cd -mavx512dq -mavx512bw -mno-sse4a -mno-avx -mno-avx2 -mno-xop -mno-fma4 -mno-avx512pf -mno-avx512er -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1SPGOBaseline1020304050SE +/- 0.07, N = 3SE +/- 0.02, N = 344.8845.101. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3PGOBaseline918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 337.2237.311. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread


Phoronix Test Suite v10.8.5