GCC 10 Compiler Optimization Benchmarks Intel Cascade Lake compiler optimization benchmarks on GCC 10. -O0: Processor: Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Steel Legend (P1.30 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: Samsung SSD 970 PRO 512GB, Graphics: NVIDIA NV132 11GB, Audio: Realtek ALC1220, Monitor: ASUS MG28U, Network: Intel I219-V + Intel I211 OS: Ubuntu 20.04, Kernel: 5.4.0-29-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 10.1.0, File-System: ext4, Screen Resolution: 3840x2160 -O2: Processor: Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Steel Legend (P1.30 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: Samsung SSD 970 PRO 512GB, Graphics: NVIDIA NV132 11GB, Audio: Realtek ALC1220, Monitor: ASUS MG28U, Network: Intel I219-V + Intel I211 OS: Ubuntu 20.04, Kernel: 5.4.0-29-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 10.1.0, File-System: ext4, Screen Resolution: 3840x2160 -O3: Processor: Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Steel Legend (P1.30 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: Samsung SSD 970 PRO 512GB, Graphics: NVIDIA NV132 11GB, Audio: Realtek ALC1220, Monitor: ASUS MG28U, Network: Intel I219-V + Intel I211 OS: Ubuntu 20.04, Kernel: 5.4.0-29-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 10.1.0, File-System: ext4, Screen Resolution: 3840x2160 -O3 -march=native: Processor: Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Steel Legend (P1.30 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: Samsung SSD 970 PRO 512GB, Graphics: NVIDIA NV132 11GB, Audio: Realtek ALC1220, Monitor: ASUS MG28U, Network: Intel I219-V + Intel I211 OS: Ubuntu 20.04, Kernel: 5.4.0-29-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 10.1.0, File-System: ext4, Screen Resolution: 3840x2160 -O3 -march=native -flto: Processor: Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Steel Legend (P1.30 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: Samsung SSD 970 PRO 512GB, Graphics: NVIDIA NV132 11GB, Audio: Realtek ALC1220, Monitor: ASUS MG28U, Network: Intel I219-V + Intel I211 OS: Ubuntu 20.04, Kernel: 5.4.0-29-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 10.1.0, File-System: ext4, Screen Resolution: 3840x2160 -Ofast -march=native: Processor: Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads), Motherboard: ASRock X299 Steel Legend (P1.30 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32GB, Disk: Samsung SSD 970 PRO 512GB, Graphics: NVIDIA NV132 11GB, Audio: Realtek ALC1220, Monitor: ASUS MG28U, Network: Intel I219-V + Intel I211 OS: Ubuntu 20.04, Kernel: 5.4.0-29-generic (x86_64), Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.3 Mesa 20.0.4, Compiler: GCC 10.1.0, File-System: ext4, Screen Resolution: 3840x2160 LeelaChessZero 0.25 Backend: Random Nodes Per Second > Higher Is Better -O0 ..................... 18300 |===== -O2 ..................... 157289 |=========================================== -O3 ..................... 142198 |======================================= -O3 -march=native ....... 136608 |====================================== -O3 -march=native -flto . 163773 |============================================= -Ofast -march=native .... 157629 |=========================================== Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better -O0 ..................... 597.53 |===== -O2 ..................... 4150.11 |==================================== -O3 ..................... 4015.86 |=================================== -O3 -march=native ....... 4771.42 |========================================= -O3 -march=native -flto . 4774.03 |========================================= -Ofast -march=native .... 5065.07 |============================================ Crypto++ 8.2 Test: Keyed Algorithms MiB/second > Higher Is Better -O0 ..................... 95.09 |====== -O2 ..................... 660.46 |======================================== -O3 ..................... 712.01 |=========================================== -O3 -march=native ....... 751.56 |============================================= -O3 -march=native -flto . 699.80 |========================================== -Ofast -march=native .... 751.01 |============================================= FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better -O0 ..................... 53.008 |============================================= -O2 ..................... 9.331 |======== -O3 ..................... 9.368 |======== -O3 -march=native ....... 8.491 |======= -O3 -march=native -flto . 8.355 |======= -Ofast -march=native .... 8.335 |======= C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better -O0 ..................... 113.58 |============================================= -O2 ..................... 69.70 |============================ -O3 ..................... 38.00 |=============== -O3 -march=native ....... 30.46 |============ -O3 -march=native -flto . 30.24 |============ -Ofast -march=native .... 27.13 |=========== LeelaChessZero 0.25 Backend: Eigen Nodes Per Second > Higher Is Better -O0 ..................... 3 | -O2 ..................... 924 |=============================================== -O3 ..................... 927 |=============================================== -O3 -march=native ....... 926 |=============================================== -O3 -march=native -flto . 948 |================================================ -Ofast -march=native .... 934 |=============================================== Crypto++ 8.2 Test: Unkeyed Algorithms MiB/second > Higher Is Better -O0 ..................... 102.25 |============ -O2 ..................... 356.58 |========================================= -O3 ..................... 359.28 |========================================= -O3 -march=native ....... 392.24 |============================================= -O3 -march=native -flto . 390.59 |============================================= -Ofast -march=native .... 394.30 |============================================= Crypto++ 8.2 Test: Integer + Elliptic Curve Public Key Algorithms MiB/second > Higher Is Better -O0 ..................... 1659.38 |============ -O2 ..................... 5718.13 |=========================================== -O3 ..................... 5679.41 |========================================== -O3 -march=native ....... 5809.05 |=========================================== -O3 -march=native -flto . 5915.79 |============================================ -Ofast -march=native .... 5906.81 |============================================ LAME MP3 Encoding 3.100 WAV To MP3 Seconds < Lower Is Better -O0 ..................... 25.150 |============================================= -O2 ..................... 11.198 |==================== -O3 ..................... 9.412 |================= -O3 -march=native ....... 8.612 |=============== -O3 -march=native -flto . 8.528 |=============== -Ofast -march=native .... 8.656 |=============== PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Only TPS > Higher Is Better -O0 ..................... 317104.97 |=========================== -O2 ..................... 483412.75 |========================================= -O3 ..................... 481599.72 |========================================= -O3 -march=native ....... 476778.15 |========================================= -O3 -march=native -flto . 491896.16 |========================================== -Ofast -march=native .... 482929.70 |========================================= PostgreSQL pgbench 12.0 Scaling: Buffer Test - Test: Normal Load - Mode: Read Write TPS > Higher Is Better -O0 ..................... 9648.29 |========================================= -O2 ..................... 9637.54 |========================================= -O3 ..................... 9922.91 |========================================== -O3 -march=native ....... 8029.08 |================================== -O3 -march=native -flto . 10194.03 |=========================================== -Ofast -march=native .... 9468.63 |======================================== GraphicsMagick 1.3.33 Operation: Rotate Iterations Per Minute > Higher Is Better -O0 ..................... 727 |========================================= -O2 ..................... 843 |=============================================== -O3 ..................... 794 |============================================ -O3 -march=native ....... 836 |=============================================== -O3 -march=native -flto . 857 |================================================ -Ofast -march=native .... 813 |============================================== NGINX Benchmark 1.9.9 Static Web Page Serving Requests Per Second > Higher Is Better -O0 ..................... 49414.81 |========================================= -O2 ..................... 51647.99 |========================================== -O3 ..................... 51596.82 |========================================== -O3 -march=native ....... 52346.17 |=========================================== -O3 -march=native -flto . 52047.28 |=========================================== -Ofast -march=native .... 51501.62 |========================================== C-Blosc 2.0 Beta 5 Compressor: blosclz MB/s > Higher Is Better -O0 ..................... 8216.9 |============================================= -O2 ..................... 8234.3 |============================================= -O3 ..................... 8202.7 |============================================= -O3 -march=native ....... 7778.0 |=========================================== -O3 -march=native -flto . 7829.8 |=========================================== -Ofast -march=native .... 8204.7 |============================================= POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better -O2 ..................... 29.00 |============================================== -O3 ..................... 27.48 |============================================ -O3 -march=native ....... 27.50 |============================================ -O3 -march=native -flto . 28.88 |============================================== -Ofast -march=native .... 27.73 |============================================ LAMMPS Molecular Dynamics Simulator 9Jan2020 Model: Rhodopsin Protein ns/day > Higher Is Better -O0 ..................... 11.84 |============================================ -O2 ..................... 11.90 |============================================ -O3 ..................... 11.89 |============================================ -O3 -march=native ....... 12.22 |============================================= -O3 -march=native -flto . 12.43 |============================================== -Ofast -march=native .... 12.23 |============================================= YafaRay 3.4.1 Total Time For Sample Scene Seconds < Lower Is Better -O0 ..................... 104.65 |============================================ -O2 ..................... 101.47 |=========================================== -O3 ..................... 105.64 |============================================ -O3 -march=native ....... 101.58 |=========================================== -O3 -march=native -flto . 106.39 |============================================= -Ofast -march=native .... 107.12 |=============================================