GCC Znver3 Compiler Optimization Levels Tests for a future article. -O3 -march=znver3: Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160 -O3 -march=znver2: Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160 -O3 -march=x86-64: Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160 -O3 -maech=znver3 -flto: Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160 -Ofast -march=znver3 -flto: Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160 -O2 -march=znver3 -flto: Processor: AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3003 BIOS), Chipset: AMD Starship/Matisse, Memory: 16GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz), Audio: AMD Navi 10 HDMI Audio, Monitor: ASUS MG28U, Network: Realtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200 OS: Ubuntu 20.04, Kernel: 5.10.0-051000rc6daily20201205-generic (x86_64) 20201204, Desktop: GNOME Shell 3.36.4, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, OpenGL: 4.6 Mesa 21.0.0-devel (git-1a53572 2020-12-09 focal-oibaf-ppa) (LLVM 11.0.0), Vulkan: 1.2.145, Compiler: GCC 11.0.0 20201213, File-System: ext4, Screen Resolution: 3840x2160 FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Mflops > Higher Is Better -O3 -march=znver3 .......... 13702.0 |==================================== -O3 -march=znver2 .......... 14096.0 |===================================== -O3 -march=x86-64 .......... 12436.0 |================================= -O3 -maech=znver3 -flto .... 15584.0 |========================================= -Ofast -march=znver3 -flto . 12149.0 |================================ -O2 -march=znver3 -flto .... 15232.0 |======================================== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Mflops > Higher Is Better -O3 -march=znver3 .......... 9314.1 |====================================== -O3 -march=znver2 .......... 8927.1 |==================================== -O3 -march=x86-64 .......... 8169.2 |================================= -O3 -maech=znver3 -flto .... 10125.0 |========================================= -Ofast -march=znver3 -flto . 8755.9 |=================================== -O2 -march=znver3 -flto .... 9718.2 |======================================= FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Mflops > Higher Is Better -O3 -march=znver3 .......... 69819 |========================================= -O3 -march=znver2 .......... 71917 |=========================================== -O3 -maech=znver3 -flto .... 64468 |====================================== -Ofast -march=znver3 -flto . 63882 |====================================== -O2 -march=znver3 -flto .... 72763 |=========================================== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Mflops > Higher Is Better -O3 -march=znver3 .......... 32774 |=========================================== -O3 -march=znver2 .......... 30461 |======================================== -O3 -maech=znver3 -flto .... 31709 |========================================== -Ofast -march=znver3 -flto . 31754 |========================================== -O2 -march=znver3 -flto .... 31717 |========================================== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better -O3 -march=znver3 .......... 76.32 |========================================= -O3 -march=znver2 .......... 75.70 |========================================= -O3 -march=x86-64 .......... 77.07 |========================================== -O3 -maech=znver3 -flto .... 73.92 |======================================== -Ofast -march=znver3 -flto . 45.27 |======================== -O2 -march=znver3 -flto .... 79.51 |=========================================== Timed HMMer Search 3.3.1 Pfam Database Search Seconds < Lower Is Better -O3 -march=znver3 .......... 85.02 |=========================================== -O3 -march=znver2 .......... 84.79 |=========================================== -O3 -march=x86-64 .......... 85.60 |=========================================== -O3 -maech=znver3 -flto .... 84.12 |========================================== -Ofast -march=znver3 -flto . 76.94 |======================================= -O2 -march=znver3 -flto .... 84.93 |=========================================== Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Seconds < Lower Is Better -O3 -march=znver3 .......... 7.115 |========================================= -O3 -march=znver2 .......... 7.324 |=========================================== -O3 -march=x86-64 .......... 7.399 |=========================================== -O3 -maech=znver3 -flto .... 7.286 |========================================== -Ofast -march=znver3 -flto . 7.341 |=========================================== -O2 -march=znver3 -flto .... 7.365 |=========================================== WebP Image Encode 1.1 Encode Settings: Quality 100 Encode Time - Seconds < Lower Is Better -O3 -march=znver3 .......... 1.648 |========================================= -O3 -march=znver2 .......... 1.684 |========================================== -O3 -march=x86-64 .......... 1.726 |=========================================== -O3 -maech=znver3 -flto .... 1.636 |========================================= -Ofast -march=znver3 -flto . 1.660 |========================================= -O2 -march=znver3 -flto .... 1.648 |========================================= Zstd Compression 1.4.5 Compression Level: 3 MB/s > Higher Is Better -O3 -march=znver3 .......... 5066.0 |========================================== -O3 -march=znver2 .......... 5028.1 |========================================== -O3 -march=x86-64 .......... 5042.5 |========================================== -O3 -maech=znver3 -flto .... 5043.1 |========================================== -Ofast -march=znver3 -flto . 5030.2 |========================================== -O2 -march=znver3 -flto .... 5045.8 |========================================== Zstd Compression 1.4.5 Compression Level: 19 MB/s > Higher Is Better -O3 -march=znver3 .......... 44.2 |============================================ -O3 -march=znver2 .......... 44.2 |============================================ -O3 -march=x86-64 .......... 44.3 |============================================ -O3 -maech=znver3 -flto .... 44.2 |============================================ -Ofast -march=znver3 -flto . 44.2 |============================================ -O2 -march=znver3 -flto .... 44.1 |============================================ SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better -O3 -march=znver3 .......... 3958.17 |===================================== -O3 -march=znver2 .......... 4407.32 |========================================= -O3 -march=x86-64 .......... 3660.74 |================================== -O3 -maech=znver3 -flto .... 3855.30 |==================================== -Ofast -march=znver3 -flto . 4101.89 |====================================== -O2 -march=znver3 -flto .... 2924.23 |=========================== SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better -O3 -march=znver3 .......... 883.30 |=================== -O3 -march=znver2 .......... 867.28 |=================== -O3 -march=x86-64 .......... 897.30 |=================== -O3 -maech=znver3 -flto .... 1625.41 |=================================== -Ofast -march=znver3 -flto . 1893.64 |========================================= -O2 -march=znver3 -flto .... 891.29 |=================== SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better -O3 -march=znver3 .......... 467.20 |==================================== -O3 -march=znver2 .......... 531.10 |======================================== -O3 -march=x86-64 .......... 550.92 |========================================== -O3 -maech=znver3 -flto .... 460.20 |=================================== -Ofast -march=znver3 -flto . 539.06 |========================================= -O2 -march=znver3 -flto .... 540.63 |========================================= SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better -O3 -march=znver3 .......... 4779.43 |================================ -O3 -march=znver2 .......... 5128.66 |=================================== -O3 -march=x86-64 .......... 4634.54 |=============================== -O3 -maech=znver3 -flto .... 4016.10 |=========================== -Ofast -march=znver3 -flto . 5235.17 |==================================== -O2 -march=znver3 -flto .... 6037.39 |========================================= SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better -O3 -march=znver3 .......... 10128.76 |================================ -O3 -march=znver2 .......... 12537.23 |======================================== -O3 -march=x86-64 .......... 9747.10 |=============================== -O3 -maech=znver3 -flto .... 8601.61 |=========================== -Ofast -march=znver3 -flto . 9843.10 |=============================== -O2 -march=znver3 -flto .... 5574.17 |================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better -O3 -march=znver3 .......... 3043.46 |========================================= -O3 -march=znver2 .......... 2972.33 |======================================== -O3 -march=x86-64 .......... 2473.84 |================================= -O3 -maech=znver3 -flto .... 2552.11 |================================== -Ofast -march=znver3 -flto . 2998.47 |======================================== -O2 -march=znver3 -flto .... 1577.67 |===================== LibRaw 0.20 Post-Processing Benchmark Mpix/sec > Higher Is Better -O3 -march=znver3 .......... 73.83 |========================================== -O3 -march=znver2 .......... 75.26 |=========================================== -O3 -march=x86-64 .......... 63.90 |===================================== -O3 -maech=znver3 -flto .... 73.73 |========================================== -Ofast -march=znver3 -flto . 75.23 |=========================================== -O2 -march=znver3 -flto .... 49.32 |============================ Crafty 25.2 Elapsed Time Nodes Per Second > Higher Is Better -O3 -march=znver3 .......... 11396837 |===================================== -O3 -march=znver2 .......... 11460428 |===================================== -O3 -march=x86-64 .......... 11161599 |==================================== -O3 -maech=znver3 -flto .... 12020364 |======================================= -Ofast -march=znver3 -flto . 12258866 |======================================== -O2 -march=znver3 -flto .... 11650472 |====================================== TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better -O3 -march=znver3 .......... 2055431 |======================================= -O3 -march=znver2 .......... 2010350 |====================================== -O3 -march=x86-64 .......... 2035599 |====================================== -O3 -maech=znver3 -flto .... 2082271 |======================================= -Ofast -march=znver3 -flto . 2187546 |========================================= -O2 -march=znver3 -flto .... 2153845 |======================================== GraphicsMagick 1.3.33 Operation: Swirl Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 1196 |============================================ -O3 -march=znver2 .......... 1139 |========================================== -O3 -march=x86-64 .......... 1094 |======================================== -O3 -maech=znver3 -flto .... 1156 |========================================== -Ofast -march=znver3 -flto . 1201 |============================================ -O2 -march=znver3 -flto .... 1153 |========================================== GraphicsMagick 1.3.33 Operation: Rotate Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 1064 |============================================ -O3 -march=znver2 .......... 919 |====================================== -O3 -march=x86-64 .......... 987 |========================================= -O3 -maech=znver3 -flto .... 948 |======================================= -Ofast -march=znver3 -flto . 961 |======================================== -O2 -march=znver3 -flto .... 888 |===================================== GraphicsMagick 1.3.33 Operation: Sharpen Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 361 |=========================================== -O3 -march=znver2 .......... 377 |============================================= -O3 -march=x86-64 .......... 225 |=========================== -O3 -maech=znver3 -flto .... 370 |============================================ -Ofast -march=znver3 -flto . 328 |======================================= -O2 -march=znver3 -flto .... 377 |============================================= GraphicsMagick 1.3.33 Operation: Enhanced Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 432 |========================================= -O3 -march=znver2 .......... 444 |========================================== -O3 -march=x86-64 .......... 382 |==================================== -O3 -maech=znver3 -flto .... 445 |========================================== -Ofast -march=znver3 -flto . 449 |========================================== -O2 -march=znver3 -flto .... 476 |============================================= GraphicsMagick 1.3.33 Operation: Resizing Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 2135 |============================================ -O3 -march=znver2 .......... 1907 |======================================= -O3 -march=x86-64 .......... 1721 |=================================== -O3 -maech=znver3 -flto .... 1906 |======================================= -Ofast -march=znver3 -flto . 1901 |======================================= -O2 -march=znver3 -flto .... 1854 |====================================== GraphicsMagick 1.3.33 Operation: Noise-Gaussian Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 441 |======================================= -O3 -march=znver2 .......... 434 |======================================= -O3 -march=x86-64 .......... 426 |====================================== -O3 -maech=znver3 -flto .... 430 |====================================== -Ofast -march=znver3 -flto . 504 |============================================= -O2 -march=znver3 -flto .... 445 |======================================== GraphicsMagick 1.3.33 Operation: HWB Color Space Iterations Per Minute > Higher Is Better -O3 -march=znver3 .......... 1115 |============================================ -O3 -march=znver2 .......... 1064 |========================================== -O3 -march=x86-64 .......... 1067 |========================================== -O3 -maech=znver3 -flto .... 1110 |============================================ -Ofast -march=znver3 -flto . 1119 |============================================ -O2 -march=znver3 -flto .... 1111 |============================================ dav1d 0.7.0 Video Input: Chimera 1080p FPS > Higher Is Better -O3 -march=znver3 . 879.20 |================================================== -O3 -march=znver2 . 893.72 |================================================== -O3 -march=x86-64 . 903.91 |=================================================== dav1d 0.7.0 Video Input: Summer Nature 4K FPS > Higher Is Better -O3 -march=znver3 . 240.24 |================================================== -O3 -march=znver2 . 241.87 |=================================================== -O3 -march=x86-64 . 243.36 |=================================================== dav1d 0.7.0 Video Input: Summer Nature 1080p FPS > Higher Is Better -O3 -march=znver3 . 914.92 |================================================== -O3 -march=znver2 . 932.67 |=================================================== -O3 -march=x86-64 . 921.07 |================================================== dav1d 0.7.0 Video Input: Chimera 1080p 10-bit FPS > Higher Is Better -O3 -march=znver3 . 261.39 |=================================================== -O3 -march=znver2 . 260.36 |=================================================== -O3 -march=x86-64 . 187.65 |===================================== AOM AV1 2.0 Encoder Mode: Speed 4 Realtime Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 2.64 |============================================ -O3 -march=znver2 .......... 2.53 |========================================== -O3 -march=x86-64 .......... 2.62 |=========================================== -O3 -maech=znver3 -flto .... 2.63 |=========================================== -Ofast -march=znver3 -flto . 2.66 |============================================ -O2 -march=znver3 -flto .... 2.67 |============================================ AOM AV1 2.0 Encoder Mode: Speed 5 Two-Pass Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 4.69 |========================================== -O3 -march=znver2 .......... 4.79 |=========================================== -O3 -march=x86-64 .......... 4.84 |=========================================== -O3 -maech=znver3 -flto .... 4.90 |============================================ -Ofast -march=znver3 -flto . 4.81 |=========================================== -O2 -march=znver3 -flto .... 4.87 |============================================ AOM AV1 2.0 Encoder Mode: Speed 8 Realtime Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 62.33 |========================================== -O3 -march=znver2 .......... 62.28 |========================================== -O3 -march=x86-64 .......... 60.07 |========================================= -O3 -maech=znver3 -flto .... 62.52 |========================================== -Ofast -march=znver3 -flto . 63.76 |=========================================== -O2 -march=znver3 -flto .... 62.08 |========================================== Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Slow Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 11.58 |========================================= -O3 -march=znver2 .......... 11.59 |========================================= -O3 -march=x86-64 .......... 11.69 |========================================= -O3 -maech=znver3 -flto .... 11.88 |========================================== -Ofast -march=znver3 -flto . 12.12 |=========================================== -O2 -march=znver3 -flto .... 11.68 |========================================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 11.74 |========================================= -O3 -march=znver2 .......... 11.72 |========================================= -O3 -march=x86-64 .......... 11.86 |========================================== -O3 -maech=znver3 -flto .... 12.03 |========================================== -Ofast -march=znver3 -flto . 12.27 |=========================================== -O2 -march=znver3 -flto .... 11.84 |========================================= Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Slow Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 44.91 |========================================= -O3 -march=znver2 .......... 44.97 |========================================= -O3 -march=x86-64 .......... 45.75 |========================================== -O3 -maech=znver3 -flto .... 46.30 |========================================== -Ofast -march=znver3 -flto . 47.25 |=========================================== -O2 -march=znver3 -flto .... 45.68 |========================================== Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Medium Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 46.42 |========================================= -O3 -march=znver2 .......... 46.38 |========================================= -O3 -march=x86-64 .......... 46.74 |========================================= -O3 -maech=znver3 -flto .... 47.32 |========================================== -Ofast -march=znver3 -flto . 48.54 |=========================================== -O2 -march=znver3 -flto .... 47.04 |========================================== Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 30.31 |======================================== -O3 -march=znver2 .......... 30.34 |======================================== -O3 -march=x86-64 .......... 30.48 |======================================== -O3 -maech=znver3 -flto .... 31.44 |========================================== -Ofast -march=znver3 -flto . 32.40 |=========================================== -O2 -march=znver3 -flto .... 30.76 |========================================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 54.73 |======================================== -O3 -march=znver2 .......... 54.75 |======================================== -O3 -march=x86-64 .......... 54.43 |======================================== -O3 -maech=znver3 -flto .... 56.12 |========================================= -Ofast -march=znver3 -flto . 58.69 |=========================================== -O2 -march=znver3 -flto .... 54.91 |======================================== Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Very Fast Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 111.50 |====================================== -O3 -march=znver2 .......... 111.54 |====================================== -O3 -march=x86-64 .......... 111.77 |======================================= -O3 -maech=znver3 -flto .... 116.07 |======================================== -Ofast -march=znver3 -flto . 121.74 |========================================== -O2 -march=znver3 -flto .... 113.41 |======================================= Kvazaar 2.0 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 200.12 |====================================== -O3 -march=znver2 .......... 199.35 |====================================== -O3 -march=x86-64 .......... 198.84 |====================================== -O3 -maech=znver3 -flto .... 203.89 |======================================= -Ofast -march=znver3 -flto . 220.99 |========================================== -O2 -march=znver3 -flto .... 201.85 |====================================== SVT-AV1 0.8 Encoder Mode: Enc Mode 4 - Input: 1080p Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 7.014 |=========================================== -O3 -march=znver2 .......... 6.956 |=========================================== -O3 -march=x86-64 .......... 6.960 |=========================================== -O3 -maech=znver3 -flto .... 6.972 |=========================================== -Ofast -march=znver3 -flto . 6.934 |=========================================== -O2 -march=znver3 -flto .... 6.957 |=========================================== SVT-AV1 0.8 Encoder Mode: Enc Mode 8 - Input: 1080p Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 60.38 |=========================================== -O3 -march=znver2 .......... 59.94 |=========================================== -O3 -march=x86-64 .......... 59.84 |=========================================== -O3 -maech=znver3 -flto .... 60.22 |=========================================== -Ofast -march=znver3 -flto . 59.76 |=========================================== -O2 -march=znver3 -flto .... 59.68 |=========================================== SVT-VP9 0.1 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 239.32 |========================================== -O3 -march=znver2 .......... 234.58 |========================================= -O3 -march=x86-64 .......... 235.02 |========================================= -O3 -maech=znver3 -flto .... 235.83 |========================================= -Ofast -march=znver3 -flto . 237.74 |========================================== -O2 -march=znver3 -flto .... 236.33 |========================================= SVT-VP9 0.1 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 230.15 |========================================== -O3 -march=znver2 .......... 223.97 |========================================= -O3 -march=x86-64 .......... 224.11 |========================================= -O3 -maech=znver3 -flto .... 224.25 |========================================= -Ofast -march=znver3 -flto . 228.43 |========================================== -O2 -march=znver3 -flto .... 225.36 |========================================= VP9 libvpx Encoding 1.8.2 Speed: Speed 0 Frames Per Second > Higher Is Better -O3 -march=znver3 . 10.03 |================================================= -O3 -march=znver2 . 10.75 |==================================================== -O3 -march=x86-64 . 9.44 |============================================== VP9 libvpx Encoding 1.8.2 Speed: Speed 5 Frames Per Second > Higher Is Better -O3 -march=znver3 . 39.78 |==================================================== -O3 -march=znver2 . 39.66 |==================================================== -O3 -march=x86-64 . 33.71 |============================================ x264 2019-12-17 H.264 Video Encoding Frames Per Second > Higher Is Better -O3 -march=znver3 . 199.02 |================================================== -O3 -march=znver2 . 200.66 |================================================== -O3 -march=x86-64 . 202.78 |=================================================== x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 27.54 |=========================================== -O3 -march=znver2 .......... 27.81 |=========================================== -O3 -march=x86-64 .......... 27.63 |=========================================== -O3 -maech=znver3 -flto .... 27.75 |=========================================== -Ofast -march=znver3 -flto . 27.32 |========================================== -O2 -march=znver3 -flto .... 27.48 |========================================== x265 3.4 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=znver3 .......... 89.13 |=========================================== -O3 -march=znver2 .......... 88.80 |=========================================== -O3 -march=x86-64 .......... 88.92 |=========================================== -O3 -maech=znver3 -flto .... 89.08 |=========================================== -Ofast -march=znver3 -flto . 88.21 |=========================================== -O2 -march=znver3 -flto .... 87.71 |========================================== ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better -O3 -march=znver3 .......... 7.307093 |====================================== -O3 -march=znver2 .......... 7.512189 |======================================= -O3 -march=x86-64 .......... 6.484557 |================================= -O3 -maech=znver3 -flto .... 7.696089 |======================================== -Ofast -march=znver3 -flto . 7.758952 |======================================== -O2 -march=znver3 -flto .... 7.345638 |====================================== Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better -O3 -march=znver3 .......... 716026.62 |====================================== -O3 -march=znver2 .......... 721620.07 |====================================== -O3 -march=x86-64 .......... 717480.24 |====================================== -O3 -maech=znver3 -flto .... 736850.43 |======================================= -Ofast -march=znver3 -flto . 728279.79 |======================================= -O2 -march=znver3 -flto .... 681281.71 |==================================== Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better -O3 -march=znver3 .......... 4918.51 |====================================== -O3 -march=znver2 .......... 5341.80 |========================================= -O3 -march=x86-64 .......... 4972.63 |====================================== -O3 -maech=znver3 -flto .... 5235.08 |======================================== -Ofast -march=znver3 -flto . 4974.88 |====================================== -O2 -march=znver3 -flto .... 5357.65 |========================================= 7-Zip Compression 16.02 Compress Speed Test MIPS > Higher Is Better -O3 -march=znver3 .......... 95844 |=========================================== -O3 -march=znver2 .......... 95715 |=========================================== -O3 -march=x86-64 .......... 95316 |=========================================== -O3 -maech=znver3 -flto .... 95172 |=========================================== -Ofast -march=znver3 -flto . 95872 |=========================================== -O2 -march=znver3 -flto .... 95588 |=========================================== Timed Apache Compilation 2.4.41 Time To Compile Seconds < Lower Is Better -O3 -march=znver3 . 15.14 |==================================================== -O3 -march=znver2 . 15.19 |==================================================== -O3 -march=x86-64 . 15.18 |==================================================== Timed FFmpeg Compilation 4.2.2 Time To Compile Seconds < Lower Is Better -O3 -march=znver3 . 30.87 |==================================================== -O3 -march=znver2 . 30.81 |==================================================== -O3 -march=x86-64 . 30.71 |==================================================== Timed ImageMagick Compilation 6.9.0 Time To Compile Seconds < Lower Is Better -O3 -march=znver3 .......... 18.86 |============ -O3 -march=znver2 .......... 19.37 |============ -O3 -march=x86-64 .......... 19.00 |============ -O3 -maech=znver3 -flto .... 63.48 |========================================= -Ofast -march=znver3 -flto . 66.84 |=========================================== -O2 -march=znver3 -flto .... 50.41 |================================ Timed MPlayer Compilation 1.4 Time To Compile Seconds < Lower Is Better -O3 -march=znver3 . 19.80 |==================================================== -O3 -march=znver2 . 19.60 |=================================================== -O3 -march=x86-64 . 19.82 |==================================================== Timed PHP Compilation 7.4.2 Time To Compile Seconds < Lower Is Better -O3 -march=znver3 . 36.68 |==================================================== -O3 -march=znver2 . 36.75 |==================================================== -O3 -march=x86-64 . 35.35 |================================================== C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better -O3 -march=znver3 .......... 25.25 |======================= -O3 -march=znver2 .......... 25.25 |======================= -O3 -march=x86-64 .......... 31.36 |============================= -O3 -maech=znver3 -flto .... 25.48 |======================== -Ofast -march=znver3 -flto . 27.34 |========================= -O2 -march=znver3 -flto .... 46.42 |=========================================== Smallpt 1.0 Global Illumination Renderer; 128 Samples Seconds < Lower Is Better -O3 -march=znver3 .......... 4.673 |======================================== -O3 -march=znver2 .......... 4.677 |======================================== -O3 -march=x86-64 .......... 5.068 |=========================================== -O3 -maech=znver3 -flto .... 4.656 |======================================== -Ofast -march=znver3 -flto . 4.602 |======================================= -O2 -march=znver3 -flto .... 4.607 |======================================= AOBench Size: 2048 x 2048 - Total Time Seconds < Lower Is Better -O3 -march=znver3 .......... 25.07 |==================================== -O3 -march=znver2 .......... 25.02 |==================================== -O3 -march=x86-64 .......... 29.99 |=========================================== -O3 -maech=znver3 -flto .... 25.85 |===================================== -Ofast -march=znver3 -flto . 24.51 |=================================== -O2 -march=znver3 -flto .... 27.36 |======================================= Bullet Physics Engine 2.81 Test: Raytests Seconds < Lower Is Better -O3 -march=znver3 .......... 1.663293 |===================================== -O3 -march=znver2 .......... 1.785458 |======================================== -O3 -march=x86-64 .......... 1.792863 |======================================== -O3 -maech=znver3 -flto .... 1.632244 |==================================== -Ofast -march=znver3 -flto . 1.627688 |==================================== -O2 -march=znver3 -flto .... 1.594243 |==================================== Bullet Physics Engine 2.81 Test: 3000 Fall Seconds < Lower Is Better -O3 -march=znver3 .......... 2.651887 |================================ -O3 -march=znver2 .......... 3.164171 |====================================== -O3 -march=x86-64 .......... 3.331015 |======================================== -O3 -maech=znver3 -flto .... 2.692810 |================================ -Ofast -march=znver3 -flto . 2.692827 |================================ -O2 -march=znver3 -flto .... 2.632108 |================================ Bullet Physics Engine 2.81 Test: 1000 Stack Seconds < Lower Is Better -O3 -march=znver3 .......... 3.265452 |================================ -O3 -march=znver2 .......... 3.890664 |====================================== -O3 -march=x86-64 .......... 4.077866 |======================================== -O3 -maech=znver3 -flto .... 3.319278 |================================= -Ofast -march=znver3 -flto . 3.316208 |================================= -O2 -march=znver3 -flto .... 3.255847 |================================ Bullet Physics Engine 2.81 Test: 1000 Convex Seconds < Lower Is Better -O3 -march=znver3 .......... 2.672805 |============================= -O3 -march=znver2 .......... 3.197418 |=================================== -O3 -march=x86-64 .......... 3.627370 |======================================== -O3 -maech=znver3 -flto .... 2.625405 |============================= -Ofast -march=znver3 -flto . 2.629167 |============================= -O2 -march=znver3 -flto .... 2.573982 |============================ Bullet Physics Engine 2.81 Test: 136 Ragdolls Seconds < Lower Is Better -O3 -march=znver3 .......... 1.710504 |=============================== -O3 -march=znver2 .......... 2.041453 |===================================== -O3 -march=x86-64 .......... 2.184048 |======================================== -O3 -maech=znver3 -flto .... 1.771787 |================================ -Ofast -march=znver3 -flto . 1.769900 |================================ -O2 -march=znver3 -flto .... 1.737217 |================================ Bullet Physics Engine 2.81 Test: Prim Trimesh Seconds < Lower Is Better -O3 -march=znver3 .......... 0.656643 |================================ -O3 -march=znver2 .......... 0.786183 |======================================= -O3 -march=x86-64 .......... 0.815518 |======================================== -O3 -maech=znver3 -flto .... 0.640668 |=============================== -Ofast -march=znver3 -flto . 0.640757 |=============================== -O2 -march=znver3 -flto .... 0.627687 |=============================== Bullet Physics Engine 2.81 Test: Convex Trimesh Seconds < Lower Is Better -O3 -march=znver3 .......... 0.731618 |=============================== -O3 -march=znver2 .......... 0.876382 |====================================== -O3 -march=x86-64 .......... 0.930682 |======================================== -O3 -maech=znver3 -flto .... 0.708918 |============================== -Ofast -march=znver3 -flto . 0.708037 |============================== -O2 -march=znver3 -flto .... 0.693470 |============================== XZ Compression 5.2.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9 Seconds < Lower Is Better -O3 -march=znver3 .......... 23.60 |========================================== -O3 -march=znver2 .......... 23.42 |========================================== -O3 -march=x86-64 .......... 23.17 |========================================= -O3 -maech=znver3 -flto .... 24.09 |=========================================== -Ofast -march=znver3 -flto . 23.71 |========================================== -O2 -march=znver3 -flto .... 23.34 |========================================== FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better -O3 -march=znver3 .......... 6.534 |========================================== -O3 -march=znver2 .......... 6.616 |=========================================== -O3 -march=x86-64 .......... 6.441 |========================================== -O3 -maech=znver3 -flto .... 6.606 |=========================================== -Ofast -march=znver3 -flto . 6.233 |========================================= -O2 -march=znver3 -flto .... 6.186 |======================================== LAME MP3 Encoding 3.100 WAV To MP3 Seconds < Lower Is Better -O3 -march=znver3 .......... 5.445 |==================================== -O3 -march=znver2 .......... 5.673 |====================================== -O3 -march=x86-64 .......... 5.787 |====================================== -O3 -maech=znver3 -flto .... 5.789 |====================================== -Ofast -march=znver3 -flto . 5.282 |=================================== -O2 -march=znver3 -flto .... 6.473 |=========================================== RNNoise 2020-06-28 Seconds < Lower Is Better -O3 -march=znver3 ....... 16.56 |============================================== -O3 -march=znver2 ....... 14.07 |======================================= -O3 -march=x86-64 ....... 14.52 |======================================== -O3 -maech=znver3 -flto . 13.86 |======================================= -O2 -march=znver3 -flto . 15.65 |=========================================== Tachyon 0.99b6 Total Time Seconds < Lower Is Better -O3 -march=znver3 .......... 49.48 |=========================================== -O3 -march=znver2 .......... 49.29 |=========================================== -O3 -march=x86-64 .......... 49.50 |=========================================== -O3 -maech=znver3 -flto .... 49.43 |=========================================== -Ofast -march=znver3 -flto . 49.54 |=========================================== -O2 -march=znver3 -flto .... 49.52 |=========================================== OpenSSL 1.1.1 RSA 4096-bit Performance Signs Per Second > Higher Is Better -O3 -march=znver3 .......... 4849.5 |========================================== -O3 -march=znver2 .......... 4857.9 |========================================== -O3 -march=x86-64 .......... 4847.3 |========================================== -O3 -maech=znver3 -flto .... 4836.6 |========================================== -Ofast -march=znver3 -flto . 4839.5 |========================================== -O2 -march=znver3 -flto .... 4852.6 |========================================== Aircrack-ng 1.5.2 k/s > Higher Is Better -O3 -march=znver3 .......... 75202.34 |======================================== -O3 -march=znver2 .......... 75330.45 |======================================== -O3 -march=x86-64 .......... 75091.64 |======================================== -O3 -maech=znver3 -flto .... 75160.07 |======================================== -Ofast -march=znver3 -flto . 75105.17 |======================================== -O2 -march=znver3 -flto .... 75257.16 |======================================== Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping Test Cases Per Minute > Higher Is Better -O3 -march=znver3 .......... 971.31 |========================================== -O3 -march=znver2 .......... 964.72 |========================================== -O3 -march=x86-64 .......... 931.03 |======================================== -O3 -maech=znver3 -flto .... 971.45 |========================================== -Ofast -march=znver3 -flto . 969.91 |========================================== -O2 -march=znver3 -flto .... 970.46 |========================================== Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image Test Cases Per Minute > Higher Is Better -O3 -march=znver3 .......... 30333.66 |======================================== -O3 -march=znver2 .......... 30133.17 |======================================== -O3 -march=x86-64 .......... 29933.96 |======================================= -O3 -maech=znver3 -flto .... 29966.05 |======================================== -Ofast -march=znver3 -flto . 29964.69 |======================================== -O2 -march=znver3 -flto .... 29486.00 |======================================= CppPerformanceBenchmarks 9 Test: Atol Seconds < Lower Is Better -O3 -march=znver3 .......... 35.81 |========================================== -O3 -march=znver2 .......... 35.33 |========================================= -O3 -march=x86-64 .......... 36.38 |=========================================== -O3 -maech=znver3 -flto .... 35.33 |========================================= -Ofast -march=znver3 -flto . 36.78 |=========================================== -O2 -march=znver3 -flto .... 35.39 |========================================= CppPerformanceBenchmarks 9 Test: Ctype Seconds < Lower Is Better -O3 -march=znver3 .......... 40.08 |=================================== -O3 -march=znver2 .......... 40.71 |==================================== -O3 -march=x86-64 .......... 41.94 |===================================== -O3 -maech=znver3 -flto .... 39.87 |=================================== -Ofast -march=znver3 -flto . 40.21 |==================================== -O2 -march=znver3 -flto .... 48.61 |=========================================== CppPerformanceBenchmarks 9 Test: Math Library Seconds < Lower Is Better -O3 -march=znver3 .......... 222.25 |================================= -O3 -march=znver2 .......... 220.55 |================================= -O3 -march=x86-64 .......... 223.87 |================================= -O3 -maech=znver3 -flto .... 219.48 |================================ -Ofast -march=znver3 -flto . 217.51 |================================ -O2 -march=znver3 -flto .... 283.69 |========================================== CppPerformanceBenchmarks 9 Test: Stepanov Vector Seconds < Lower Is Better -O3 -march=znver3 .......... 42.38 |=========================================== -O3 -march=znver2 .......... 41.72 |========================================== -O3 -march=x86-64 .......... 42.38 |=========================================== -O3 -maech=znver3 -flto .... 41.58 |========================================== -Ofast -march=znver3 -flto . 41.83 |========================================== -O2 -march=znver3 -flto .... 42.03 |=========================================== CppPerformanceBenchmarks 9 Test: Function Objects Seconds < Lower Is Better -O3 -march=znver3 .......... 11.20 |==================================== -O3 -march=znver2 .......... 11.46 |===================================== -O3 -march=x86-64 .......... 11.33 |==================================== -O3 -maech=znver3 -flto .... 11.47 |===================================== -Ofast -march=znver3 -flto . 11.26 |==================================== -O2 -march=znver3 -flto .... 13.45 |=========================================== CppPerformanceBenchmarks 9 Test: Stepanov Abstraction Seconds < Lower Is Better -O3 -march=znver3 .......... 18.86 |=============================== -O3 -march=znver2 .......... 20.64 |================================== -O3 -march=x86-64 .......... 19.85 |================================= -O3 -maech=znver3 -flto .... 18.88 |=============================== -Ofast -march=znver3 -flto . 18.55 |=============================== -O2 -march=znver3 -flto .... 25.80 |=========================================== SQLite Speedtest 3.30 Timed Time - Size 1,000 Seconds < Lower Is Better -O3 -march=znver3 .......... 40.64 |====================================== -O3 -march=znver2 .......... 41.91 |======================================== -O3 -march=x86-64 .......... 41.95 |======================================== -O3 -maech=znver3 -flto .... 41.31 |======================================= -Ofast -march=znver3 -flto . 41.13 |======================================= -O2 -march=znver3 -flto .... 45.49 |=========================================== Hierarchical INTegration 1.0 Test: FLOAT QUIPs > Higher Is Better -O3 -march=znver3 .......... 530415720.67 |==================================== -O3 -march=znver2 .......... 535659290.51 |==================================== -O3 -march=x86-64 .......... 536140751.39 |==================================== -O3 -maech=znver3 -flto .... 525844746.57 |=================================== -Ofast -march=znver3 -flto . 483531659.14 |================================ -O2 -march=znver3 -flto .... 533589946.00 |==================================== NGINX Benchmark 1.9.9 Static Web Page Serving Requests Per Second > Higher Is Better -O3 -march=znver3 .......... 51446.92 |======================================== -O3 -march=znver2 .......... 48905.01 |====================================== -O3 -march=x86-64 .......... 51463.87 |======================================== -O3 -maech=znver3 -flto .... 51640.99 |======================================== -Ofast -march=znver3 -flto . 49006.08 |====================================== -O2 -march=znver3 -flto .... 51831.76 |======================================== Apache Benchmark 2.4.29 Static Web Page Serving Requests Per Second > Higher Is Better -O3 -march=znver3 . 46267.92 |================================================= -O3 -march=znver2 . 46212.92 |================================================= -O3 -march=x86-64 . 45996.80 |=================================================