11900K Compiler Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite. GCC 11.1: -O3 -march=native: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Fedora 34, Kernel: 5.11.20-300.fc34.x86_64 (x86_64), Desktop: GNOME Shell 40.1, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.0.3 (LLVM 12.0.0), Compiler: GCC 11.1.1 20210428, File-System: btrfs, Screen Resolution: 3840x2160 GCC 11.1: -O3 -march=native -flto: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Fedora 34, Kernel: 5.11.20-300.fc34.x86_64 (x86_64), Desktop: GNOME Shell 40.1, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.0.3 (LLVM 12.0.0), Compiler: GCC 11.1.1 20210428, File-System: btrfs, Screen Resolution: 3840x2160 GCC 11.1: -O2: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 500GB Western Digital WDS500G3X0C-00SJG0 + 15GB Ultra USB 3.0, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Fedora 34, Kernel: 5.11.20-300.fc34.x86_64 (x86_64), Desktop: GNOME Shell 40.1, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.0.3 (LLVM 12.0.0), Compiler: GCC 11.1.1 20210428, File-System: btrfs, Screen Resolution: 3840x2160 Crypto++ 8.2 Test: Unkeyed Algorithms MiB/second > Higher Is Better GCC 11.1: -O3 -march=native ....... 489.76 |=================================== GCC 11.1: -O3 -march=native -flto . 488.63 |=================================== GCC 11.1: -O2 ..................... 491.64 |=================================== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 86.70 |==================================== GCC 11.1: -O3 -march=native -flto . 84.93 |=================================== GCC 11.1: -O2 ..................... 87.30 |==================================== Timed HMMer Search 3.3.2 Pfam Database Search Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 100.74 |================================== GCC 11.1: -O3 -march=native -flto . 99.97 |================================== GCC 11.1: -O2 ..................... 103.29 |=================================== Quantum ESPRESSO 6.7 Input: AUSURF112 Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 2576.97 |================================== GCC 11.1: -O3 -march=native -flto . 2540.19 |================================== GCC 11.1: -O2 ..................... 2538.25 |================================= LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein ns/day > Higher Is Better GCC 11.1: -O3 -march=native ....... 8.067 |=================================== GCC 11.1: -O3 -march=native -flto . 8.328 |==================================== GCC 11.1: -O2 ..................... 8.023 |=================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Encode Time - Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 12.90 |================================== GCC 11.1: -O3 -march=native -flto . 12.71 |================================= GCC 11.1: -O2 ..................... 13.76 |==================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Encode Time - Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 5.127 |================================== GCC 11.1: -O3 -march=native -flto . 5.103 |================================== GCC 11.1: -O2 ..................... 5.360 |==================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Encode Time - Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 27.26 |=================================== GCC 11.1: -O3 -march=native -flto . 27.07 |=================================== GCC 11.1: -O2 ..................... 27.84 |==================================== GNU GMP GMPbench 6.2.1 Total Time GMPbench Score > Higher Is Better GCC 11.1: -O3 -march=native ....... 6172.9 |=================================== GCC 11.1: -O3 -march=native -flto . 6171.6 |=================================== Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MB/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 35.4 |===================================== GCC 11.1: -O3 -march=native -flto . 34.8 |==================================== GCC 11.1: -O2 ..................... 34.5 |==================================== Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 4514.8 |================================= GCC 11.1: -O3 -march=native -flto . 4503.1 |================================= GCC 11.1: -O2 ..................... 4718.1 |=================================== Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed MB/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 285.3 |=================================== GCC 11.1: -O3 -march=native -flto . 281.1 |================================== GCC 11.1: -O2 ..................... 296.0 |==================================== Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed MB/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 5546.0 |================================== GCC 11.1: -O3 -march=native -flto . 5477.9 |================================= GCC 11.1: -O2 ..................... 5760.9 |=================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 33.0 |===================================== GCC 11.1: -O3 -march=native -flto . 32.8 |===================================== GCC 11.1: -O2 ..................... 32.7 |===================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 4582.3 |================================== GCC 11.1: -O3 -march=native -flto . 4579.8 |================================== GCC 11.1: -O2 ..................... 4777.3 |=================================== GraphicsMagick 1.3.33 Operation: Rotate Iterations Per Minute > Higher Is Better GCC 11.1: -O3 -march=native ....... 1141 |===================================== GCC 11.1: -O3 -march=native -flto . 1072 |=================================== GCC 11.1: -O2 ..................... 1066 |=================================== GraphicsMagick 1.3.33 Operation: Sharpen Iterations Per Minute > Higher Is Better GCC 11.1: -O3 -march=native ....... 195 |====================================== GCC 11.1: -O3 -march=native -flto . 195 |====================================== GCC 11.1: -O2 ..................... 164 |================================ GraphicsMagick 1.3.33 Operation: Enhanced Iterations Per Minute > Higher Is Better GCC 11.1: -O3 -march=native ....... 270 |====================================== GCC 11.1: -O3 -march=native -flto . 269 |====================================== GCC 11.1: -O2 ..................... 219 |=============================== GraphicsMagick 1.3.33 Operation: Resizing Iterations Per Minute > Higher Is Better GCC 11.1: -O3 -march=native ....... 1198 |==================================== GCC 11.1: -O3 -march=native -flto . 1229 |===================================== GCC 11.1: -O2 ..................... 1091 |================================= dav1d 0.8.2 Video Input: Chimera 1080p FPS > Higher Is Better GCC 11.1: -O3 -march=native . 763.05 |======================================== GCC 11.1: -O2 ............... 773.93 |========================================= dav1d 0.8.2 Video Input: Summer Nature 4K FPS > Higher Is Better GCC 11.1: -O3 -march=native . 190.31 |========================================= GCC 11.1: -O2 ............... 186.75 |======================================== dav1d 0.8.2 Video Input: Summer Nature 1080p FPS > Higher Is Better GCC 11.1: -O3 -march=native . 717.31 |======================================== GCC 11.1: -O2 ............... 727.60 |========================================= dav1d 0.8.2 Video Input: Chimera 1080p 10-bit FPS > Higher Is Better GCC 11.1: -O3 -march=native . 223.02 |========================================= GCC 11.1: -O2 ............... 148.40 |=========================== SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 139.13 |================================== GCC 11.1: -O3 -march=native -flto . 141.83 |=================================== GCC 11.1: -O2 ..................... 136.31 |================================== SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 278.72 |=================================== GCC 11.1: -O3 -march=native -flto . 278.59 |=================================== GCC 11.1: -O2 ..................... 273.60 |================================== SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 195.87 |=================================== GCC 11.1: -O3 -march=native -flto . 195.07 |=================================== GCC 11.1: -O2 ..................... 191.83 |================================== SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 201.70 |=================================== GCC 11.1: -O3 -march=native -flto . 201.10 |=================================== GCC 11.1: -O2 ..................... 198.01 |================================== SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 164.77 |=================================== GCC 11.1: -O3 -march=native -flto . 166.05 |=================================== GCC 11.1: -O2 ..................... 160.65 |================================== x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 15.81 |==================================== GCC 11.1: -O3 -march=native -flto . 15.40 |=================================== GCC 11.1: -O2 ..................... 15.64 |==================================== Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better GCC 11.1: -O3 -march=native ....... 432583.96 |================================ GCC 11.1: -O3 -march=native -flto . 435901.44 |================================ GCC 11.1: -O2 ..................... 430127.50 |================================ Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better GCC 11.1: -O3 -march=native ....... 6878.51 |================================= GCC 11.1: -O3 -march=native -flto . 7079.88 |================================== GCC 11.1: -O2 ..................... 6305.48 |============================== Stockfish 13 Total Time Nodes Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 29932441 |================================= GCC 11.1: -O3 -march=native -flto . 29086394 |================================ GCC 11.1: -O2 ..................... 29094819 |================================ PJSIP 2.11 Method: INVITE Responses Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 4959 |==================================== GCC 11.1: -O3 -march=native -flto . 5058 |===================================== GCC 11.1: -O2 ..................... 5001 |===================================== PJSIP 2.11 Method: OPTIONS, Stateful Responses Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 9389 |===================================== GCC 11.1: -O3 -march=native -flto . 9395 |===================================== GCC 11.1: -O2 ..................... 9381 |===================================== PJSIP 2.11 Method: OPTIONS, Stateless Responses Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 241439 |=================================== GCC 11.1: -O3 -march=native -flto . 239892 |=================================== GCC 11.1: -O2 ..................... 239792 |=================================== C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 47.35 |================ GCC 11.1: -O3 -march=native -flto . 47.61 |================ GCC 11.1: -O2 ..................... 106.52 |=================================== Smallpt 1.0 Global Illumination Renderer; 128 Samples Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 8.405 |================================== GCC 11.1: -O3 -march=native -flto . 8.454 |=================================== GCC 11.1: -O2 ..................... 8.771 |==================================== oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 4.06617 |================================== GCC 11.1: -O3 -march=native -flto . 4.04481 |================================== GCC 11.1: -O2 ..................... 4.04477 |================================== oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 11.24 |==================================== GCC 11.1: -O3 -march=native -flto . 11.23 |==================================== GCC 11.1: -O2 ..................... 10.75 |================================== oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 0.722430 |================================= GCC 11.1: -O3 -march=native -flto . 0.720482 |================================= GCC 11.1: -O2 ..................... 0.717882 |================================= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3.17026 |================================== GCC 11.1: -O3 -march=native -flto . 3.13941 |================================== GCC 11.1: -O2 ..................... 3.13532 |================================== oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 8.57548 |================================== GCC 11.1: -O3 -march=native -flto . 8.57248 |================================== GCC 11.1: -O2 ..................... 8.57623 |================================== oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 5.28726 |================================= GCC 11.1: -O3 -march=native -flto . 5.40080 |================================== GCC 11.1: -O2 ..................... 5.01199 |================================ oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 14.29 |==================================== GCC 11.1: -O3 -march=native -flto . 14.25 |==================================== GCC 11.1: -O2 ..................... 14.17 |==================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 4.86522 |================================== GCC 11.1: -O3 -march=native -flto . 4.74611 |================================= GCC 11.1: -O2 ..................... 4.87601 |================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 4.30798 |================================== GCC 11.1: -O3 -march=native -flto . 4.25176 |================================== GCC 11.1: -O2 ..................... 4.27077 |================================== oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 12.51 |==================================== GCC 11.1: -O3 -march=native -flto . 12.52 |==================================== GCC 11.1: -O2 ..................... 12.37 |==================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 0.829637 |================================= GCC 11.1: -O3 -march=native -flto . 0.831699 |================================= GCC 11.1: -O2 ..................... 0.829564 |================================= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 1.45788 |================================== GCC 11.1: -O3 -march=native -flto . 1.47524 |================================== GCC 11.1: -O2 ..................... 1.46726 |================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3173.47 |================================== GCC 11.1: -O3 -march=native -flto . 3152.89 |================================== GCC 11.1: -O2 ..................... 3124.56 |================================= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 1887.61 |================================== GCC 11.1: -O3 -march=native -flto . 1876.25 |================================== GCC 11.1: -O2 ..................... 1842.14 |================================= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3172.19 |================================== GCC 11.1: -O3 -march=native -flto . 3148.67 |================================== GCC 11.1: -O2 ..................... 3123.64 |================================= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 16.18 |==================================== GCC 11.1: -O3 -march=native -flto . 16.19 |==================================== GCC 11.1: -O2 ..................... 16.17 |==================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 16.50 |================================== GCC 11.1: -O3 -march=native -flto . 17.42 |==================================== GCC 11.1: -O2 ..................... 16.69 |================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 17.06 |==================================== GCC 11.1: -O3 -march=native -flto . 17.13 |==================================== GCC 11.1: -O2 ..................... 17.06 |==================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 1890.59 |================================== GCC 11.1: -O3 -march=native -flto . 1874.70 |================================== GCC 11.1: -O2 ..................... 1845.74 |================================= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3.52791 |================================== GCC 11.1: -O3 -march=native -flto . 3.52381 |================================== GCC 11.1: -O2 ..................... 3.53315 |================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3171.46 |================================== GCC 11.1: -O3 -march=native -flto . 3154.69 |================================== GCC 11.1: -O2 ..................... 3123.95 |================================= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 1891.71 |================================== GCC 11.1: -O3 -march=native -flto . 1877.51 |================================== GCC 11.1: -O2 ..................... 1841.63 |================================= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 1.32271 |================================== GCC 11.1: -O3 -march=native -flto . 1.32311 |================================== GCC 11.1: -O2 ..................... 1.32100 |================================== oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3.53708 |================================== GCC 11.1: -O3 -march=native -flto . 3.53500 |================================== GCC 11.1: -O2 ..................... 3.54019 |================================== AOBench Size: 2048 x 2048 - Total Time Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 21.54 |================================ GCC 11.1: -O3 -march=native -flto . 21.58 |================================ GCC 11.1: -O2 ..................... 24.46 |==================================== FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 5.931 |=================================== GCC 11.1: -O3 -march=native -flto . 5.936 |=================================== GCC 11.1: -O2 ..................... 6.086 |==================================== LAME MP3 Encoding 3.100 WAV To MP3 Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 5.479 |=========================== GCC 11.1: -O3 -march=native -flto . 5.376 |========================== GCC 11.1: -O2 ..................... 7.304 |==================================== Opus Codec Encoding 1.3.1 WAV To Opus Encode Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 5.587 |=============================== GCC 11.1: -O3 -march=native -flto . 5.575 |=============================== GCC 11.1: -O2 ..................... 6.467 |==================================== eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 21.71 |=================================== GCC 11.1: -O3 -march=native -flto . 22.60 |==================================== GCC 11.1: -O2 ..................... 21.33 |================================== Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 686530000 |================================ GCC 11.1: -O3 -march=native -flto . 684356667 |================================ GCC 11.1: -O2 ..................... 635506667 |============================== Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better GCC 11.1: -O3 -march=native ....... 722893333 |================================ GCC 11.1: -O3 -march=native -flto . 722393333 |================================ GCC 11.1: -O2 ..................... 711343333 |=============================== libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput Megapixels/sec > Higher Is Better GCC 11.1: -O3 -march=native ....... 273.10 |=================================== GCC 11.1: -O3 -march=native -flto . 272.60 |=================================== GCC 11.1: -O2 ..................... 261.03 |================================= ASTC Encoder 2.4 Preset: Medium Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 5.1820 |=================================== GCC 11.1: -O3 -march=native -flto . 5.1705 |================================== GCC 11.1: -O2 ..................... 5.2481 |=================================== ASTC Encoder 2.4 Preset: Thorough Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 11.38 |================================== GCC 11.1: -O3 -march=native -flto . 11.40 |================================== GCC 11.1: -O2 ..................... 12.09 |==================================== ASTC Encoder 2.4 Preset: Exhaustive Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 85.42 |================================== GCC 11.1: -O3 -march=native -flto . 85.42 |================================== GCC 11.1: -O2 ..................... 91.38 |==================================== SQLite Speedtest 3.30 Timed Time - Size 1,000 Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 44.09 |==================================== GCC 11.1: -O3 -march=native -flto . 43.78 |==================================== GCC 11.1: -O2 ..................... 43.62 |==================================== Redis 6.0.9 Test: GET Requests Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 4036791.92 |=============================== GCC 11.1: -O3 -march=native -flto . 4060369.08 |=============================== GCC 11.1: -O2 ..................... 4051463.17 |=============================== Redis 6.0.9 Test: SET Requests Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 2980192.00 |=============================== GCC 11.1: -O3 -march=native -flto . 2990164.92 |=============================== GCC 11.1: -O2 ..................... 2936296.08 |============================== NCNN 20201218 Target: CPU - Model: mobilenet ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 11.83 |============================ GCC 11.1: -O3 -march=native -flto . 13.34 |================================ GCC 11.1: -O2 ..................... 15.15 |==================================== NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3.24 |============================= GCC 11.1: -O3 -march=native -flto . 3.25 |============================= GCC 11.1: -O2 ..................... 4.20 |===================================== NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 2.55 |============================== GCC 11.1: -O3 -march=native -flto . 2.52 |============================= GCC 11.1: -O2 ..................... 3.18 |===================================== NCNN 20201218 Target: CPU - Model: shufflenet-v2 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 3.24 |===================== GCC 11.1: -O3 -march=native -flto . 5.60 |===================================== GCC 11.1: -O2 ..................... 3.48 |======================= NCNN 20201218 Target: CPU - Model: mnasnet ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 2.30 |=========================== GCC 11.1: -O3 -march=native -flto . 2.27 |=========================== GCC 11.1: -O2 ..................... 3.11 |===================================== NCNN 20201218 Target: CPU - Model: efficientnet-b0 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 4.38 |=============================== GCC 11.1: -O3 -march=native -flto . 4.32 |=============================== GCC 11.1: -O2 ..................... 5.23 |===================================== NCNN 20201218 Target: CPU - Model: blazeface ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 1.19 |========================== GCC 11.1: -O3 -march=native -flto . 1.69 |===================================== GCC 11.1: -O2 ..................... 1.19 |========================== NCNN 20201218 Target: CPU - Model: googlenet ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 10.20 |================================= GCC 11.1: -O3 -march=native -flto . 10.27 |================================= GCC 11.1: -O2 ..................... 11.11 |==================================== NCNN 20201218 Target: CPU - Model: vgg16 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 54.50 |==================================== GCC 11.1: -O3 -march=native -flto . 54.13 |==================================== GCC 11.1: -O2 ..................... 54.80 |==================================== NCNN 20201218 Target: CPU - Model: resnet18 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 11.08 |=================================== GCC 11.1: -O3 -march=native -flto . 11.39 |==================================== GCC 11.1: -O2 ..................... 11.30 |==================================== NCNN 20201218 Target: CPU - Model: alexnet ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 9.63 |===================================== GCC 11.1: -O3 -march=native -flto . 9.70 |===================================== GCC 11.1: -O2 ..................... 9.63 |===================================== NCNN 20201218 Target: CPU - Model: resnet50 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 18.23 |============================== GCC 11.1: -O3 -march=native -flto . 18.43 |============================== GCC 11.1: -O2 ..................... 22.07 |==================================== NCNN 20201218 Target: CPU - Model: yolov4-tiny ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 20.23 |=============================== GCC 11.1: -O3 -march=native -flto . 23.45 |==================================== GCC 11.1: -O2 ..................... 21.07 |================================ NCNN 20201218 Target: CPU - Model: squeezenet_ssd ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 15.53 |=================================== GCC 11.1: -O3 -march=native -flto . 15.92 |=================================== GCC 11.1: -O2 ..................... 16.15 |==================================== NCNN 20201218 Target: CPU - Model: regnety_400m ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 8.62 |================================= GCC 11.1: -O3 -march=native -flto . 8.91 |================================== GCC 11.1: -O2 ..................... 9.61 |===================================== TNN 0.2.3 Target: CPU - Model: MobileNet v2 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 230.02 |================================ GCC 11.1: -O3 -march=native -flto . 247.89 |=================================== GCC 11.1: -O2 ..................... 243.42 |================================== TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 ms < Lower Is Better GCC 11.1: -O3 -march=native ....... 227.66 |================================= GCC 11.1: -O3 -march=native -flto . 242.55 |=================================== GCC 11.1: -O2 ..................... 236.05 |================================== Sysbench 1.0.20 Test: CPU Events Per Second > Higher Is Better GCC 11.1: -O3 -march=native ....... 34776.08 |================================= GCC 11.1: -O3 -march=native -flto . 34751.01 |================================= GCC 11.1: -O2 ..................... 34799.70 |================================= WavPack Audio Encoding 5.3 WAV To WavPack Seconds < Lower Is Better GCC 11.1: -O3 -march=native ....... 11.08 |==================================== GCC 11.1: -O3 -march=native -flto . 11.10 |==================================== GCC 11.1: -O2 ..................... 11.08 |====================================