Compiler Optimization Levels Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite. -O3 -march=native: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 2000GB Corsair Force MP600 + 257GB Flash Drive, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Fedora 34, Kernel: 5.12.9-300.fc34.x86_64 (x86_64), Desktop: GNOME Shell 40.1, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.1.1 (LLVM 12.0.0), Compiler: GCC 11.1.1 20210531, File-System: btrfs, Screen Resolution: 3840x2160 -O1: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Tiger Lake-H, Memory: 32GB, Disk: 2000GB Corsair Force MP600 + 257GB Flash Drive, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Tiger Lake-H HD Audio, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Fedora 34, Kernel: 5.12.9-300.fc34.x86_64 (x86_64), Desktop: GNOME Shell 40.1, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.1.1 (LLVM 12.0.0), Compiler: GCC 11.1.1 20210531, File-System: btrfs, Screen Resolution: 3840x2160 PostMark 1.51 Disk Transaction Performance TPS > Higher Is Better -O3 -march=native . 9496 |===================================================== -O1 ............... 9259 |==================================================== Crypto++ 8.2 Test: All Algorithms MiB/second > Higher Is Better -O3 -march=native . 2346.36 |================================================== -O1 ............... 2114.62 |============================================= Crypto++ 8.2 Test: Keyed Algorithms MiB/second > Higher Is Better -O3 -march=native . 924.21 |=================================================== -O1 ............... 751.48 |========================================= Crypto++ 8.2 Test: Unkeyed Algorithms MiB/second > Higher Is Better -O3 -march=native . 491.45 |=================================================== -O1 ............... 472.95 |================================================= Crypto++ 8.2 Test: Integer + Elliptic Curve Public Key Algorithms MiB/second > Higher Is Better -O3 -march=native . 7194.86 |================================================== -O1 ............... 6862.79 |================================================ CLOMP 1.2 Static OMP Speedup Speedup > Higher Is Better -O3 -march=native . 4.8 |=================================================== -O1 ............... 5.1 |====================================================== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better -O3 -march=native . 83.43 |================================================= -O1 ............... 88.53 |==================================================== Timed HMMer Search 3.3.2 Pfam Database Search Seconds < Lower Is Better -O3 -march=native . 99.48 |================================================= -O1 ............... 103.74 |=================================================== Quantum ESPRESSO 6.7 Input: AUSURF112 Seconds < Lower Is Better -O3 -march=native . 2609.02 |================================================== -O1 ............... 2525.86 |================================================ LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms ns/day > Higher Is Better -O3 -march=native . 8.737 |==================================================== -O1 ............... 8.345 |================================================== LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein ns/day > Higher Is Better -O3 -march=native . 8.513 |==================================================== -O1 ............... 8.184 |================================================== GNU GMP GMPbench 6.2.1 Total Time GMPbench Score > Higher Is Better -O3 -march=native . 6171.8 |=================================================== Chia Blockchain VDF 1.0.1 Test: Square Plain C++ IPS > Higher Is Better -O3 -march=native . 208400 |=================================================== -O1 ............... 209233 |=================================================== Chia Blockchain VDF 1.0.1 Test: Square Assembly Optimized IPS > Higher Is Better -O3 -march=native . 250633 |=================================================== -O1 ............... 247933 |================================================== Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed MB/s > Higher Is Better -O3 -march=native . 2731.5 |=================================================== -O1 ............... 2568.0 |================================================ Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better -O3 -march=native . 4997.8 |=================================================== -O1 ............... 4847.5 |================================================= Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed MB/s > Higher Is Better -O3 -march=native . 192.6 |==================================================== -O1 ............... 189.2 |=================================================== Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed MB/s > Higher Is Better -O3 -march=native . 5189.9 |=================================================== -O1 ............... 5075.8 |================================================== Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MB/s > Higher Is Better -O3 -march=native . 35.4 |===================================================== -O1 ............... 35.4 |===================================================== Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better -O3 -march=native . 4506.5 |=================================================== -O1 ............... 4406.4 |================================================== Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed MB/s > Higher Is Better -O3 -march=native . 1451.0 |================================================ -O1 ............... 1542.8 |=================================================== Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Decompression Speed MB/s > Higher Is Better -O3 -march=native . 5346.0 |=================================================== -O1 ............... 5215.3 |================================================== Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed MB/s > Higher Is Better -O3 -march=native . 285.9 |==================================================== -O1 ............... 281.5 |=================================================== Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed MB/s > Higher Is Better -O3 -march=native . 5542.9 |=================================================== -O1 ............... 5385.7 |================================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better -O3 -march=native . 32.8 |===================================================== -O1 ............... 32.9 |===================================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better -O3 -march=native . 4540.6 |=================================================== -O1 ............... 4506.0 |=================================================== Botan 2.17.3 Test: KASUMI MiB/s > Higher Is Better -O3 -march=native . 115.82 |=================================================== -O1 ............... 108.28 |================================================ Botan 2.17.3 Test: KASUMI - Decrypt MiB/s > Higher Is Better -O3 -march=native . 112.03 |=================================================== -O1 ............... 106.48 |================================================ Botan 2.17.3 Test: AES-256 MiB/s > Higher Is Better -O3 -march=native . 8401.85 |=============================================== -O1 ............... 8879.33 |================================================== Botan 2.17.3 Test: AES-256 - Decrypt MiB/s > Higher Is Better -O3 -march=native . 8412.96 |=============================================== -O1 ............... 8885.13 |================================================== Botan 2.17.3 Test: Twofish MiB/s > Higher Is Better -O3 -march=native . 464.47 |=================================================== -O1 ............... 430.95 |=============================================== Botan 2.17.3 Test: Twofish - Decrypt MiB/s > Higher Is Better -O3 -march=native . 451.66 |=================================================== -O1 ............... 427.26 |================================================ Botan 2.17.3 Test: Blowfish MiB/s > Higher Is Better -O3 -march=native . 552.46 |=================================================== -O1 ............... 533.96 |================================================= Botan 2.17.3 Test: Blowfish - Decrypt MiB/s > Higher Is Better -O3 -march=native . 553.52 |=================================================== -O1 ............... 532.56 |================================================= Botan 2.17.3 Test: CAST-256 MiB/s > Higher Is Better -O3 -march=native . 168.76 |=================================================== -O1 ............... 149.44 |============================================= Botan 2.17.3 Test: CAST-256 - Decrypt MiB/s > Higher Is Better -O3 -march=native . 168.85 |=================================================== -O1 ............... 149.81 |============================================= Botan 2.17.3 Test: ChaCha20Poly1305 MiB/s > Higher Is Better -O3 -march=native . 1012.73 |================================================== -O1 ............... 1019.91 |================================================== Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt MiB/s > Higher Is Better -O3 -march=native . 1010.79 |================================================== -O1 ............... 1004.65 |================================================== GraphicsMagick 1.3.33 Operation: Swirl Iterations Per Minute > Higher Is Better -O3 -march=native . 689 |====================================================== -O1 ............... 592 |============================================== GraphicsMagick 1.3.33 Operation: Rotate Iterations Per Minute > Higher Is Better -O3 -march=native . 1094 |===================================================== -O1 ............... 1078 |==================================================== GraphicsMagick 1.3.33 Operation: Sharpen Iterations Per Minute > Higher Is Better -O3 -march=native . 195 |====================================================== -O1 ............... 162 |============================================= GraphicsMagick 1.3.33 Operation: Enhanced Iterations Per Minute > Higher Is Better -O3 -march=native . 270 |====================================================== -O1 ............... 218 |============================================ GraphicsMagick 1.3.33 Operation: Resizing Iterations Per Minute > Higher Is Better -O3 -march=native . 1222 |===================================================== -O1 ............... 1021 |============================================ GraphicsMagick 1.3.33 Operation: Noise-Gaussian Iterations Per Minute > Higher Is Better -O3 -march=native . 310 |====================================================== -O1 ............... 306 |===================================================== GraphicsMagick 1.3.33 Operation: HWB Color Space Iterations Per Minute > Higher Is Better -O3 -march=native . 1285 |===================================================== -O1 ............... 1207 |================================================== dav1d 0.9.0 Video Input: Summer Nature 4K FPS > Higher Is Better -O3 -march=native . 195.94 |=================================================== -O1 ............... 185.95 |================================================ SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 9.48 |===================================================== -O1 ............... 9.20 |=================================================== SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 140.40 |=================================================== -O1 ............... 137.23 |================================================== SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 279.12 |=================================================== -O1 ............... 271.99 |================================================== SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 198.73 |=================================================== -O1 ............... 191.41 |================================================= SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 204.96 |=================================================== -O1 ............... 198.18 |================================================= SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 166.43 |=================================================== -O1 ............... 160.73 |================================================= x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better -O3 -march=native . 16.02 |==================================================== -O1 ............... 15.72 |=================================================== x265 3.4 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better -O3 -march=native . 67.85 |==================================================== -O1 ............... 67.85 |==================================================== ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better -O3 -march=native . 3.604641 |============================================= -O1 ............... 3.922224 |================================================= Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better -O3 -march=native . 434724.85 |================================================ -O1 ............... 366951.48 |========================================= Stockfish 13 Total Time Nodes Per Second > Higher Is Better -O3 -march=native . 29443112 |================================================= -O1 ............... 29448017 |================================================= PJSIP 2.11 Method: INVITE Responses Per Second > Higher Is Better -O3 -march=native . 5060 |===================================================== -O1 ............... 4993 |==================================================== PJSIP 2.11 Method: OPTIONS, Stateful Responses Per Second > Higher Is Better -O3 -march=native . 9375 |===================================================== -O1 ............... 9333 |===================================================== PJSIP 2.11 Method: OPTIONS, Stateless Responses Per Second > Higher Is Better -O3 -march=native . 254610 |=================================================== -O1 ............... 247106 |================================================= C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better -O3 -march=native . 47.34 |=================== -O1 ............... 128.91 |=================================================== Smallpt 1.0 Global Illumination Renderer; 128 Samples Seconds < Lower Is Better -O3 -march=native . 8.401 |================================================ -O1 ............... 9.133 |==================================================== oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 4.03781 |================================================== -O1 ............... 4.04828 |================================================== oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 11.20 |==================================================== -O1 ............... 11.03 |=================================================== oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 14.28 |==================================================== -O1 ............... 14.17 |==================================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 4.98281 |================================================== -O1 ............... 4.97288 |================================================== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 4.28984 |================================================== -O1 ............... 4.28224 |================================================== oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 3165.60 |================================================== -O1 ............... 3133.28 |================================================= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 1876.42 |================================================== -O1 ............... 1854.40 |================================================= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU ms < Lower Is Better -O3 -march=native . 3.52485 |================================================== -O1 ............... 3.52499 |================================================== AOBench Size: 2048 x 2048 - Total Time Seconds < Lower Is Better -O3 -march=native . 21.56 |============================================== -O1 ............... 24.61 |==================================================== FLAC Audio Encoding 1.3.2 WAV To FLAC Seconds < Lower Is Better -O3 -march=native . 5.937 |=============================================== -O1 ............... 6.590 |==================================================== LAME MP3 Encoding 3.100 WAV To MP3 Seconds < Lower Is Better -O3 -march=native . 5.473 |===================================== -O1 ............... 7.675 |==================================================== Opus Codec Encoding 1.3.1 WAV To Opus Encode Seconds < Lower Is Better -O3 -march=native . 5.595 |=========================================== -O1 ............... 6.828 |==================================================== eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis Seconds < Lower Is Better -O3 -march=native . 21.77 |=============================================== -O1 ............... 24.00 |==================================================== Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better -O3 -march=native . 99844333 |================================================= -O1 ............... 88411000 |=========================================== Liquid-DSP 2021.01.31 Threads: 2 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better -O3 -march=native . 188003333 |================================================ -O1 ............... 162046667 |========================================= Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better -O3 -march=native . 363760000 |================================================ -O1 ............... 316710000 |========================================== Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better -O3 -march=native . 687846667 |================================================ -O1 ............... 595816667 |========================================== Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better -O3 -march=native . 722756667 |================================================ -O1 ............... 672296667 |============================================= libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput Megapixels/sec > Higher Is Better -O3 -march=native . 271.68 |=================================================== -O1 ............... 260.26 |================================================= ASTC Encoder 3.0 Preset: Medium Seconds < Lower Is Better -O3 -march=native . 4.2153 |================================================= -O1 ............... 4.3606 |=================================================== ASTC Encoder 3.0 Preset: Thorough Seconds < Lower Is Better -O3 -march=native . 9.3601 |================================================= -O1 ............... 9.7734 |=================================================== ASTC Encoder 3.0 Preset: Exhaustive Seconds < Lower Is Better -O3 -march=native . 51.49 |================================================== -O1 ............... 53.25 |==================================================== Basis Universal 1.13 Settings: ETC1S Seconds < Lower Is Better -O3 -march=native . 20.81 |==================================================== -O1 ............... 20.85 |==================================================== Basis Universal 1.13 Settings: UASTC Level 0 Seconds < Lower Is Better -O3 -march=native . 6.106 |==================================================== -O1 ............... 6.114 |==================================================== Basis Universal 1.13 Settings: UASTC Level 2 Seconds < Lower Is Better -O3 -march=native . 29.14 |==================================================== -O1 ............... 29.11 |==================================================== Basis Universal 1.13 Settings: UASTC Level 3 Seconds < Lower Is Better -O3 -march=native . 54.59 |==================================================== -O1 ............... 54.56 |==================================================== SQLite Speedtest 3.30 Timed Time - Size 1,000 Seconds < Lower Is Better -O3 -march=native . 46.09 |================================================= -O1 ............... 49.01 |==================================================== Redis 6.0.9 Test: GET Requests Per Second > Higher Is Better -O3 -march=native . 4049394.67 |=============================================== -O1 ............... 3982525.83 |============================================== Redis 6.0.9 Test: SET Requests Per Second > Higher Is Better -O3 -march=native . 2956462.00 |=============================================== -O1 ............... 2962660.83 |=============================================== Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Milli-Seconds < Lower Is Better -O3 -march=native . 36558 |==================================================== -O1 ............... 36622 |==================================================== Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Milli-Seconds < Lower Is Better -O3 -march=native . 83625 |=================================================== -O1 ............... 84729 |==================================================== Mobile Neural Network 1.1.3 Model: SqueezeNetV1.0 ms < Lower Is Better -O3 -march=native . 3.748 |=================================================== -O1 ............... 3.848 |==================================================== Mobile Neural Network 1.1.3 Model: resnet-v2-50 ms < Lower Is Better -O3 -march=native . 19.22 |=================================================== -O1 ............... 19.51 |==================================================== Mobile Neural Network 1.1.3 Model: MobileNetV2_224 ms < Lower Is Better -O3 -march=native . 1.916 |================================================== -O1 ............... 1.982 |==================================================== Mobile Neural Network 1.1.3 Model: mobilenet-v1-1.0 ms < Lower Is Better -O3 -march=native . 1.883 |=================================================== -O1 ............... 1.921 |==================================================== Mobile Neural Network 1.1.3 Model: inception-v3 ms < Lower Is Better -O3 -march=native . 22.51 |=================================================== -O1 ............... 22.94 |==================================================== NCNN 20201218 Target: CPU - Model: mobilenet ms < Lower Is Better -O3 -march=native . 11.76 |========================================= -O1 ............... 15.02 |==================================================== NCNN 20201218 Target: CPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better -O3 -march=native . 3.21 |========================================= -O1 ............... 4.19 |===================================================== NCNN 20201218 Target: CPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better -O3 -march=native . 2.49 |========================================= -O1 ............... 3.19 |===================================================== NCNN 20201218 Target: CPU - Model: shufflenet-v2 ms < Lower Is Better -O3 -march=native . 3.26 |================================================== -O1 ............... 3.45 |===================================================== NCNN 20201218 Target: CPU - Model: mnasnet ms < Lower Is Better -O3 -march=native . 2.22 |===================================== -O1 ............... 3.17 |===================================================== NCNN 20201218 Target: CPU - Model: efficientnet-b0 ms < Lower Is Better -O3 -march=native . 4.24 |=========================================== -O1 ............... 5.24 |===================================================== NCNN 20201218 Target: CPU - Model: blazeface ms < Lower Is Better -O3 -march=native . 1.15 |================================================= -O1 ............... 1.24 |===================================================== NCNN 20201218 Target: CPU - Model: googlenet ms < Lower Is Better -O3 -march=native . 10.09 |============================================== -O1 ............... 11.40 |==================================================== NCNN 20201218 Target: CPU - Model: vgg16 ms < Lower Is Better -O3 -march=native . 54.36 |=================================================== -O1 ............... 54.91 |==================================================== NCNN 20201218 Target: CPU - Model: resnet18 ms < Lower Is Better -O3 -march=native . 11.08 |================================================== -O1 ............... 11.47 |==================================================== NCNN 20201218 Target: CPU - Model: alexnet ms < Lower Is Better -O3 -march=native . 9.64 |===================================================== -O1 ............... 9.62 |===================================================== NCNN 20201218 Target: CPU - Model: resnet50 ms < Lower Is Better -O3 -march=native . 18.23 |=========================================== -O1 ............... 22.29 |==================================================== NCNN 20201218 Target: CPU - Model: yolov4-tiny ms < Lower Is Better -O3 -march=native . 20.21 |================================================= -O1 ............... 21.26 |==================================================== NCNN 20201218 Target: CPU - Model: squeezenet_ssd ms < Lower Is Better -O3 -march=native . 15.29 |================================================= -O1 ............... 16.18 |==================================================== NCNN 20201218 Target: CPU - Model: regnety_400m ms < Lower Is Better -O3 -march=native . 8.57 |=============================================== -O1 ............... 9.73 |===================================================== TNN 0.2.3 Target: CPU - Model: MobileNet v2 ms < Lower Is Better -O3 -march=native . 230.11 |================================================ -O1 ............... 243.16 |=================================================== TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 ms < Lower Is Better -O3 -march=native . 227.46 |================================================= -O1 ............... 235.96 |=================================================== Sysbench 1.0.20 Test: CPU Events Per Second > Higher Is Better -O3 -march=native . 34770.14 |================================================= -O1 ............... 34882.14 |================================================= WavPack Audio Encoding 5.3 WAV To WavPack Seconds < Lower Is Better -O3 -march=native . 11.10 |==================================================== -O1 ............... 11.13 |==================================================== Kripke 1.2.4 Throughput FoM > Higher Is Better -O3 -march=native . 33544357 |================================================= -O1 ............... 33790753 |=================================================