Intel Core i9-10980XE GCC compiler benchmarking by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2107032-IB-10980XECO53 Intel 10980XE GCC Compiler Benchmarks - Phoronix Test Suite Intel 10980XE GCC Compiler Benchmarks Intel Core i9-10980XE GCC compiler benchmarking by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2107032-IB-10980XECO53&export=pdf .
Intel 10980XE GCC Compiler Benchmarks Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 Intel Core i9-10980XE @ 4.80GHz (18 Cores / 36 Threads) ASRock X299 Steel Legend (P1.30 BIOS) Intel Sky Lake-E DMI3 Registers 32GB Samsung SSD 970 PRO 512GB NVIDIA NV132 11GB Realtek ALC1220 ASUS VP28U Intel I219-V + Intel I211 Ubuntu 21.04 5.11.0-22-generic (x86_64) GNOME Shell 3.38.4 X Server + Wayland nouveau 4.3 Mesa 21.0.1 1.0.2 GCC 8.5.0 ext4 2560x1600 GCC 9.4.0 GCC 10.3.0 GCC 11.1.0 GCC 12.0.0 20210701 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Compiler Details - --disable-multilib --enable-checking=release --enable-languages=c,c++ Processor Details - Scaling Governor: intel_cpufreq schedutil - CPU Microcode: 0x5003102 Python Details - Python 3.9.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Intel 10980XE GCC Compiler Benchmarks blosc: blosclz quantlib: cryptopp: Keyed Algorithms cryptopp: Unkeyed Algorithms cryptopp: Integer + Elliptic Curve Public Key Algorithms etcpak: DXT1 etcpak: ETC2 etcpak: ETC1 + Dithering mrbayes: Primate Phylogeny Analysis hmmer: Pfam Database Search webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression libgav1: Summer Nature 4K libgav1: Chimera 1080p 10-bit compress-zstd: 8 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed botan: KASUMI botan: KASUMI - Decrypt botan: AES-256 botan: AES-256 - Decrypt botan: Twofish botan: Twofish - Decrypt botan: Blowfish botan: Blowfish - Decrypt botan: CAST-256 botan: CAST-256 - Decrypt botan: ChaCha20Poly1305 botan: ChaCha20Poly1305 - Decrypt graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space dav1d: Summer Nature 4K dav1d: Chimera 1080p 10-bit aom-av1: Speed 6 Realtime - Bosphorus 4K aom-av1: Speed 6 Two-Pass - Bosphorus 4K aom-av1: Speed 8 Realtime - Bosphorus 4K aom-av1: Speed 9 Realtime - Bosphorus 4K kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p vpxenc: Speed 0 - Bosphorus 4K vpxenc: Speed 5 - Bosphorus 4K x265: Bosphorus 4K coremark: CoreMark Size 666 - Iterations Per Second himeno: Poisson Pressure Solver compress-7zip: Compress Speed Test stockfish: Total Time pjsip: INVITE pjsip: OPTIONS, Stateful pjsip: OPTIONS, Stateless c-ray: Total Time - 4K, 16 Rays Per Pixel smallpt: Global Illumination Renderer; 128 Samples onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: IP Shapes 1D - bf16bf16bf16 - CPU onednn: IP Shapes 3D - bf16bf16bf16 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU encode-flac: WAV To FLAC encode-mp3: WAV To MP3 encode-opus: WAV To Opus Encode espeak: Text-To-Speech Synthesis gcrypt: ngspice: C2670 ngspice: C7552 tachyon: Total Time vosk: securemark: SecureMark-TLS liquid-dsp: 32 - 256 - 57 liquid-dsp: 36 - 256 - 57 financebench: Repo OpenMP financebench: Bonds OpenMP viennacl: CPU BLAS - sCOPY viennacl: CPU BLAS - sAXPY viennacl: CPU BLAS - sDOT viennacl: CPU BLAS - dCOPY viennacl: CPU BLAS - dAXPY viennacl: CPU BLAS - dDOT viennacl: CPU BLAS - dGEMV-N viennacl: CPU BLAS - dGEMV-T viennacl: CPU BLAS - dGEMM-NN viennacl: CPU BLAS - dGEMM-NT viennacl: CPU BLAS - dGEMM-TN viennacl: CPU BLAS - dGEMM-TT tjbench: Decompression Throughput sqlite-speedtest: Timed Time - Size 1,000 mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m tnn: CPU - DenseNet tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v2 tnn: CPU - SqueezeNet v1.1 encode-wavpack: WAV To WavPack gnupg: 2.7GB Sample File Encryption GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 11800.7 2586.3 714.487669 377.780902 5519.144212 1445.235 198.109 329.128 146.096 126.204 17.202 6.642 37.695 425.7 3361.7 60.3 2676.8 335.5 3547.4 43.3 2819.2 99.716 97.644 3998.627 3991.201 416.184 420.366 491.138 482.780 152.235 152.429 984.019 977.165 792 809 197 424 1444 390 903 197.46 215.16 7.51 4.46 19.40 27.18 21.35 40.13 1.340 11.925 12.88 190.03 374.62 306.08 310.79 247.45 4.75 8.22 21.53 618050.830009 4522.731170 99711 51140013 3307 5801 135302 29.818 5.286 0.528416 1.23239 5.54062 2.94903 9.34766 0.461459 0.679482 1573.03 7.89032 9.82858 11.0180 937.607 1573.18 939.923 0.441500 1.76308 8.436 8.716 8.283 26.836 194.121 134.346 126.200 48.1611 20.721 259869 924630000 916790000 42354.898437 74820.187500 45.9 68.8 77.0 38.1 57.1 63.6 69.5 79.9 55.5 54.7 56.0 55.2 218.813581 57.606 2.403 4.564 27.297 5.594 3.704 2.433 30.154 14.16 5.36 4.72 5.05 4.82 6.69 2.56 12.73 36.20 10.54 9.05 18.33 20.84 15.63 13.77 3505.931 321.387 70.124 289.963 13.355 64.239 11802.6 2568.8 719.762484 375.231492 5538.547039 1450.560 199.097 336.106 157.332 126.506 16.823 6.560 36.869 429.4 3436.8 60.2 2802.7 375.6 3628.4 43.5 3017.6 100.411 98.557 3987.080 3993.890 414.445 411.573 486.315 475.408 150.585 150.879 951.152 945.449 752 776 265 424 1617 387 864 199.89 219.69 7.43 4.44 19.34 27.24 21.20 40.16 1.355 12.021 12.91 190.62 376.89 295.07 302.25 243.48 4.87 8.61 21.64 650499.474888 4609.398563 98298 50622552 3252 5729 135903 30.035 6.045 0.526798 1.27991 5.59533 2.86008 9.55455 0.459456 0.828866 1636.96 8.10038 9.79185 10.9596 960.128 1639.78 961.383 0.467594 1.79150 8.500 8.525 8.241 28.107 193.973 134.222 127.688 47.8849 20.993 263457 930236667 921670000 42795.053385 76452.778646 47.1 71.2 78.0 38.1 57.4 63.7 71.8 79.7 56.1 54.1 56.2 54.9 220.617787 57.034 2.297 4.420 31.452 5.540 3.594 2.455 32.412 13.94 5.10 4.57 4.97 4.73 6.57 2.54 12.44 36.01 10.58 9.15 17.59 21.35 15.08 13.87 3527.682 347.436 73.938 296.142 13.375 64.228 11713.4 2529.8 717.165559 372.583041 5503.078606 1484.230 197.629 329.486 154.629 126.687 17.251 6.866 37.869 424.0 3285.5 60.1 2701.9 370.1 3479.0 43.9 2876.9 98.200 96.879 3999.248 3998.479 404.157 411.522 486.288 474.804 151.184 151.248 788.573 780.123 761 765 317 427 1585 403 864 195.15 222.27 7.38 4.48 19.28 27.86 21.09 40.64 1.359 12.062 13.04 189.88 372.69 299.97 306.89 246.75 4.84 8.59 21.84 630485.588510 4538.661961 97426 49942276 3281 5744 138222 30.430 6.130 0.526014 1.22703 5.53334 2.93978 9.35095 0.460482 0.680522 1565.41 7.90163 9.39062 10.9199 935.962 1566.75 938.220 0.426902 1.75520 8.369 8.730 8.283 32.672 193.411 133.912 126.877 47.9045 20.753 263472 939716667 940660000 35458.662761 49799.115885 46.4 70.3 77.4 38.3 57.3 54.7 71.6 79.6 58.7 56.4 58.9 57.1 219.542799 57.303 2.341 4.271 28.255 5.612 3.669 2.467 30.804 13.81 5.12 4.64 5.02 4.80 6.62 2.61 13.06 36.35 11.16 9.18 17.87 21.26 15.26 13.63 3508.290 311.604 69.297 286.215 13.340 64.424 11926.8 2749.1 692.986399 360.414192 5593.130439 1468.968 194.759 327.329 142.934 126.616 16.803 6.780 36.471 419.5 3351.9 61.0 2642.2 469.4 3553.2 44.3 2775.7 100.697 97.001 3985.266 3995.232 367.677 373.562 442.670 439.555 141.031 140.850 779.376 774.645 924 852 319 432 1571 403 916 192.95 223.06 7.51 4.33 19.64 28.07 21.01 40.31 1.347 12.090 12.96 191.02 375.65 297.32 305.72 244.05 4.87 8.63 21.16 597455.160812 4592.947401 98149 52206963 3240 5769 136444 29.960 6.201 0.528133 1.23012 5.54544 2.93428 9.35550 0.459956 0.697080 1563.05 7.90477 9.41346 10.9199 938.043 1566.25 936.880 0.430676 1.76264 8.411 8.732 8.456 35.086 208.343 134.711 129.396 47.8659 20.894 259565 951170000 954536667 34558.223958 48802.755208 46.7 70.7 77.6 38.3 57.4 63.7 71.7 79.3 51.0 49.8 51.9 50.5 218.618162 57.411 2.412 4.609 28.445 5.723 3.754 2.477 31.275 13.79 5.06 4.66 5.05 4.72 6.56 2.55 12.84 36.56 10.97 9.08 17.77 21.37 15.23 13.93 3508.390 314.561 69.802 288.910 13.353 64.204 11889.4 2773.6 714.205269 374.640795 5532.467940 1419.365 197.128 325.278 145.174 129.454 16.892 6.548 37.161 28.24 21.32 432.7 3332.7 60.8 2773.9 385.5 3531.4 43.9 2782.7 100.609 96.610 3972.020 3993.173 366.648 374.707 442.265 442.410 140.309 140.480 781.383 775.488 903 794 319 429 1607 403 884 194.06 221.94 7.49 4.40 19.67 28.10 21.15 40.20 1.348 11.966 12.94 191.33 375.32 293.71 303.54 244.56 4.92 8.65 21.08 601830.263292 4580.188073 98493 50734571 3304 5763 137966 29.973 5.991 0.527939 1.22496 5.53672 2.94670 9.34206 0.459660 0.678626 1567.41 7.91258 9.59452 10.9330 936.491 1564.71 935.378 0.425675 1.74471 8.379 8.599 8.186 27.281 196.222 135.428 128.320 49.3237 20.574 264514 944433333 937553333 34223.303386 48317.579427 46.4 70.1 77.1 38.2 57.2 63.7 71.4 79.5 51.2 49.8 51.9 51.5 217.425816 56.715 2.372 4.283 28.272 5.506 3.729 2.477 30.797 13.81 4.89 4.54 4.98 4.58 6.27 2.54 12.21 36.70 10.93 8.99 17.74 22.83 15.28 13.48 3524.746 318.056 69.231 287.490 13.331 64.613 OpenBenchmarking.org
C-Blosc Compressor: blosclz OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.0 Compressor: blosclz GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3K 6K 9K 12K 15K SE +/- 11.42, N = 3 SE +/- 18.15, N = 3 SE +/- 21.15, N = 3 SE +/- 69.56, N = 3 SE +/- 37.20, N = 3 11800.7 11802.6 11713.4 11926.8 11889.4 1. (CC) gcc options: -std=gnu99 -O3 -pthread -lrt -lm
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 600 1200 1800 2400 3000 SE +/- 19.67, N = 3 SE +/- 35.29, N = 3 SE +/- 18.68, N = 3 SE +/- 33.86, N = 4 SE +/- 0.85, N = 3 2586.3 2568.8 2529.8 2749.1 2773.6 1. (CXX) g++ options: -O3 -march=native -rdynamic
Crypto++ Test: Keyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Keyed Algorithms GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 160 320 480 640 800 SE +/- 0.10, N = 3 SE +/- 0.35, N = 3 SE +/- 0.48, N = 3 SE +/- 0.20, N = 3 SE +/- 0.24, N = 3 714.49 719.76 717.17 692.99 714.21 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 80 160 240 320 400 SE +/- 0.02, N = 3 SE +/- 0.65, N = 3 SE +/- 0.17, N = 3 SE +/- 0.04, N = 3 SE +/- 0.68, N = 3 377.78 375.23 372.58 360.41 374.64 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Crypto++ Test: Integer + Elliptic Curve Public Key Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Integer + Elliptic Curve Public Key Algorithms GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1200 2400 3600 4800 6000 SE +/- 1.33, N = 3 SE +/- 6.16, N = 3 SE +/- 1.85, N = 3 SE +/- 1.99, N = 3 SE +/- 5.19, N = 3 5519.14 5538.55 5503.08 5593.13 5532.47 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Etcpak Configuration: DXT1 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: DXT1 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 300 600 900 1200 1500 SE +/- 1.51, N = 3 SE +/- 0.45, N = 3 SE +/- 0.37, N = 3 SE +/- 0.47, N = 3 SE +/- 1.30, N = 3 1445.24 1450.56 1484.23 1468.97 1419.37 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC2 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 40 80 120 160 200 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 198.11 199.10 197.63 194.76 197.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Etcpak Configuration: ETC1 + Dithering OpenBenchmarking.org Mpx/s, More Is Better Etcpak 0.7 Configuration: ETC1 + Dithering GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 70 140 210 280 350 SE +/- 2.72, N = 3 SE +/- 0.09, N = 3 SE +/- 0.38, N = 3 SE +/- 0.20, N = 3 SE +/- 0.05, N = 3 329.13 336.11 329.49 327.33 325.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30 60 90 120 150 SE +/- 0.45, N = 3 SE +/- 1.65, N = 12 SE +/- 0.58, N = 3 SE +/- 0.32, N = 3 SE +/- 1.02, N = 3 146.10 157.33 154.63 142.93 145.17 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -march=native -lm
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.2 Pfam Database Search GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.10, N = 3 SE +/- 0.32, N = 3 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 126.20 126.51 126.69 126.62 129.45 1. (CC) gcc options: -O3 -march=native -pthread -lhmmer -leasel -lm -lmpi
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 17.20 16.82 17.25 16.80 16.89 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.007, N = 3 SE +/- 0.007, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 SE +/- 0.043, N = 3 6.642 6.560 6.866 6.780 6.548 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 9 18 27 36 45 SE +/- 0.11, N = 3 SE +/- 0.13, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 37.70 36.87 37.87 36.47 37.16 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K GCC 12.0.0 20210701 7 14 21 28 35 SE +/- 0.01, N = 3 28.24 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.01, N = 3 21.32 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 90 180 270 360 450 SE +/- 4.99, N = 3 SE +/- 4.23, N = 5 SE +/- 4.87, N = 3 SE +/- 5.64, N = 3 SE +/- 5.56, N = 3 425.7 429.4 424.0 419.5 432.7 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Decompression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 700 1400 2100 2800 3500 SE +/- 2.28, N = 3 SE +/- 2.27, N = 5 SE +/- 3.06, N = 3 SE +/- 5.37, N = 3 SE +/- 2.91, N = 3 3361.7 3436.8 3285.5 3351.9 3332.7 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 14 28 42 56 70 SE +/- 0.19, N = 3 SE +/- 0.50, N = 3 SE +/- 0.56, N = 3 SE +/- 0.50, N = 9 SE +/- 0.47, N = 3 60.3 60.2 60.1 61.0 60.8 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 600 1200 1800 2400 3000 SE +/- 3.24, N = 3 SE +/- 14.28, N = 3 SE +/- 9.76, N = 3 SE +/- 7.69, N = 9 SE +/- 16.86, N = 3 2676.8 2802.7 2701.9 2642.2 2773.9 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 100 200 300 400 500 SE +/- 3.63, N = 3 SE +/- 2.56, N = 13 SE +/- 3.28, N = 15 SE +/- 3.45, N = 3 SE +/- 3.88, N = 15 335.5 375.6 370.1 469.4 385.5 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 800 1600 2400 3200 4000 SE +/- 5.55, N = 3 SE +/- 2.52, N = 13 SE +/- 3.49, N = 15 SE +/- 3.35, N = 3 SE +/- 3.04, N = 15 3547.4 3628.4 3479.0 3553.2 3531.4 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.19, N = 3 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 43.3 43.5 43.9 44.3 43.9 1. (CC) gcc options: -O3 -march=native -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 600 1200 1800 2400 3000 SE +/- 10.70, N = 3 SE +/- 4.29, N = 3 SE +/- 2.75, N = 3 SE +/- 14.84, N = 3 SE +/- 2.25, N = 3 2819.2 3017.6 2876.9 2775.7 2782.7 1. (CC) gcc options: -O3 -march=native -pthread -lz
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 SE +/- 0.25, N = 3 99.72 100.41 98.20 100.70 100.61 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 97.64 98.56 96.88 97.00 96.61 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 900 1800 2700 3600 4500 SE +/- 0.87, N = 3 SE +/- 0.82, N = 3 SE +/- 4.08, N = 3 SE +/- 2.67, N = 3 SE +/- 7.30, N = 3 3998.63 3987.08 3999.25 3985.27 3972.02 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 900 1800 2700 3600 4500 SE +/- 1.28, N = 3 SE +/- 0.39, N = 3 SE +/- 0.87, N = 3 SE +/- 0.79, N = 3 SE +/- 4.17, N = 3 3991.20 3993.89 3998.48 3995.23 3993.17 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 0.12, N = 3 SE +/- 0.23, N = 3 SE +/- 1.13, N = 3 SE +/- 0.30, N = 3 416.18 414.45 404.16 367.68 366.65 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 90 180 270 360 450 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.26, N = 3 SE +/- 0.56, N = 3 SE +/- 0.17, N = 3 420.37 411.57 411.52 373.56 374.71 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 110 220 330 440 550 SE +/- 0.01, N = 3 SE +/- 0.29, N = 3 SE +/- 0.22, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 491.14 486.32 486.29 442.67 442.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 100 200 300 400 500 SE +/- 0.04, N = 3 SE +/- 0.21, N = 3 SE +/- 0.30, N = 3 SE +/- 0.13, N = 3 SE +/- 0.07, N = 3 482.78 475.41 474.80 439.56 442.41 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.30, N = 3 152.24 150.59 151.18 141.03 140.31 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.50, N = 3 SE +/- 0.28, N = 3 152.43 150.88 151.25 140.85 140.48 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 2.79, N = 3 SE +/- 1.36, N = 3 SE +/- 0.15, N = 3 SE +/- 1.05, N = 3 SE +/- 0.73, N = 3 984.02 951.15 788.57 779.38 781.38 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 2.38, N = 3 SE +/- 0.08, N = 3 SE +/- 0.79, N = 3 SE +/- 0.89, N = 3 SE +/- 0.47, N = 3 977.17 945.45 780.12 774.65 775.49 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 792 752 761 924 903 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 7.07, N = 15 SE +/- 3.51, N = 3 SE +/- 5.24, N = 3 SE +/- 2.52, N = 3 SE +/- 7.25, N = 15 809 776 765 852 794 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 70 140 210 280 350 SE +/- 0.33, N = 3 197 265 317 319 319 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 90 180 270 360 450 424 424 427 432 429 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 300 600 900 1200 1500 SE +/- 2.91, N = 3 SE +/- 4.93, N = 3 SE +/- 8.67, N = 3 SE +/- 3.51, N = 3 SE +/- 7.21, N = 3 1444 1617 1585 1571 1607 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 90 180 270 360 450 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 SE +/- 0.58, N = 3 390 387 403 403 403 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 1.15, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 903 864 864 916 884 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 40 80 120 160 200 SE +/- 0.89, N = 3 SE +/- 1.65, N = 3 SE +/- 2.16, N = 3 SE +/- 1.13, N = 3 SE +/- 1.97, N = 6 197.46 199.89 195.15 192.95 194.06 -lm - MIN: 150.4 / MAX: 226.05 -lm - MIN: 143.79 / MAX: 228.44 MIN: 149.2 / MAX: 222.59 MIN: 132.83 / MAX: 217.9 MIN: 131.48 / MAX: 225.93 1. (CC) gcc options: -O3 -march=native -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 50 100 150 200 250 SE +/- 0.39, N = 3 SE +/- 0.34, N = 3 SE +/- 0.50, N = 3 SE +/- 1.18, N = 3 SE +/- 0.48, N = 3 215.16 219.69 222.27 223.06 221.94 -lm - MIN: 151.62 / MAX: 411.26 -lm - MIN: 156.35 / MAX: 406.23 MIN: 157.09 / MAX: 436.51 MIN: 157.45 / MAX: 397.96 MIN: 157.38 / MAX: 404.98 1. (CC) gcc options: -O3 -march=native -pthread
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.08, N = 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 12 7.51 7.43 7.38 7.51 7.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.008 2.016 3.024 4.032 5.04 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 10 4.46 4.44 4.48 4.33 4.40 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 19.40 19.34 19.28 19.64 19.67 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 7 14 21 28 35 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 27.18 27.24 27.86 28.07 28.10 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 21.35 21.20 21.09 21.01 21.15 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 9 18 27 36 45 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 40.13 40.16 40.64 40.31 40.20 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lpthread -lm -lrt
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.3058 0.6116 0.9174 1.2232 1.529 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 1.340 1.355 1.359 1.347 1.348 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.93 12.02 12.06 12.09 11.97 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 12.88 12.91 13.04 12.96 12.94 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.18, N = 3 SE +/- 0.44, N = 3 SE +/- 0.31, N = 3 SE +/- 0.32, N = 3 190.03 190.62 189.88 191.02 191.33 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 80 160 240 320 400 SE +/- 0.95, N = 3 SE +/- 0.96, N = 3 SE +/- 1.74, N = 3 SE +/- 1.99, N = 3 SE +/- 0.75, N = 3 374.62 376.89 372.69 375.65 375.32 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 70 140 210 280 350 SE +/- 2.56, N = 13 SE +/- 4.16, N = 3 SE +/- 1.76, N = 14 SE +/- 2.83, N = 6 SE +/- 3.01, N = 5 306.08 295.07 299.97 297.32 293.71 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 70 140 210 280 350 SE +/- 0.21, N = 3 SE +/- 1.42, N = 3 SE +/- 1.79, N = 3 SE +/- 2.35, N = 3 SE +/- 0.68, N = 3 310.79 302.25 306.89 305.72 303.54 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 50 100 150 200 250 SE +/- 1.34, N = 3 SE +/- 1.22, N = 3 SE +/- 3.33, N = 3 SE +/- 3.10, N = 3 SE +/- 1.58, N = 3 247.45 243.48 246.75 244.05 244.56 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.107 2.214 3.321 4.428 5.535 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 4.75 4.87 4.84 4.87 4.92 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.22 8.61 8.59 8.63 8.65 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 21.53 21.64 21.84 21.16 21.08 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 140K 280K 420K 560K 700K SE +/- 3624.80, N = 3 SE +/- 1621.16, N = 3 SE +/- 2003.80, N = 3 SE +/- 1267.23, N = 3 SE +/- 2406.19, N = 3 618050.83 650499.47 630485.59 597455.16 601830.26 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1000 2000 3000 4000 5000 SE +/- 0.61, N = 3 SE +/- 5.70, N = 3 SE +/- 13.08, N = 3 SE +/- 0.64, N = 3 SE +/- 2.82, N = 3 4522.73 4609.40 4538.66 4592.95 4580.19 1. (CC) gcc options: -O3 -march=native -mavx2
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 20K 40K 60K 80K 100K SE +/- 69.57, N = 3 SE +/- 343.09, N = 3 SE +/- 46.23, N = 3 SE +/- 304.44, N = 3 SE +/- 228.39, N = 3 99711 98298 97426 98149 98493 1. (CXX) g++ options: -pipe -lpthread
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 11M 22M 33M 44M 55M SE +/- 562135.06, N = 15 SE +/- 508293.85, N = 15 SE +/- 623930.99, N = 3 SE +/- 211973.29, N = 3 SE +/- 432778.20, N = 8 51140013 50622552 49942276 52206963 50734571 1. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 700 1400 2100 2800 3500 SE +/- 25.16, N = 15 SE +/- 27.02, N = 3 SE +/- 6.36, N = 3 SE +/- 7.22, N = 3 SE +/- 25.40, N = 15 3307 3252 3281 3240 3304 1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1200 2400 3600 4800 6000 SE +/- 16.26, N = 3 SE +/- 24.67, N = 3 SE +/- 8.67, N = 3 SE +/- 53.69, N = 3 SE +/- 23.13, N = 3 5801 5729 5744 5769 5763 1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30K 60K 90K 120K 150K SE +/- 578.10, N = 3 SE +/- 1635.40, N = 4 SE +/- 1006.09, N = 3 SE +/- 946.90, N = 3 SE +/- 734.59, N = 3 135302 135903 138222 136444 137966 1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread -O3 -march=native
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 29.82 30.04 30.43 29.96 29.97 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.007, N = 3 SE +/- 0.011, N = 3 SE +/- 0.033, N = 3 SE +/- 0.004, N = 3 SE +/- 0.017, N = 3 5.286 6.045 6.130 6.201 5.991 1. (CXX) g++ options: -fopenmp -O3 -march=native
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.1189 0.2378 0.3567 0.4756 0.5945 SE +/- 0.003001, N = 3 SE +/- 0.003361, N = 3 SE +/- 0.003324, N = 3 SE +/- 0.003036, N = 3 SE +/- 0.003225, N = 3 0.528416 0.526798 0.526014 0.528133 0.527939 MIN: 0.5 MIN: 0.5 MIN: 0.5 MIN: 0.5 MIN: 0.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.288 0.576 0.864 1.152 1.44 SE +/- 0.00683, N = 3 SE +/- 0.00348, N = 3 SE +/- 0.00554, N = 3 SE +/- 0.00400, N = 3 SE +/- 0.00255, N = 3 1.23239 1.27991 1.22703 1.23012 1.22496 MIN: 1.18 MIN: 1.23 MIN: 1.18 MIN: 1.19 MIN: 1.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.2589 2.5178 3.7767 5.0356 6.2945 SE +/- 0.02371, N = 3 SE +/- 0.02285, N = 3 SE +/- 0.01959, N = 3 SE +/- 0.02335, N = 3 SE +/- 0.02089, N = 3 5.54062 5.59533 5.53334 5.54544 5.53672 MIN: 5.4 MIN: 5.45 MIN: 5.38 MIN: 5.4 MIN: 5.38 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.6635 1.327 1.9905 2.654 3.3175 SE +/- 0.01360, N = 3 SE +/- 0.01601, N = 3 SE +/- 0.01261, N = 3 SE +/- 0.01584, N = 3 SE +/- 0.01797, N = 3 2.94903 2.86008 2.93978 2.93428 2.94670 MIN: 2.85 MIN: 2.77 MIN: 2.83 MIN: 2.83 MIN: 2.84 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.01311, N = 3 SE +/- 0.01517, N = 3 SE +/- 0.01276, N = 3 SE +/- 0.01164, N = 3 SE +/- 0.00808, N = 3 9.34766 9.55455 9.35095 9.35550 9.34206 MIN: 9.29 MIN: 9.5 MIN: 9.29 MIN: 9.29 MIN: 9.28 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.1038 0.2076 0.3114 0.4152 0.519 SE +/- 0.000631, N = 3 SE +/- 0.001062, N = 3 SE +/- 0.001585, N = 3 SE +/- 0.000687, N = 3 SE +/- 0.001427, N = 3 0.461459 0.459456 0.460482 0.459956 0.459660 MIN: 0.45 MIN: 0.45 MIN: 0.45 MIN: 0.45 MIN: 0.45 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.1865 0.373 0.5595 0.746 0.9325 SE +/- 0.003298, N = 3 SE +/- 0.008442, N = 15 SE +/- 0.005954, N = 8 SE +/- 0.005568, N = 3 SE +/- 0.008934, N = 3 0.679482 0.828866 0.680522 0.697080 0.678626 MIN: 0.66 MIN: 0.74 MIN: 0.63 MIN: 0.67 MIN: 0.65 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 400 800 1200 1600 2000 SE +/- 1.09, N = 3 SE +/- 2.09, N = 3 SE +/- 0.93, N = 3 SE +/- 2.29, N = 3 SE +/- 0.75, N = 3 1573.03 1636.96 1565.41 1563.05 1567.41 MIN: 1566.77 MIN: 1629.18 MIN: 1559.12 MIN: 1555.64 MIN: 1561.17 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.02602, N = 3 SE +/- 0.17046, N = 14 SE +/- 0.03159, N = 3 SE +/- 0.03513, N = 3 SE +/- 0.03413, N = 3 7.89032 8.10038 7.90163 7.90477 7.91258 MIN: 7.58 MIN: 7.58 MIN: 7.61 MIN: 7.56 MIN: 7.63 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.02091, N = 3 SE +/- 0.00795, N = 3 SE +/- 0.02137, N = 3 SE +/- 0.01742, N = 3 SE +/- 0.02038, N = 3 9.82858 9.79185 9.39062 9.41346 9.59452 MIN: 9.53 MIN: 9.62 MIN: 9.29 MIN: 9.27 MIN: 9.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 11.02 10.96 10.92 10.92 10.93 MIN: 10.79 MIN: 10.74 MIN: 10.74 MIN: 10.74 MIN: 10.77 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 0.50, N = 3 SE +/- 0.63, N = 3 SE +/- 0.69, N = 3 SE +/- 0.60, N = 3 SE +/- 0.46, N = 3 937.61 960.13 935.96 938.04 936.49 MIN: 932.91 MIN: 955.07 MIN: 930.6 MIN: 933.04 MIN: 932.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 400 800 1200 1600 2000 SE +/- 0.81, N = 3 SE +/- 1.53, N = 3 SE +/- 1.06, N = 3 SE +/- 3.72, N = 3 SE +/- 1.54, N = 3 1573.18 1639.78 1566.75 1566.25 1564.71 MIN: 1566.72 MIN: 1630.86 MIN: 1558.77 MIN: 1553.36 MIN: 1557.51 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200 400 600 800 1000 SE +/- 1.32, N = 3 SE +/- 1.46, N = 3 SE +/- 0.16, N = 3 SE +/- 0.70, N = 3 SE +/- 0.15, N = 3 939.92 961.38 938.22 936.88 935.38 MIN: 933.27 MIN: 955.17 MIN: 930.52 MIN: 931.54 MIN: 931.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.1052 0.2104 0.3156 0.4208 0.526 SE +/- 0.003487, N = 3 SE +/- 0.002588, N = 3 SE +/- 0.004576, N = 4 SE +/- 0.004665, N = 3 SE +/- 0.000391, N = 3 0.441500 0.467594 0.426902 0.430676 0.425675 MIN: 0.41 MIN: 0.44 MIN: 0.4 MIN: 0.41 MIN: 0.41 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.4031 0.8062 1.2093 1.6124 2.0155 SE +/- 0.00719, N = 3 SE +/- 0.00782, N = 3 SE +/- 0.00494, N = 3 SE +/- 0.00508, N = 3 SE +/- 0.00613, N = 3 1.76308 1.79150 1.75520 1.76264 1.74471 MIN: 1.7 MIN: 1.74 MIN: 1.69 MIN: 1.69 MIN: 1.66 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.012, N = 5 SE +/- 0.006, N = 5 SE +/- 0.004, N = 5 SE +/- 0.009, N = 5 SE +/- 0.014, N = 5 8.436 8.500 8.369 8.411 8.379 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.011, N = 3 SE +/- 0.004, N = 3 SE +/- 0.005, N = 3 SE +/- 0.003, N = 3 8.716 8.525 8.730 8.732 8.599 1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -march=native -lm
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.011, N = 5 SE +/- 0.021, N = 5 SE +/- 0.031, N = 5 SE +/- 0.013, N = 5 SE +/- 0.011, N = 5 8.283 8.241 8.283 8.456 8.186 1. (CXX) g++ options: -O3 -march=native -fvisibility=hidden -logg -lm
eSpeak-NG Speech Engine Text-To-Speech Synthesis OpenBenchmarking.org Seconds, Fewer Is Better eSpeak-NG Speech Engine 20200907 Text-To-Speech Synthesis GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 8 16 24 32 40 SE +/- 0.19, N = 4 SE +/- 0.23, N = 4 SE +/- 0.09, N = 4 SE +/- 0.16, N = 4 SE +/- 0.18, N = 4 26.84 28.11 32.67 35.09 27.28 1. (CC) gcc options: -O3 -march=native -std=c99
Gcrypt Library OpenBenchmarking.org Seconds, Fewer Is Better Gcrypt Library 1.9 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 50 100 150 200 250 SE +/- 0.27, N = 3 SE +/- 0.21, N = 3 SE +/- 0.44, N = 3 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 194.12 193.97 193.41 208.34 196.22 1. (CC) gcc options: -O3 -march=native -fvisibility=hidden -lgpg-error
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30 60 90 120 150 SE +/- 0.81, N = 3 SE +/- 0.71, N = 3 SE +/- 1.13, N = 3 SE +/- 0.92, N = 3 SE +/- 0.83, N = 3 134.35 134.22 133.91 134.71 135.43 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 30 60 90 120 150 SE +/- 1.35, N = 3 SE +/- 1.54, N = 3 SE +/- 1.51, N = 3 SE +/- 0.26, N = 3 SE +/- 1.05, N = 3 126.20 127.69 126.88 129.40 128.32 1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 SE +/- 0.18, N = 3 48.16 47.88 47.90 47.87 49.32 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
VOSK Speech Recognition Toolkit OpenBenchmarking.org Seconds, Fewer Is Better VOSK Speech Recognition Toolkit 0.3.21 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.12, N = 3 SE +/- 0.26, N = 3 SE +/- 0.07, N = 3 20.72 20.99 20.75 20.89 20.57
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 60K 120K 180K 240K 300K SE +/- 106.27, N = 3 SE +/- 247.29, N = 3 SE +/- 95.66, N = 3 SE +/- 68.80, N = 3 SE +/- 101.43, N = 3 259869 263457 263472 259565 264514 1. (CC) gcc options: -pedantic -O3
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200M 400M 600M 800M 1000M SE +/- 353836.12, N = 3 SE +/- 539269.05, N = 3 SE +/- 2904171.33, N = 3 SE +/- 4781007.56, N = 3 SE +/- 4623189.13, N = 3 924630000 930236667 939716667 951170000 944433333 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 36 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 36 - Buffer Length: 256 - Filter Length: 57 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 200M 400M 600M 800M 1000M SE +/- 120554.28, N = 3 SE +/- 588132.64, N = 3 SE +/- 272213.15, N = 3 SE +/- 1013283.99, N = 3 SE +/- 1056729.76, N = 3 916790000 921670000 940660000 954536667 937553333 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 9K 18K 27K 36K 45K SE +/- 34.41, N = 3 SE +/- 10.60, N = 3 SE +/- 120.35, N = 3 SE +/- 43.71, N = 3 SE +/- 22.03, N = 3 42354.90 42795.05 35458.66 34558.22 34223.30 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 16K 32K 48K 64K 80K SE +/- 5.53, N = 3 SE +/- 1061.39, N = 3 SE +/- 11.94, N = 3 SE +/- 36.89, N = 3 SE +/- 48.36, N = 3 74820.19 76452.78 49799.12 48802.76 48317.58 1. (CXX) g++ options: -O3 -march=native -fopenmp
ViennaCL Test: CPU BLAS - sCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sCOPY GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 11 22 33 44 55 SE +/- 0.27, N = 3 SE +/- 0.17, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.09, N = 3 45.9 47.1 46.4 46.7 46.4 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sAXPY GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 16 32 48 64 80 SE +/- 1.48, N = 3 SE +/- 0.12, N = 3 SE +/- 0.30, N = 3 SE +/- 0.31, N = 3 SE +/- 0.35, N = 3 68.8 71.2 70.3 70.7 70.1 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - sDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - sDOT GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 20 40 60 80 100 SE +/- 0.42, N = 3 SE +/- 0.26, N = 3 SE +/- 0.33, N = 3 SE +/- 0.43, N = 3 SE +/- 0.49, N = 3 77.0 78.0 77.4 77.6 77.1 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dCOPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dCOPY GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 9 18 27 36 45 SE +/- 0.00, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 38.1 38.1 38.3 38.3 38.2 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dAXPY OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dAXPY GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 13 26 39 52 65 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 57.1 57.4 57.3 57.4 57.2 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dDOT OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dDOT GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 14 28 42 56 70 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 9.10, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 63.6 63.7 54.7 63.7 63.7 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-N OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-N GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 16 32 48 64 80 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.31, N = 3 69.5 71.8 71.6 71.7 71.4 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMV-T OpenBenchmarking.org GB/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMV-T GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 20 40 60 80 100 SE +/- 0.25, N = 3 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 SE +/- 0.38, N = 3 SE +/- 0.09, N = 3 79.9 79.7 79.6 79.3 79.5 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NN GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 13 26 39 52 65 SE +/- 0.75, N = 3 SE +/- 0.13, N = 3 SE +/- 0.63, N = 3 SE +/- 0.15, N = 3 SE +/- 0.44, N = 3 55.5 56.1 58.7 51.0 51.2 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-NT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-NT GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 13 26 39 52 65 SE +/- 0.25, N = 3 SE +/- 0.27, N = 3 SE +/- 0.48, N = 3 SE +/- 0.15, N = 3 SE +/- 0.37, N = 3 54.7 54.1 56.4 49.8 49.8 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TN OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TN GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 13 26 39 52 65 SE +/- 1.55, N = 2 SE +/- 0.36, N = 3 SE +/- 1.17, N = 3 SE +/- 0.10, N = 3 SE +/- 0.10, N = 3 56.0 56.2 58.9 51.9 51.9 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
ViennaCL Test: CPU BLAS - dGEMM-TT OpenBenchmarking.org GFLOPs/s, More Is Better ViennaCL 1.7.1 Test: CPU BLAS - dGEMM-TT GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 13 26 39 52 65 SE +/- 0.32, N = 3 SE +/- 0.07, N = 3 SE +/- 0.70, N = 2 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 55.2 54.9 57.1 50.5 51.5 1. (CXX) g++ options: -O3 -march=native -fopenmp -rdynamic -lOpenCL
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 50 100 150 200 250 SE +/- 0.03, N = 3 SE +/- 0.84, N = 3 SE +/- 0.30, N = 3 SE +/- 0.47, N = 3 SE +/- 0.26, N = 3 218.81 220.62 219.54 218.62 217.43 1. (CC) gcc options: -O3 -march=native -rdynamic
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 13 26 39 52 65 SE +/- 0.22, N = 3 SE +/- 0.09, N = 3 SE +/- 0.19, N = 3 SE +/- 0.10, N = 3 SE +/- 0.11, N = 3 57.61 57.03 57.30 57.41 56.72 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.5427 1.0854 1.6281 2.1708 2.7135 SE +/- 0.012, N = 3 SE +/- 0.015, N = 15 SE +/- 0.011, N = 3 SE +/- 0.032, N = 3 SE +/- 0.011, N = 3 2.403 2.297 2.341 2.412 2.372 MIN: 2.28 / MAX: 2.54 MIN: 1.96 / MAX: 2.53 MIN: 2.16 / MAX: 2.53 MIN: 2.23 / MAX: 2.61 MIN: 2.25 / MAX: 2.5 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.037 2.074 3.111 4.148 5.185 SE +/- 0.036, N = 3 SE +/- 0.061, N = 15 SE +/- 0.162, N = 3 SE +/- 0.007, N = 3 SE +/- 0.149, N = 3 4.564 4.420 4.271 4.609 4.283 MIN: 4.42 / MAX: 4.75 MIN: 3.98 / MAX: 4.76 MIN: 3.97 / MAX: 4.72 MIN: 4.51 / MAX: 4.78 MIN: 3.97 / MAX: 4.71 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 7 14 21 28 35 SE +/- 0.10, N = 3 SE +/- 0.47, N = 15 SE +/- 0.24, N = 3 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 27.30 31.45 28.26 28.45 28.27 MIN: 26.86 / MAX: 27.93 MIN: 24.41 / MAX: 36.24 MIN: 27.6 / MAX: 28.76 MIN: 27.77 / MAX: 28.83 MIN: 27.7 / MAX: 28.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.2877 2.5754 3.8631 5.1508 6.4385 SE +/- 0.056, N = 3 SE +/- 0.039, N = 15 SE +/- 0.083, N = 3 SE +/- 0.020, N = 3 SE +/- 0.079, N = 3 5.594 5.540 5.612 5.723 5.506 MIN: 5.4 / MAX: 5.85 MIN: 5.06 / MAX: 6.72 MIN: 5.24 / MAX: 6.01 MIN: 5.47 / MAX: 6.72 MIN: 5.22 / MAX: 5.91 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.8447 1.6894 2.5341 3.3788 4.2235 SE +/- 0.069, N = 3 SE +/- 0.040, N = 15 SE +/- 0.090, N = 3 SE +/- 0.018, N = 3 SE +/- 0.058, N = 3 3.704 3.594 3.669 3.754 3.729 MIN: 3.31 / MAX: 3.95 MIN: 3.07 / MAX: 4.08 MIN: 3.42 / MAX: 4.19 MIN: 3.47 / MAX: 3.95 MIN: 3.41 / MAX: 3.94 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.5573 1.1146 1.6719 2.2292 2.7865 SE +/- 0.025, N = 3 SE +/- 0.012, N = 15 SE +/- 0.025, N = 3 SE +/- 0.024, N = 3 SE +/- 0.029, N = 3 2.433 2.455 2.467 2.477 2.477 MIN: 2.32 / MAX: 2.62 MIN: 2.23 / MAX: 3.16 MIN: 2.32 / MAX: 2.68 MIN: 2.3 / MAX: 2.65 MIN: 2.3 / MAX: 2.74 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 8 16 24 32 40 SE +/- 0.10, N = 3 SE +/- 0.28, N = 15 SE +/- 0.42, N = 3 SE +/- 0.42, N = 3 SE +/- 0.43, N = 3 30.15 32.41 30.80 31.28 30.80 MIN: 29.77 / MAX: 30.53 MIN: 29.17 / MAX: 33.89 MIN: 30.12 / MAX: 31.88 MIN: 30.28 / MAX: 31.97 MIN: 30.14 / MAX: 31.87 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mobilenet GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 14.16 13.94 13.81 13.79 13.81 MIN: 13.93 / MAX: 14.76 MIN: 13.68 / MAX: 22.27 MIN: 13.51 / MAX: 20.36 MIN: 13.61 / MAX: 14.23 MIN: 13.64 / MAX: 14.54 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.206 2.412 3.618 4.824 6.03 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.04, N = 3 5.36 5.10 5.12 5.06 4.89 MIN: 4.96 / MAX: 8.4 MIN: 4.73 / MAX: 10.4 MIN: 4.76 / MAX: 8.89 MIN: 4.72 / MAX: 10.01 MIN: 4.72 / MAX: 10.07 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.062 2.124 3.186 4.248 5.31 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 4.72 4.57 4.64 4.66 4.54 MIN: 4.49 / MAX: 7.39 MIN: 4.37 / MAX: 9.05 MIN: 4.36 / MAX: 10.01 MIN: 4.46 / MAX: 12.92 MIN: 4.37 / MAX: 11.47 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: shufflenet-v2 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.1363 2.2726 3.4089 4.5452 5.6815 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 5.05 4.97 5.02 5.05 4.98 MIN: 4.83 / MAX: 14.14 MIN: 4.8 / MAX: 8.9 MIN: 4.83 / MAX: 15.94 MIN: 4.88 / MAX: 9.37 MIN: 4.88 / MAX: 8.6 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: mnasnet GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 1.0845 2.169 3.2535 4.338 5.4225 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 4.82 4.73 4.80 4.72 4.58 MIN: 4.55 / MAX: 12.4 MIN: 4.44 / MAX: 11.54 MIN: 4.42 / MAX: 16.22 MIN: 4.46 / MAX: 10.72 MIN: 4.39 / MAX: 10.87 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: efficientnet-b0 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 6.69 6.57 6.62 6.56 6.27 MIN: 6.33 / MAX: 10.85 MIN: 6.28 / MAX: 14.64 MIN: 6.24 / MAX: 24.41 MIN: 6.27 / MAX: 11.76 MIN: 6.05 / MAX: 14.37 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: blazeface GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 0.5873 1.1746 1.7619 2.3492 2.9365 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 2.56 2.54 2.61 2.55 2.54 MIN: 2.5 / MAX: 3.31 MIN: 2.45 / MAX: 3.32 MIN: 2.47 / MAX: 3.3 MIN: 2.47 / MAX: 3.17 MIN: 2.46 / MAX: 3.12 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: googlenet GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.31, N = 3 12.73 12.44 13.06 12.84 12.21 MIN: 12.1 / MAX: 19.86 MIN: 12 / MAX: 13.22 MIN: 12.84 / MAX: 14.29 MIN: 12.68 / MAX: 16.73 MIN: 11.77 / MAX: 13.02 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: vgg16 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 8 16 24 32 40 SE +/- 0.37, N = 3 SE +/- 0.49, N = 3 SE +/- 0.47, N = 3 SE +/- 0.52, N = 3 SE +/- 0.53, N = 3 36.20 36.01 36.35 36.56 36.70 MIN: 35.36 / MAX: 47.25 MIN: 35.37 / MAX: 37.68 MIN: 35.3 / MAX: 37.7 MIN: 35.42 / MAX: 58.41 MIN: 35.5 / MAX: 41.99 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet18 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 10.54 10.58 11.16 10.97 10.93 MIN: 10.19 / MAX: 17.98 MIN: 10.2 / MAX: 11.57 MIN: 11.03 / MAX: 11.45 MIN: 10.84 / MAX: 20.49 MIN: 10.84 / MAX: 11.27 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: alexnet GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 9.05 9.15 9.18 9.08 8.99 MIN: 8.96 / MAX: 19.48 MIN: 9.08 / MAX: 9.59 MIN: 9.11 / MAX: 9.74 MIN: 9 / MAX: 11.81 MIN: 8.73 / MAX: 9.39 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: resnet50 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.30, N = 3 SE +/- 0.29, N = 3 SE +/- 0.29, N = 3 SE +/- 0.24, N = 3 SE +/- 0.24, N = 3 18.33 17.59 17.87 17.77 17.74 MIN: 17.58 / MAX: 24.57 MIN: 17.07 / MAX: 18.69 MIN: 17.16 / MAX: 18.62 MIN: 17.07 / MAX: 28.68 MIN: 17.09 / MAX: 18.96 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: yolov4-tiny GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 5 10 15 20 25 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 SE +/- 0.10, N = 3 SE +/- 1.86, N = 3 20.84 21.35 21.26 21.37 22.83 MIN: 19.92 / MAX: 24.91 MIN: 20.42 / MAX: 33.9 MIN: 20 / MAX: 24.4 MIN: 20.44 / MAX: 22.72 MIN: 20.18 / MAX: 937.4 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: squeezenet_ssd GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 4 8 12 16 20 SE +/- 0.36, N = 3 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 15.63 15.08 15.26 15.23 15.28 MIN: 15.02 / MAX: 16.85 MIN: 14.88 / MAX: 21.62 MIN: 14.97 / MAX: 18.92 MIN: 14.88 / MAX: 17.07 MIN: 14.89 / MAX: 16.1 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210525 Target: CPU - Model: regnety_400m GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 4 8 12 16 20 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 13.77 13.87 13.63 13.93 13.48 MIN: 12.93 / MAX: 14.62 MIN: 13.18 / MAX: 15.01 MIN: 13.01 / MAX: 14.57 MIN: 13.17 / MAX: 14.48 MIN: 13.11 / MAX: 14.04 1. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread -pthread
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 800 1600 2400 3200 4000 SE +/- 0.16, N = 3 SE +/- 2.55, N = 3 SE +/- 0.59, N = 3 SE +/- 0.77, N = 3 SE +/- 0.21, N = 3 3505.93 3527.68 3508.29 3508.39 3524.75 MIN: 3487.54 / MAX: 3535.34 MIN: 3508.67 / MAX: 3981.67 MIN: 3489.27 / MAX: 3603.98 MIN: 3486.98 / MAX: 3606.8 MIN: 3509.67 / MAX: 3548.51 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 80 160 240 320 400 SE +/- 0.29, N = 3 SE +/- 0.34, N = 3 SE +/- 0.31, N = 3 SE +/- 0.19, N = 3 SE +/- 0.17, N = 3 321.39 347.44 311.60 314.56 318.06 MIN: 319.29 / MAX: 341.28 MIN: 345.68 / MAX: 356.59 MIN: 309.73 / MAX: 322.67 MIN: 312.66 / MAX: 328.44 MIN: 316.44 / MAX: 326.16 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 16 32 48 64 80 SE +/- 0.03, N = 3 SE +/- 1.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 70.12 73.94 69.30 69.80 69.23 MIN: 69.44 / MAX: 71.63 MIN: 72.33 / MAX: 77.08 MIN: 68.65 / MAX: 70.61 MIN: 69.16 / MAX: 70.99 MIN: 68.59 / MAX: 70.34 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 60 120 180 240 300 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 SE +/- 0.63, N = 3 SE +/- 0.19, N = 3 289.96 296.14 286.22 288.91 287.49 MIN: 288.43 / MAX: 291.61 MIN: 294.66 / MAX: 298.56 MIN: 285.02 / MAX: 287.82 MIN: 286.05 / MAX: 294.45 MIN: 285.88 / MAX: 299.62 1. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 SE +/- 0.02, N = 5 SE +/- 0.00, N = 5 SE +/- 0.00, N = 5 13.36 13.38 13.34 13.35 13.33 1. (CXX) g++ options: -O3 -march=native -rdynamic
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption GCC 8.5 GCC 9.4 GCC 10.3 GCC 11.1 GCC 12.0.0 20210701 14 28 42 56 70 SE +/- 0.17, N = 3 SE +/- 0.19, N = 3 SE +/- 0.36, N = 3 SE +/- 0.23, N = 3 SE +/- 0.56, N = 3 64.24 64.23 64.42 64.20 64.61 1. (CC) gcc options: -O3 -march=native
Phoronix Test Suite v10.8.4