Compiler benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2112135-PTS-COREI51282 Intel Core i5 12600K Alder Lake Compilers - Phoronix Test Suite Intel Core i5 12600K Alder Lake Compilers Compiler benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2112135-PTS-COREI51282&grt&sro .
Intel Core i5 12600K Alder Lake Compilers Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 11.2 GCC 12.0 Git Intel Core i5-12600K @ 6.30GHz (10 Cores / 16 Threads) ASUS PRIME Z690-P WIFI D4 (0605 BIOS) Intel Device 7aa7 16GB 1000GB Western Digital WDS100T1X0E-00AFY0 llvmpipe Realtek ALC897 Realtek RTL8125 2.5GbE + Intel Device 7af0 Ubuntu 21.10 5.15.7-051507-generic (x86_64) GNOME Shell 40.5 X Server 1.20.13 4.5 Mesa 22.0.0-devel (git-d80c7f3 2021-11-14 impish-oibaf-ppa) (LLVM 13.0.0 256 bits) 1.2.197 GCC 11.2.0 ext4 3840x2160 GCC 12.0.0 20211212 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x12 - Thermald 2.4.6 Python Details - Python 3.9.7 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Intel Core i5 12600K Alder Lake Compilers compress-7zip: Compression Rating compress-7zip: Decompression Rating aircrack-ng: aobench: 2048 x 2048 - Total Time aom-av1: Speed 9 Realtime - Bosphorus 4K apache: 100 apache: 200 coremark: CoreMark Size 666 - Iterations Per Second crafty: Elapsed Time encode-flac: WAV To FLAC gmpbench: Total Time synthmark: VoiceMark_100 gromacs: MPI CPU - water_GMX50_bare helsing: 12 digit hpcg: himeno: Poisson Pressure Solver toktx: UASTC 3 toktx: Zstd Compression 19 toktx: UASTC 3 + Zstd Compression 19 toktx: UASTC 4 + Zstd Compression 19 kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast lammps: 20k Atoms lammps: Rhodopsin Protein avifenc: 10 avifenc: 6, Lossless tjbench: Decompression Throughput liquid-dsp: 1 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 8 - 256 - 57 liquid-dsp: 16 - 256 - 57 compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - regnety_400m ncnn: CPU - shufflenet-v2 ngspice: C2670 ngspice: C7552 nwchem: C240 Buckyball openfoam: Motorbike 30M openssl: SHA256 openssl: RSA4096 openssl: RSA4096 primesieve: 1e12 Prime Number Generation quantlib: redis: GET redis: SET scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Jacobi Successive Over-Relaxation securemark: SecureMark-TLS svt-av1: Preset 8 - Bosphorus 4K svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p swet: Average sysbench: RAM / Memory sysbench: CPU tachyon: Total Time mafft: Multiple Sequence Alignment - LSU RNA toybrot: OpenMP toybrot: C++ Tasks toybrot: C++ Threads ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping encode-wavpack: WAV To WavPack webp: Quality 100, Lossless webp: Quality 100, Lossless, Highest Compression x265: Bosphorus 4K x265: Bosphorus 1080p xsbench: compress-zstd: 3 - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19, Long Mode - Compression Speed GCC 11.2 GCC 12.0 Git 74682 60738 32259.670 20.198 72.10 111283.95 96536.17 448314.248044 12620563 11.602 7975.2 985.959 0.971 5.389 7.23358 9416.908233 9.896 15.258 14.251 252.795 22.15 39.39 6.303 6.473 2.831 44.170 276.018649 99537000 385676667 674486667 781553333 72.74 12921.7 71.18 12900.4 2.98 2.67 2.75 4.63 1.37 9.46 36.82 10.06 8.96 18.88 19.60 6.07 3.08 66.804 62.523 7304.2 148.88 15094797463 2786.8 181230.9 24.198 4504.5 5350610.2 4025423.00 799.18 4853.99 3335.69 376623 18.958 159.20 326.15 215.05 224.61 1235344228 18566.17 38019.77 66.8540 8.379 35455 31167 30724 747.947 11.851 13.156 26.673 17.78 70.58 3175399 3476.6 4706.7 33.9 29.6 74313 61439 33043.122 19.107 72.92 117868.21 96741.53 445380.872792 12479935 11.578 7976.5 1010.93 0.976 5.414 7.18854 9404.408297 9.914 15.210 14.262 252.762 25.85 43.76 6.677 6.746 2.749 44.236 274.782778 100949667 388656667 670536667 781873333 74.05 12913.2 72.53 12894.7 2.99 2.67 2.71 4.49 1.36 9.12 36.24 9.66 8.56 18.06 19.65 6.03 3.08 68.001 63.077 5210.6 149.28 15135893657 2790.4 181522.4 24.274 4550.1 5445328.2 4207347.08 794.17 5086.85 3336.74 378883 18.579 155.86 314.09 204.90 211.02 1279053006 18948.14 37964.18 66.9649 8.457 35557 31366 30752 759.209 11.977 12.494 25.849 18.04 70.61 3160393 3506.4 4681.2 34.4 28.7 OpenBenchmarking.org
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Compression Rating GCC 11.2 GCC 12.0 Git 16K 32K 48K 64K 80K SE +/- 174.08, N = 3 SE +/- 181.56, N = 3 74682 74313 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Decompression Rating GCC 11.2 GCC 12.0 Git 13K 26K 39K 52K 65K SE +/- 518.05, N = 3 SE +/- 809.44, N = 3 60738 61439 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 363.05, N = 3 SE +/- 205.89, N = 3 32259.67 33043.12 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 20.20 19.11 1. (CC) gcc options: -lm -O3 -march=native -flto
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.67, N = 13 SE +/- 0.12, N = 3 72.10 72.92 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -U_FORTIFY_SOURCE -lm
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 GCC 11.2 GCC 12.0 Git 30K 60K 90K 120K 150K SE +/- 164.15, N = 3 SE +/- 276.65, N = 3 111283.95 117868.21 1. (CC) gcc options: -shared -fPIC -O3 -march=native -flto
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 GCC 11.2 GCC 12.0 Git 20K 40K 60K 80K 100K SE +/- 143.96, N = 3 SE +/- 56.54, N = 3 96536.17 96741.53 1. (CC) gcc options: -shared -fPIC -O3 -march=native -flto
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 11.2 GCC 12.0 Git 100K 200K 300K 400K 500K SE +/- 736.27, N = 3 SE +/- 611.70, N = 3 448314.25 445380.87 1. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 11.2 GCC 12.0 Git 3M 6M 9M 12M 15M SE +/- 81918.59, N = 3 SE +/- 85295.89, N = 3 12620563 12479935 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.3 WAV To FLAC GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 11.60 11.58 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time GCC 11.2 GCC 12.0 Git 2K 4K 6K 8K 10K 7975.2 7976.5 1. (CC) gcc options: -O3 -march=native -flto -lm
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 11.2 GCC 12.0 Git 200 400 600 800 1000 SE +/- 2.07, N = 3 SE +/- 3.82, N = 3 985.96 1010.93 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare GCC 11.2 GCC 12.0 Git 0.2196 0.4392 0.6588 0.8784 1.098 SE +/- 0.009, N = 7 SE +/- 0.010, N = 6 0.971 0.976 1. (CXX) g++ options: -O3 -march=native -flto
Helsing Digit Range: 12 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 12 digit GCC 11.2 GCC 12.0 Git 1.2182 2.4364 3.6546 4.8728 6.091 SE +/- 0.021, N = 3 SE +/- 0.022, N = 3 5.389 5.414 1. (CC) gcc options: -O2 -pthread
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.04428, N = 3 SE +/- 0.05896, N = 3 7.23358 7.18854 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 11.2 GCC 12.0 Git 2K 4K 6K 8K 10K SE +/- 17.81, N = 3 SE +/- 10.06, N = 3 9416.91 9404.41 1. (CC) gcc options: -O3 -march=native -flto -mavx2
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.010, N = 3 SE +/- 0.013, N = 3 9.896 9.914
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 15.26 15.21
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.25 14.26
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 GCC 11.2 GCC 12.0 Git 60 120 180 240 300 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 252.80 252.76
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.19, N = 3 SE +/- 0.21, N = 3 22.15 25.85 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 11.2 GCC 12.0 Git 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 39.39 43.76 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lpthread -lm -lrt
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.117, N = 9 SE +/- 0.157, N = 9 6.303 6.677 1. (CXX) g++ options: -O3 -march=native -flto -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.068, N = 3 SE +/- 0.080, N = 4 6.473 6.746 1. (CXX) g++ options: -O3 -march=native -flto -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 11.2 GCC 12.0 Git 0.637 1.274 1.911 2.548 3.185 SE +/- 0.039, N = 15 SE +/- 0.027, N = 15 2.831 2.749 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 11.2 GCC 12.0 Git 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.13, N = 3 44.17 44.24 1. (CXX) g++ options: -O3 -fPIC -lm
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput GCC 11.2 GCC 12.0 Git 60 120 180 240 300 SE +/- 1.33, N = 3 SE +/- 0.67, N = 3 276.02 274.78 1. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 20M 40M 60M 80M 100M SE +/- 1296527.03, N = 3 SE +/- 995538.44, N = 3 99537000 100949667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 80M 160M 240M 320M 400M SE +/- 2089771.39, N = 3 SE +/- 1083471.79, N = 3 385676667 388656667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 140M 280M 420M 560M 700M SE +/- 211843.76, N = 3 SE +/- 2965829.47, N = 3 674486667 670536667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 200M 400M 600M 800M 1000M SE +/- 242578.74, N = 3 SE +/- 153767.50, N = 3 781553333 781873333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.13, N = 3 SE +/- 0.24, N = 3 72.74 74.05 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 11.2 GCC 12.0 Git 3K 6K 9K 12K 15K SE +/- 3.58, N = 3 SE +/- 1.03, N = 3 12921.7 12913.2 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 71.18 72.53 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 11.2 GCC 12.0 Git 3K 6K 9K 12K 15K SE +/- 38.78, N = 3 SE +/- 12.58, N = 3 12900.4 12894.7 1. (CC) gcc options: -O3
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 11.2 GCC 12.0 Git 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.02, N = 15 SE +/- 0.04, N = 13 2.98 2.99 MIN: 2.77 / MAX: 3.58 MIN: 2.73 / MAX: 9.17 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 11.2 GCC 12.0 Git 0.6008 1.2016 1.8024 2.4032 3.004 SE +/- 0.02, N = 15 SE +/- 0.03, N = 13 2.67 2.67 MIN: 2.52 / MAX: 3.2 MIN: 2.47 / MAX: 8.86 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet GCC 11.2 GCC 12.0 Git 0.6188 1.2376 1.8564 2.4752 3.094 SE +/- 0.03, N = 15 SE +/- 0.06, N = 13 2.75 2.71 MIN: 2.49 / MAX: 3.1 MIN: 2.43 / MAX: 8.92 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 GCC 11.2 GCC 12.0 Git 1.0418 2.0836 3.1254 4.1672 5.209 SE +/- 0.06, N = 15 SE +/- 0.06, N = 13 4.63 4.49 MIN: 4.29 / MAX: 5.12 MIN: 4.24 / MAX: 10.94 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface GCC 11.2 GCC 12.0 Git 0.3083 0.6166 0.9249 1.2332 1.5415 SE +/- 0.01, N = 15 SE +/- 0.02, N = 13 1.37 1.36 MIN: 1.3 / MAX: 1.59 MIN: 1.3 / MAX: 3.44 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.21, N = 15 SE +/- 0.15, N = 13 9.46 9.12 MIN: 8.89 / MAX: 11.34 MIN: 8.8 / MAX: 16.01 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 GCC 11.2 GCC 12.0 Git 8 16 24 32 40 SE +/- 0.05, N = 15 SE +/- 0.04, N = 13 36.82 36.24 MIN: 36.55 / MAX: 38.66 MIN: 35.99 / MAX: 43.4 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.19, N = 15 SE +/- 0.14, N = 13 10.06 9.66 MIN: 9.51 / MAX: 14.28 MIN: 9.41 / MAX: 16.78 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.13, N = 15 SE +/- 0.13, N = 13 8.96 8.56 MIN: 8.29 / MAX: 11.75 MIN: 8.18 / MAX: 15 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.23, N = 15 SE +/- 0.16, N = 13 18.88 18.06 MIN: 18.22 / MAX: 21.16 MIN: 17.76 / MAX: 20.55 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.21, N = 15 SE +/- 0.24, N = 13 19.60 19.65 MIN: 18.33 / MAX: 27.27 MIN: 17.91 / MAX: 21.99 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.03, N = 15 SE +/- 0.05, N = 13 6.07 6.03 MIN: 5.81 / MAX: 9.25 MIN: 5.81 / MAX: 81.91 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 GCC 11.2 GCC 12.0 Git 0.693 1.386 2.079 2.772 3.465 SE +/- 0.01, N = 12 SE +/- 0.01, N = 13 3.08 3.08 MIN: 3.01 / MAX: 3.29 MIN: 2.95 / MAX: 9.11 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 11.2 GCC 12.0 Git 15 30 45 60 75 SE +/- 0.17, N = 3 SE +/- 0.22, N = 3 66.80 68.00 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 11.2 GCC 12.0 Git 14 28 42 56 70 SE +/- 0.08, N = 3 SE +/- 0.57, N = 3 62.52 63.08 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball GCC 11.2 GCC 12.0 Git 1600 3200 4800 6400 8000 7304.2 5210.6 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 11.2 GCC 12.0 Git 30 60 90 120 150 SE +/- 2.06, N = 3 SE +/- 1.84, N = 3 148.88 149.28 -lfoamToVTK -llagrangian -lfileFormats 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.0 Algorithm: SHA256 GCC 11.2 GCC 12.0 Git 3000M 6000M 9000M 12000M 15000M SE +/- 15623989.67, N = 3 SE +/- 6276939.55, N = 3 15094797463 15135893657 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 GCC 11.2 GCC 12.0 Git 600 1200 1800 2400 3000 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 2786.8 2790.4 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 GCC 11.2 GCC 12.0 Git 40K 80K 120K 160K 200K SE +/- 73.97, N = 3 SE +/- 29.80, N = 3 181230.9 181522.4 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.7 1e12 Prime Number Generation GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 24.20 24.27 1. (CXX) g++ options: -O3 -march=native -flto
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 11.2 GCC 12.0 Git 1000 2000 3000 4000 5000 SE +/- 27.05, N = 3 SE +/- 36.68, N = 3 4504.5 4550.1 1. (CXX) g++ options: -O3 -march=native -rdynamic
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 11.2 GCC 12.0 Git 1.2M 2.4M 3.6M 4.8M 6M SE +/- 15711.16, N = 3 SE +/- 115300.99, N = 12 5350610.2 5445328.2 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 11.2 GCC 12.0 Git 900K 1800K 2700K 3600K 4500K SE +/- 17947.57, N = 3 SE +/- 26067.57, N = 3 4025423.00 4207347.08 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 11.2 GCC 12.0 Git 200 400 600 800 1000 SE +/- 1.71, N = 3 SE +/- 9.66, N = 3 799.18 794.17 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 11.2 GCC 12.0 Git 1100 2200 3300 4400 5500 SE +/- 50.00, N = 3 SE +/- 43.65, N = 3 4853.99 5086.85 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 11.2 GCC 12.0 Git 700 1400 2100 2800 3500 SE +/- 11.93, N = 3 SE +/- 10.75, N = 3 3335.69 3336.74 1. (CC) gcc options: -O3 -march=native -flto -lm
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 11.2 GCC 12.0 Git 80K 160K 240K 320K 400K SE +/- 459.77, N = 3 SE +/- 280.17, N = 3 376623 378883 1. (CC) gcc options: -pedantic -O3
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 18.96 18.58 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 40 80 120 160 200 SE +/- 0.84, N = 3 SE +/- 0.76, N = 3 159.20 155.86 1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 70 140 210 280 350 SE +/- 0.50, N = 3 SE +/- 1.38, N = 3 326.15 314.09 1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 50 100 150 200 250 SE +/- 1.34, N = 3 SE +/- 1.91, N = 3 215.05 204.90 1. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.69, N = 3 224.61 211.02 1. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Swet Average OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average GCC 11.2 GCC 12.0 Git 300M 600M 900M 1200M 1500M SE +/- 10065234.77, N = 15 SE +/- 10922654.66, N = 8 1235344228 1279053006 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory GCC 11.2 GCC 12.0 Git 4K 8K 12K 16K 20K SE +/- 23.22, N = 3 SE +/- 79.87, N = 3 18566.17 18948.14 1. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -flto -rdynamic -ldl -laio -lm
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU GCC 11.2 GCC 12.0 Git 8K 16K 24K 32K 40K SE +/- 1.29, N = 3 SE +/- 4.15, N = 3 38019.77 37964.18 1. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -flto -rdynamic -ldl -laio -lm
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 11.2 GCC 12.0 Git 15 30 45 60 75 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 66.85 66.96 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.068, N = 9 8.379 8.457 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP GCC 11.2 GCC 12.0 Git 8K 16K 24K 32K 40K SE +/- 33.38, N = 3 SE +/- 1.20, N = 3 35455 35557 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 77.38, N = 3 SE +/- 51.85, N = 3 31167 31366 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 56.69, N = 3 SE +/- 48.72, N = 3 30724 30752 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping GCC 11.2 GCC 12.0 Git 160 320 480 640 800 SE +/- 1.23, N = 3 SE +/- 8.19, N = 4 747.95 759.21 1. (CXX) g++ options: -O3 -march=native -flto -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.02, N = 5 11.85 11.98 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 13.16 12.49 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 26.67 25.85 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 17.78 18.04 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.72, N = 3 SE +/- 0.71, N = 6 70.58 70.61 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
Xsbench OpenBenchmarking.org Lookups/s, More Is Better Xsbench 2017-07-06 GCC 11.2 GCC 12.0 Git 700K 1400K 2100K 2800K 3500K SE +/- 382.23, N = 3 SE +/- 9899.00, N = 3 3175399 3160393 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed GCC 11.2 GCC 12.0 Git 800 1600 2400 3200 4000 SE +/- 34.76, N = 3 SE +/- 38.86, N = 3 3476.6 3506.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed GCC 11.2 GCC 12.0 Git 1000 2000 3000 4000 5000 SE +/- 1.95, N = 3 SE +/- 2.75, N = 3 4706.7 4681.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed GCC 11.2 GCC 12.0 Git 8 16 24 32 40 SE +/- 0.24, N = 15 SE +/- 0.30, N = 7 33.9 34.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed GCC 11.2 GCC 12.0 Git 7 14 21 28 35 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 29.6 28.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Phoronix Test Suite v10.8.4