Compiler benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2112135-PTS-COREI51282 Intel Core i5 12600K Alder Lake Compilers - Phoronix Test Suite Intel Core i5 12600K Alder Lake Compilers Compiler benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2112135-PTS-COREI51282&sor&grs .
Intel Core i5 12600K Alder Lake Compilers Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 11.2 GCC 12.0 Git Intel Core i5-12600K @ 6.30GHz (10 Cores / 16 Threads) ASUS PRIME Z690-P WIFI D4 (0605 BIOS) Intel Device 7aa7 16GB 1000GB Western Digital WDS100T1X0E-00AFY0 llvmpipe Realtek ALC897 Realtek RTL8125 2.5GbE + Intel Device 7af0 Ubuntu 21.10 5.15.7-051507-generic (x86_64) GNOME Shell 40.5 X Server 1.20.13 4.5 Mesa 22.0.0-devel (git-d80c7f3 2021-11-14 impish-oibaf-ppa) (LLVM 13.0.0 256 bits) 1.2.197 GCC 11.2.0 ext4 3840x2160 GCC 12.0.0 20211212 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x12 - Thermald 2.4.6 Python Details - Python 3.9.7 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Intel Core i5 12600K Alder Lake Compilers nwchem: C240 Buckyball kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p apache: 100 aobench: 2048 x 2048 - Total Time webp: Quality 100, Lossless svt-vp9: VMAF Optimized - Bosphorus 1080p scimark2: Sparse Matrix Multiply ncnn: CPU - alexnet ncnn: CPU - resnet50 redis: SET lammps: Rhodopsin Protein svt-hevc: 10 - Bosphorus 1080p swet: Average webp: Quality 100, Lossless, Highest Compression compress-zstd: 19, Long Mode - Compression Speed ncnn: CPU - efficientnet-b0 avifenc: 10 synthmark: VoiceMark_100 aircrack-ng: svt-hevc: 7 - Bosphorus 1080p sysbench: RAM / Memory svt-av1: Preset 8 - Bosphorus 4K compress-lz4: 9 - Compression Speed compress-lz4: 3 - Compression Speed ngspice: C2670 ncnn: CPU - vgg16 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping compress-zstd: 19 - Compression Speed x265: Bosphorus 4K liquid-dsp: 1 - 256 - 57 compress-7zip: Decompression Rating aom-av1: Speed 9 Realtime - Bosphorus 4K crafty: Elapsed Time encode-wavpack: WAV To WavPack quantlib: mafft: Multiple Sequence Alignment - LSU RNA ngspice: C7552 compress-zstd: 3 - Compression Speed liquid-dsp: 4 - 256 - 57 ncnn: CPU - blazeface ncnn: CPU - regnety_400m coremark: CoreMark Size 666 - Iterations Per Second toybrot: C++ Tasks scimark2: Fast Fourier Transform hpcg: securemark: SecureMark-TLS liquid-dsp: 8 - 256 - 57 compress-zstd: 3 - Decompression Speed gromacs: MPI CPU - water_GMX50_bare compress-7zip: Compression Rating xsbench: helsing: 12 digit tjbench: Decompression Throughput ncnn: CPU-v2-v2 - mobilenet-v2 toktx: Zstd Compression 19 primesieve: 1e12 Prime Number Generation toybrot: OpenMP openssl: SHA256 openfoam: Motorbike 30M ncnn: CPU - yolov4-tiny apache: 200 encode-flac: WAV To FLAC toktx: UASTC 3 tachyon: Total Time openssl: RSA4096 avifenc: 6, Lossless sysbench: CPU himeno: Poisson Pressure Solver openssl: RSA4096 toybrot: C++ Threads toktx: UASTC 3 + Zstd Compression 19 compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Decompression Speed x265: Bosphorus 1080p liquid-dsp: 16 - 256 - 57 scimark2: Jacobi Successive Over-Relaxation gmpbench: Total Time toktx: UASTC 4 + Zstd Compression 19 ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - resnet18 ncnn: CPU - googlenet ncnn: CPU - mnasnet redis: GET lammps: 20k Atoms GCC 11.2 GCC 12.0 Git 7304.2 22.15 39.39 224.61 111283.95 20.198 13.156 215.05 4853.99 8.96 18.88 4025423.00 6.473 326.15 1235344228 26.673 29.6 4.63 2.831 985.959 32259.670 159.20 18566.17 18.958 71.18 72.74 66.804 36.82 747.947 33.9 17.78 99537000 60738 72.10 12620563 11.851 4504.5 8.379 62.523 3476.6 385676667 1.37 6.07 448314.248044 31167 799.18 7.23358 376623 674486667 4706.7 0.971 74682 3175399 5.389 276.018649 2.98 15.258 24.198 35455 15094797463 148.88 19.60 96536.17 11.602 9.896 66.8540 181230.9 44.170 38019.77 9416.908233 2786.8 30724 14.251 12921.7 12900.4 70.58 781553333 3335.69 7975.2 252.795 3.08 2.67 10.06 9.46 2.75 5350610.2 6.303 5210.6 25.85 43.76 211.02 117868.21 19.107 12.494 204.90 5086.85 8.56 18.06 4207347.08 6.746 314.09 1279053006 25.849 28.7 4.49 2.749 1010.93 33043.122 155.86 18948.14 18.579 72.53 74.05 68.001 36.24 759.209 34.4 18.04 100949667 61439 72.92 12479935 11.977 4550.1 8.457 63.077 3506.4 388656667 1.36 6.03 445380.872792 31366 794.17 7.18854 378883 670536667 4681.2 0.976 74313 3160393 5.414 274.782778 2.99 15.210 24.274 35557 15135893657 149.28 19.65 96741.53 11.578 9.914 66.9649 181522.4 44.236 37964.18 9404.408297 2790.4 30752 14.262 12913.2 12894.7 70.61 781873333 3336.74 7976.5 252.762 3.08 2.67 9.66 9.12 2.71 5445328.2 6.677 OpenBenchmarking.org
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball GCC 12.0 Git GCC 11.2 1600 3200 4800 6400 8000 5210.6 7304.2 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 12.0 Git GCC 11.2 6 12 18 24 30 SE +/- 0.21, N = 3 SE +/- 0.19, N = 3 25.85 22.15 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 12.0 Git GCC 11.2 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 43.76 39.39 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lpthread -lm -lrt
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.69, N = 3 224.61 211.02 1. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 GCC 12.0 Git GCC 11.2 30K 60K 90K 120K 150K SE +/- 276.65, N = 3 SE +/- 164.15, N = 3 117868.21 111283.95 1. (CC) gcc options: -shared -fPIC -O3 -march=native -flto
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 12.0 Git GCC 11.2 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 19.11 20.20 1. (CC) gcc options: -lm -O3 -march=native -flto
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 12.0 Git GCC 11.2 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 12.49 13.16 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 50 100 150 200 250 SE +/- 1.34, N = 3 SE +/- 1.91, N = 3 215.05 204.90 1. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 12.0 Git GCC 11.2 1100 2200 3300 4400 5500 SE +/- 43.65, N = 3 SE +/- 50.00, N = 3 5086.85 4853.99 1. (CC) gcc options: -O3 -march=native -flto -lm
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet GCC 12.0 Git GCC 11.2 3 6 9 12 15 SE +/- 0.13, N = 13 SE +/- 0.13, N = 15 8.56 8.96 MIN: 8.18 / MAX: 15 MIN: 8.29 / MAX: 11.75 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 GCC 12.0 Git GCC 11.2 5 10 15 20 25 SE +/- 0.16, N = 13 SE +/- 0.23, N = 15 18.06 18.88 MIN: 17.76 / MAX: 20.55 MIN: 18.22 / MAX: 21.16 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 12.0 Git GCC 11.2 900K 1800K 2700K 3600K 4500K SE +/- 26067.57, N = 3 SE +/- 17947.57, N = 3 4207347.08 4025423.00 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein GCC 12.0 Git GCC 11.2 2 4 6 8 10 SE +/- 0.080, N = 4 SE +/- 0.068, N = 3 6.746 6.473 1. (CXX) g++ options: -O3 -march=native -flto -lm
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 70 140 210 280 350 SE +/- 0.50, N = 3 SE +/- 1.38, N = 3 326.15 314.09 1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
Swet Average OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average GCC 12.0 Git GCC 11.2 300M 600M 900M 1200M 1500M SE +/- 10922654.66, N = 8 SE +/- 10065234.77, N = 15 1279053006 1235344228 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 12.0 Git GCC 11.2 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 25.85 26.67 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed GCC 11.2 GCC 12.0 Git 7 14 21 28 35 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 29.6 28.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 GCC 12.0 Git GCC 11.2 1.0418 2.0836 3.1254 4.1672 5.209 SE +/- 0.06, N = 13 SE +/- 0.06, N = 15 4.49 4.63 MIN: 4.24 / MAX: 10.94 MIN: 4.29 / MAX: 5.12 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 12.0 Git GCC 11.2 0.637 1.274 1.911 2.548 3.185 SE +/- 0.027, N = 15 SE +/- 0.039, N = 15 2.749 2.831 1. (CXX) g++ options: -O3 -fPIC -lm
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 12.0 Git GCC 11.2 200 400 600 800 1000 SE +/- 3.82, N = 3 SE +/- 2.07, N = 3 1010.93 985.96 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 GCC 12.0 Git GCC 11.2 7K 14K 21K 28K 35K SE +/- 205.89, N = 3 SE +/- 363.05, N = 3 33043.12 32259.67 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 40 80 120 160 200 SE +/- 0.84, N = 3 SE +/- 0.76, N = 3 159.20 155.86 1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory GCC 12.0 Git GCC 11.2 4K 8K 12K 16K 20K SE +/- 79.87, N = 3 SE +/- 23.22, N = 3 18948.14 18566.17 1. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -flto -rdynamic -ldl -laio -lm
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 18.96 18.58 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 12.0 Git GCC 11.2 16 32 48 64 80 SE +/- 0.10, N = 3 SE +/- 0.14, N = 3 72.53 71.18 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 12.0 Git GCC 11.2 16 32 48 64 80 SE +/- 0.24, N = 3 SE +/- 0.13, N = 3 74.05 72.74 1. (CC) gcc options: -O3
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 11.2 GCC 12.0 Git 15 30 45 60 75 SE +/- 0.17, N = 3 SE +/- 0.22, N = 3 66.80 68.00 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 GCC 12.0 Git GCC 11.2 8 16 24 32 40 SE +/- 0.04, N = 13 SE +/- 0.05, N = 15 36.24 36.82 MIN: 35.99 / MAX: 43.4 MIN: 36.55 / MAX: 38.66 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping GCC 12.0 Git GCC 11.2 160 320 480 640 800 SE +/- 8.19, N = 4 SE +/- 1.23, N = 3 759.21 747.95 1. (CXX) g++ options: -O3 -march=native -flto -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed GCC 12.0 Git GCC 11.2 8 16 24 32 40 SE +/- 0.30, N = 7 SE +/- 0.24, N = 15 34.4 33.9 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 12.0 Git GCC 11.2 4 8 12 16 20 SE +/- 0.23, N = 3 SE +/- 0.17, N = 3 18.04 17.78 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 12.0 Git GCC 11.2 20M 40M 60M 80M 100M SE +/- 995538.44, N = 3 SE +/- 1296527.03, N = 3 100949667 99537000 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Decompression Rating GCC 12.0 Git GCC 11.2 13K 26K 39K 52K 65K SE +/- 809.44, N = 3 SE +/- 518.05, N = 3 61439 60738 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K GCC 12.0 Git GCC 11.2 16 32 48 64 80 SE +/- 0.12, N = 3 SE +/- 0.67, N = 13 72.92 72.10 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -U_FORTIFY_SOURCE -lm
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 11.2 GCC 12.0 Git 3M 6M 9M 12M 15M SE +/- 81918.59, N = 3 SE +/- 85295.89, N = 3 12620563 12479935 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.02, N = 5 11.85 11.98 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 12.0 Git GCC 11.2 1000 2000 3000 4000 5000 SE +/- 36.68, N = 3 SE +/- 27.05, N = 3 4550.1 4504.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.068, N = 9 8.379 8.457 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 11.2 GCC 12.0 Git 14 28 42 56 70 SE +/- 0.08, N = 3 SE +/- 0.57, N = 3 62.52 63.08 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed GCC 12.0 Git GCC 11.2 800 1600 2400 3200 4000 SE +/- 38.86, N = 3 SE +/- 34.76, N = 3 3506.4 3476.6 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 GCC 12.0 Git GCC 11.2 80M 160M 240M 320M 400M SE +/- 1083471.79, N = 3 SE +/- 2089771.39, N = 3 388656667 385676667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface GCC 12.0 Git GCC 11.2 0.3083 0.6166 0.9249 1.2332 1.5415 SE +/- 0.02, N = 13 SE +/- 0.01, N = 15 1.36 1.37 MIN: 1.3 / MAX: 3.44 MIN: 1.3 / MAX: 1.59 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m GCC 12.0 Git GCC 11.2 2 4 6 8 10 SE +/- 0.05, N = 13 SE +/- 0.03, N = 15 6.03 6.07 MIN: 5.81 / MAX: 81.91 MIN: 5.81 / MAX: 9.25 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 11.2 GCC 12.0 Git 100K 200K 300K 400K 500K SE +/- 736.27, N = 3 SE +/- 611.70, N = 3 448314.25 445380.87 1. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 77.38, N = 3 SE +/- 51.85, N = 3 31167 31366 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 11.2 GCC 12.0 Git 200 400 600 800 1000 SE +/- 1.71, N = 3 SE +/- 9.66, N = 3 799.18 794.17 1. (CC) gcc options: -O3 -march=native -flto -lm
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.04428, N = 3 SE +/- 0.05896, N = 3 7.23358 7.18854 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 12.0 Git GCC 11.2 80K 160K 240K 320K 400K SE +/- 280.17, N = 3 SE +/- 459.77, N = 3 378883 376623 1. (CC) gcc options: -pedantic -O3
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 140M 280M 420M 560M 700M SE +/- 211843.76, N = 3 SE +/- 2965829.47, N = 3 674486667 670536667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed GCC 11.2 GCC 12.0 Git 1000 2000 3000 4000 5000 SE +/- 1.95, N = 3 SE +/- 2.75, N = 3 4706.7 4681.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare GCC 12.0 Git GCC 11.2 0.2196 0.4392 0.6588 0.8784 1.098 SE +/- 0.010, N = 6 SE +/- 0.009, N = 7 0.976 0.971 1. (CXX) g++ options: -O3 -march=native -flto
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Compression Rating GCC 11.2 GCC 12.0 Git 16K 32K 48K 64K 80K SE +/- 174.08, N = 3 SE +/- 181.56, N = 3 74682 74313 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Xsbench OpenBenchmarking.org Lookups/s, More Is Better Xsbench 2017-07-06 GCC 11.2 GCC 12.0 Git 700K 1400K 2100K 2800K 3500K SE +/- 382.23, N = 3 SE +/- 9899.00, N = 3 3175399 3160393 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm
Helsing Digit Range: 12 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 12 digit GCC 11.2 GCC 12.0 Git 1.2182 2.4364 3.6546 4.8728 6.091 SE +/- 0.021, N = 3 SE +/- 0.022, N = 3 5.389 5.414 1. (CC) gcc options: -O2 -pthread
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput GCC 11.2 GCC 12.0 Git 60 120 180 240 300 SE +/- 1.33, N = 3 SE +/- 0.67, N = 3 276.02 274.78 1. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 11.2 GCC 12.0 Git 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.02, N = 15 SE +/- 0.04, N = 13 2.98 2.99 MIN: 2.77 / MAX: 3.58 MIN: 2.73 / MAX: 9.17 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 GCC 12.0 Git GCC 11.2 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 15.21 15.26
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.7 1e12 Prime Number Generation GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 24.20 24.27 1. (CXX) g++ options: -O3 -march=native -flto
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP GCC 11.2 GCC 12.0 Git 8K 16K 24K 32K 40K SE +/- 33.38, N = 3 SE +/- 1.20, N = 3 35455 35557 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.0 Algorithm: SHA256 GCC 12.0 Git GCC 11.2 3000M 6000M 9000M 12000M 15000M SE +/- 6276939.55, N = 3 SE +/- 15623989.67, N = 3 15135893657 15094797463 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 11.2 GCC 12.0 Git 30 60 90 120 150 SE +/- 2.06, N = 3 SE +/- 1.84, N = 3 148.88 149.28 -lfoamToVTK -llagrangian -lfileFormats 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.21, N = 15 SE +/- 0.24, N = 13 19.60 19.65 MIN: 18.33 / MAX: 27.27 MIN: 17.91 / MAX: 21.99 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 GCC 12.0 Git GCC 11.2 20K 40K 60K 80K 100K SE +/- 56.54, N = 3 SE +/- 143.96, N = 3 96741.53 96536.17 1. (CC) gcc options: -shared -fPIC -O3 -march=native -flto
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.3 WAV To FLAC GCC 12.0 Git GCC 11.2 3 6 9 12 15 SE +/- 0.01, N = 5 SE +/- 0.02, N = 5 11.58 11.60 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.010, N = 3 SE +/- 0.013, N = 3 9.896 9.914
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 11.2 GCC 12.0 Git 15 30 45 60 75 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 66.85 66.96 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 GCC 12.0 Git GCC 11.2 40K 80K 120K 160K 200K SE +/- 29.80, N = 3 SE +/- 73.97, N = 3 181522.4 181230.9 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 11.2 GCC 12.0 Git 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.13, N = 3 44.17 44.24 1. (CXX) g++ options: -O3 -fPIC -lm
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU GCC 11.2 GCC 12.0 Git 8K 16K 24K 32K 40K SE +/- 1.29, N = 3 SE +/- 4.15, N = 3 38019.77 37964.18 1. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -flto -rdynamic -ldl -laio -lm
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 11.2 GCC 12.0 Git 2K 4K 6K 8K 10K SE +/- 17.81, N = 3 SE +/- 10.06, N = 3 9416.91 9404.41 1. (CC) gcc options: -O3 -march=native -flto -mavx2
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 GCC 12.0 Git GCC 11.2 600 1200 1800 2400 3000 SE +/- 0.10, N = 3 SE +/- 0.78, N = 3 2790.4 2786.8 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 56.69, N = 3 SE +/- 48.72, N = 3 30724 30752 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.25 14.26
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 11.2 GCC 12.0 Git 3K 6K 9K 12K 15K SE +/- 3.58, N = 3 SE +/- 1.03, N = 3 12921.7 12913.2 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 11.2 GCC 12.0 Git 3K 6K 9K 12K 15K SE +/- 38.78, N = 3 SE +/- 12.58, N = 3 12900.4 12894.7 1. (CC) gcc options: -O3
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 12.0 Git GCC 11.2 16 32 48 64 80 SE +/- 0.71, N = 6 SE +/- 0.72, N = 3 70.61 70.58 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 12.0 Git GCC 11.2 200M 400M 600M 800M 1000M SE +/- 153767.50, N = 3 SE +/- 242578.74, N = 3 781873333 781553333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 12.0 Git GCC 11.2 700 1400 2100 2800 3500 SE +/- 10.75, N = 3 SE +/- 11.93, N = 3 3336.74 3335.69 1. (CC) gcc options: -O3 -march=native -flto -lm
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time GCC 12.0 Git GCC 11.2 2K 4K 6K 8K 10K 7976.5 7975.2 1. (CC) gcc options: -O3 -march=native -flto -lm
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 GCC 12.0 Git GCC 11.2 60 120 180 240 300 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 252.76 252.80
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 GCC 11.2 GCC 12.0 Git 0.693 1.386 2.079 2.772 3.465 SE +/- 0.01, N = 12 SE +/- 0.01, N = 13 3.08 3.08 MIN: 3.01 / MAX: 3.29 MIN: 2.95 / MAX: 9.11 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 11.2 GCC 12.0 Git 0.6008 1.2016 1.8024 2.4032 3.004 SE +/- 0.02, N = 15 SE +/- 0.03, N = 13 2.67 2.67 MIN: 2.52 / MAX: 3.2 MIN: 2.47 / MAX: 8.86 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 GCC 12.0 Git GCC 11.2 3 6 9 12 15 SE +/- 0.14, N = 13 SE +/- 0.19, N = 15 9.66 10.06 MIN: 9.41 / MAX: 16.78 MIN: 9.51 / MAX: 14.28 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet GCC 12.0 Git GCC 11.2 3 6 9 12 15 SE +/- 0.15, N = 13 SE +/- 0.21, N = 15 9.12 9.46 MIN: 8.8 / MAX: 16.01 MIN: 8.89 / MAX: 11.34 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet GCC 12.0 Git GCC 11.2 0.6188 1.2376 1.8564 2.4752 3.094 SE +/- 0.06, N = 13 SE +/- 0.03, N = 15 2.71 2.75 MIN: 2.43 / MAX: 8.92 MIN: 2.49 / MAX: 3.1 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 12.0 Git GCC 11.2 1.2M 2.4M 3.6M 4.8M 6M SE +/- 115300.99, N = 12 SE +/- 15711.16, N = 3 5445328.2 5350610.2 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms GCC 12.0 Git GCC 11.2 2 4 6 8 10 SE +/- 0.157, N = 9 SE +/- 0.117, N = 9 6.677 6.303 1. (CXX) g++ options: -O3 -march=native -flto -lm
Phoronix Test Suite v10.8.4