Compiler benchmarks by Michael Larabel for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2112135-PTS-COREI51282 Intel Core i5 12600K Alder Lake Compilers - Phoronix Test Suite Intel Core i5 12600K Alder Lake Compilers Compiler benchmarks by Michael Larabel for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2112135-PTS-COREI51282&gru&rdt .
Intel Core i5 12600K Alder Lake Compilers Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenGL Vulkan Compiler File-System Screen Resolution GCC 11.2 GCC 12.0 Git Intel Core i5-12600K @ 6.30GHz (10 Cores / 16 Threads) ASUS PRIME Z690-P WIFI D4 (0605 BIOS) Intel Device 7aa7 16GB 1000GB Western Digital WDS100T1X0E-00AFY0 llvmpipe Realtek ALC897 Realtek RTL8125 2.5GbE + Intel Device 7af0 Ubuntu 21.10 5.15.7-051507-generic (x86_64) GNOME Shell 40.5 X Server 1.20.13 4.5 Mesa 22.0.0-devel (git-d80c7f3 2021-11-14 impish-oibaf-ppa) (LLVM 13.0.0 256 bits) 1.2.197 GCC 11.2.0 ext4 3840x2160 GCC 12.0.0 20211212 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto" Compiler Details - --disable-multilib --enable-checking=release Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x12 - Thermald 2.4.6 Python Details - Python 3.9.7 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Intel Core i5 12600K Alder Lake Compilers openssl: SHA256 sysbench: CPU ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping aom-av1: Speed 9 Realtime - Bosphorus 4K kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast svt-av1: Preset 8 - Bosphorus 4K svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p svt-vp9: VMAF Optimized - Bosphorus 1080p svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p x265: Bosphorus 4K x265: Bosphorus 1080p hpcg: gmpbench: Total Time coremark: CoreMark Size 666 - Iterations Per Second aircrack-ng: xsbench: securemark: SecureMark-TLS compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19, Long Mode - Compression Speed tjbench: Decompression Throughput quantlib: scimark2: Fast Fourier Transform scimark2: Sparse Matrix Multiply scimark2: Jacobi Successive Over-Relaxation himeno: Poisson Pressure Solver sysbench: RAM / Memory compress-7zip: Compression Rating compress-7zip: Decompression Rating crafty: Elapsed Time gromacs: MPI CPU - water_GMX50_bare lammps: 20k Atoms lammps: Rhodopsin Protein swet: Average redis: GET redis: SET apache: 100 apache: 200 liquid-dsp: 1 - 256 - 57 liquid-dsp: 4 - 256 - 57 liquid-dsp: 8 - 256 - 57 liquid-dsp: 16 - 256 - 57 openssl: RSA4096 openssl: RSA4096 synthmark: VoiceMark_100 webp: Quality 100, Lossless webp: Quality 100, Lossless, Highest Compression toybrot: OpenMP toybrot: C++ Tasks toybrot: C++ Threads ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - regnety_400m ncnn: CPU - shufflenet-v2 nwchem: C240 Buckyball mafft: Multiple Sequence Alignment - LSU RNA openfoam: Motorbike 30M avifenc: 10 avifenc: 6, Lossless primesieve: 1e12 Prime Number Generation aobench: 2048 x 2048 - Total Time encode-flac: WAV To FLAC helsing: 12 digit ngspice: C2670 ngspice: C7552 tachyon: Total Time toktx: UASTC 3 toktx: Zstd Compression 19 toktx: UASTC 3 + Zstd Compression 19 toktx: UASTC 4 + Zstd Compression 19 encode-wavpack: WAV To WavPack GCC 11.2 GCC 12.0 Git 15094797463 38019.77 747.947 72.10 22.15 39.39 18.958 159.20 326.15 215.05 224.61 17.78 70.58 7.23358 7975.2 448314.248044 32259.670 3175399 376623 72.74 12921.7 71.18 12900.4 3476.6 4706.7 33.9 29.6 276.018649 4504.5 799.18 4853.99 3335.69 9416.908233 18566.17 74682 60738 12620563 0.971 6.303 6.473 1235344228 5350610.2 4025423.00 111283.95 96536.17 99537000 385676667 674486667 781553333 2786.8 181230.9 985.959 13.156 26.673 35455 31167 30724 2.98 2.67 2.75 4.63 1.37 9.46 36.82 10.06 8.96 18.88 19.60 6.07 3.08 7304.2 8.379 148.88 2.831 44.170 24.198 20.198 11.602 5.389 66.804 62.523 66.8540 9.896 15.258 14.251 252.795 11.851 15135893657 37964.18 759.209 72.92 25.85 43.76 18.579 155.86 314.09 204.90 211.02 18.04 70.61 7.18854 7976.5 445380.872792 33043.122 3160393 378883 74.05 12913.2 72.53 12894.7 3506.4 4681.2 34.4 28.7 274.782778 4550.1 794.17 5086.85 3336.74 9404.408297 18948.14 74313 61439 12479935 0.976 6.677 6.746 1279053006 5445328.2 4207347.08 117868.21 96741.53 100949667 388656667 670536667 781873333 2790.4 181522.4 1010.93 12.494 25.849 35557 31366 30752 2.99 2.67 2.71 4.49 1.36 9.12 36.24 9.66 8.56 18.06 19.65 6.03 3.08 5210.6 8.457 149.28 2.749 44.236 24.274 19.107 11.578 5.414 68.001 63.077 66.9649 9.914 15.210 14.262 252.762 11.977 OpenBenchmarking.org
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.0 Algorithm: SHA256 GCC 11.2 GCC 12.0 Git 3000M 6000M 9000M 12000M 15000M SE +/- 15623989.67, N = 3 SE +/- 6276939.55, N = 3 15094797463 15135893657 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
Sysbench Test: CPU OpenBenchmarking.org Events Per Second, More Is Better Sysbench 1.0.20 Test: CPU GCC 11.2 GCC 12.0 Git 8K 16K 24K 32K 40K SE +/- 1.29, N = 3 SE +/- 4.15, N = 3 38019.77 37964.18 1. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -flto -rdynamic -ldl -laio -lm
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping GCC 11.2 GCC 12.0 Git 160 320 480 640 800 SE +/- 1.23, N = 3 SE +/- 8.19, N = 4 747.95 759.21 1. (CXX) g++ options: -O3 -march=native -flto -fomit-frame-pointer -ffast-math -mtune=native -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.2 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.67, N = 13 SE +/- 0.12, N = 3 72.10 72.92 1. (CXX) g++ options: -O3 -march=native -flto -std=c++11 -U_FORTIFY_SOURCE -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Very Fast GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.19, N = 3 SE +/- 0.21, N = 3 22.15 25.85 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lpthread -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast GCC 11.2 GCC 12.0 Git 10 20 30 40 50 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 39.39 43.76 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -flto -lpthread -lm -lrt
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 18.96 18.58 1. (CXX) g++ options: -O3 -march=native -flto -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 40 80 120 160 200 SE +/- 0.84, N = 3 SE +/- 0.76, N = 3 159.20 155.86 1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 70 140 210 280 350 SE +/- 0.50, N = 3 SE +/- 1.38, N = 3 326.15 314.09 1. (CC) gcc options: -O3 -march=native -flto -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 50 100 150 200 250 SE +/- 1.34, N = 3 SE +/- 1.91, N = 3 215.05 204.90 1. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.69, N = 3 224.61 211.02 1. (CC) gcc options: -O3 -fcommon -march=native -flto -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.23, N = 3 17.78 18.04 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.72, N = 3 SE +/- 0.71, N = 6 70.58 70.61 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lpthread -lrt -ldl -lnuma
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.04428, N = 3 SE +/- 0.05896, N = 3 7.23358 7.18854 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
GNU GMP GMPbench Total Time OpenBenchmarking.org GMPbench Score, More Is Better GNU GMP GMPbench 6.2.1 Total Time GCC 11.2 GCC 12.0 Git 2K 4K 6K 8K 10K 7975.2 7976.5 1. (CC) gcc options: -O3 -march=native -flto -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 11.2 GCC 12.0 Git 100K 200K 300K 400K 500K SE +/- 736.27, N = 3 SE +/- 611.70, N = 3 448314.25 445380.87 1. (CC) gcc options: -O2 -O3 -march=native -flto -lrt" -lrt
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 363.05, N = 3 SE +/- 205.89, N = 3 32259.67 33043.12 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Xsbench OpenBenchmarking.org Lookups/s, More Is Better Xsbench 2017-07-06 GCC 11.2 GCC 12.0 Git 700K 1400K 2100K 2800K 3500K SE +/- 382.23, N = 3 SE +/- 9899.00, N = 3 3175399 3160393 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS GCC 11.2 GCC 12.0 Git 80K 160K 240K 320K 400K SE +/- 459.77, N = 3 SE +/- 280.17, N = 3 376623 378883 1. (CC) gcc options: -pedantic -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.13, N = 3 SE +/- 0.24, N = 3 72.74 74.05 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 11.2 GCC 12.0 Git 3K 6K 9K 12K 15K SE +/- 3.58, N = 3 SE +/- 1.03, N = 3 12921.7 12913.2 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 11.2 GCC 12.0 Git 16 32 48 64 80 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 71.18 72.53 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 11.2 GCC 12.0 Git 3K 6K 9K 12K 15K SE +/- 38.78, N = 3 SE +/- 12.58, N = 3 12900.4 12894.7 1. (CC) gcc options: -O3
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed GCC 11.2 GCC 12.0 Git 800 1600 2400 3200 4000 SE +/- 34.76, N = 3 SE +/- 38.86, N = 3 3476.6 3506.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed GCC 11.2 GCC 12.0 Git 1000 2000 3000 4000 5000 SE +/- 1.95, N = 3 SE +/- 2.75, N = 3 4706.7 4681.2 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed GCC 11.2 GCC 12.0 Git 8 16 24 32 40 SE +/- 0.24, N = 15 SE +/- 0.30, N = 7 33.9 34.4 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed GCC 11.2 GCC 12.0 Git 7 14 21 28 35 SE +/- 0.32, N = 5 SE +/- 0.12, N = 3 29.6 28.7 1. (CC) gcc options: -O3 -march=native -flto -pthread -lz -llzma
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput GCC 11.2 GCC 12.0 Git 60 120 180 240 300 SE +/- 1.33, N = 3 SE +/- 0.67, N = 3 276.02 274.78 1. (CC) gcc options: -O3 -march=native -flto -rdynamic -lm
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 GCC 11.2 GCC 12.0 Git 1000 2000 3000 4000 5000 SE +/- 27.05, N = 3 SE +/- 36.68, N = 3 4504.5 4550.1 1. (CXX) g++ options: -O3 -march=native -rdynamic
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 11.2 GCC 12.0 Git 200 400 600 800 1000 SE +/- 1.71, N = 3 SE +/- 9.66, N = 3 799.18 794.17 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 11.2 GCC 12.0 Git 1100 2200 3300 4400 5500 SE +/- 50.00, N = 3 SE +/- 43.65, N = 3 4853.99 5086.85 1. (CC) gcc options: -O3 -march=native -flto -lm
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 11.2 GCC 12.0 Git 700 1400 2100 2800 3500 SE +/- 11.93, N = 3 SE +/- 10.75, N = 3 3335.69 3336.74 1. (CC) gcc options: -O3 -march=native -flto -lm
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver GCC 11.2 GCC 12.0 Git 2K 4K 6K 8K 10K SE +/- 17.81, N = 3 SE +/- 10.06, N = 3 9416.91 9404.41 1. (CC) gcc options: -O3 -march=native -flto -mavx2
Sysbench Test: RAM / Memory OpenBenchmarking.org MiB/sec, More Is Better Sysbench 1.0.20 Test: RAM / Memory GCC 11.2 GCC 12.0 Git 4K 8K 12K 16K 20K SE +/- 23.22, N = 3 SE +/- 79.87, N = 3 18566.17 18948.14 1. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -flto -rdynamic -ldl -laio -lm
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Compression Rating GCC 11.2 GCC 12.0 Git 16K 32K 48K 64K 80K SE +/- 174.08, N = 3 SE +/- 181.56, N = 3 74682 74313 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 21.06 Test: Decompression Rating GCC 11.2 GCC 12.0 Git 13K 26K 39K 52K 65K SE +/- 518.05, N = 3 SE +/- 809.44, N = 3 60738 61439 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Crafty Elapsed Time OpenBenchmarking.org Nodes Per Second, More Is Better Crafty 25.2 Elapsed Time GCC 11.2 GCC 12.0 Git 3M 6M 9M 12M 15M SE +/- 81918.59, N = 3 SE +/- 85295.89, N = 3 12620563 12479935 1. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare GCC 11.2 GCC 12.0 Git 0.2196 0.4392 0.6588 0.8784 1.098 SE +/- 0.009, N = 7 SE +/- 0.010, N = 6 0.971 0.976 1. (CXX) g++ options: -O3 -march=native -flto
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.117, N = 9 SE +/- 0.157, N = 9 6.303 6.677 1. (CXX) g++ options: -O3 -march=native -flto -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.068, N = 3 SE +/- 0.080, N = 4 6.473 6.746 1. (CXX) g++ options: -O3 -march=native -flto -lm
Swet Average OpenBenchmarking.org Operations Per Second, More Is Better Swet 1.5.16 Average GCC 11.2 GCC 12.0 Git 300M 600M 900M 1200M 1500M SE +/- 10065234.77, N = 15 SE +/- 10922654.66, N = 8 1235344228 1279053006 1. (CC) gcc options: -lm -lpthread -lcurses -lrt
Redis Test: GET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: GET GCC 11.2 GCC 12.0 Git 1.2M 2.4M 3.6M 4.8M 6M SE +/- 15711.16, N = 3 SE +/- 115300.99, N = 12 5350610.2 5445328.2 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Redis Test: SET OpenBenchmarking.org Requests Per Second, More Is Better Redis 6.0.9 Test: SET GCC 11.2 GCC 12.0 Git 900K 1800K 2700K 3600K 4500K SE +/- 17947.57, N = 3 SE +/- 26067.57, N = 3 4025423.00 4207347.08 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -march=native -flto
Apache HTTP Server Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 100 GCC 11.2 GCC 12.0 Git 30K 60K 90K 120K 150K SE +/- 164.15, N = 3 SE +/- 276.65, N = 3 111283.95 117868.21 1. (CC) gcc options: -shared -fPIC -O3 -march=native -flto
Apache HTTP Server Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 200 GCC 11.2 GCC 12.0 Git 20K 40K 60K 80K 100K SE +/- 143.96, N = 3 SE +/- 56.54, N = 3 96536.17 96741.53 1. (CC) gcc options: -shared -fPIC -O3 -march=native -flto
Liquid-DSP Threads: 1 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 1 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 20M 40M 60M 80M 100M SE +/- 1296527.03, N = 3 SE +/- 995538.44, N = 3 99537000 100949667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 4 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 80M 160M 240M 320M 400M SE +/- 2089771.39, N = 3 SE +/- 1083471.79, N = 3 385676667 388656667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 140M 280M 420M 560M 700M SE +/- 211843.76, N = 3 SE +/- 2965829.47, N = 3 674486667 670536667 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 GCC 11.2 GCC 12.0 Git 200M 400M 600M 800M 1000M SE +/- 242578.74, N = 3 SE +/- 153767.50, N = 3 781553333 781873333 1. (CC) gcc options: -O3 -march=native -flto -pthread -lm -lc -lliquid
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 GCC 11.2 GCC 12.0 Git 600 1200 1800 2400 3000 SE +/- 0.78, N = 3 SE +/- 0.10, N = 3 2786.8 2790.4 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.0 Algorithm: RSA4096 GCC 11.2 GCC 12.0 Git 40K 80K 120K 160K 200K SE +/- 73.97, N = 3 SE +/- 29.80, N = 3 181230.9 181522.4 1. (CC) gcc options: -pthread -m64 -O3 -march=native -flto -lssl -lcrypto -ldl
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 11.2 GCC 12.0 Git 200 400 600 800 1000 SE +/- 2.07, N = 3 SE +/- 3.82, N = 3 985.96 1010.93 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 13.16 12.49 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.05, N = 3 26.67 25.85 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -flto -lm -lpng16 -ljpeg
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP GCC 11.2 GCC 12.0 Git 8K 16K 24K 32K 40K SE +/- 33.38, N = 3 SE +/- 1.20, N = 3 35455 35557 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 77.38, N = 3 SE +/- 51.85, N = 3 31167 31366 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 11.2 GCC 12.0 Git 7K 14K 21K 28K 35K SE +/- 56.69, N = 3 SE +/- 48.72, N = 3 30724 30752 1. (CXX) g++ options: -O3 -march=native -flto -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 11.2 GCC 12.0 Git 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.02, N = 15 SE +/- 0.04, N = 13 2.98 2.99 MIN: 2.77 / MAX: 3.58 MIN: 2.73 / MAX: 9.17 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 GCC 11.2 GCC 12.0 Git 0.6008 1.2016 1.8024 2.4032 3.004 SE +/- 0.02, N = 15 SE +/- 0.03, N = 13 2.67 2.67 MIN: 2.52 / MAX: 3.2 MIN: 2.47 / MAX: 8.86 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet GCC 11.2 GCC 12.0 Git 0.6188 1.2376 1.8564 2.4752 3.094 SE +/- 0.03, N = 15 SE +/- 0.06, N = 13 2.75 2.71 MIN: 2.49 / MAX: 3.1 MIN: 2.43 / MAX: 8.92 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 GCC 11.2 GCC 12.0 Git 1.0418 2.0836 3.1254 4.1672 5.209 SE +/- 0.06, N = 15 SE +/- 0.06, N = 13 4.63 4.49 MIN: 4.29 / MAX: 5.12 MIN: 4.24 / MAX: 10.94 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface GCC 11.2 GCC 12.0 Git 0.3083 0.6166 0.9249 1.2332 1.5415 SE +/- 0.01, N = 15 SE +/- 0.02, N = 13 1.37 1.36 MIN: 1.3 / MAX: 1.59 MIN: 1.3 / MAX: 3.44 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.21, N = 15 SE +/- 0.15, N = 13 9.46 9.12 MIN: 8.89 / MAX: 11.34 MIN: 8.8 / MAX: 16.01 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 GCC 11.2 GCC 12.0 Git 8 16 24 32 40 SE +/- 0.05, N = 15 SE +/- 0.04, N = 13 36.82 36.24 MIN: 36.55 / MAX: 38.66 MIN: 35.99 / MAX: 43.4 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.19, N = 15 SE +/- 0.14, N = 13 10.06 9.66 MIN: 9.51 / MAX: 14.28 MIN: 9.41 / MAX: 16.78 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.13, N = 15 SE +/- 0.13, N = 13 8.96 8.56 MIN: 8.29 / MAX: 11.75 MIN: 8.18 / MAX: 15 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.23, N = 15 SE +/- 0.16, N = 13 18.88 18.06 MIN: 18.22 / MAX: 21.16 MIN: 17.76 / MAX: 20.55 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.21, N = 15 SE +/- 0.24, N = 13 19.60 19.65 MIN: 18.33 / MAX: 27.27 MIN: 17.91 / MAX: 21.99 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.03, N = 15 SE +/- 0.05, N = 13 6.07 6.03 MIN: 5.81 / MAX: 9.25 MIN: 5.81 / MAX: 81.91 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 GCC 11.2 GCC 12.0 Git 0.693 1.386 2.079 2.772 3.465 SE +/- 0.01, N = 12 SE +/- 0.01, N = 13 3.08 3.08 MIN: 3.01 / MAX: 3.29 MIN: 2.95 / MAX: 9.11 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic -lgomp -lpthread
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball GCC 11.2 GCC 12.0 Git 1600 3200 4800 6400 8000 7304.2 5210.6 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA GCC 11.2 GCC 12.0 Git 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.068, N = 9 8.379 8.457 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M GCC 11.2 GCC 12.0 Git 30 60 90 120 150 SE +/- 2.06, N = 3 SE +/- 1.84, N = 3 148.88 149.28 -lfoamToVTK -llagrangian -lfileFormats 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 GCC 11.2 GCC 12.0 Git 0.637 1.274 1.911 2.548 3.185 SE +/- 0.039, N = 15 SE +/- 0.027, N = 15 2.831 2.749 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless GCC 11.2 GCC 12.0 Git 10 20 30 40 50 SE +/- 0.32, N = 3 SE +/- 0.13, N = 3 44.17 44.24 1. (CXX) g++ options: -O3 -fPIC -lm
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.7 1e12 Prime Number Generation GCC 11.2 GCC 12.0 Git 6 12 18 24 30 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 24.20 24.27 1. (CXX) g++ options: -O3 -march=native -flto
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 11.2 GCC 12.0 Git 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 20.20 19.11 1. (CC) gcc options: -lm -O3 -march=native -flto
FLAC Audio Encoding WAV To FLAC OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.3 WAV To FLAC GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.02, N = 5 SE +/- 0.01, N = 5 11.60 11.58 1. (CXX) g++ options: -O3 -march=native -flto -fvisibility=hidden -logg -lm
Helsing Digit Range: 12 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 12 digit GCC 11.2 GCC 12.0 Git 1.2182 2.4364 3.6546 4.8728 6.091 SE +/- 0.021, N = 3 SE +/- 0.022, N = 3 5.389 5.414 1. (CC) gcc options: -O2 -pthread
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 GCC 11.2 GCC 12.0 Git 15 30 45 60 75 SE +/- 0.17, N = 3 SE +/- 0.22, N = 3 66.80 68.00 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 GCC 11.2 GCC 12.0 Git 14 28 42 56 70 SE +/- 0.08, N = 3 SE +/- 0.57, N = 3 62.52 63.08 1. (CC) gcc options: -O3 -march=native -flto -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 11.2 GCC 12.0 Git 15 30 45 60 75 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 66.85 66.96 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.010, N = 3 SE +/- 0.013, N = 3 9.896 9.914
KTX-Software toktx Settings: Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: Zstd Compression 19 GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 15.26 15.21
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 GCC 11.2 GCC 12.0 Git 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 14.25 14.26
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 GCC 11.2 GCC 12.0 Git 60 120 180 240 300 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 252.80 252.76
WavPack Audio Encoding WAV To WavPack OpenBenchmarking.org Seconds, Fewer Is Better WavPack Audio Encoding 5.3 WAV To WavPack GCC 11.2 GCC 12.0 Git 3 6 9 12 15 SE +/- 0.00, N = 5 SE +/- 0.02, N = 5 11.85 11.98 1. (CXX) g++ options: -O3 -march=native -flto -rdynamic
Phoronix Test Suite v10.8.4