LLVM Clang benchmarks for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2108236-TJ-LLVMCLANG38 LLVM Clang 13 Bencmarking Intel Xeon Ice Lake - Phoronix Test Suite LLVM Clang 13 Bencmarking Intel Xeon Ice Lake LLVM Clang benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2108236-TJ-LLVMCLANG38&sro&grw .
LLVM Clang 13 Bencmarking Intel Xeon Ice Lake Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution Clang 11 Clang 12 Clang 13 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.38.4 X Server 1.20.11 Clang 11.0.1-2ubuntu4 ext4 1920x1080 Clang 12.0.0-3ubuntu1~21.04.1 Clang 13.0.0-++20210820072921+23ba3732246a-1~exp1~20210820174536.53 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Processor Details - Scaling Governor: intel_pstate performance - CPU Microcode: 0xd0002a0 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Python Details - Clang 12, Clang 13: Python 3.9.5
LLVM Clang 13 Bencmarking Intel Xeon Ice Lake tscp: AI Chess Performance botan: KASUMI botan: KASUMI - Decrypt botan: AES-256 botan: AES-256 - Decrypt botan: Twofish botan: Twofish - Decrypt botan: Blowfish botan: Blowfish - Decrypt botan: CAST-256 botan: CAST-256 - Decrypt botan: ChaCha20Poly1305 botan: ChaCha20Poly1305 - Decrypt encode-opus: WAV To Opus Encode draco: Lion draco: Church Facade libraw: Post-Processing Benchmark synthmark: VoiceMark_100 securemark: SecureMark-TLS quantlib: fftw: Float + SSE - 2D FFT Size 4096 tnn: CPU - DenseNet tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v2 tnn: CPU - SqueezeNet v1.1 ncnn: CPU - mobilenet ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m onednn: IP Shapes 1D - bf16bf16bf16 - CPU onednn: IP Shapes 3D - bf16bf16bf16 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU coremark: CoreMark Size 666 - Iterations Per Second aircrack-ng: john-the-ripper: Blowfish john-the-ripper: MD5 compress-zstd: 3 - Compression Speed compress-zstd: 8 - Compression Speed compress-zstd: 19 - Compression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Compression Speed aom-av1: Speed 6 Realtime - Bosphorus 4K aom-av1: Speed 8 Realtime - Bosphorus 4K aom-av1: Speed 9 Realtime - Bosphorus 4K aobench: 2048 x 2048 - Total Time vpxenc: Speed 0 - Bosphorus 4K vpxenc: Speed 5 - Bosphorus 4K graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Sharpen graphics-magick: Enhanced graphics-magick: Resizing graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space tachyon: Total Time svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p svt-vp9: Visual Quality Optimized - Bosphorus 1080p dav1d: Summer Nature 4K dav1d: Chimera 1080p 10-bit svt-av1: Preset 4 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K c-ray: Total Time - 4K, 16 Rays Per Pixel svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p financebench: Repo OpenMP financebench: Bonds OpenMP blosc: blosclz nginx: 1 nginx: 20 nginx: 100 nginx: 200 nginx: 500 nginx: 1000 apache: 500 apache: 1000 sqlite-speedtest: Timed Time - Size 1,000 pgbench: 100 - 250 - Read Only pgbench: 100 - 250 - Read Only - Average Latency pgbench: 100 - 250 - Read Write pgbench: 100 - 250 - Read Write - Average Latency mysqlslap: 2048 mysqlslap: 4096 pjsip: INVITE pjsip: OPTIONS, Stateful pjsip: OPTIONS, Stateless Clang 11 Clang 12 Clang 13 1477410 74.395 73.561 5743.347 5714.916 286.034 279.963 326.769 325.704 114.754 115.648 869.352 866.095 9.423 5827 7000 34.55 491.027 252089 2589.5 18135 6912.120 773.467 105.397 657.100 17.30 11.46 10.90 15.32 6.07 21.59 9.97 22.36 26.79 19.72 36.26 2.78642 1.71640 2.08089 3.05315 3.59463 608.604 368.248 2103884.892521 211799.583 113200 8880000 6433.3 2729.0 84.8 826.7 798.8 45.5 19.32 44.06 54.44 36.167 5.60 14.32 1860 728 745 1104 473 623 852 14.2482 460.20 362.67 528.63 842.95 4.895 57.841 14.801 344.59 616.45 37221.003906 58558.255208 23751.9 74606.79 254323.92 205660.13 196465.21 204519.00 205224.87 122576.13 120827.45 64.415 967214 0.260 85162 2.938 700 319 5289 10195 41305 1481388 76.916 76.243 5735.246 5700.326 293.730 288.611 329.534 326.798 115.327 114.829 882.165 874.911 9.908 5870 7057 36.65 503.930 250849 2606.3 18443 4353.389 541.927 83.860 402.255 17.65 11.62 11.01 13.75 6.94 21.24 11.04 21.48 28.35 19.26 37.11 2.79658 1.72558 2.06918 3.05821 3.57952 611.926 368.449 2146447.380082 210157.651 111335 10275333 6321.0 2560.6 80.4 865.6 833.6 45.7 19.39 44.84 54.47 36.428 5.62 14.11 1864 723 758 1104 452 600 852 13.9533 456.18 365.79 532.51 844.78 4.849 57.917 15.224 340.21 604.45 37760.885417 58502.854167 24249.3 75015.29 251352.56 206297.92 194567.38 204058.28 205740.39 121643.80 112287.17 65.294 981525 0.255 84132 2.973 698 319 5291 10185 40364 1491816 77.094 76.248 5761.303 5753.637 288.807 290.469 327.933 333.278 116.103 114.500 855.764 849.956 9.632 5967 7202 39.99 513.010 240447 2657.3 18489 4370.099 539.646 84.610 400.261 17.38 7.84 8.79 12.66 4.86 17.87 9.64 21.71 27.94 19.21 23.98 2.78910 1.71770 2.07168 3.28114 3.61489 598.020 365.727 2185330.036774 212825.677 112407 10430000 6697.9 2775.9 83.6 863.1 828.0 47.0 19.60 45.30 56.65 36.000 5.71 15.35 1874 728 877 1174 485 600 866 13.4699 476.72 374.91 533.02 843.96 4.967 58.982 12.594 353.72 626.33 38153.404297 58281.786458 24671.5 75270.65 254133.55 208590.36 196023.59 204880.02 205977.68 145589.94 112600.02 64.635 973432 0.257 84854 2.948 712 321 5271 10140 41871 OpenBenchmarking.org
TSCP AI Chess Performance OpenBenchmarking.org Nodes Per Second, More Is Better TSCP 1.81 AI Chess Performance Clang 11 Clang 12 Clang 13 300K 600K 900K 1200K 1500K SE +/- 794.00, N = 5 SE +/- 1489.72, N = 5 SE +/- 806.80, N = 5 1477410 1481388 1491816 1. (CC) gcc options: -O3 -march=native
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Clang 11 Clang 12 Clang 13 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 74.40 76.92 77.09 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Clang 11 Clang 12 Clang 13 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.23, N = 3 SE +/- 0.02, N = 3 73.56 76.24 76.25 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Clang 11 Clang 12 Clang 13 1200 2400 3600 4800 6000 SE +/- 0.30, N = 3 SE +/- 5.15, N = 3 SE +/- 0.33, N = 3 5743.35 5735.25 5761.30 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Clang 11 Clang 12 Clang 13 1200 2400 3600 4800 6000 SE +/- 0.51, N = 3 SE +/- 0.79, N = 3 SE +/- 0.20, N = 3 5714.92 5700.33 5753.64 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Clang 11 Clang 12 Clang 13 60 120 180 240 300 SE +/- 0.21, N = 3 SE +/- 0.97, N = 3 SE +/- 0.22, N = 3 286.03 293.73 288.81 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Clang 11 Clang 12 Clang 13 60 120 180 240 300 SE +/- 0.36, N = 3 SE +/- 0.24, N = 3 SE +/- 0.27, N = 3 279.96 288.61 290.47 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Clang 11 Clang 12 Clang 13 70 140 210 280 350 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 326.77 329.53 327.93 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Clang 11 Clang 12 Clang 13 70 140 210 280 350 SE +/- 0.09, N = 3 SE +/- 3.35, N = 3 SE +/- 0.04, N = 3 325.70 326.80 333.28 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Clang 11 Clang 12 Clang 13 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 114.75 115.33 116.10 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Clang 11 Clang 12 Clang 13 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 115.65 114.83 114.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 0.40, N = 3 SE +/- 0.87, N = 3 SE +/- 4.73, N = 3 869.35 882.17 855.76 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 0.04, N = 3 SE +/- 0.95, N = 3 SE +/- 1.46, N = 3 866.10 874.91 849.96 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Opus Codec Encoding WAV To Opus Encode OpenBenchmarking.org Seconds, Fewer Is Better Opus Codec Encoding 1.3.1 WAV To Opus Encode Clang 11 Clang 12 Clang 13 3 6 9 12 15 SE +/- 0.013, N = 5 SE +/- 0.006, N = 5 SE +/- 0.069, N = 5 9.423 9.908 9.632 1. (CXX) g++ options: -O3 -march=native -logg -lm
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion Clang 11 Clang 12 Clang 13 1300 2600 3900 5200 6500 SE +/- 1.73, N = 3 5827 5870 5967 1. (CXX) g++ options: -O3 -march=native
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade Clang 11 Clang 12 Clang 13 1500 3000 4500 6000 7500 SE +/- 3.18, N = 3 SE +/- 4.26, N = 3 7000 7057 7202 1. (CXX) g++ options: -O3 -march=native
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark Clang 11 Clang 12 Clang 13 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 SE +/- 0.19, N = 3 34.55 36.65 39.99 1. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Clang 11 Clang 12 Clang 13 110 220 330 440 550 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 491.03 503.93 513.01 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
SecureMark Benchmark: SecureMark-TLS OpenBenchmarking.org marks, More Is Better SecureMark 1.0.4 Benchmark: SecureMark-TLS Clang 11 Clang 12 Clang 13 50K 100K 150K 200K 250K SE +/- 87.48, N = 3 SE +/- 339.82, N = 3 SE +/- 40.95, N = 3 252089 250849 240447 1. (CC) gcc options: -pedantic -O3
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Clang 11 Clang 12 Clang 13 600 1200 1800 2400 3000 SE +/- 5.39, N = 3 SE +/- 4.59, N = 3 SE +/- 8.72, N = 3 2589.5 2606.3 2657.3 1. (CXX) g++ options: -O3 -march=native -rdynamic
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Clang 11 Clang 12 Clang 13 4K 8K 12K 16K 20K SE +/- 187.54, N = 4 SE +/- 34.47, N = 3 SE +/- 43.73, N = 3 18135 18443 18489 1. (CC) gcc options: -pthread -O3 -march=native -lm
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet Clang 11 Clang 12 Clang 13 1500 3000 4500 6000 7500 SE +/- 3.50, N = 3 SE +/- 1.66, N = 3 SE +/- 2.30, N = 3 6912.12 4353.39 4370.10 MIN: 6888.33 / MAX: 6947.75 MIN: 4332.87 / MAX: 4442 MIN: 4351.72 / MAX: 4450.96 1. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 Clang 11 Clang 12 Clang 13 170 340 510 680 850 SE +/- 0.69, N = 3 SE +/- 0.70, N = 3 SE +/- 0.66, N = 3 773.47 541.93 539.65 MIN: 690.55 / MAX: 820.36 MIN: 536.56 / MAX: 554.17 MIN: 536.06 / MAX: 559.33 1. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 Clang 11 Clang 12 Clang 13 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 105.40 83.86 84.61 MIN: 104.83 / MAX: 106.1 MIN: 83.28 / MAX: 84.63 MIN: 84.23 / MAX: 85 1. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 Clang 11 Clang 12 Clang 13 140 280 420 560 700 SE +/- 0.05, N = 3 SE +/- 0.19, N = 3 SE +/- 0.27, N = 3 657.10 402.26 400.26 MIN: 656.47 / MAX: 657.89 MIN: 401.13 / MAX: 403.41 MIN: 399.7 / MAX: 401.33 1. (CXX) g++ options: -O3 -march=native -fopenmp=libomp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Clang 11 Clang 12 Clang 13 4 8 12 16 20 SE +/- 0.27, N = 14 SE +/- 0.16, N = 3 SE +/- 0.11, N = 3 17.30 17.65 17.38 MIN: 15.79 / MAX: 37.29 MIN: 16.51 / MAX: 19.27 MIN: 16.17 / MAX: 40.05 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Clang 11 Clang 12 Clang 13 3 6 9 12 15 SE +/- 0.67, N = 14 SE +/- 1.05, N = 3 SE +/- 0.01, N = 3 11.46 11.62 7.84 MIN: 7.56 / MAX: 31.76 MIN: 7.71 / MAX: 25.56 MIN: 7.49 / MAX: 13.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Clang 11 Clang 12 Clang 13 3 6 9 12 15 SE +/- 0.66, N = 14 SE +/- 1.31, N = 3 SE +/- 1.33, N = 3 10.90 11.01 8.79 MIN: 6.9 / MAX: 32.84 MIN: 7.09 / MAX: 20.3 MIN: 6.69 / MAX: 26.58 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Clang 11 Clang 12 Clang 13 4 8 12 16 20 SE +/- 1.01, N = 14 SE +/- 2.00, N = 3 SE +/- 2.13, N = 3 15.32 13.75 12.66 MIN: 8.59 / MAX: 45.64 MIN: 9.12 / MAX: 41.76 MIN: 8.73 / MAX: 27.07 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Clang 11 Clang 12 Clang 13 2 4 6 8 10 SE +/- 0.47, N = 14 SE +/- 1.08, N = 3 SE +/- 0.01, N = 3 6.07 6.94 4.86 MIN: 4.35 / MAX: 27.78 MIN: 4.49 / MAX: 19.88 MIN: 4.57 / MAX: 9.45 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Clang 11 Clang 12 Clang 13 5 10 15 20 25 SE +/- 1.02, N = 14 SE +/- 1.87, N = 3 SE +/- 0.31, N = 3 21.59 21.24 17.87 MIN: 16.19 / MAX: 50.35 MIN: 17.28 / MAX: 39.77 MIN: 16.58 / MAX: 37.88 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Clang 11 Clang 12 Clang 13 3 6 9 12 15 SE +/- 0.16, N = 14 SE +/- 1.31, N = 3 SE +/- 0.17, N = 3 9.97 11.04 9.64 MIN: 8.85 / MAX: 29.23 MIN: 9.24 / MAX: 14.73 MIN: 8.9 / MAX: 11.05 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Clang 11 Clang 12 Clang 13 5 10 15 20 25 SE +/- 0.61, N = 14 SE +/- 0.14, N = 3 SE +/- 0.29, N = 3 22.36 21.48 21.71 MIN: 18.98 / MAX: 71.34 MIN: 19.9 / MAX: 40.42 MIN: 19.54 / MAX: 44.52 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Clang 11 Clang 12 Clang 13 7 14 21 28 35 SE +/- 0.26, N = 14 SE +/- 0.55, N = 3 SE +/- 0.54, N = 3 26.79 28.35 27.94 MIN: 23.92 / MAX: 56.1 MIN: 25.63 / MAX: 60.64 MIN: 25.71 / MAX: 39.58 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Clang 11 Clang 12 Clang 13 5 10 15 20 25 SE +/- 0.53, N = 14 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 19.72 19.26 19.21 MIN: 17.67 / MAX: 58.39 MIN: 18.05 / MAX: 35.83 MIN: 18.04 / MAX: 38.36 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Clang 11 Clang 12 Clang 13 9 18 27 36 45 SE +/- 2.48, N = 14 SE +/- 3.84, N = 3 SE +/- 1.77, N = 3 36.26 37.11 23.98 MIN: 18.78 / MAX: 124.57 MIN: 19.88 / MAX: 96.76 MIN: 20.12 / MAX: 66.47 1. (CXX) g++ options: -O3 -march=native -rdynamic -lomp -lpthread -pthread
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 0.6292 1.2584 1.8876 2.5168 3.146 SE +/- 0.00451, N = 3 SE +/- 0.00591, N = 3 SE +/- 0.00435, N = 3 2.78642 2.79658 2.78910 MIN: 2.65 MIN: 2.65 MIN: 2.65 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 0.3883 0.7766 1.1649 1.5532 1.9415 SE +/- 0.00619, N = 3 SE +/- 0.00427, N = 3 SE +/- 0.00225, N = 3 1.71640 1.72558 1.71770 MIN: 1.53 MIN: 1.57 MIN: 1.55 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 0.4682 0.9364 1.4046 1.8728 2.341 SE +/- 0.00072, N = 3 SE +/- 0.00197, N = 3 SE +/- 0.00196, N = 3 2.08089 2.06918 2.07168 MIN: 1.99 MIN: 1.99 MIN: 1.99 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 0.7383 1.4766 2.2149 2.9532 3.6915 SE +/- 0.00886, N = 3 SE +/- 0.00384, N = 3 SE +/- 0.00367, N = 3 3.05315 3.05821 3.28114 MIN: 2.87 MIN: 2.86 MIN: 3.08 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 0.8134 1.6268 2.4402 3.2536 4.067 SE +/- 0.00881, N = 3 SE +/- 0.00151, N = 3 SE +/- 0.00211, N = 3 3.59463 3.57952 3.61489 MIN: 3.5 MIN: 3.5 MIN: 3.52 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 130 260 390 520 650 SE +/- 5.71, N = 13 SE +/- 6.83, N = 4 SE +/- 0.92, N = 3 608.60 611.93 598.02 MIN: 570.79 MIN: 569.44 MIN: 575.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Clang 11 Clang 12 Clang 13 80 160 240 320 400 SE +/- 2.59, N = 3 SE +/- 1.79, N = 3 SE +/- 0.40, N = 3 368.25 368.45 365.73 MIN: 349.99 MIN: 347.27 MIN: 355.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Clang 11 Clang 12 Clang 13 500K 1000K 1500K 2000K 2500K SE +/- 2482.75, N = 3 SE +/- 3711.92, N = 3 SE +/- 1510.09, N = 3 2103884.89 2146447.38 2185330.04 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Clang 11 Clang 12 Clang 13 50K 100K 150K 200K 250K SE +/- 607.21, N = 3 SE +/- 504.25, N = 3 SE +/- 467.06, N = 3 211799.58 210157.65 212825.68 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lsqlite3 -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Clang 11 Clang 12 Clang 13 20K 40K 60K 80K 100K SE +/- 249.93, N = 3 SE +/- 138.68, N = 3 SE +/- 41.83, N = 3 113200 111335 112407 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Clang 11 Clang 12 Clang 13 2M 4M 6M 8M 10M SE +/- 8504.90, N = 3 SE +/- 16973.84, N = 3 SE +/- 3785.94, N = 3 8880000 10275333 10430000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed Clang 11 Clang 12 Clang 13 1400 2800 4200 5600 7000 SE +/- 78.33, N = 3 SE +/- 60.99, N = 3 SE +/- 47.01, N = 3 6433.3 6321.0 6697.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8 - Compression Speed Clang 11 Clang 12 Clang 13 600 1200 1800 2400 3000 SE +/- 32.58, N = 3 SE +/- 34.31, N = 15 SE +/- 7.16, N = 3 2729.0 2560.6 2775.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Clang 11 Clang 12 Clang 13 20 40 60 80 100 SE +/- 0.85, N = 15 SE +/- 0.45, N = 3 SE +/- 0.64, N = 10 84.8 80.4 83.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 3, Long Mode - Compression Speed Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 1.30, N = 3 SE +/- 2.11, N = 3 SE +/- 1.29, N = 3 826.7 865.6 863.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 0.29, N = 3 SE +/- 1.68, N = 3 SE +/- 5.79, N = 3 798.8 833.6 828.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Clang 11 Clang 12 Clang 13 11 22 33 44 55 SE +/- 0.51, N = 4 SE +/- 0.55, N = 15 SE +/- 0.53, N = 15 45.5 45.7 47.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
AOM AV1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 5 10 15 20 25 SE +/- 0.23, N = 3 SE +/- 0.28, N = 3 SE +/- 0.24, N = 3 19.32 19.39 19.60 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 10 20 30 40 50 SE +/- 0.36, N = 9 SE +/- 0.40, N = 3 SE +/- 0.42, N = 7 44.06 44.84 45.30 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOM AV1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.1 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 13 26 39 52 65 SE +/- 0.16, N = 3 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 54.44 54.47 56.65 1. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Clang 11 Clang 12 Clang 13 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.24, N = 3 SE +/- 0.02, N = 3 36.17 36.43 36.00 1. (CC) gcc options: -lm -O3 -march=native
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 1.2848 2.5696 3.8544 5.1392 6.424 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 5.60 5.62 5.71 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.09, N = 3 SE +/- 0.14, N = 15 14.32 14.11 15.35 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl Clang 11 Clang 12 Clang 13 400 800 1200 1600 2000 SE +/- 13.09, N = 3 SE +/- 8.51, N = 3 SE +/- 8.69, N = 3 1860 1864 1874 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Clang 11 Clang 12 Clang 13 160 320 480 640 800 SE +/- 2.40, N = 3 SE +/- 2.65, N = 3 SE +/- 7.80, N = 3 728 723 728 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 3.18, N = 3 SE +/- 0.67, N = 3 SE +/- 5.24, N = 3 745 758 877 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced Clang 11 Clang 12 Clang 13 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 5.49, N = 3 SE +/- 5.78, N = 3 1104 1104 1174 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing Clang 11 Clang 12 Clang 13 100 200 300 400 500 SE +/- 14.35, N = 12 SE +/- 9.74, N = 15 SE +/- 8.56, N = 15 473 452 485 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian Clang 11 Clang 12 Clang 13 130 260 390 520 650 SE +/- 2.33, N = 3 SE +/- 4.10, N = 3 SE +/- 3.06, N = 3 623 600 600 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 6.84, N = 3 SE +/- 5.60, N = 15 SE +/- 8.76, N = 3 852 852 866 1. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Clang 11 Clang 12 Clang 13 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 14.25 13.95 13.47 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Clang 11 Clang 12 Clang 13 100 200 300 400 500 SE +/- 4.56, N = 3 SE +/- 4.57, N = 3 SE +/- 5.26, N = 3 460.20 456.18 476.72 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Clang 11 Clang 12 Clang 13 80 160 240 320 400 SE +/- 1.62, N = 3 SE +/- 2.53, N = 3 SE +/- 4.27, N = 3 362.67 365.79 374.91 1. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Summer Nature 4K Clang 11 Clang 12 Clang 13 120 240 360 480 600 SE +/- 2.65, N = 3 SE +/- 0.77, N = 3 SE +/- 0.44, N = 3 528.63 532.51 533.02 MIN: 176.65 / MAX: 587.68 MIN: 186.64 / MAX: 587.12 MIN: 186.74 / MAX: 587.2 1. (CC) gcc options: -O3 -march=native -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Chimera 1080p 10-bit Clang 11 Clang 12 Clang 13 200 400 600 800 1000 SE +/- 0.82, N = 3 SE +/- 1.52, N = 3 SE +/- 2.33, N = 3 842.95 844.78 843.96 MIN: 515.35 / MAX: 1115.74 MIN: 517.09 / MAX: 1121.33 MIN: 503.8 / MAX: 1131.86 1. (CC) gcc options: -O3 -march=native -pthread
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 1.1176 2.2352 3.3528 4.4704 5.588 SE +/- 0.020, N = 3 SE +/- 0.004, N = 3 SE +/- 0.027, N = 3 4.895 4.849 4.967 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Clang 11 Clang 12 Clang 13 13 26 39 52 65 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 SE +/- 0.38, N = 3 57.84 57.92 58.98 1. (CXX) g++ options: -O3 -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Clang 11 Clang 12 Clang 13 4 8 12 16 20 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 14.80 15.22 12.59 1. (CC) gcc options: -lm -lpthread -O3 -march=native
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Clang 11 Clang 12 Clang 13 80 160 240 320 400 SE +/- 2.06, N = 3 SE +/- 1.14, N = 3 SE +/- 3.85, N = 3 344.59 340.21 353.72 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Clang 11 Clang 12 Clang 13 140 280 420 560 700 SE +/- 2.02, N = 3 SE +/- 2.53, N = 3 SE +/- 3.02, N = 3 616.45 604.45 626.33 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Clang 11 Clang 12 Clang 13 8K 16K 24K 32K 40K SE +/- 19.15, N = 3 SE +/- 290.04, N = 3 SE +/- 450.24, N = 4 37221.00 37760.89 38153.40 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Clang 11 Clang 12 Clang 13 13K 26K 39K 52K 65K SE +/- 38.93, N = 3 SE +/- 82.28, N = 3 SE +/- 17.71, N = 3 58558.26 58502.85 58281.79 1. (CXX) g++ options: -O3 -march=native -fopenmp
C-Blosc Compressor: blosclz OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.0 Compressor: blosclz Clang 11 Clang 12 Clang 13 5K 10K 15K 20K 25K SE +/- 34.48, N = 3 SE +/- 34.61, N = 3 SE +/- 33.59, N = 3 23751.9 24249.3 24671.5 1. (CXX) g++ options: -O3 -march=native
nginx Concurrent Requests: 1 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1 Clang 11 Clang 12 Clang 13 16K 32K 48K 64K 80K SE +/- 131.50, N = 3 SE +/- 102.54, N = 3 SE +/- 160.67, N = 3 74606.79 75015.29 75270.65 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 20 Clang 11 Clang 12 Clang 13 50K 100K 150K 200K 250K SE +/- 2398.86, N = 3 SE +/- 1222.93, N = 3 SE +/- 1379.02, N = 3 254323.92 251352.56 254133.55 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 100 Clang 11 Clang 12 Clang 13 40K 80K 120K 160K 200K SE +/- 1506.43, N = 3 SE +/- 431.14, N = 3 SE +/- 987.48, N = 3 205660.13 206297.92 208590.36 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 200 Clang 11 Clang 12 Clang 13 40K 80K 120K 160K 200K SE +/- 2823.39, N = 3 SE +/- 312.30, N = 3 SE +/- 911.83, N = 3 196465.21 194567.38 196023.59 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 500 Clang 11 Clang 12 Clang 13 40K 80K 120K 160K 200K SE +/- 554.68, N = 3 SE +/- 63.72, N = 3 SE +/- 936.49, N = 3 204519.00 204058.28 204880.02 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
nginx Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.21.1 Concurrent Requests: 1000 Clang 11 Clang 12 Clang 13 40K 80K 120K 160K 200K SE +/- 947.34, N = 3 SE +/- 632.41, N = 3 SE +/- 1305.06, N = 3 205224.87 205740.39 205977.68 1. (CC) gcc options: -ldl -lpthread -lcrypt -lz -O3 -march=native
Apache HTTP Server Concurrent Requests: 500 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 500 Clang 11 Clang 12 Clang 13 30K 60K 90K 120K 150K SE +/- 798.53, N = 3 SE +/- 1145.12, N = 15 SE +/- 843.46, N = 3 122576.13 121643.80 145589.94 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
Apache HTTP Server Concurrent Requests: 1000 OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.48 Concurrent Requests: 1000 Clang 11 Clang 12 Clang 13 30K 60K 90K 120K 150K SE +/- 924.52, N = 15 SE +/- 534.15, N = 3 SE +/- 485.15, N = 3 120827.45 112287.17 112600.02 1. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 Clang 11 Clang 12 Clang 13 15 30 45 60 75 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 64.42 65.29 64.64 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only Clang 11 Clang 12 Clang 13 200K 400K 600K 800K 1000K SE +/- 16009.81, N = 15 SE +/- 3346.94, N = 3 SE +/- 10672.94, N = 3 967214 981525 973432 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency Clang 11 Clang 12 Clang 13 0.0585 0.117 0.1755 0.234 0.2925 SE +/- 0.005, N = 15 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 0.260 0.255 0.257 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write Clang 11 Clang 12 Clang 13 20K 40K 60K 80K 100K SE +/- 786.14, N = 3 SE +/- 85.43, N = 3 SE +/- 119.61, N = 3 85162 84132 84854 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
PostgreSQL pgbench Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL pgbench 13.0 Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency Clang 11 Clang 12 Clang 13 0.6689 1.3378 2.0067 2.6756 3.3445 SE +/- 0.027, N = 3 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 2.938 2.973 2.948 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm
MariaDB Clients: 2048 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 10.6.4 Clients: 2048 Clang 11 Clang 12 Clang 13 150 300 450 600 750 SE +/- 7.30, N = 9 SE +/- 6.96, N = 9 SE +/- 1.71, N = 3 700 698 712 -lbz2 -lsnappy -lpthread -lm -lstdc++ -lpthread -lm -lstdc++ 1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -ldl -lz -lrt
MariaDB Clients: 4096 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 10.6.4 Clients: 4096 Clang 11 Clang 12 Clang 13 70 140 210 280 350 SE +/- 1.96, N = 3 SE +/- 1.07, N = 3 SE +/- 1.31, N = 3 319 319 321 -lbz2 -lsnappy -lpthread -lm -lstdc++ -lpthread -lm -lstdc++ 1. (CXX) g++ options: -fPIC -O3 -march=native -fstack-protector -shared -pthread -ldl -lz -lrt
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Clang 11 Clang 12 Clang 13 1100 2200 3300 4400 5500 SE +/- 8.50, N = 3 SE +/- 10.68, N = 3 SE +/- 11.10, N = 3 5289 5291 5271 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Clang 11 Clang 12 Clang 13 2K 4K 6K 8K 10K SE +/- 4.48, N = 3 SE +/- 11.92, N = 3 SE +/- 43.11, N = 3 10195 10185 10140 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Clang 11 Clang 12 Clang 13 9K 18K 27K 36K 45K SE +/- 575.37, N = 3 SE +/- 510.13, N = 3 SE +/- 380.52, N = 3 41305 40364 41871 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
Phoronix Test Suite v10.8.4