Xeon Platinum 8380 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108113-IB-XEONPLATI03&rdt&grs .
Xeon Platinum 8380 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenCL Compiler File-System Screen Resolution Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 3841GB Micron_9300_MTFDHAL3T8TDP + 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 20.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.36.4 X Server 1.20.9 OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 2.1 LINUX Intel oneAPI DPC++/C++ Compiler 2021.3.0 (2021.3.0.20210619) + ICC ext4 1920x1080 GCC 9.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Python Details - Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Compiler Details - GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Xeon Platinum 8380 john-the-ripper: Blowfish john-the-ripper: MD5 ncnn: CPU - resnet18 ncnn: CPU - squeezenet_ssd ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - googlenet ncnn: CPU - regnety_400m ncnn: CPU - mobilenet c-ray: Total Time - 4K, 16 Rays Per Pixel tungsten: Non-Exponential toybrot: C++ Tasks toybrot: C++ Threads toybrot: TBB ncnn: CPU - blazeface financebench: Bonds OpenMP liquid-dsp: 160 - 256 - 57 botan: ChaCha20Poly1305 botan: ChaCha20Poly1305 - Decrypt compress-zstd: 8, Long Mode - Compression Speed liquid-dsp: 128 - 256 - 57 encode-mp3: WAV To MP3 svt-hevc: 7 - Bosphorus 1080p vpxenc: Speed 5 - Bosphorus 4K pjsip: OPTIONS, Stateful x265: Bosphorus 4K pjsip: INVITE financebench: Repo OpenMP botan: AES-256 - Decrypt svt-hevc: 10 - Bosphorus 1080p botan: Blowfish botan: AES-256 webp2: Quality 95, Compression Effort 7 liquid-dsp: 64 - 256 - 57 liquid-dsp: 32 - 256 - 57 ncnn: CPU-v3-v3 - mobilenet-v3 influxdb: 4 - 10000 - 2,5000,1 - 10000 webp: Quality 100, Highest Compression vpxenc: Speed 0 - Bosphorus 4K webp: Quality 100, Lossless avifenc: 6, Lossless quantlib: draco: Lion botan: Blowfish - Decrypt botan: Twofish - Decrypt basis: UASTC Level 0 draco: Church Facade tungsten: Water Caustic webp: Quality 100, Lossless, Highest Compression botan: Twofish tachyon: Total Time avifenc: 10 mt-dgemm: Sustained Floating-Point Rate aobench: 2048 x 2048 - Total Time compress-zstd: 19, Long Mode - Compression Speed kvazaar: Bosphorus 4K - Very Fast tungsten: Hair webp2: Quality 100, Lossless Compression botan: KASUMI - Decrypt compress-lz4: 3 - Compression Speed sqlite-speedtest: Timed Time - Size 1,000 botan: KASUMI compress-lz4: 9 - Compression Speed basis: UASTC Level 2 influxdb: 64 - 10000 - 2,5000,1 - 10000 ncnn: CPU - shufflenet-v2 compress-zstd: 19 - Decompression Speed svt-hevc: 1 - Bosphorus 1080p ncnn: CPU - mnasnet kvazaar: Bosphorus 4K - Ultra Fast compress-zstd: 19, Long Mode - Decompression Speed ncnn: CPU-v2-v2 - mobilenet-v2 compress-zstd: 19 - Compression Speed compress-lz4: 3 - Decompression Speed povray: Trace Time avifenc: 10, Lossless coremark: CoreMark Size 666 - Iterations Per Second compress-zstd: 8, Long Mode - Decompression Speed compress-lz4: 9 - Decompression Speed synthmark: VoiceMark_100 pjsip: OPTIONS, Stateless botan: CAST-256 asmfish: 1024 Hash Memory, 26 Depth cryptopp: Keyed Algorithms xmrig: Monero - 1M kvazaar: Bosphorus 4K - Medium aircrack-ng: botan: CAST-256 - Decrypt basis: UASTC Level 3 cryptopp: Unkeyed Algorithms xmrig: Wownero - 1M gnupg: 2.7GB Sample File Encryption libgav1: Chimera 1080p 10-bit libgav1: Summer Nature 1080p libgav1: Summer Nature 4K libgav1: Chimera 1080p ncnn: CPU - alexnet ncnn: CPU - vgg16 ncnn: CPU - efficientnet-b0 webp2: Quality 100, Compression Effort 5 yafaray: Total Time For Sample Scene tungsten: Volumetric Caustic mafft: Multiple Sequence Alignment - LSU RNA Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1441 184846 75.12 113.60 108.49 95.00 63.82 14.59 46.79 15.023 2.72185 7766 6805 6562 4.82 60281.306771 4166766667 865.867 866.953 232.7 4197466667 10.535 204.09 6.59 4559 15.52 3199 38237.468750 5677.238 304.17 320.928 5614.282 189.884 3627366667 1930066667 10.02 813674.4 7.363 3.40 20.923 33.568 2659.0 5884 332.678 280.691 10.084 7162 29.5066 43.856 286.013 14.7890 4.781 25.865503 36.050 44.3 15.38 6.02119 389.388 76.309 44.36 64.422 76.037 43.38 13.327 1217485.3 11.75 2551.8 32.50 11.44 28.14 2656.7 11.89 80.9 7192.7 9.025 8.455 2347780.427451 3204.4 7185.3 534.860 40962 115.287 166251925 608.426435 26057.3 7.40 210560.412 114.855 16.188 380.438710 42072.8 78.540 15.67 46.06 21.07 38.50 73.53 202.30 19.94 7.077 80.095 11.14004 17.077 116824 10158000 13.25 22.50 26.42 23.34 21.06 35.85 20.84 7.989 5.09923 5305 4740 4620 6.77 83257.941146 3167466667 679.571 685.526 288.6 3422600000 8.964 174.05 5.63 3897 13.31 2752 44139.001302 4979.655 266.96 364.907 4979.243 213.922 3232100000 1723700000 11.20 728618.4 8.207 3.07 19.046 36.849 2422.5 5386 361.431 304.849 9.315 6620 31.8414 40.729 307.571 13.8028 5.109 27.593344 33.865 47.0 14.50 6.38637 412.185 72.106 46.81 61.142 72.394 45.55 12.703 1162331.3 11.23 2656.1 31.32 11.05 27.19 2749.0 11.51 83.5 7423.6 9.314 8.722 2417894.893134 3299.4 7393.8 546.485 41775 113.050 169366836 598.416866 26441.1 7.31 209532.448 114.332 16.115 381.484647 42035.5 78.565 9.07 28.97 14.14 6.706 77.757 12.9638 16.131 OpenBenchmarking.org
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30K 60K 90K 120K 150K SE +/- 236.74, N = 3 1441 116824 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2M 4M 6M 8M 10M SE +/- 153.78, N = 3 SE +/- 36473.73, N = 3 184846 10158000 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.41, N = 3 75.12 13.25 MIN: 74.2 / MAX: 92.38 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.14, N = 3 SE +/- 0.27, N = 3 113.60 22.50 MIN: 112.78 / MAX: 128.04 -lgomp -lpthread - MIN: 21.54 / MAX: 62.81 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.73, N = 3 SE +/- 0.74, N = 3 108.49 26.42 MIN: 106.38 / MAX: 119 -lgomp -lpthread - MIN: 24.26 / MAX: 43.04 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.42, N = 3 SE +/- 0.18, N = 3 95.00 23.34 MIN: 92.76 / MAX: 106.32 -lgomp -lpthread - MIN: 22.23 / MAX: 75.49 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 63.82 21.06 MIN: 63.17 / MAX: 76.37 -lgomp -lpthread - MIN: 20.2 / MAX: 92.44 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.41, N = 3 14.59 35.85 MIN: 14.36 / MAX: 21.91 -lgomp -lpthread - MIN: 34.62 / MAX: 57.79 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 46.79 20.84 MIN: 45.73 / MAX: 62.24 -lgomp -lpthread - MIN: 19.62 / MAX: 212.02 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.097, N = 3 SE +/- 0.096, N = 3 15.023 7.989 1. (CC) gcc options: -lm -lpthread -O3 -march=native
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1473 2.2946 3.4419 4.5892 5.7365 SE +/- 0.01979, N = 3 SE +/- 0.02612, N = 3 2.72185 5.09923 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1700 3400 5100 6800 8500 SE +/- 125.31, N = 3 SE +/- 60.78, N = 6 7766 5305 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1500 3000 4500 6000 7500 SE +/- 111.86, N = 3 SE +/- 70.34, N = 4 6805 4740 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1400 2800 4200 5600 7000 SE +/- 89.55, N = 14 SE +/- 56.86, N = 5 6562 4620 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 4.82 6.77 MIN: 4.7 / MAX: 6.74 -lgomp -lpthread - MIN: 6.37 / MAX: 51.55 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20K 40K 60K 80K 100K SE +/- 708.19, N = 15 SE +/- 1019.88, N = 15 60281.31 83257.94 1. (CXX) g++ options: -O3 -march=native -fopenmp
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 13561014.38, N = 3 SE +/- 13945289.93, N = 3 4166766667 3167466667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 1.09, N = 3 SE +/- 4.19, N = 3 865.87 679.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 3.68, N = 3 SE +/- 4.35, N = 3 866.95 685.53 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 60 120 180 240 300 SE +/- 1.95, N = 3 SE +/- 2.75, N = 3 232.7 288.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 12898621.80, N = 3 SE +/- 9990161.83, N = 3 4197466667 3422600000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.142, N = 3 SE +/- 0.136, N = 3 10.535 8.964 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe 1. (CC) gcc options: -O3 -march=native -lncurses -lm
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 0.64, N = 3 SE +/- 2.16, N = 3 204.09 174.05 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 6.59 5.63 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1000 2000 3000 4000 5000 SE +/- 18.45, N = 3 SE +/- 29.17, N = 3 4559 3897 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 15.52 13.31 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 38.00, N = 3 SE +/- 35.41, N = 3 3199 2752 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 147.68, N = 3 SE +/- 143.23, N = 3 38237.47 44139.00 1. (CXX) g++ options: -O3 -march=native -fopenmp
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 3.33, N = 3 SE +/- 3.16, N = 3 5677.24 4979.66 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 4.07, N = 3 SE +/- 1.65, N = 3 304.17 266.96 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 3.06, N = 3 SE +/- 3.94, N = 3 320.93 364.91 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 55.64, N = 3 SE +/- 3.24, N = 3 5614.28 4979.24 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.40, N = 3 189.88 213.92 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 800M 1600M 2400M 3200M 4000M SE +/- 7338104.51, N = 3 SE +/- 19736514.38, N = 3 3627366667 3232100000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 400M 800M 1200M 1600M 2000M SE +/- 6519798.91, N = 3 SE +/- 5919459.43, N = 3 1930066667 1723700000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 10.02 11.20 MIN: 9.87 / MAX: 10.62 -lgomp -lpthread - MIN: 10.61 / MAX: 17.17 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200K 400K 600K 800K 1000K SE +/- 3114.75, N = 3 SE +/- 1618.92, N = 3 813674.4 728618.4
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.120, N = 3 SE +/- 0.089, N = 3 7.363 8.207 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 0.765 1.53 2.295 3.06 3.825 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.40 3.07 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 20.92 19.05 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 33.57 36.85 1. (CXX) g++ options: -O3 -fPIC -lm
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 4.35, N = 3 SE +/- 6.01, N = 3 2659.0 2422.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1300 2600 3900 5200 6500 SE +/- 15.30, N = 3 5884 5386 1. (CXX) g++ options: -O3 -march=native
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 1.63, N = 3 SE +/- 0.21, N = 3 332.68 361.43 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 280.69 304.85 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.038, N = 3 SE +/- 0.137, N = 4 10.084 9.315 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1500 3000 4500 6000 7500 SE +/- 10.17, N = 3 SE +/- 7.69, N = 3 7162 6620 1. (CXX) g++ options: -O3 -march=native
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.20, N = 3 SE +/- 0.13, N = 3 29.51 31.84 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 10 20 30 40 50 SE +/- 0.22, N = 3 SE +/- 0.05, N = 3 43.86 40.73 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 286.01 307.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.22, N = 4 SE +/- 0.05, N = 3 14.79 13.80 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1495 2.299 3.4485 4.598 5.7475 SE +/- 0.060, N = 15 SE +/- 0.069, N = 15 4.781 5.109 1. (CXX) g++ options: -O3 -fPIC -lm
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6 12 18 24 30 SE +/- 0.34, N = 3 SE +/- 0.35, N = 4 25.87 27.59 1. (CC) gcc options: -O3 -march=native -fopenmp
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 36.05 33.87 1. (CC) gcc options: -lm -O3 -march=native
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.67, N = 3 SE +/- 0.56, N = 15 44.3 47.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 15.38 14.50 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.07129, N = 15 SE +/- 0.07248, N = 15 6.02119 6.38637 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 90 180 270 360 450 SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 389.39 412.19 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 76.31 72.11 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 44.36 46.81 1. (CC) gcc options: -O3
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 64.42 61.14 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.80, N = 3 76.04 72.39 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 43.38 45.55 1. (CC) gcc options: -O3
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.16, N = 3 13.33 12.70 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 300K 600K 900K 1200K 1500K SE +/- 2230.96, N = 3 SE +/- 7744.78, N = 3 1217485.3 1162331.3
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 11.75 11.23 MIN: 11.6 / MAX: 12.28 -lgomp -lpthread - MIN: 10.84 / MAX: 55.99 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 1.00, N = 3 SE +/- 15.68, N = 3 2551.8 2656.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 32.50 31.32 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.44 11.05 MIN: 11.33 / MAX: 11.98 -lgomp -lpthread - MIN: 10.47 / MAX: 19.28 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.31, N = 3 SE +/- 0.11, N = 3 28.14 27.19 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 1.35, N = 3 SE +/- 2.00, N = 15 2656.7 2749.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 11.89 11.51 MIN: 11.74 / MAX: 12.68 -lgomp -lpthread - MIN: 10.88 / MAX: 45.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.49, N = 3 SE +/- 0.61, N = 3 80.9 83.5 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1600 3200 4800 6400 8000 SE +/- 28.83, N = 3 SE +/- 39.65, N = 3 7192.7 7423.6 1. (CC) gcc options: -O3
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.121, N = 3 SE +/- 0.149, N = 3 9.025 9.314 -xHost 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.060, N = 3 SE +/- 0.133, N = 15 8.455 8.722 1. (CXX) g++ options: -O3 -fPIC -lm
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 500K 1000K 1500K 2000K 2500K SE +/- 18857.12, N = 3 SE +/- 1735.32, N = 3 2347780.43 2417894.89 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 17.52, N = 3 SE +/- 12.55, N = 3 3204.4 3299.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1600 3200 4800 6400 8000 SE +/- 7.48, N = 3 SE +/- 0.72, N = 3 7185.3 7393.8 1. (CC) gcc options: -O3
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 120 240 360 480 600 SE +/- 6.01, N = 3 SE +/- 0.47, N = 3 534.86 546.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 601.66, N = 3 SE +/- 632.78, N = 3 40962 41775 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 1.11, N = 3 115.29 113.05 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40M 80M 120M 160M 200M SE +/- 2171881.50, N = 3 SE +/- 1483247.98, N = 3 166251925 169366836
Crypto++ Test: Keyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Keyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 130 260 390 520 650 SE +/- 0.23, N = 3 SE +/- 0.86, N = 3 608.43 598.42 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6K 12K 18K 24K 30K SE +/- 270.05, N = 8 SE +/- 169.85, N = 3 26057.3 26441.1 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 7.40 7.31 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50K 100K 150K 200K 250K SE +/- 494.94, N = 3 SE +/- 610.57, N = 3 210560.41 209532.45 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 114.86 114.33 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 16.19 16.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 0.19, N = 3 SE +/- 1.24, N = 3 380.44 381.48 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 156.73, N = 3 SE +/- 145.44, N = 3 42072.8 42035.5 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.54, N = 3 SE +/- 0.16, N = 3 78.54 78.57 1. (CC) gcc options: -O3 -march=native
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.03, N = 3 15.67 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p Intel oneAPI DPC++ Compiler 2021.3 10 20 30 40 50 SE +/- 0.17, N = 3 46.06 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Intel oneAPI DPC++ Compiler 2021.3 5 10 15 20 25 SE +/- 0.12, N = 3 21.07 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p Intel oneAPI DPC++ Compiler 2021.3 9 18 27 36 45 SE +/- 0.19, N = 3 38.50 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 16 32 48 64 80 SE +/- 0.19, N = 3 SE +/- 0.61, N = 3 73.53 9.07 MIN: 72.7 / MAX: 85.89 -lgomp -lpthread - MIN: 7.84 / MAX: 27.74 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 1.68, N = 3 SE +/- 1.90, N = 3 202.30 28.97 MIN: 195.35 / MAX: 219.18 -lgomp -lpthread - MIN: 25.85 / MAX: 52.73 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 5 10 15 20 25 SE +/- 1.31, N = 3 SE +/- 0.04, N = 3 19.94 14.14 MIN: 18.2 / MAX: 40.6 -lgomp -lpthread - MIN: 13.35 / MAX: 34.49 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.089, N = 4 SE +/- 0.116, N = 14 7.077 6.706 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 2.07, N = 12 SE +/- 1.96, N = 15 80.10 77.76 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.37, N = 15 SE +/- 0.56, N = 15 11.14 12.96 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.33, N = 12 17.08 16.13 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Phoronix Test Suite v10.8.4