Xeon Platinum 8380 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108113-IB-XEONPLATI03&sor .
Xeon Platinum 8380 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenCL Compiler File-System Screen Resolution Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 3841GB Micron_9300_MTFDHAL3T8TDP + 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 20.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.36.4 X Server 1.20.9 OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 2.1 LINUX Intel oneAPI DPC++/C++ Compiler 2021.3.0 (2021.3.0.20210619) + ICC ext4 1920x1080 GCC 9.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Python Details - Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Compiler Details - GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Xeon Platinum 8380 quantlib: cryptopp: Keyed Algorithms cryptopp: Unkeyed Algorithms toybrot: TBB toybrot: C++ Tasks toybrot: C++ Threads mafft: Multiple Sequence Alignment - LSU RNA webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression libgav1: Chimera 1080p libgav1: Summer Nature 4K libgav1: Summer Nature 1080p libgav1: Chimera 1080p 10-bit xmrig: Monero - 1M xmrig: Wownero - 1M compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed botan: KASUMI botan: KASUMI - Decrypt botan: AES-256 botan: AES-256 - Decrypt botan: Twofish botan: Twofish - Decrypt botan: Blowfish botan: Blowfish - Decrypt botan: CAST-256 botan: CAST-256 - Decrypt botan: ChaCha20Poly1305 botan: ChaCha20Poly1305 - Decrypt john-the-ripper: Blowfish john-the-ripper: MD5 kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p vpxenc: Speed 0 - Bosphorus 4K vpxenc: Speed 5 - Bosphorus 4K x265: Bosphorus 4K mt-dgemm: Sustained Floating-Point Rate coremark: CoreMark Size 666 - Iterations Per Second asmfish: 1024 Hash Memory, 26 Depth pjsip: INVITE pjsip: OPTIONS, Stateful pjsip: OPTIONS, Stateless avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless c-ray: Total Time - 4K, 16 Rays Per Pixel povray: Trace Time tungsten: Hair tungsten: Water Caustic tungsten: Non-Exponential tungsten: Volumetric Caustic yafaray: Total Time For Sample Scene aobench: 2048 x 2048 - Total Time encode-mp3: WAV To MP3 tachyon: Total Time webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression synthmark: VoiceMark_100 aircrack-ng: liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 160 - 256 - 57 financebench: Repo OpenMP financebench: Bonds OpenMP basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 sqlite-speedtest: Timed Time - Size 1,000 draco: Lion draco: Church Facade ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m gnupg: 2.7GB Sample File Encryption influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2659.0 608.426435 380.438710 6562 7766 6805 17.077 20.923 7.363 43.856 38.50 21.07 46.06 15.67 26057.3 42072.8 44.36 7192.7 43.38 7185.3 80.9 2551.8 232.7 3204.4 44.3 2656.7 76.037 76.309 5614.282 5677.238 286.013 280.691 320.928 332.678 115.287 114.855 865.867 866.953 1441 184846 7.40 15.38 28.14 32.50 204.09 304.17 3.40 6.59 15.52 25.865503 2347780.427451 166251925 3199 4559 40962 4.781 33.568 8.455 15.023 9.025 6.02119 29.5066 2.72185 11.14004 80.095 36.050 10.535 14.7890 189.884 7.077 389.388 534.860 210560.412 1930066667 3627366667 4197466667 4166766667 38237.468750 60281.306771 10.084 13.327 16.188 64.422 5884 7162 46.79 11.89 10.02 11.75 11.44 19.94 4.82 63.82 202.30 75.12 73.53 108.49 95.00 113.60 14.59 78.540 813674.4 1217485.3 2422.5 598.416866 381.484647 4620 5305 4740 16.131 19.046 8.207 40.729 26441.1 42035.5 46.81 7423.6 45.55 7393.8 83.5 2656.1 288.6 3299.4 47.0 2749.0 72.394 72.106 4979.243 4979.655 307.571 304.849 364.907 361.431 113.050 114.332 679.571 685.526 116824 10158000 7.31 14.50 27.19 31.32 174.05 266.96 3.07 5.63 13.31 27.593344 2417894.893134 169366836 2752 3897 41775 5.109 36.849 8.722 7.989 9.314 6.38637 31.8414 5.09923 12.9638 77.757 33.865 8.964 13.8028 213.922 6.706 412.185 546.485 209532.448 1723700000 3232100000 3422600000 3167466667 44139.001302 83257.941146 9.315 12.703 16.115 61.142 5386 6620 20.84 11.51 11.20 11.23 11.05 14.14 6.77 21.06 28.97 13.25 9.07 26.42 23.34 22.50 35.85 78.565 728618.4 1162331.3 OpenBenchmarking.org
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 4.35, N = 3 SE +/- 6.01, N = 3 2659.0 2422.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
Crypto++ Test: Keyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Keyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 130 260 390 520 650 SE +/- 0.23, N = 3 SE +/- 0.86, N = 3 608.43 598.42 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 80 160 240 320 400 SE +/- 1.24, N = 3 SE +/- 0.19, N = 3 381.48 380.44 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1400 2800 4200 5600 7000 SE +/- 56.86, N = 5 SE +/- 89.55, N = 14 4620 6562 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1700 3400 5100 6800 8500 SE +/- 60.78, N = 6 SE +/- 125.31, N = 3 5305 7766 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1500 3000 4500 6000 7500 SE +/- 70.34, N = 4 SE +/- 111.86, N = 3 4740 6805 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.33, N = 12 SE +/- 0.17, N = 3 16.13 17.08 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 19.05 20.92 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.120, N = 3 SE +/- 0.089, N = 3 7.363 8.207 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 10 20 30 40 50 SE +/- 0.05, N = 3 SE +/- 0.22, N = 3 40.73 43.86 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p Intel oneAPI DPC++ Compiler 2021.3 9 18 27 36 45 SE +/- 0.19, N = 3 38.50 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Intel oneAPI DPC++ Compiler 2021.3 5 10 15 20 25 SE +/- 0.12, N = 3 21.07 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p Intel oneAPI DPC++ Compiler 2021.3 10 20 30 40 50 SE +/- 0.17, N = 3 46.06 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.03, N = 3 15.67 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 6K 12K 18K 24K 30K SE +/- 169.85, N = 3 SE +/- 270.05, N = 8 26441.1 26057.3 -static-libgcc -static-libstdc++ -funroll-loops 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 156.73, N = 3 SE +/- 145.44, N = 3 42072.8 42035.5 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.26, N = 3 46.81 44.36 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1600 3200 4800 6400 8000 SE +/- 39.65, N = 3 SE +/- 28.83, N = 3 7423.6 7192.7 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 10 20 30 40 50 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 45.55 43.38 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1600 3200 4800 6400 8000 SE +/- 0.72, N = 3 SE +/- 7.48, N = 3 7393.8 7185.3 1. (CC) gcc options: -O3
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.49, N = 3 83.5 80.9 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 600 1200 1800 2400 3000 SE +/- 15.68, N = 3 SE +/- 1.00, N = 3 2656.1 2551.8 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 60 120 180 240 300 SE +/- 2.75, N = 3 SE +/- 1.95, N = 3 288.6 232.7 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 700 1400 2100 2800 3500 SE +/- 12.55, N = 3 SE +/- 17.52, N = 3 3299.4 3204.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 11 22 33 44 55 SE +/- 0.56, N = 15 SE +/- 0.67, N = 3 47.0 44.3 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 600 1200 1800 2400 3000 SE +/- 2.00, N = 15 SE +/- 1.35, N = 3 2749.0 2656.7 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.80, N = 3 76.04 72.39 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 76.31 72.11 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 55.64, N = 3 SE +/- 3.24, N = 3 5614.28 4979.24 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 3.33, N = 3 SE +/- 3.16, N = 3 5677.24 4979.66 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 70 140 210 280 350 SE +/- 0.15, N = 3 SE +/- 0.10, N = 3 307.57 286.01 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 70 140 210 280 350 SE +/- 0.20, N = 3 SE +/- 0.12, N = 3 304.85 280.69 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 80 160 240 320 400 SE +/- 3.94, N = 3 SE +/- 3.06, N = 3 364.91 320.93 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 80 160 240 320 400 SE +/- 0.21, N = 3 SE +/- 1.63, N = 3 361.43 332.68 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 1.11, N = 3 115.29 113.05 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 114.86 114.33 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 1.09, N = 3 SE +/- 4.19, N = 3 865.87 679.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 3.68, N = 3 SE +/- 4.35, N = 3 866.95 685.53 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 30K 60K 90K 120K 150K SE +/- 236.74, N = 3 116824 1441 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 2M 4M 6M 8M 10M SE +/- 36473.73, N = 3 SE +/- 153.78, N = 3 10158000 184846 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 7.40 7.31 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 15.38 14.50 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.31, N = 3 SE +/- 0.11, N = 3 28.14 27.19 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 32.50 31.32 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 0.64, N = 3 SE +/- 2.16, N = 3 204.09 174.05 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 4.07, N = 3 SE +/- 1.65, N = 3 304.17 266.96 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 0.765 1.53 2.295 3.06 3.825 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.40 3.07 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 6.59 5.63 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 15.52 13.31 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 6 12 18 24 30 SE +/- 0.35, N = 4 SE +/- 0.34, N = 3 27.59 25.87 1. (CC) gcc options: -O3 -march=native -fopenmp
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 500K 1000K 1500K 2000K 2500K SE +/- 1735.32, N = 3 SE +/- 18857.12, N = 3 2417894.89 2347780.43 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 40M 80M 120M 160M 200M SE +/- 1483247.98, N = 3 SE +/- 2171881.50, N = 3 169366836 166251925
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 38.00, N = 3 SE +/- 35.41, N = 3 3199 2752 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1000 2000 3000 4000 5000 SE +/- 18.45, N = 3 SE +/- 29.17, N = 3 4559 3897 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 9K 18K 27K 36K 45K SE +/- 632.78, N = 3 SE +/- 601.66, N = 3 41775 40962 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1495 2.299 3.4485 4.598 5.7475 SE +/- 0.060, N = 15 SE +/- 0.069, N = 15 4.781 5.109 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 33.57 36.85 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.060, N = 3 SE +/- 0.133, N = 15 8.455 8.722 1. (CXX) g++ options: -O3 -fPIC -lm
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.096, N = 3 SE +/- 0.097, N = 3 7.989 15.023 1. (CC) gcc options: -lm -lpthread -O3 -march=native
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.121, N = 3 SE +/- 0.149, N = 3 9.025 9.314 -xHost 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.07129, N = 15 SE +/- 0.07248, N = 15 6.02119 6.38637 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.20, N = 3 SE +/- 0.13, N = 3 29.51 31.84 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1473 2.2946 3.4419 4.5892 5.7365 SE +/- 0.01979, N = 3 SE +/- 0.02612, N = 3 2.72185 5.09923 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.37, N = 15 SE +/- 0.56, N = 15 11.14 12.96 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 20 40 60 80 100 SE +/- 1.96, N = 15 SE +/- 2.07, N = 12 77.76 80.10 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 33.87 36.05 1. (CC) gcc options: -lm -O3 -march=native
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 3 6 9 12 15 SE +/- 0.136, N = 3 SE +/- 0.142, N = 3 8.964 10.535 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe 1. (CC) gcc options: -O3 -march=native -lncurses -lm
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.22, N = 4 13.80 14.79 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.40, N = 3 189.88 213.92 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 2 4 6 8 10 SE +/- 0.116, N = 14 SE +/- 0.089, N = 4 6.706 7.077 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 90 180 270 360 450 SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 389.39 412.19 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 120 240 360 480 600 SE +/- 0.47, N = 3 SE +/- 6.01, N = 3 546.49 534.86 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50K 100K 150K 200K 250K SE +/- 494.94, N = 3 SE +/- 610.57, N = 3 210560.41 209532.45 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 400M 800M 1200M 1600M 2000M SE +/- 6519798.91, N = 3 SE +/- 5919459.43, N = 3 1930066667 1723700000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 800M 1600M 2400M 3200M 4000M SE +/- 7338104.51, N = 3 SE +/- 19736514.38, N = 3 3627366667 3232100000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 12898621.80, N = 3 SE +/- 9990161.83, N = 3 4197466667 3422600000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 13561014.38, N = 3 SE +/- 13945289.93, N = 3 4166766667 3167466667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 147.68, N = 3 SE +/- 143.23, N = 3 38237.47 44139.00 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20K 40K 60K 80K 100K SE +/- 708.19, N = 15 SE +/- 1019.88, N = 15 60281.31 83257.94 1. (CXX) g++ options: -O3 -march=native -fopenmp
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 3 6 9 12 15 SE +/- 0.137, N = 4 SE +/- 0.038, N = 3 9.315 10.084 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 3 6 9 12 15 SE +/- 0.16, N = 3 SE +/- 0.01, N = 3 12.70 13.33 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.14, N = 3 SE +/- 0.11, N = 3 16.12 16.19 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 61.14 64.42 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1300 2600 3900 5200 6500 SE +/- 15.30, N = 3 5386 5884 1. (CXX) g++ options: -O3 -march=native
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 1500 3000 4500 6000 7500 SE +/- 7.69, N = 3 SE +/- 10.17, N = 3 6620 7162 1. (CXX) g++ options: -O3 -march=native
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 11 22 33 44 55 SE +/- 0.23, N = 3 SE +/- 0.22, N = 3 20.84 46.79 -lgomp -lpthread - MIN: 19.62 / MAX: 212.02 MIN: 45.73 / MAX: 62.24 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 3 6 9 12 15 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 11.51 11.89 -lgomp -lpthread - MIN: 10.88 / MAX: 45.76 MIN: 11.74 / MAX: 12.68 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 10.02 11.20 MIN: 9.87 / MAX: 10.62 -lgomp -lpthread - MIN: 10.61 / MAX: 17.17 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 11.23 11.75 -lgomp -lpthread - MIN: 10.84 / MAX: 55.99 MIN: 11.6 / MAX: 12.28 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 11.05 11.44 -lgomp -lpthread - MIN: 10.47 / MAX: 19.28 MIN: 11.33 / MAX: 11.98 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 1.31, N = 3 14.14 19.94 -lgomp -lpthread - MIN: 13.35 / MAX: 34.49 MIN: 18.2 / MAX: 40.6 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 4.82 6.77 MIN: 4.7 / MAX: 6.74 -lgomp -lpthread - MIN: 6.37 / MAX: 51.55 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 14 28 42 56 70 SE +/- 0.02, N = 3 SE +/- 0.15, N = 3 21.06 63.82 -lgomp -lpthread - MIN: 20.2 / MAX: 92.44 MIN: 63.17 / MAX: 76.37 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 40 80 120 160 200 SE +/- 1.90, N = 3 SE +/- 1.68, N = 3 28.97 202.30 -lgomp -lpthread - MIN: 25.85 / MAX: 52.73 MIN: 195.35 / MAX: 219.18 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 20 40 60 80 100 SE +/- 0.41, N = 3 SE +/- 0.31, N = 3 13.25 75.12 MIN: 74.2 / MAX: 92.38 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 16 32 48 64 80 SE +/- 0.61, N = 3 SE +/- 0.19, N = 3 9.07 73.53 -lgomp -lpthread - MIN: 7.84 / MAX: 27.74 MIN: 72.7 / MAX: 85.89 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 20 40 60 80 100 SE +/- 0.74, N = 3 SE +/- 0.73, N = 3 26.42 108.49 -lgomp -lpthread - MIN: 24.26 / MAX: 43.04 MIN: 106.38 / MAX: 119 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.42, N = 3 23.34 95.00 -lgomp -lpthread - MIN: 22.23 / MAX: 75.49 MIN: 92.76 / MAX: 106.32 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd GCC 9.3 Intel oneAPI DPC++ Compiler 2021.3 30 60 90 120 150 SE +/- 0.27, N = 3 SE +/- 0.14, N = 3 22.50 113.60 -lgomp -lpthread - MIN: 21.54 / MAX: 62.81 MIN: 112.78 / MAX: 128.04 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.41, N = 3 14.59 35.85 MIN: 14.36 / MAX: 21.91 -lgomp -lpthread - MIN: 34.62 / MAX: 57.79 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.54, N = 3 SE +/- 0.16, N = 3 78.54 78.57 1. (CC) gcc options: -O3 -march=native
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200K 400K 600K 800K 1000K SE +/- 3114.75, N = 3 SE +/- 1618.92, N = 3 813674.4 728618.4
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 300K 600K 900K 1200K 1500K SE +/- 2230.96, N = 3 SE +/- 7744.78, N = 3 1217485.3 1162331.3
Phoronix Test Suite v10.8.4