Xeon Platinum 8380 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108113-IB-XEONPLATI03&rdt&grw .
Xeon Platinum 8380 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenCL Compiler File-System Screen Resolution Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 3841GB Micron_9300_MTFDHAL3T8TDP + 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 20.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.36.4 X Server 1.20.9 OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 2.1 LINUX Intel oneAPI DPC++/C++ Compiler 2021.3.0 (2021.3.0.20210619) + ICC ext4 1920x1080 GCC 9.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Python Details - Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Compiler Details - GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Xeon Platinum 8380 toybrot: TBB toybrot: C++ Tasks toybrot: C++ Threads cryptopp: Keyed Algorithms cryptopp: Unkeyed Algorithms compress-lz4: 3 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 9 - Decompression Speed botan: KASUMI botan: KASUMI - Decrypt botan: AES-256 botan: AES-256 - Decrypt botan: Twofish botan: Twofish - Decrypt botan: Blowfish botan: Blowfish - Decrypt botan: CAST-256 botan: CAST-256 - Decrypt botan: ChaCha20Poly1305 botan: ChaCha20Poly1305 - Decrypt basis: UASTC Level 0 basis: UASTC Level 2 basis: UASTC Level 3 encode-mp3: WAV To MP3 draco: Lion draco: Church Facade webp2: Quality 95, Compression Effort 7 webp2: Quality 100, Compression Effort 5 webp2: Quality 100, Lossless Compression webp: Quality 100, Lossless webp: Quality 100, Highest Compression webp: Quality 100, Lossless, Highest Compression synthmark: VoiceMark_100 xmrig: Monero - 1M xmrig: Wownero - 1M quantlib: mafft: Multiple Sequence Alignment - LSU RNA ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m mt-dgemm: Sustained Floating-Point Rate coremark: CoreMark Size 666 - Iterations Per Second aircrack-ng: john-the-ripper: Blowfish john-the-ripper: MD5 compress-zstd: 19 - Compression Speed compress-zstd: 19 - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 19, Long Mode - Decompression Speed asmfish: 1024 Hash Memory, 26 Depth kvazaar: Bosphorus 4K - Medium kvazaar: Bosphorus 4K - Very Fast kvazaar: Bosphorus 4K - Ultra Fast libgav1: Chimera 1080p libgav1: Summer Nature 4K libgav1: Summer Nature 1080p libgav1: Chimera 1080p 10-bit aobench: 2048 x 2048 - Total Time tungsten: Hair tungsten: Water Caustic tungsten: Non-Exponential tungsten: Volumetric Caustic vpxenc: Speed 0 - Bosphorus 4K vpxenc: Speed 5 - Bosphorus 4K tachyon: Total Time x265: Bosphorus 4K c-ray: Total Time - 4K, 16 Rays Per Pixel svt-hevc: 1 - Bosphorus 1080p svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p povray: Trace Time avifenc: 10 avifenc: 6, Lossless avifenc: 10, Lossless yafaray: Total Time For Sample Scene financebench: Repo OpenMP financebench: Bonds OpenMP liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 160 - 256 - 57 influxdb: 4 - 10000 - 2,5000,1 - 10000 influxdb: 64 - 10000 - 2,5000,1 - 10000 sqlite-speedtest: Timed Time - Size 1,000 gnupg: 2.7GB Sample File Encryption pjsip: INVITE pjsip: OPTIONS, Stateful pjsip: OPTIONS, Stateless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6562 7766 6805 608.426435 380.438710 44.36 7192.7 43.38 7185.3 76.037 76.309 5614.282 5677.238 286.013 280.691 320.928 332.678 115.287 114.855 865.867 866.953 10.084 13.327 16.188 10.535 5884 7162 189.884 7.077 389.388 20.923 7.363 43.856 534.860 26057.3 42072.8 2659.0 17.077 46.79 11.89 10.02 11.75 11.44 19.94 4.82 63.82 202.30 75.12 73.53 108.49 95.00 113.60 14.59 25.865503 2347780.427451 210560.412 1441 184846 80.9 2551.8 232.7 3204.4 44.3 2656.7 166251925 7.40 15.38 28.14 38.50 21.07 46.06 15.67 36.050 6.02119 29.5066 2.72185 11.14004 3.40 6.59 14.7890 15.52 15.023 32.50 204.09 304.17 9.025 4.781 33.568 8.455 80.095 38237.468750 60281.306771 1930066667 3627366667 4197466667 4166766667 813674.4 1217485.3 64.422 78.540 3199 4559 40962 4620 5305 4740 598.416866 381.484647 46.81 7423.6 45.55 7393.8 72.394 72.106 4979.243 4979.655 307.571 304.849 364.907 361.431 113.050 114.332 679.571 685.526 9.315 12.703 16.115 8.964 5386 6620 213.922 6.706 412.185 19.046 8.207 40.729 546.485 26441.1 42035.5 2422.5 16.131 20.84 11.51 11.20 11.23 11.05 14.14 6.77 21.06 28.97 13.25 9.07 26.42 23.34 22.50 35.85 27.593344 2417894.893134 209532.448 116824 10158000 83.5 2656.1 288.6 3299.4 47.0 2749.0 169366836 7.31 14.50 27.19 33.865 6.38637 31.8414 5.09923 12.9638 3.07 5.63 13.8028 13.31 7.989 31.32 174.05 266.96 9.314 5.109 36.849 8.722 77.757 44139.001302 83257.941146 1723700000 3232100000 3422600000 3167466667 728618.4 1162331.3 61.142 78.565 2752 3897 41775 OpenBenchmarking.org
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1400 2800 4200 5600 7000 SE +/- 89.55, N = 14 SE +/- 56.86, N = 5 6562 4620 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1700 3400 5100 6800 8500 SE +/- 125.31, N = 3 SE +/- 60.78, N = 6 7766 5305 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1500 3000 4500 6000 7500 SE +/- 111.86, N = 3 SE +/- 70.34, N = 4 6805 4740 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
Crypto++ Test: Keyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Keyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 130 260 390 520 650 SE +/- 0.23, N = 3 SE +/- 0.86, N = 3 608.43 598.42 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 0.19, N = 3 SE +/- 1.24, N = 3 380.44 381.48 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 44.36 46.81 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1600 3200 4800 6400 8000 SE +/- 28.83, N = 3 SE +/- 39.65, N = 3 7192.7 7423.6 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 43.38 45.55 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1600 3200 4800 6400 8000 SE +/- 7.48, N = 3 SE +/- 0.72, N = 3 7185.3 7393.8 1. (CC) gcc options: -O3
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.80, N = 3 76.04 72.39 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 76.31 72.11 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 55.64, N = 3 SE +/- 3.24, N = 3 5614.28 4979.24 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 3.33, N = 3 SE +/- 3.16, N = 3 5677.24 4979.66 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 286.01 307.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 280.69 304.85 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 3.06, N = 3 SE +/- 3.94, N = 3 320.93 364.91 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 1.63, N = 3 SE +/- 0.21, N = 3 332.68 361.43 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 1.11, N = 3 115.29 113.05 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 114.86 114.33 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 1.09, N = 3 SE +/- 4.19, N = 3 865.87 679.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 3.68, N = 3 SE +/- 4.35, N = 3 866.95 685.53 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.038, N = 3 SE +/- 0.137, N = 4 10.084 9.315 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.16, N = 3 13.33 12.70 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 16.19 16.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.142, N = 3 SE +/- 0.136, N = 3 10.535 8.964 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe 1. (CC) gcc options: -O3 -march=native -lncurses -lm
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1300 2600 3900 5200 6500 SE +/- 15.30, N = 3 5884 5386 1. (CXX) g++ options: -O3 -march=native
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1500 3000 4500 6000 7500 SE +/- 10.17, N = 3 SE +/- 7.69, N = 3 7162 6620 1. (CXX) g++ options: -O3 -march=native
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.40, N = 3 189.88 213.92 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.089, N = 4 SE +/- 0.116, N = 14 7.077 6.706 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 90 180 270 360 450 SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 389.39 412.19 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 20.92 19.05 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.120, N = 3 SE +/- 0.089, N = 3 7.363 8.207 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 10 20 30 40 50 SE +/- 0.22, N = 3 SE +/- 0.05, N = 3 43.86 40.73 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 120 240 360 480 600 SE +/- 6.01, N = 3 SE +/- 0.47, N = 3 534.86 546.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6K 12K 18K 24K 30K SE +/- 270.05, N = 8 SE +/- 169.85, N = 3 26057.3 26441.1 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 156.73, N = 3 SE +/- 145.44, N = 3 42072.8 42035.5 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 4.35, N = 3 SE +/- 6.01, N = 3 2659.0 2422.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.33, N = 12 17.08 16.13 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 46.79 20.84 MIN: 45.73 / MAX: 62.24 -lgomp -lpthread - MIN: 19.62 / MAX: 212.02 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 11.89 11.51 MIN: 11.74 / MAX: 12.68 -lgomp -lpthread - MIN: 10.88 / MAX: 45.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 10.02 11.20 MIN: 9.87 / MAX: 10.62 -lgomp -lpthread - MIN: 10.61 / MAX: 17.17 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 11.75 11.23 MIN: 11.6 / MAX: 12.28 -lgomp -lpthread - MIN: 10.84 / MAX: 55.99 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.44 11.05 MIN: 11.33 / MAX: 11.98 -lgomp -lpthread - MIN: 10.47 / MAX: 19.28 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 5 10 15 20 25 SE +/- 1.31, N = 3 SE +/- 0.04, N = 3 19.94 14.14 MIN: 18.2 / MAX: 40.6 -lgomp -lpthread - MIN: 13.35 / MAX: 34.49 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 4.82 6.77 MIN: 4.7 / MAX: 6.74 -lgomp -lpthread - MIN: 6.37 / MAX: 51.55 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 63.82 21.06 MIN: 63.17 / MAX: 76.37 -lgomp -lpthread - MIN: 20.2 / MAX: 92.44 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 1.68, N = 3 SE +/- 1.90, N = 3 202.30 28.97 MIN: 195.35 / MAX: 219.18 -lgomp -lpthread - MIN: 25.85 / MAX: 52.73 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.41, N = 3 75.12 13.25 MIN: 74.2 / MAX: 92.38 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 16 32 48 64 80 SE +/- 0.19, N = 3 SE +/- 0.61, N = 3 73.53 9.07 MIN: 72.7 / MAX: 85.89 -lgomp -lpthread - MIN: 7.84 / MAX: 27.74 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.73, N = 3 SE +/- 0.74, N = 3 108.49 26.42 MIN: 106.38 / MAX: 119 -lgomp -lpthread - MIN: 24.26 / MAX: 43.04 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.42, N = 3 SE +/- 0.18, N = 3 95.00 23.34 MIN: 92.76 / MAX: 106.32 -lgomp -lpthread - MIN: 22.23 / MAX: 75.49 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.14, N = 3 SE +/- 0.27, N = 3 113.60 22.50 MIN: 112.78 / MAX: 128.04 -lgomp -lpthread - MIN: 21.54 / MAX: 62.81 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.41, N = 3 14.59 35.85 MIN: 14.36 / MAX: 21.91 -lgomp -lpthread - MIN: 34.62 / MAX: 57.79 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6 12 18 24 30 SE +/- 0.34, N = 3 SE +/- 0.35, N = 4 25.87 27.59 1. (CC) gcc options: -O3 -march=native -fopenmp
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 500K 1000K 1500K 2000K 2500K SE +/- 18857.12, N = 3 SE +/- 1735.32, N = 3 2347780.43 2417894.89 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50K 100K 150K 200K 250K SE +/- 494.94, N = 3 SE +/- 610.57, N = 3 210560.41 209532.45 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30K 60K 90K 120K 150K SE +/- 236.74, N = 3 1441 116824 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2M 4M 6M 8M 10M SE +/- 153.78, N = 3 SE +/- 36473.73, N = 3 184846 10158000 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.49, N = 3 SE +/- 0.61, N = 3 80.9 83.5 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 1.00, N = 3 SE +/- 15.68, N = 3 2551.8 2656.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 60 120 180 240 300 SE +/- 1.95, N = 3 SE +/- 2.75, N = 3 232.7 288.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 17.52, N = 3 SE +/- 12.55, N = 3 3204.4 3299.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.67, N = 3 SE +/- 0.56, N = 15 44.3 47.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 1.35, N = 3 SE +/- 2.00, N = 15 2656.7 2749.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40M 80M 120M 160M 200M SE +/- 2171881.50, N = 3 SE +/- 1483247.98, N = 3 166251925 169366836
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 7.40 7.31 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 15.38 14.50 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.31, N = 3 SE +/- 0.11, N = 3 28.14 27.19 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p Intel oneAPI DPC++ Compiler 2021.3 9 18 27 36 45 SE +/- 0.19, N = 3 38.50 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Intel oneAPI DPC++ Compiler 2021.3 5 10 15 20 25 SE +/- 0.12, N = 3 21.07 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p Intel oneAPI DPC++ Compiler 2021.3 10 20 30 40 50 SE +/- 0.17, N = 3 46.06 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.03, N = 3 15.67 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 36.05 33.87 1. (CC) gcc options: -lm -O3 -march=native
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.07129, N = 15 SE +/- 0.07248, N = 15 6.02119 6.38637 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.20, N = 3 SE +/- 0.13, N = 3 29.51 31.84 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1473 2.2946 3.4419 4.5892 5.7365 SE +/- 0.01979, N = 3 SE +/- 0.02612, N = 3 2.72185 5.09923 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.37, N = 15 SE +/- 0.56, N = 15 11.14 12.96 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 0.765 1.53 2.295 3.06 3.825 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.40 3.07 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 6.59 5.63 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.22, N = 4 SE +/- 0.05, N = 3 14.79 13.80 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 15.52 13.31 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.097, N = 3 SE +/- 0.096, N = 3 15.023 7.989 1. (CC) gcc options: -lm -lpthread -O3 -march=native
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 32.50 31.32 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 0.64, N = 3 SE +/- 2.16, N = 3 204.09 174.05 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 4.07, N = 3 SE +/- 1.65, N = 3 304.17 266.96 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.121, N = 3 SE +/- 0.149, N = 3 9.025 9.314 -xHost 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1495 2.299 3.4485 4.598 5.7475 SE +/- 0.060, N = 15 SE +/- 0.069, N = 15 4.781 5.109 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 33.57 36.85 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.060, N = 3 SE +/- 0.133, N = 15 8.455 8.722 1. (CXX) g++ options: -O3 -fPIC -lm
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 2.07, N = 12 SE +/- 1.96, N = 15 80.10 77.76 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 147.68, N = 3 SE +/- 143.23, N = 3 38237.47 44139.00 1. (CXX) g++ options: -O3 -march=native -fopenmp
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20K 40K 60K 80K 100K SE +/- 708.19, N = 15 SE +/- 1019.88, N = 15 60281.31 83257.94 1. (CXX) g++ options: -O3 -march=native -fopenmp
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 400M 800M 1200M 1600M 2000M SE +/- 6519798.91, N = 3 SE +/- 5919459.43, N = 3 1930066667 1723700000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 800M 1600M 2400M 3200M 4000M SE +/- 7338104.51, N = 3 SE +/- 19736514.38, N = 3 3627366667 3232100000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 12898621.80, N = 3 SE +/- 9990161.83, N = 3 4197466667 3422600000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 13561014.38, N = 3 SE +/- 13945289.93, N = 3 4166766667 3167466667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200K 400K 600K 800K 1000K SE +/- 3114.75, N = 3 SE +/- 1618.92, N = 3 813674.4 728618.4
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 300K 600K 900K 1200K 1500K SE +/- 2230.96, N = 3 SE +/- 7744.78, N = 3 1217485.3 1162331.3
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 64.42 61.14 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.54, N = 3 SE +/- 0.16, N = 3 78.54 78.57 1. (CC) gcc options: -O3 -march=native
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 38.00, N = 3 SE +/- 35.41, N = 3 3199 2752 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1000 2000 3000 4000 5000 SE +/- 18.45, N = 3 SE +/- 29.17, N = 3 4559 3897 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 601.66, N = 3 SE +/- 632.78, N = 3 40962 41775 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
Phoronix Test Suite v10.8.4