Xeon Platinum 8380 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108113-IB-XEONPLATI03&grr .
Xeon Platinum 8380 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenCL Compiler File-System Screen Resolution Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 3841GB Micron_9300_MTFDHAL3T8TDP + 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 20.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.36.4 X Server 1.20.9 OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 2.1 LINUX Intel oneAPI DPC++/C++ Compiler 2021.3.0 (2021.3.0.20210619) + ICC ext4 1920x1080 GCC 9.3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Python Details - Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Compiler Details - GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Xeon Platinum 8380 libgav1: Chimera 1080p 10-bit financebench: Bonds OpenMP webp2: Quality 100, Lossless Compression cryptopp: Keyed Algorithms yafaray: Total Time For Sample Scene libgav1: Chimera 1080p webp2: Quality 95, Compression Effort 7 ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet vpxenc: Speed 0 - Bosphorus 4K libgav1: Summer Nature 4K asmfish: 1024 Hash Memory, 26 Depth influxdb: 4 - 10000 - 2,5000,1 - 10000 compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed vpxenc: Speed 5 - Bosphorus 4K influxdb: 64 - 10000 - 2,5000,1 - 10000 kvazaar: Bosphorus 4K - Medium gnupg: 2.7GB Sample File Encryption libgav1: Summer Nature 1080p xmrig: Monero - 1M cryptopp: Unkeyed Algorithms compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed pjsip: INVITE pjsip: OPTIONS, Stateful sqlite-speedtest: Timed Time - Size 1,000 tungsten: Volumetric Caustic john-the-ripper: MD5 pjsip: OPTIONS, Stateless financebench: Repo OpenMP compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed webp: Quality 100, Lossless, Highest Compression x265: Bosphorus 4K tungsten: Water Caustic mafft: Multiple Sequence Alignment - LSU RNA kvazaar: Bosphorus 4K - Very Fast compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed avifenc: 6, Lossless aobench: 2048 x 2048 - Total Time botan: AES-256 - Decrypt botan: AES-256 tungsten: Hair quantlib: coremark: CoreMark Size 666 - Iterations Per Second botan: ChaCha20Poly1305 - Decrypt botan: ChaCha20Poly1305 botan: Blowfish - Decrypt botan: Blowfish botan: Twofish - Decrypt botan: Twofish john-the-ripper: Blowfish aircrack-ng: botan: CAST-256 - Decrypt botan: CAST-256 botan: KASUMI - Decrypt botan: KASUMI synthmark: VoiceMark_100 xmrig: Wownero - 1M avifenc: 10, Lossless avifenc: 10 kvazaar: Bosphorus 4K - Ultra Fast webp2: Quality 100, Compression Effort 5 svt-hevc: 1 - Bosphorus 1080p webp: Quality 100, Lossless liquid-dsp: 160 - 256 - 57 liquid-dsp: 128 - 256 - 57 liquid-dsp: 64 - 256 - 57 liquid-dsp: 32 - 256 - 57 toybrot: TBB tachyon: Total Time basis: UASTC Level 3 basis: UASTC Level 2 c-ray: Total Time - 4K, 16 Rays Per Pixel povray: Trace Time basis: UASTC Level 0 mt-dgemm: Sustained Floating-Point Rate encode-mp3: WAV To MP3 toybrot: C++ Tasks draco: Church Facade webp: Quality 100, Highest Compression draco: Lion toybrot: C++ Threads svt-hevc: 7 - Bosphorus 1080p svt-hevc: 10 - Bosphorus 1080p tungsten: Non-Exponential Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 15.67 60281.306771 389.388 608.426435 80.095 38.50 189.884 14.59 113.60 95.00 108.49 73.53 75.12 202.30 63.82 4.82 19.94 11.44 11.75 10.02 11.89 46.79 3.40 21.07 166251925 813674.4 2656.7 44.3 6.59 1217485.3 7.40 78.540 46.06 26057.3 380.438710 7185.3 43.38 7192.7 44.36 3199 4559 64.422 11.14004 184846 40962 38237.468750 2551.8 80.9 43.856 15.52 29.5066 17.077 15.38 3204.4 232.7 33.568 36.050 5677.238 5614.282 6.02119 2659.0 2347780.427451 866.953 865.867 332.678 320.928 280.691 286.013 1441 210560.412 114.855 115.287 76.309 76.037 534.860 42072.8 8.455 4.781 28.14 7.077 32.50 20.923 4166766667 4197466667 3627366667 1930066667 6562 14.7890 16.188 13.327 15.023 9.025 10.084 25.865503 10.535 7766 7162 7.363 5884 6805 204.09 304.17 2.72185 83257.941146 412.185 598.416866 77.757 213.922 35.85 22.50 23.34 26.42 9.07 13.25 28.97 21.06 6.77 14.14 11.05 11.23 11.20 11.51 20.84 3.07 169366836 728618.4 2749.0 47.0 5.63 1162331.3 7.31 78.565 26441.1 381.484647 7393.8 45.55 7423.6 46.81 2752 3897 61.142 12.9638 10158000 41775 44139.001302 2656.1 83.5 40.729 13.31 31.8414 16.131 14.50 3299.4 288.6 36.849 33.865 4979.655 4979.243 6.38637 2422.5 2417894.893134 685.526 679.571 361.431 364.907 304.849 307.571 116824 209532.448 114.332 113.050 72.106 72.394 546.485 42035.5 8.722 5.109 27.19 6.706 31.32 19.046 3167466667 3422600000 3232100000 1723700000 4620 13.8028 16.115 12.703 7.989 9.314 9.315 27.593344 8.964 5305 6620 8.207 5386 4740 174.05 266.96 5.09923 OpenBenchmarking.org
libgav1 Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p 10-bit Intel oneAPI DPC++ Compiler 2021.3 4 8 12 16 20 SE +/- 0.03, N = 3 15.67 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
FinanceBench Benchmark: Bonds OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Bonds OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20K 40K 60K 80K 100K SE +/- 708.19, N = 15 SE +/- 1019.88, N = 15 60281.31 83257.94 1. (CXX) g++ options: -O3 -march=native -fopenmp
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 90 180 270 360 450 SE +/- 0.12, N = 3 SE +/- 0.17, N = 3 389.39 412.19 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
Crypto++ Test: Keyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Keyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 130 260 390 520 650 SE +/- 0.23, N = 3 SE +/- 0.86, N = 3 608.43 598.42 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 2.07, N = 12 SE +/- 1.96, N = 15 80.10 77.76 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
libgav1 Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Chimera 1080p Intel oneAPI DPC++ Compiler 2021.3 9 18 27 36 45 SE +/- 0.19, N = 3 38.50 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.40, N = 3 189.88 213.92 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.41, N = 3 14.59 35.85 MIN: 14.36 / MAX: 21.91 -lgomp -lpthread - MIN: 34.62 / MAX: 57.79 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.14, N = 3 SE +/- 0.27, N = 3 113.60 22.50 MIN: 112.78 / MAX: 128.04 -lgomp -lpthread - MIN: 21.54 / MAX: 62.81 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.42, N = 3 SE +/- 0.18, N = 3 95.00 23.34 MIN: 92.76 / MAX: 106.32 -lgomp -lpthread - MIN: 22.23 / MAX: 75.49 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.73, N = 3 SE +/- 0.74, N = 3 108.49 26.42 MIN: 106.38 / MAX: 119 -lgomp -lpthread - MIN: 24.26 / MAX: 43.04 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 16 32 48 64 80 SE +/- 0.19, N = 3 SE +/- 0.61, N = 3 73.53 9.07 MIN: 72.7 / MAX: 85.89 -lgomp -lpthread - MIN: 7.84 / MAX: 27.74 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.31, N = 3 SE +/- 0.41, N = 3 75.12 13.25 MIN: 74.2 / MAX: 92.38 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 1.68, N = 3 SE +/- 1.90, N = 3 202.30 28.97 MIN: 195.35 / MAX: 219.18 -lgomp -lpthread - MIN: 25.85 / MAX: 52.73 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 63.82 21.06 MIN: 63.17 / MAX: 76.37 -lgomp -lpthread - MIN: 20.2 / MAX: 92.44 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 4.82 6.77 MIN: 4.7 / MAX: 6.74 -lgomp -lpthread - MIN: 6.37 / MAX: 51.55 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 5 10 15 20 25 SE +/- 1.31, N = 3 SE +/- 0.04, N = 3 19.94 14.14 MIN: 18.2 / MAX: 40.6 -lgomp -lpthread - MIN: 13.35 / MAX: 34.49 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 11.44 11.05 MIN: 11.33 / MAX: 11.98 -lgomp -lpthread - MIN: 10.47 / MAX: 19.28 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 11.75 11.23 MIN: 11.6 / MAX: 12.28 -lgomp -lpthread - MIN: 10.84 / MAX: 55.99 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 10.02 11.20 MIN: 9.87 / MAX: 10.62 -lgomp -lpthread - MIN: 10.61 / MAX: 17.17 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 11.89 11.51 MIN: 11.74 / MAX: 12.68 -lgomp -lpthread - MIN: 10.88 / MAX: 45.76 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.22, N = 3 SE +/- 0.23, N = 3 46.79 20.84 MIN: 45.73 / MAX: 62.24 -lgomp -lpthread - MIN: 19.62 / MAX: 212.02 1. (CXX) g++ options: -O3 -march=native -rdynamic -pthread
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 0.765 1.53 2.295 3.06 3.825 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 3.40 3.07 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
libgav1 Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 4K Intel oneAPI DPC++ Compiler 2021.3 5 10 15 20 25 SE +/- 0.12, N = 3 21.07 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40M 80M 120M 160M 200M SE +/- 2171881.50, N = 3 SE +/- 1483247.98, N = 3 166251925 169366836
InfluxDB Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200K 400K 600K 800K 1000K SE +/- 3114.75, N = 3 SE +/- 1618.92, N = 3 813674.4 728618.4
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 1.35, N = 3 SE +/- 2.00, N = 15 2656.7 2749.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.67, N = 3 SE +/- 0.56, N = 15 44.3 47.0 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 6.59 5.63 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -march=native -fPIC -U_FORTIFY_SOURCE -std=gnu++11
InfluxDB Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 OpenBenchmarking.org val/sec, More Is Better InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 300K 600K 900K 1200K 1500K SE +/- 2230.96, N = 3 SE +/- 7744.78, N = 3 1217485.3 1162331.3
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 7.40 7.31 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
GnuPG 2.7GB Sample File Encryption OpenBenchmarking.org Seconds, Fewer Is Better GnuPG 2.2.27 2.7GB Sample File Encryption Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.54, N = 3 SE +/- 0.16, N = 3 78.54 78.57 1. (CC) gcc options: -O3 -march=native
libgav1 Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better libgav1 0.16.3 Video Input: Summer Nature 1080p Intel oneAPI DPC++ Compiler 2021.3 10 20 30 40 50 SE +/- 0.17, N = 3 46.06 1. (CXX) g++ options: -O3 -march=native -lpthread -lrt
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6K 12K 18K 24K 30K SE +/- 270.05, N = 8 SE +/- 169.85, N = 3 26057.3 26441.1 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Crypto++ Test: Unkeyed Algorithms OpenBenchmarking.org MiB/second, More Is Better Crypto++ 8.2 Test: Unkeyed Algorithms Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 0.19, N = 3 SE +/- 1.24, N = 3 380.44 381.48 1. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1600 3200 4800 6400 8000 SE +/- 7.48, N = 3 SE +/- 0.72, N = 3 7185.3 7393.8 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 10 20 30 40 50 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 43.38 45.55 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1600 3200 4800 6400 8000 SE +/- 28.83, N = 3 SE +/- 39.65, N = 3 7192.7 7423.6 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 11 22 33 44 55 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 44.36 46.81 1. (CC) gcc options: -O3
PJSIP Method: INVITE OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: INVITE Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 38.00, N = 3 SE +/- 35.41, N = 3 3199 2752 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
PJSIP Method: OPTIONS, Stateful OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateful Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1000 2000 3000 4000 5000 SE +/- 18.45, N = 3 SE +/- 29.17, N = 3 4559 3897 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
SQLite Speedtest Timed Time - Size 1,000 OpenBenchmarking.org Seconds, Fewer Is Better SQLite Speedtest 3.30 Timed Time - Size 1,000 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 14 28 42 56 70 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 64.42 61.14 1. (CC) gcc options: -O3 -march=native -ldl -lz -lpthread
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.37, N = 15 SE +/- 0.56, N = 15 11.14 12.96 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2M 4M 6M 8M 10M SE +/- 153.78, N = 3 SE +/- 36473.73, N = 3 184846 10158000 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
PJSIP Method: OPTIONS, Stateless OpenBenchmarking.org Responses Per Second, More Is Better PJSIP 2.11 Method: OPTIONS, Stateless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 601.66, N = 3 SE +/- 632.78, N = 3 40962 41775 1. (CC) gcc options: -lSDL2 -lavformat -lavcodec -lswscale -lavutil -lstdc++ -lssl -lcrypto -luuid -lm -lrt -lpthread -lasound -O3 -march=native
FinanceBench Benchmark: Repo OpenMP OpenBenchmarking.org ms, Fewer Is Better FinanceBench 2016-07-25 Benchmark: Repo OpenMP Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 147.68, N = 3 SE +/- 143.23, N = 3 38237.47 44139.00 1. (CXX) g++ options: -O3 -march=native -fopenmp
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 1.00, N = 3 SE +/- 15.68, N = 3 2551.8 2656.1 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.49, N = 3 SE +/- 0.61, N = 3 80.9 83.5 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 10 20 30 40 50 SE +/- 0.22, N = 3 SE +/- 0.05, N = 3 43.86 40.73 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.21, N = 3 SE +/- 0.09, N = 3 15.52 13.31 1. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.20, N = 3 SE +/- 0.13, N = 3 29.51 31.84 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.17, N = 3 SE +/- 0.33, N = 12 17.08 16.13 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 15.38 14.50 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 700 1400 2100 2800 3500 SE +/- 17.52, N = 3 SE +/- 12.55, N = 3 3204.4 3299.4 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 60 120 180 240 300 SE +/- 1.95, N = 3 SE +/- 2.75, N = 3 232.7 288.6 1. (CC) gcc options: -O3 -march=native -pthread -lz -llzma
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.34, N = 3 SE +/- 0.32, N = 3 33.57 36.85 1. (CXX) g++ options: -O3 -fPIC -lm
AOBench Size: 2048 x 2048 - Total Time OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.14, N = 3 SE +/- 0.03, N = 3 36.05 33.87 1. (CC) gcc options: -lm -O3 -march=native
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 3.33, N = 3 SE +/- 3.16, N = 3 5677.24 4979.66 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1200 2400 3600 4800 6000 SE +/- 55.64, N = 3 SE +/- 3.24, N = 3 5614.28 4979.24 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.07129, N = 15 SE +/- 0.07248, N = 15 6.02119 6.38637 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
QuantLib OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.21 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 600 1200 1800 2400 3000 SE +/- 4.35, N = 3 SE +/- 6.01, N = 3 2659.0 2422.5 1. (CXX) g++ options: -O3 -march=native -rdynamic
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 500K 1000K 1500K 2000K 2500K SE +/- 18857.12, N = 3 SE +/- 1735.32, N = 3 2347780.43 2417894.89 1. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 3.68, N = 3 SE +/- 4.35, N = 3 866.95 685.53 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 200 400 600 800 1000 SE +/- 1.09, N = 3 SE +/- 4.19, N = 3 865.87 679.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 1.63, N = 3 SE +/- 0.21, N = 3 332.68 361.43 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 80 160 240 320 400 SE +/- 3.06, N = 3 SE +/- 3.94, N = 3 320.93 364.91 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 0.12, N = 3 SE +/- 0.20, N = 3 280.69 304.85 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 286.01 307.57 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30K 60K 90K 120K 150K SE +/- 236.74, N = 3 1441 116824 -fopenmp 1. (CC) gcc options: -m64 -lssl -lcrypto -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 50K 100K 150K 200K 250K SE +/- 494.94, N = 3 SE +/- 610.57, N = 3 210560.41 209532.45 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 114.86 114.33 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 30 60 90 120 150 SE +/- 0.23, N = 3 SE +/- 1.11, N = 3 115.29 113.05 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 76.31 72.11 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 20 40 60 80 100 SE +/- 0.81, N = 3 SE +/- 0.80, N = 3 76.04 72.39 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 120 240 360 480 600 SE +/- 6.01, N = 3 SE +/- 0.47, N = 3 534.86 546.49 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 9K 18K 27K 36K 45K SE +/- 156.73, N = 3 SE +/- 145.44, N = 3 42072.8 42035.5 -funroll-loops -static-libgcc -static-libstdc++ 1. (CXX) g++ options: -O3 -march=native -fexceptions -fno-rtti -maes -Ofast -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.060, N = 3 SE +/- 0.133, N = 15 8.455 8.722 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1495 2.299 3.4485 4.598 5.7475 SE +/- 0.060, N = 15 SE +/- 0.069, N = 15 4.781 5.109 1. (CXX) g++ options: -O3 -fPIC -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 7 14 21 28 35 SE +/- 0.31, N = 3 SE +/- 0.11, N = 3 28.14 27.19 -lpthread 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O3 -march=native -lm -lrt
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.089, N = 4 SE +/- 0.116, N = 14 7.077 6.706 1. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 8 16 24 32 40 SE +/- 0.08, N = 3 SE +/- 0.26, N = 3 32.50 31.32 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 20.92 19.05 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 13561014.38, N = 3 SE +/- 13945289.93, N = 3 4166766667 3167466667 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 900M 1800M 2700M 3600M 4500M SE +/- 12898621.80, N = 3 SE +/- 9990161.83, N = 3 4197466667 3422600000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 800M 1600M 2400M 3200M 4000M SE +/- 7338104.51, N = 3 SE +/- 19736514.38, N = 3 3627366667 3232100000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 32 - Buffer Length: 256 - Filter Length: 57 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 400M 800M 1200M 1600M 2000M SE +/- 6519798.91, N = 3 SE +/- 5919459.43, N = 3 1930066667 1723700000 1. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1400 2800 4200 5600 7000 SE +/- 89.55, N = 14 SE +/- 56.86, N = 5 6562 4620 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.22, N = 4 SE +/- 0.05, N = 3 14.79 13.80 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.14, N = 3 16.19 16.12 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.16, N = 3 13.33 12.70 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 4 8 12 16 20 SE +/- 0.097, N = 3 SE +/- 0.096, N = 3 15.023 7.989 1. (CC) gcc options: -lm -lpthread -O3 -march=native
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.121, N = 3 SE +/- 0.149, N = 3 9.025 9.314 -xHost 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.038, N = 3 SE +/- 0.137, N = 4 10.084 9.315 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 6 12 18 24 30 SE +/- 0.34, N = 3 SE +/- 0.35, N = 4 25.87 27.59 1. (CC) gcc options: -O3 -march=native -fopenmp
LAME MP3 Encoding WAV To MP3 OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 3 6 9 12 15 SE +/- 0.142, N = 3 SE +/- 0.136, N = 3 10.535 8.964 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe 1. (CC) gcc options: -O3 -march=native -lncurses -lm
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1700 3400 5100 6800 8500 SE +/- 125.31, N = 3 SE +/- 60.78, N = 6 7766 5305 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
Google Draco Model: Church Facade OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Church Facade Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1500 3000 4500 6000 7500 SE +/- 10.17, N = 3 SE +/- 7.69, N = 3 7162 6620 1. (CXX) g++ options: -O3 -march=native
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org Encode Time - Seconds, Fewer Is Better WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 2 4 6 8 10 SE +/- 0.120, N = 3 SE +/- 0.089, N = 3 7.363 8.207 1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg
Google Draco Model: Lion OpenBenchmarking.org ms, Fewer Is Better Google Draco 1.4.1 Model: Lion Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1300 2600 3900 5200 6500 SE +/- 15.30, N = 3 5884 5386 1. (CXX) g++ options: -O3 -march=native
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1500 3000 4500 6000 7500 SE +/- 111.86, N = 3 SE +/- 70.34, N = 4 6805 4740 -lm -lgcc -lgcc_s -lc 1. (CXX) g++ options: -O3 -march=native -lpthread
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 40 80 120 160 200 SE +/- 0.64, N = 3 SE +/- 2.16, N = 3 204.09 174.05 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 70 140 210 280 350 SE +/- 4.07, N = 3 SE +/- 1.65, N = 3 304.17 266.96 1. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Intel oneAPI DPC++ Compiler 2021.3 GCC 9.3 1.1473 2.2946 3.4419 4.5892 5.7365 SE +/- 0.01979, N = 3 SE +/- 0.02612, N = 3 2.72185 5.09923 -fstrict-aliasing 1. (CXX) g++ options: -O3 -march=native -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Phoronix Test Suite v10.8.4