Intel oneAPI DPC++ Compiler Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Ubuntu 21.04 via the Phoronix Test Suite. Intel oneAPI DPC++ 2021.3: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Device 43ef, Memory: 32GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Device 43c8, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Device 2725 OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.0.1 (LLVM 11.0.1), OpenCL: OpenCL 2.1 LINUX, Vulkan: 1.2.145, Compiler: Intel oneAPI DPC++/C++ Compiler 2021.3.0 (2021.3.0.20210619) + ICC, File-System: ext4, Screen Resolution: 3840x2160 GCC 10.3: Processor: Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads), Motherboard: ASUS ROG MAXIMUS XIII HERO (0707 BIOS), Chipset: Intel Device 43ef, Memory: 32GB, Disk: 2000GB Corsair Force MP600, Graphics: AMD Radeon VII 16GB (1801/1000MHz), Audio: Intel Device 43c8, Monitor: ASUS MG28U, Network: 2 x Intel I225-V + Intel Device 2725 OS: Ubuntu 21.04, Kernel: 5.11.0-25-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 21.0.1 (LLVM 11.0.1), Vulkan: 1.2.145, Compiler: GCC 10.3.0, File-System: ext4, Screen Resolution: 3840x2160 QuantLib 1.21 MFLOPS > Higher Is Better Intel oneAPI DPC++ 2021.3 . 4038.9 |=========================================== GCC 10.3 .................. 3518.3 |===================================== Crypto++ 8.2 Test: Keyed Algorithms MiB/second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 965.32 |=========================================== GCC 10.3 .................. 940.63 |========================================== Crypto++ 8.2 Test: Unkeyed Algorithms MiB/second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 518.19 |=========================================== GCC 10.3 .................. 504.90 |========================================== Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 8.150 |============================================ GCC 10.3 .................. 7.641 |========================================= WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Encode Time - Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 13.80 |============================================ GCC 10.3 .................. 12.57 |======================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Encode Time - Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 4.590 |======================================= GCC 10.3 .................. 5.193 |============================================ WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Encode Time - Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 29.82 |============================================ GCC 10.3 .................. 26.76 |======================================= Xmrig 6.12.1 Variant: Monero - Hash Count: 1M H/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 2203.5 |========================================== GCC 10.3 .................. 2276.8 |=========================================== Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M H/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 3759.4 |======================================== GCC 10.3 .................. 4020.1 |=========================================== LZ4 Compression 1.9.3 Compression Level: 3 - Compression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 71.58 |=========================================== GCC 10.3 .................. 73.88 |============================================ LZ4 Compression 1.9.3 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 11183.8 |========================================== GCC 10.3 .................. 11293.0 |========================================== LZ4 Compression 1.9.3 Compression Level: 9 - Compression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 69.25 |========================================== GCC 10.3 .................. 72.42 |============================================ LZ4 Compression 1.9.3 Compression Level: 9 - Decompression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 11183.8 |========================================== GCC 10.3 .................. 11289.4 |========================================== Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 35.1 |============================================= GCC 10.3 .................. 35.1 |============================================= Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 4496.8 |=========================================== GCC 10.3 .................. 4281.3 |========================================= Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Compression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 293.4 |========================================= GCC 10.3 .................. 314.2 |============================================ Zstd Compression 1.5.0 Compression Level: 8, Long Mode - Decompression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 5504.8 |=========================================== GCC 10.3 .................. 5258.9 |========================================= Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 32.4 |============================================ GCC 10.3 .................. 33.0 |============================================= Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 4556.1 |=========================================== GCC 10.3 .................. 4386.0 |========================================= Botan 2.17.3 Test: KASUMI MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 121.68 |=========================================== GCC 10.3 .................. 113.47 |======================================== Botan 2.17.3 Test: KASUMI - Decrypt MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 120.13 |=========================================== GCC 10.3 .................. 112.13 |======================================== Botan 2.17.3 Test: AES-256 MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 8897.22 |========================================== GCC 10.3 .................. 8178.06 |======================================= Botan 2.17.3 Test: AES-256 - Decrypt MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 8849.00 |========================================== GCC 10.3 .................. 8242.75 |======================================= Botan 2.17.3 Test: Twofish MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 444.30 |======================================= GCC 10.3 .................. 493.78 |=========================================== Botan 2.17.3 Test: Twofish - Decrypt MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 434.17 |====================================== GCC 10.3 .................. 491.01 |=========================================== Botan 2.17.3 Test: Blowfish MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 514.22 |===================================== GCC 10.3 .................. 604.90 |=========================================== Botan 2.17.3 Test: Blowfish - Decrypt MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 540.48 |======================================= GCC 10.3 .................. 598.89 |=========================================== Botan 2.17.3 Test: CAST-256 MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 180.53 |=========================================== GCC 10.3 .................. 177.26 |========================================== Botan 2.17.3 Test: CAST-256 - Decrypt MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 181.16 |=========================================== GCC 10.3 .................. 177.30 |========================================== Botan 2.17.3 Test: ChaCha20Poly1305 MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 1378.37 |========================================== GCC 10.3 .................. 972.90 |============================== Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt MiB/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 1373.80 |========================================== GCC 10.3 .................. 962.27 |============================= Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Medium Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 7.07 |============================================= GCC 10.3 .................. 6.55 |========================================== Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 20.11 |============================================ GCC 10.3 .................. 18.14 |======================================== Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 35.09 |============================================ GCC 10.3 .................. 32.96 |========================================= SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 9.68 |============================================= GCC 10.3 .................. 9.37 |============================================ SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 144.12 |=========================================== GCC 10.3 .................. 138.54 |========================================= SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 283.06 |=========================================== GCC 10.3 .................. 272.36 |========================================= x265 3.4 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 16.07 |============================================ GCC 10.3 .................. 15.77 |=========================================== ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 4.271913 |========================================= GCC 10.3 .................. 3.797649 |==================================== Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better Intel oneAPI DPC++ 2021.3 . 419476.15 |====================================== GCC 10.3 .................. 438737.03 |======================================== asmFish 2018-07-23 1024 Hash Memory, 26 Depth Nodes/second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 31278710 |======================================== GCC 10.3 .................. 31755304 |========================================= PJSIP 2.11 Method: INVITE Responses Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 5113 |============================================ GCC 10.3 .................. 5208 |============================================= PJSIP 2.11 Method: OPTIONS, Stateful Responses Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 9522 |============================================= GCC 10.3 .................. 9613 |============================================= PJSIP 2.11 Method: OPTIONS, Stateless Responses Per Second > Higher Is Better Intel oneAPI DPC++ 2021.3 . 55669 |============================================ GCC 10.3 .................. 55235 |============================================ libavif avifenc 0.9.0 Encoder Speed: 10 Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 2.607 |=========================================== GCC 10.3 .................. 2.692 |============================================ libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 46.61 |======================================== GCC 10.3 .................. 51.37 |============================================ libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 4.579 |========================================== GCC 10.3 .................. 4.761 |============================================ C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 82.76 |============================================ GCC 10.3 .................. 46.88 |========================= POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 35.51 |========================================= GCC 10.3 .................. 38.43 |============================================ YafaRay 3.5.1 Total Time For Sample Scene Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 120.04 |=========================================== GCC 10.3 .................. 121.00 |=========================================== AOBench Size: 2048 x 2048 - Total Time Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 22.99 |============================================ GCC 10.3 .................. 21.69 |========================================== WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 293.87 |===================================== GCC 10.3 .................. 345.78 |=========================================== WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 9.292 |============================================ GCC 10.3 .................. 9.381 |============================================ WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 587.80 |========================================= GCC 10.3 .................. 617.02 |=========================================== Google SynthMark 20201109 Test: VoiceMark_100 Voices > Higher Is Better Intel oneAPI DPC++ 2021.3 . 924.61 |=========================================== Aircrack-ng 1.5.2 k/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 36884.52 |========================================= GCC 10.3 .................. 36691.07 |========================================= Liquid-DSP 2021.01.31 Threads: 4 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 402603333 |======================================== GCC 10.3 .................. 350700000 |=================================== Liquid-DSP 2021.01.31 Threads: 8 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 750953333 |======================================== GCC 10.3 .................. 655346667 |=================================== Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better Intel oneAPI DPC++ 2021.3 . 872933333 |======================================== GCC 10.3 .................. 698133333 |================================ FinanceBench 2016-07-25 Benchmark: Repo OpenMP ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 27148.78 |========================================= GCC 10.3 .................. 27297.13 |========================================= FinanceBench 2016-07-25 Benchmark: Bonds OpenMP ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 41862.91 |========================================= GCC 10.3 .................. 41581.31 |========================================= SQLite Speedtest 3.30 Timed Time - Size 1,000 Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 41.99 |============================================ GCC 10.3 .................. 41.15 |=========================================== Google Draco 1.4.1 Model: Lion ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 3915 |============================================= GCC 10.3 .................. 3569 |========================================= Google Draco 1.4.1 Model: Church Facade ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 5533 |============================================= GCC 10.3 .................. 5161 |========================================== NCNN 20210720 Target: CPU - Model: mobilenet ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 31.01 |============================================ GCC 10.3 .................. 13.26 |=================== NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 7.74 |============================================= GCC 10.3 .................. 3.66 |===================== NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 6.28 |============================================= GCC 10.3 .................. 2.92 |===================== NCNN 20210720 Target: CPU - Model: shufflenet-v2 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 7.09 |============================================= GCC 10.3 .................. 2.61 |================= NCNN 20210720 Target: CPU - Model: mnasnet ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 6.93 |============================================= GCC 10.3 .................. 2.68 |================= NCNN 20210720 Target: CPU - Model: efficientnet-b0 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 11.84 |============================================ GCC 10.3 .................. 4.75 |================== NCNN 20210720 Target: CPU - Model: blazeface ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 2.35 |============================================= GCC 10.3 .................. 1.18 |======================= NCNN 20210720 Target: CPU - Model: googlenet ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 39.95 |============================================ GCC 10.3 .................. 11.17 |============ NCNN 20210720 Target: CPU - Model: vgg16 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 117.71 |=========================================== GCC 10.3 .................. 54.66 |==================== NCNN 20210720 Target: CPU - Model: resnet18 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 44.88 |============================================ GCC 10.3 .................. 12.22 |============ NCNN 20210720 Target: CPU - Model: alexnet ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 43.71 |============================================ GCC 10.3 .................. 10.31 |========== NCNN 20210720 Target: CPU - Model: resnet50 ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 65.73 |============================================ GCC 10.3 .................. 20.45 |============== NCNN 20210720 Target: CPU - Model: yolov4-tiny ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 59.91 |============================================ GCC 10.3 .................. 20.38 |=============== NCNN 20210720 Target: CPU - Model: squeezenet_ssd ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 72.11 |============================================ GCC 10.3 .................. 16.39 |========== NCNN 20210720 Target: CPU - Model: regnety_400m ms < Lower Is Better Intel oneAPI DPC++ 2021.3 . 8.90 |============================================= GCC 10.3 .................. 6.43 |================================= GnuPG 2.2.27 2.7GB Sample File Encryption Seconds < Lower Is Better Intel oneAPI DPC++ 2021.3 . 48.51 |============================================ GCC 10.3 .................. 48.79 |============================================ InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 val/sec > Higher Is Better Intel oneAPI DPC++ 2021.3 . 2191345.5 |======================================== GCC 10.3 .................. 2201616.9 |======================================== InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 val/sec > Higher Is Better Intel oneAPI DPC++ 2021.3 . 2228673.5 |======================================== GCC 10.3 .................. 2239830.5 |========================================