opencl-subset-IrisPro580-20211109-2 Intel UHD Graphics 750 (RKL, Gen12, GT1, 32 EUs) testing with a Dell XPS 8940 (0K3CM7 motherboard, 2.3.0 BIOS) and Intel Core i7-11700K on Ubuntu 20.04.3 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111121-TJ-2111095TJ59 .
opencl-subset-IrisPro580-20211109-2 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Iris Pro 580 Intel UHD 750 Intel Core i7-6770HQ @ 3.50GHz (4 Cores / 8 Threads) Intel NUC6i7KYB (KYSKLi70.86A.0073.2021.0722.1021 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB Samsung SSD 950 PRO 256GB Intel Iris Pro 580 SKL GT4 3GB (950MHz) Realtek ALC233 DELL P2415Q Intel I219-LM + Intel 8260 Ubuntu 20.04 5.14.0-1007-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 4.6 Mesa 21.0.3 OpenCL 2.1 1.2.145 GCC 10.3.0 ext4 3840x2160 Intel Core i7-11700K @ 5.00GHz (8 Cores / 16 Threads) Dell 0K3CM7 (2.3.0 BIOS) Intel Comet Lake PCH 16GB SK hynix BC511 NVMe 256GB + 4001GB Seagate ST4000DM004-2CV1 + 1000GB CT1000MX500SSD1 Intel RKL GT1 3GB (1300MHz) Realtek ALC3861 Realtek Device 2600 + Intel Wi-Fi 6 AX201 OpenCL 3.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Intel Iris Pro 580: Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 1.9.1 - Intel UHD 750: Scaling Governor: intel_pstate performance - CPU Microcode: 0x40 - Thermald 1.9.1 Security Details - Intel Iris Pro 580: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - Intel UHD 750: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
opencl-subset-IrisPro580-20211109-2 shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL smallpt-gpu: GPU - 3840 x 2160 - Caustic smallpt-gpu: GPU - 3840 x 2160 - Cornell smallpt-gpu: GPU - 3840 x 2160 - Caustic3 clpeak: Kernel Latency clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 Intel UHD 750 17.1419 35.1111 0.9402 3767.70 27.2748 26.8171 127.707 54.753 10.243 0.627 6.485 1636497361 1636497485 1636497613 42.30 1068.05 267.29 25.77 10.86 29.34 15.7544 77.8145 0.7041 177085 40.0972 41.7889 82.2610 14.472 4.806 0.193 3.893 1636747769 1636747889 1636748012 18.61 661.21 37.30 14.24 29.03 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Iris Pro 580 Intel UHD 750 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 17.14 15.75 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Iris Pro 580 Intel UHD 750 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 35.11 77.81 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Intel Iris Pro 580 Intel UHD 750 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.0003, N = 3 SE +/- 0.0000, N = 3 0.9402 0.7041 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Iris Pro 580 Intel UHD 750 40K 80K 120K 160K 200K SE +/- 22.88, N = 3 SE +/- 235.89, N = 3 3767.70 177085.00 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Iris Pro 580 Intel UHD 750 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.65, N = 15 27.27 40.10 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Iris Pro 580 Intel UHD 750 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.46, N = 15 26.82 41.79 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Intel Iris Pro 580 Intel UHD 750 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.01, N = 3 127.71 82.26 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 12 24 36 48 60 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 54.75 14.47
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 3 6 9 12 15 SE +/- 0.013, N = 3 SE +/- 0.007, N = 3 10.243 4.806
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 0.1411 0.2822 0.4233 0.5644 0.7055 SE +/- 0.007, N = 4 SE +/- 0.001, N = 3 0.627 0.193
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 2 4 6 8 10 SE +/- 0.026, N = 3 SE +/- 0.005, N = 3 6.485 3.893
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic Intel Iris Pro 580 Intel UHD 750 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 23.38, N = 3 1636497361 1636747769 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell Intel Iris Pro 580 Intel UHD 750 400M 800M 1200M 1600M 2000M SE +/- 21.94, N = 3 SE +/- 21.07, N = 3 1636497485 1636747889 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 Intel Iris Pro 580 Intel UHD 750 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 23.09, N = 3 1636497613 1636748012 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel Iris Pro 580 Intel UHD 750 10 20 30 40 50 SE +/- 1.99, N = 15 SE +/- 0.26, N = 3 42.30 18.61 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel Iris Pro 580 Intel UHD 750 200 400 600 800 1000 SE +/- 2.30, N = 3 SE +/- 0.11, N = 3 1068.05 661.21 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Intel Iris Pro 580 60 120 180 240 300 SE +/- 0.23, N = 3 267.29 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel Iris Pro 580 Intel UHD 750 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 25.77 37.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel Iris Pro 580 Intel UHD 750 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.86 14.24 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 Intel UHD 750 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 29.34 29.03 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4