opencl-subset-IrisPro580-20211109-2 Intel Iris 540 (SKL, Gen9, GT3e) testing with an Intel NUC (Swift Canyon, 6i5SYB, 0073 BIOS) and Intel Core i5-6260U on Ubnutu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111113-TJ-2111095TJ17&grs .
opencl-subset-IrisPro580-20211109-2 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Iris Pro 580 Intel Iris 540 Intel Core i7-6770HQ @ 3.50GHz (4 Cores / 8 Threads) Intel NUC6i7KYB (KYSKLi70.86A.0073.2021.0722.1021 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB Samsung SSD 950 PRO 256GB Intel Iris Pro 580 SKL GT4 3GB (950MHz) Realtek ALC233 DELL P2415Q Intel I219-LM + Intel 8260 Ubuntu 20.04 5.14.0-1007-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 4.6 Mesa 21.0.3 OpenCL 2.1 1.2.145 GCC 10.3.0 ext4 3840x2160 Intel Core i5-6260U @ 2.90GHz (2 Cores / 4 Threads) Intel NUC6i5SYB (SYSKLi35.86A.0073.2020.0909.1625 BIOS) 8GB 256GB INTEL SSDPEKKW256G8 Intel Iris 540 (950MHz) Realtek ALC283 S2719DGF Intel I219-V + Intel 8260 Ubuntu 21.04 5.11.0-40-generic (x86_64) GNOME Shell 3.38.4 X Server + Wayland OpenCL 3.0 GCC 10.3.0 + Clang 12.0.0-3ubuntu1~21.04.2 2560x1440 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - Intel Iris Pro 580: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Iris 540: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Intel Iris Pro 580: Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 1.9.1 - Intel Iris 540: Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 2.4.3 Security Details - Intel Iris Pro 580: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - Intel Iris 540: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected
opencl-subset-IrisPro580-20211109-2 shoc: OpenCL - FFT SP shoc: OpenCL - Max SP Flops clpeak: Single-Precision Float shoc: OpenCL - MD5 Hash clpeak: Double-Precision Double shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Triad shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueWriteBuffer smallpt-gpu: GPU - 3840 x 2160 - Caustic smallpt-gpu: GPU - 3840 x 2160 - Cornell smallpt-gpu: GPU - 3840 x 2160 - Caustic3 darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL clpeak: Kernel Latency juliagpu: GPU - $VIDEO_WIDTH x $VIDEO_HEIGHT Intel Iris Pro 580 Intel Iris 540 35.1111 3767.70 1068.05 0.9402 267.29 127.707 17.1419 27.2748 26.8171 10.86 25.77 29.34 1636497361 1636497485 1636497613 6.485 0.627 10.243 54.753 42.30 74.4383 2050.00 636.35 0.5648 160.58 77.8864 12.1785 22.3346 22.2675 12.13 24.73 30.13 1636643154 1636643276 1636643404 8.209 0.898 16.233 24.100 39.88 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Iris Pro 580 Intel Iris 540 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.62, N = 3 35.11 74.44 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Iris Pro 580 Intel Iris 540 800 1600 2400 3200 4000 SE +/- 22.88, N = 3 SE +/- 6.43, N = 3 3767.70 2050.00 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel Iris Pro 580 Intel Iris 540 200 400 600 800 1000 SE +/- 2.30, N = 3 SE +/- 0.26, N = 3 1068.05 636.35 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Intel Iris Pro 580 Intel Iris 540 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.0003, N = 3 SE +/- 0.0002, N = 3 0.9402 0.5648 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Intel Iris Pro 580 Intel Iris 540 60 120 180 240 300 SE +/- 0.23, N = 3 SE +/- 0.04, N = 3 267.29 160.58 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Intel Iris Pro 580 Intel Iris 540 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.25, N = 3 127.71 77.89 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Iris Pro 580 Intel Iris 540 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 17.14 12.18 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Iris Pro 580 Intel Iris 540 6 12 18 24 30 SE +/- 0.13, N = 3 SE +/- 0.25, N = 3 27.27 22.33 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Iris Pro 580 Intel Iris 540 6 12 18 24 30 SE +/- 0.15, N = 3 SE +/- 0.25, N = 3 26.82 22.27 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel Iris Pro 580 Intel Iris 540 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 10.86 12.13 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel Iris Pro 580 Intel Iris 540 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.23, N = 3 25.77 24.73 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 Intel Iris 540 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.17, N = 3 29.34 30.13 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic Intel Iris Pro 580 Intel Iris 540 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 24.54, N = 3 1636497361 1636643154 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell Intel Iris Pro 580 Intel Iris 540 400M 800M 1200M 1600M 2000M SE +/- 21.94, N = 3 SE +/- 20.78, N = 3 1636497485 1636643276 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 Intel Iris Pro 580 Intel Iris 540 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 24.54, N = 3 1636497613 1636643404 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Room - Acceleration: OpenCL Intel Iris 540 2 4 6 8 10 SE +/- 0.048, N = 3 8.209
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Server Rack - Acceleration: OpenCL Intel Iris 540 0.2021 0.4042 0.6063 0.8084 1.0105 SE +/- 0.008, N = 3 0.898
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Masskrug - Acceleration: OpenCL Intel Iris 540 4 8 12 16 20 SE +/- 0.22, N = 3 16.23
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.4.1 Test: Boat - Acceleration: OpenCL Intel Iris 540 6 12 18 24 30 SE +/- 0.28, N = 4 24.10
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Intel Iris Pro 580 2 4 6 8 10 SE +/- 0.026, N = 3 6.485
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Intel Iris Pro 580 0.1411 0.2822 0.4233 0.5644 0.7055 SE +/- 0.007, N = 4 0.627
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Intel Iris Pro 580 3 6 9 12 15 SE +/- 0.01, N = 3 10.24
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Intel Iris Pro 580 12 24 36 48 60 SE +/- 0.15, N = 3 54.75
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel Iris Pro 580 Intel Iris 540 10 20 30 40 50 SE +/- 1.99, N = 15 SE +/- 0.51, N = 3 42.30 39.88 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4