opencl-subset-IrisPro580-20211109-2 Intel Iris Pro 580 (SKL, Gen9, GT4) testing with an Intel NUC (Skull Canyon, 6i7KYB, 0073 BIOS) and Intel Core i7-6770HQ on Ubuntu 20.04.3 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111095-TJ-OPENCLSUB26&grw .
opencl-subset-IrisPro580-20211109-2 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Iris Pro 580 Intel Core i7-6770HQ @ 3.50GHz (4 Cores / 8 Threads) Intel NUC6i7KYB (KYSKLi70.86A.0073.2021.0722.1021 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB Samsung SSD 950 PRO 256GB Intel Iris Pro 580 SKL GT4 3GB (950MHz) Realtek ALC233 DELL P2415Q Intel I219-LM + Intel 8260 Ubuntu 20.04 5.14.0-1007-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 4.6 Mesa 21.0.3 OpenCL 2.1 1.2.145 GCC 10.3.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 1.9.1 - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled
opencl-subset-IrisPro580-20211109-2 darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth clpeak: Kernel Latency clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer smallpt-gpu: GPU - 3840 x 2160 - Caustic smallpt-gpu: GPU - 3840 x 2160 - Cornell smallpt-gpu: GPU - 3840 x 2160 - Caustic3 Intel Iris Pro 580 54.753 10.243 0.627 6.485 17.1419 35.1111 0.9402 3767.70 27.2748 26.8171 127.707 42.30 1068.05 267.29 25.77 10.86 29.34 1636497361 1636497485 1636497613 OpenBenchmarking.org
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Intel Iris Pro 580 12 24 36 48 60 SE +/- 0.15, N = 3 54.75
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Intel Iris Pro 580 3 6 9 12 15 SE +/- 0.01, N = 3 10.24
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Intel Iris Pro 580 0.1411 0.2822 0.4233 0.5644 0.7055 SE +/- 0.007, N = 4 0.627
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Intel Iris Pro 580 2 4 6 8 10 SE +/- 0.026, N = 3 6.485
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Iris Pro 580 4 8 12 16 20 SE +/- 0.06, N = 3 17.14 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Iris Pro 580 8 16 24 32 40 SE +/- 0.03, N = 3 35.11 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Intel Iris Pro 580 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.0003, N = 3 0.9402 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Iris Pro 580 800 1600 2400 3200 4000 SE +/- 22.88, N = 3 3767.70 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Iris Pro 580 6 12 18 24 30 SE +/- 0.13, N = 3 27.27 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Iris Pro 580 6 12 18 24 30 SE +/- 0.15, N = 3 26.82 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Intel Iris Pro 580 30 60 90 120 150 SE +/- 0.18, N = 3 127.71 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel Iris Pro 580 10 20 30 40 50 SE +/- 1.99, N = 15 42.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel Iris Pro 580 200 400 600 800 1000 SE +/- 2.30, N = 3 1068.05 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Intel Iris Pro 580 60 120 180 240 300 SE +/- 0.23, N = 3 267.29 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel Iris Pro 580 6 12 18 24 30 SE +/- 0.08, N = 3 25.77 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel Iris Pro 580 3 6 9 12 15 SE +/- 0.03, N = 3 10.86 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 7 14 21 28 35 SE +/- 0.06, N = 3 29.34 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic Intel Iris Pro 580 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 1636497361 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell Intel Iris Pro 580 400M 800M 1200M 1600M 2000M SE +/- 21.94, N = 3 1636497485 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 Intel Iris Pro 580 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 1636497613 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Phoronix Test Suite v10.8.4