opencl-subset-IrisPro580-20211109-2 Intel Iris Pro 580 (SKL, Gen9, GT4) testing with an Intel NUC (Skull Canyon, 6i7KYB, 0073 BIOS) and Intel Core i7-6770HQ on Ubuntu 20.04.3 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111095-TJ-OPENCLSUB26 .
opencl-subset-IrisPro580-20211109-2 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Iris Pro 580 Intel Core i7-6770HQ @ 3.50GHz (4 Cores / 8 Threads) Intel NUC6i7KYB (KYSKLi70.86A.0073.2021.0722.1021 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB Samsung SSD 950 PRO 256GB Intel Iris Pro 580 SKL GT4 3GB (950MHz) Realtek ALC233 DELL P2415Q Intel I219-LM + Intel 8260 Ubuntu 20.04 5.14.0-1007-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 4.6 Mesa 21.0.3 OpenCL 2.1 1.2.145 GCC 10.3.0 ext4 3840x2160 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 1.9.1 - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled
opencl-subset-IrisPro580-20211109-2 shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL smallpt-gpu: GPU - 3840 x 2160 - Caustic smallpt-gpu: GPU - 3840 x 2160 - Cornell smallpt-gpu: GPU - 3840 x 2160 - Caustic3 clpeak: Kernel Latency clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 17.1419 35.1111 0.9402 3767.70 27.2748 26.8171 127.707 54.753 10.243 0.627 6.485 1636497361 1636497485 1636497613 42.30 1068.05 267.29 25.77 10.86 29.34 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Iris Pro 580 4 8 12 16 20 SE +/- 0.06, N = 3 17.14 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Iris Pro 580 8 16 24 32 40 SE +/- 0.03, N = 3 35.11 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Intel Iris Pro 580 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.0003, N = 3 0.9402 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Iris Pro 580 800 1600 2400 3200 4000 SE +/- 22.88, N = 3 3767.70 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Iris Pro 580 6 12 18 24 30 SE +/- 0.13, N = 3 27.27 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Iris Pro 580 6 12 18 24 30 SE +/- 0.15, N = 3 26.82 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Intel Iris Pro 580 30 60 90 120 150 SE +/- 0.18, N = 3 127.71 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Intel Iris Pro 580 12 24 36 48 60 SE +/- 0.15, N = 3 54.75
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Intel Iris Pro 580 3 6 9 12 15 SE +/- 0.01, N = 3 10.24
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Intel Iris Pro 580 0.1411 0.2822 0.4233 0.5644 0.7055 SE +/- 0.007, N = 4 0.627
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Intel Iris Pro 580 2 4 6 8 10 SE +/- 0.026, N = 3 6.485
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic Intel Iris Pro 580 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 1636497361 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell Intel Iris Pro 580 400M 800M 1200M 1600M 2000M SE +/- 21.94, N = 3 1636497485 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 Intel Iris Pro 580 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 1636497613 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel Iris Pro 580 10 20 30 40 50 SE +/- 1.99, N = 15 42.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel Iris Pro 580 200 400 600 800 1000 SE +/- 2.30, N = 3 1068.05 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Intel Iris Pro 580 60 120 180 240 300 SE +/- 0.23, N = 3 267.29 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel Iris Pro 580 6 12 18 24 30 SE +/- 0.08, N = 3 25.77 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel Iris Pro 580 3 6 9 12 15 SE +/- 0.03, N = 3 10.86 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 7 14 21 28 35 SE +/- 0.06, N = 3 29.34 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4