opencl-subset-IrisPro580-20211109-2 Dell NVIDIA GeForce GTX 1660 Super 6 GB (90.16.48.00.38 VBIOS) testing with a Dell G5 (model 5000, 0M6C7G motherboard, 1.4.0 BIOS) and Intel Core i5-10400F on Ubuntu 21.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111128-TJ-2111121TJ51&sro&grs .
opencl-subset-IrisPro580-20211109-2 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Display Driver Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super Intel Core i7-6770HQ @ 3.50GHz (4 Cores / 8 Threads) Intel NUC6i7KYB (KYSKLi70.86A.0073.2021.0722.1021 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB Samsung SSD 950 PRO 256GB Intel Iris Pro 580 SKL GT4 3GB (950MHz) Realtek ALC233 DELL P2415Q Intel I219-LM + Intel 8260 Ubuntu 20.04 5.14.0-1007-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 4.6 Mesa 21.0.3 OpenCL 2.1 1.2.145 GCC 10.3.0 ext4 3840x2160 Intel Core i7-11700K @ 5.00GHz (8 Cores / 16 Threads) Dell 0K3CM7 (2.3.0 BIOS) Intel Comet Lake PCH 16GB SK hynix BC511 NVMe 256GB + 4001GB Seagate ST4000DM004-2CV1 + 1000GB CT1000MX500SSD1 Intel RKL GT1 3GB (1300MHz) Realtek ALC3861 Realtek Device 2600 + Intel Wi-Fi 6 AX201 OpenCL 3.0 Intel Core i5-10400F @ 4.30GHz (6 Cores / 12 Threads) Dell 0M6C7G (1.4.0 BIOS) 32GB SK hynix BC511 NVMe 512GB + 2000GB Seagate ST2000DM008-2FR1 + 500GB Western Digital WDS500G2B0A + 2000GB Seagate ST2000LM007-1R81 NVIDIA GeForce GTX 1660 SUPER 6GB VX2257 Realtek Device 2600 + Intel Comet Lake PCH CNVi WiFi Ubuntu 21.10 5.13.0-21-generic (x86_64) GNOME Shell 40.5 X Server 1.20.13 NVIDIA 470.82.00 4.6.0 OpenCL 3.0 CUDA 11.4.153 1.2.175 GCC 9.4.0 + Clang 13.0.0-2 + CUDA 11.3 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - Intel Iris Pro 580: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel UHD 750: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Nvidia GTX 1660 Super: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-PEIxyV/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Intel Iris Pro 580: Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 1.9.1 - Intel UHD 750: Scaling Governor: intel_pstate performance - CPU Microcode: 0x40 - Thermald 1.9.1 - Nvidia GTX 1660 Super: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xea - Thermald 2.4.6 Security Details - Intel Iris Pro 580: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - Intel UHD 750: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - Nvidia GTX 1660 Super: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Environment Details - Nvidia GTX 1660 Super: CUDA_CPPFLAGS=-gencode=arch=compute_75,code=sm_75 Graphics Details - Nvidia GTX 1660 Super: BAR1 / Visible vRAM Size: 256 MiB OpenCL Details - Nvidia GTX 1660 Super: GPU Compute Cores: 1408
opencl-subset-IrisPro580-20211109-2 clpeak: Single-Precision Float shoc: OpenCL - Max SP Flops darktable: Boat - OpenCL darktable: Server Rack - OpenCL clpeak: Transfer Bandwidth enqueueWriteBuffer shoc: OpenCL - FFT SP darktable: Masskrug - OpenCL clpeak: Global Memory Bandwidth darktable: Server Room - OpenCL clpeak: Double-Precision Double shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - MD5 Hash clpeak: Transfer Bandwidth enqueueReadBuffer shoc: OpenCL - Triad smallpt-gpu: GPU - 3840 x 2160 - Caustic3 smallpt-gpu: GPU - 3840 x 2160 - Cornell smallpt-gpu: GPU - 3840 x 2160 - Caustic clpeak: Integer Compute INT darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL luxmark: GPU - Luxball HDR luxmark: GPU - Microphone luxmark: GPU - Hotel clpeak: Kernel Latency shoc: OpenCL - Bus Speed Download Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 1068.05 3767.70 54.753 0.627 29.34 35.1111 10.243 25.77 6.485 267.29 26.8171 127.707 0.9402 10.86 17.1419 1636497613 1636497485 1636497361 42.30 27.2748 661.21 177085 14.472 0.193 29.03 77.8145 4.806 37.30 3.893 41.7889 82.2610 0.7041 14.24 15.7544 1636748012 1636747889 1636747769 18.61 40.0972 4333.70 10.96 276.63 167.30 11.00 1636762083 1636761947 1636761813 4385.08 1.224 0.166 4.373 2.905 16832 11216 3744 3.54 OpenBenchmarking.org
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 900 1800 2700 3600 4500 SE +/- 2.30, N = 3 SE +/- 0.11, N = 3 SE +/- 24.74, N = 3 1068.05 661.21 4333.70 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Iris Pro 580 Intel UHD 750 40K 80K 120K 160K 200K SE +/- 22.88, N = 3 SE +/- 235.89, N = 3 3767.70 177085.00 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 12 24 36 48 60 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 54.75 14.47
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 0.1411 0.2822 0.4233 0.5644 0.7055 SE +/- 0.007, N = 4 SE +/- 0.001, N = 3 0.627 0.193
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 29.34 29.03 10.96 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Iris Pro 580 Intel UHD 750 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 35.11 77.81 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 3 6 9 12 15 SE +/- 0.013, N = 3 SE +/- 0.007, N = 3 10.243 4.806
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 60 120 180 240 300 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.34, N = 3 25.77 37.30 276.63 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 2 4 6 8 10 SE +/- 0.026, N = 3 SE +/- 0.005, N = 3 6.485 3.893
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Intel Iris Pro 580 Nvidia GTX 1660 Super 60 120 180 240 300 SE +/- 0.23, N = 3 SE +/- 0.08, N = 3 267.29 167.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Iris Pro 580 Intel UHD 750 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.46, N = 15 26.82 41.79 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Intel Iris Pro 580 Intel UHD 750 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.01, N = 3 127.71 82.26 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Intel Iris Pro 580 Intel UHD 750 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.0003, N = 3 SE +/- 0.0000, N = 3 0.9402 0.7041 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 10.86 14.24 11.00 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Iris Pro 580 Intel UHD 750 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 17.14 15.75 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 23.09, N = 3 SE +/- 25.12, N = 3 1636497613 1636748012 1636762083 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 400M 800M 1200M 1600M 2000M SE +/- 21.94, N = 3 SE +/- 21.07, N = 3 SE +/- 24.25, N = 3 1636497485 1636747889 1636761947 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 23.38, N = 3 SE +/- 25.12, N = 3 1636497361 1636747769 1636761813 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Nvidia GTX 1660 Super 900 1800 2700 3600 4500 SE +/- 46.62, N = 3 4385.08 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Server Room - Acceleration: OpenCL Nvidia GTX 1660 Super 0.2754 0.5508 0.8262 1.1016 1.377 SE +/- 0.006, N = 3 1.224
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Server Rack - Acceleration: OpenCL Nvidia GTX 1660 Super 0.0374 0.0748 0.1122 0.1496 0.187 SE +/- 0.000, N = 3 0.166
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Masskrug - Acceleration: OpenCL Nvidia GTX 1660 Super 0.9839 1.9678 2.9517 3.9356 4.9195 SE +/- 0.010, N = 3 4.373
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Boat - Acceleration: OpenCL Nvidia GTX 1660 Super 0.6536 1.3072 1.9608 2.6144 3.268 SE +/- 0.004, N = 3 2.905
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR Nvidia GTX 1660 Super 4K 8K 12K 16K 20K SE +/- 62.34, N = 3 16832
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone Nvidia GTX 1660 Super 2K 4K 6K 8K 10K SE +/- 3.38, N = 3 11216
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel Nvidia GTX 1660 Super 800 1600 2400 3200 4000 SE +/- 1.76, N = 3 3744
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 10 20 30 40 50 SE +/- 1.99, N = 15 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 42.30 18.61 3.54 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Iris Pro 580 Intel UHD 750 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.65, N = 15 27.27 40.10 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Phoronix Test Suite v10.8.4