opencl-subset-IrisPro580-20211109-2 Dell NVIDIA GeForce GTX 1660 Super 6 GB (90.16.48.00.38 VBIOS) testing with a Dell G5 (model 5000, 0M6C7G motherboard, 1.4.0 BIOS) and Intel Core i5-10400F on Ubuntu 21.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2111128-TJ-2111121TJ51 .
opencl-subset-IrisPro580-20211109-2 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Display Driver Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super Intel Core i7-6770HQ @ 3.50GHz (4 Cores / 8 Threads) Intel NUC6i7KYB (KYSKLi70.86A.0073.2021.0722.1021 BIOS) Intel Xeon E3-1200 v5/E3-1500 32GB Samsung SSD 950 PRO 256GB Intel Iris Pro 580 SKL GT4 3GB (950MHz) Realtek ALC233 DELL P2415Q Intel I219-LM + Intel 8260 Ubuntu 20.04 5.14.0-1007-oem (x86_64) GNOME Shell 3.36.9 X Server 1.20.11 4.6 Mesa 21.0.3 OpenCL 2.1 1.2.145 GCC 10.3.0 ext4 3840x2160 Intel Core i7-11700K @ 5.00GHz (8 Cores / 16 Threads) Dell 0K3CM7 (2.3.0 BIOS) Intel Comet Lake PCH 16GB SK hynix BC511 NVMe 256GB + 4001GB Seagate ST4000DM004-2CV1 + 1000GB CT1000MX500SSD1 Intel RKL GT1 3GB (1300MHz) Realtek ALC3861 Realtek Device 2600 + Intel Wi-Fi 6 AX201 OpenCL 3.0 Intel Core i5-10400F @ 4.30GHz (6 Cores / 12 Threads) Dell 0M6C7G (1.4.0 BIOS) 32GB SK hynix BC511 NVMe 512GB + 2000GB Seagate ST2000DM008-2FR1 + 500GB Western Digital WDS500G2B0A + 2000GB Seagate ST2000LM007-1R81 NVIDIA GeForce GTX 1660 SUPER 6GB VX2257 Realtek Device 2600 + Intel Comet Lake PCH CNVi WiFi Ubuntu 21.10 5.13.0-21-generic (x86_64) GNOME Shell 40.5 X Server 1.20.13 NVIDIA 470.82.00 4.6.0 OpenCL 3.0 CUDA 11.4.153 1.2.175 GCC 9.4.0 + Clang 13.0.0-2 + CUDA 11.3 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - Intel Iris Pro 580: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel UHD 750: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-S4I5Pr/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Nvidia GTX 1660 Super: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-PEIxyV/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Intel Iris Pro 580: Scaling Governor: intel_pstate performance - CPU Microcode: 0xea - Thermald 1.9.1 - Intel UHD 750: Scaling Governor: intel_pstate performance - CPU Microcode: 0x40 - Thermald 1.9.1 - Nvidia GTX 1660 Super: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xea - Thermald 2.4.6 Security Details - Intel Iris Pro 580: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of TSX disabled - Intel UHD 750: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - Nvidia GTX 1660 Super: itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Environment Details - Nvidia GTX 1660 Super: CUDA_CPPFLAGS=-gencode=arch=compute_75,code=sm_75 Graphics Details - Nvidia GTX 1660 Super: BAR1 / Visible vRAM Size: 256 MiB OpenCL Details - Nvidia GTX 1660 Super: GPU Compute Cores: 1408
opencl-subset-IrisPro580-20211109-2 shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL smallpt-gpu: GPU - 3840 x 2160 - Caustic smallpt-gpu: GPU - 3840 x 2160 - Cornell smallpt-gpu: GPU - 3840 x 2160 - Caustic3 luxmark: GPU - Hotel luxmark: GPU - Microphone luxmark: GPU - Luxball HDR clpeak: Kernel Latency clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL clpeak: Integer Compute INT Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 17.1419 35.1111 0.9402 3767.70 27.2748 26.8171 127.707 54.753 10.243 0.627 6.485 1636497361 1636497485 1636497613 42.30 1068.05 267.29 25.77 10.86 29.34 15.7544 77.8145 0.7041 177085 40.0972 41.7889 82.2610 14.472 4.806 0.193 3.893 1636747769 1636747889 1636748012 18.61 661.21 37.30 14.24 29.03 1636761813 1636761947 1636762083 3744 11216 16832 3.54 4333.70 167.30 276.63 11.00 10.96 2.905 4.373 0.166 1.224 4385.08 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Iris Pro 580 Intel UHD 750 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 17.14 15.75 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Iris Pro 580 Intel UHD 750 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 35.11 77.81 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash Intel Iris Pro 580 Intel UHD 750 0.2115 0.423 0.6345 0.846 1.0575 SE +/- 0.0003, N = 3 SE +/- 0.0000, N = 3 0.9402 0.7041 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Iris Pro 580 Intel UHD 750 40K 80K 120K 160K 200K SE +/- 22.88, N = 3 SE +/- 235.89, N = 3 3767.70 177085.00 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Iris Pro 580 Intel UHD 750 9 18 27 36 45 SE +/- 0.13, N = 3 SE +/- 0.65, N = 15 27.27 40.10 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Iris Pro 580 Intel UHD 750 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.46, N = 15 26.82 41.79 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth Intel Iris Pro 580 Intel UHD 750 30 60 90 120 150 SE +/- 0.18, N = 3 SE +/- 0.01, N = 3 127.71 82.26 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Boat - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 12 24 36 48 60 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 54.75 14.47
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Masskrug - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 3 6 9 12 15 SE +/- 0.013, N = 3 SE +/- 0.007, N = 3 10.243 4.806
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Rack - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 0.1411 0.2822 0.4233 0.5644 0.7055 SE +/- 0.007, N = 4 SE +/- 0.001, N = 3 0.627 0.193
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.0.1 Test: Server Room - Acceleration: OpenCL Intel Iris Pro 580 Intel UHD 750 2 4 6 8 10 SE +/- 0.026, N = 3 SE +/- 0.005, N = 3 6.485 3.893
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 23.38, N = 3 SE +/- 25.12, N = 3 1636497361 1636747769 1636761813 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Cornell Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 400M 800M 1200M 1600M 2000M SE +/- 21.94, N = 3 SE +/- 21.07, N = 3 SE +/- 24.25, N = 3 1636497485 1636747889 1636761947 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Resolution: 3840 x 2160 - Scene: Caustic3 Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 400M 800M 1200M 1600M 2000M SE +/- 24.25, N = 3 SE +/- 23.09, N = 3 SE +/- 25.12, N = 3 1636497613 1636748012 1636762083 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel Nvidia GTX 1660 Super 800 1600 2400 3200 4000 SE +/- 1.76, N = 3 3744
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone Nvidia GTX 1660 Super 2K 4K 6K 8K 10K SE +/- 3.38, N = 3 11216
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR Nvidia GTX 1660 Super 4K 8K 12K 16K 20K SE +/- 62.34, N = 3 16832
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 10 20 30 40 50 SE +/- 1.99, N = 15 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 42.30 18.61 3.54 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 900 1800 2700 3600 4500 SE +/- 2.30, N = 3 SE +/- 0.11, N = 3 SE +/- 24.74, N = 3 1068.05 661.21 4333.70 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Intel Iris Pro 580 Nvidia GTX 1660 Super 60 120 180 240 300 SE +/- 0.23, N = 3 SE +/- 0.08, N = 3 267.29 167.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 60 120 180 240 300 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 SE +/- 0.34, N = 3 25.77 37.30 276.63 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 10.86 14.24 11.00 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel Iris Pro 580 Intel UHD 750 Nvidia GTX 1660 Super 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 29.34 29.03 10.96 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Boat - Acceleration: OpenCL Nvidia GTX 1660 Super 0.6536 1.3072 1.9608 2.6144 3.268 SE +/- 0.004, N = 3 2.905
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Masskrug - Acceleration: OpenCL Nvidia GTX 1660 Super 0.9839 1.9678 2.9517 3.9356 4.9195 SE +/- 0.010, N = 3 4.373
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Server Rack - Acceleration: OpenCL Nvidia GTX 1660 Super 0.0374 0.0748 0.1122 0.1496 0.187 SE +/- 0.000, N = 3 0.166
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 3.6.0 Test: Server Room - Acceleration: OpenCL Nvidia GTX 1660 Super 0.2754 0.5508 0.8262 1.1016 1.377 SE +/- 0.006, N = 3 1.224
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Nvidia GTX 1660 Super 900 1800 2700 3600 4500 SE +/- 46.62, N = 3 4385.08 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4