CL-VK-COMPUTE Intel Core i3-3217U testing with a Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) and Intel HD 4000 IVB GT2 2GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2209011-EIRI-CLVKCOM97&grr .
CL-VK-COMPUTE Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel HD Graphics 4000 IVB GT2 Intel Core i3-3217U (2 Cores / 4 Threads) Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) Intel 3rd Gen Core DRAM 16GB 256GB SAMSUNG SSD PM85 Intel HD 4000 IVB GT2 2GB (1050MHz) Intel 7 /C216 32S305 Intel 82579V + Intel 7260 EndeavourOS rolling 5.19.4-lqx1-2-lqx (x86_64) GNOME Shell 42.4 X Server 1.21.1.4 4.2 Mesa 22.1.7 OpenCL 2.0 beignet 1.4 (git-419c0417) 1.3.211 GCC 12.2.0 + Clang 14.0.6 + LLVM 14.0.6 ext4 1360x768 OpenBenchmarking.org - Transparent Huge Pages: madvise - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - CPU Microcode: 0x21 - SNA - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
CL-VK-COMPUTE vkpeak: int32-vec4 vkpeak: int32-scalar vkpeak: fp64-vec4 vkpeak: fp64-scalar vkpeak: fp32-vec4 vkpeak: fp32-scalar cl-mem: Write darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Room - OpenCL clpeak: Single-Precision Float xsbench-cl: waifu2x-ncnn: 2x - 3 - No clpeak: Global Memory Bandwidth cl-mem: Copy cl-mem: Read clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer darktable: Server Rack - OpenCL clpeak: Kernel Latency waifu2x-ncnn: 2x - 3 - Yes vkresample: 2x - Double Intel HD Graphics 4000 IVB GT2 31.67 30.53 60.04 60.07 215.06 115.08 84.5 54.379 38.907 29.316 181.30 5022519 23.122 18.77 85.4 85.4 5.88 9.55 4.862 41.67 7.013 OpenBenchmarking.org
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-vec4 Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 31.67
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-scalar Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 30.53
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-vec4 Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.04
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-scalar Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.07
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-vec4 Intel HD Graphics 4000 IVB GT2 50 100 150 200 250 SE +/- 0.01, N = 3 215.06
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-scalar Intel HD Graphics 4000 IVB GT2 30 60 90 120 150 SE +/- 0.01, N = 3 115.08
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 1.95, N = 12 84.5 1. (CC) gcc options: -O2 -flto -lOpenCL
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Boat - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.14, N = 3 54.38
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Masskrug - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 9 18 27 36 45 SE +/- 0.11, N = 3 38.91
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Room - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.06, N = 3 29.32
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel HD Graphics 4000 IVB GT2 40 80 120 160 200 SE +/- 0.10, N = 3 181.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Xsbench OpenCL OpenBenchmarking.org Lookups/s, More Is Better Xsbench OpenCL 2017-07-06 Intel HD Graphics 4000 IVB GT2 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 17886.24, N = 3 5022519 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm -lOpenCL
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Intel HD Graphics 4000 IVB GT2 6 12 18 24 30 SE +/- 0.02, N = 3 23.12
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel HD Graphics 4000 IVB GT2 5 10 15 20 25 SE +/- 0.03, N = 3 18.77 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel HD Graphics 4000 IVB GT2 1.323 2.646 3.969 5.292 6.615 SE +/- 0.01, N = 3 5.88 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.01, N = 3 9.55 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Rack - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 1.094 2.188 3.282 4.376 5.47 SE +/- 0.034, N = 3 4.862
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel HD Graphics 4000 IVB GT2 10 20 30 40 50 SE +/- 0.11, N = 3 41.67 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Intel HD Graphics 4000 IVB GT2 2 4 6 8 10 SE +/- 0.003, N = 3 7.013
Phoronix Test Suite v10.8.4