CL-VK-COMPUTE Intel Core i3-3217U testing with a Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) and Intel HD 4000 IVB GT2 2GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2209011-EIRI-CLVKCOM97&grt .
CL-VK-COMPUTE Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel HD Graphics 4000 IVB GT2 Intel Core i3-3217U (2 Cores / 4 Threads) Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) Intel 3rd Gen Core DRAM 16GB 256GB SAMSUNG SSD PM85 Intel HD 4000 IVB GT2 2GB (1050MHz) Intel 7 /C216 32S305 Intel 82579V + Intel 7260 EndeavourOS rolling 5.19.4-lqx1-2-lqx (x86_64) GNOME Shell 42.4 X Server 1.21.1.4 4.2 Mesa 22.1.7 OpenCL 2.0 beignet 1.4 (git-419c0417) 1.3.211 GCC 12.2.0 + Clang 14.0.6 + LLVM 14.0.6 ext4 1360x768 OpenBenchmarking.org - Transparent Huge Pages: madvise - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - CPU Microcode: 0x21 - SNA - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
CL-VK-COMPUTE cl-mem: Copy cl-mem: Read cl-mem: Write clpeak: Kernel Latency clpeak: Single-Precision Float clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL vkpeak: fp32-scalar vkpeak: fp32-vec4 vkpeak: fp64-scalar vkpeak: fp64-vec4 vkpeak: int32-scalar vkpeak: int32-vec4 waifu2x-ncnn: 2x - 3 - No waifu2x-ncnn: 2x - 3 - Yes xsbench-cl: Intel HD Graphics 4000 IVB GT2 85.4 85.4 84.5 41.67 181.30 18.77 5.88 9.55 54.379 38.907 4.862 29.316 115.08 215.06 60.07 60.04 30.53 31.67 23.122 7.013 5022519 OpenBenchmarking.org
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 1.95, N = 12 84.5 1. (CC) gcc options: -O2 -flto -lOpenCL
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Intel HD Graphics 4000 IVB GT2 10 20 30 40 50 SE +/- 0.11, N = 3 41.67 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Intel HD Graphics 4000 IVB GT2 40 80 120 160 200 SE +/- 0.10, N = 3 181.30 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth Intel HD Graphics 4000 IVB GT2 5 10 15 20 25 SE +/- 0.03, N = 3 18.77 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Intel HD Graphics 4000 IVB GT2 1.323 2.646 3.969 5.292 6.615 SE +/- 0.01, N = 3 5.88 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.01, N = 3 9.55 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Boat - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.14, N = 3 54.38
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Masskrug - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 9 18 27 36 45 SE +/- 0.11, N = 3 38.91
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Rack - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 1.094 2.188 3.282 4.376 5.47 SE +/- 0.034, N = 3 4.862
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Room - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.06, N = 3 29.32
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-scalar Intel HD Graphics 4000 IVB GT2 30 60 90 120 150 SE +/- 0.01, N = 3 115.08
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-vec4 Intel HD Graphics 4000 IVB GT2 50 100 150 200 250 SE +/- 0.01, N = 3 215.06
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-scalar Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.07
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-vec4 Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.04
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-scalar Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 30.53
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-vec4 Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 31.67
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Intel HD Graphics 4000 IVB GT2 6 12 18 24 30 SE +/- 0.02, N = 3 23.12
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Intel HD Graphics 4000 IVB GT2 2 4 6 8 10 SE +/- 0.003, N = 3 7.013
Xsbench OpenCL OpenBenchmarking.org Lookups/s, More Is Better Xsbench OpenCL 2017-07-06 Intel HD Graphics 4000 IVB GT2 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 17886.24, N = 3 5022519 1. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm -lOpenCL
Phoronix Test Suite v10.8.4