CL-VK-COMPUTE-FULL Intel Core i3-3217U testing with a Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) and Intel HD 4000 IVB GT2 2GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2209033-EIRI-CLVKCOM88&grs .
CL-VK-COMPUTE-FULL Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel HD Graphics 4000 IVB GT2 Intel Core i3-3217U (2 Cores / 4 Threads) Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) Intel 3rd Gen Core DRAM 16GB 256GB SAMSUNG SSD PM85 Intel HD 4000 IVB GT2 2GB (1050MHz) Intel 7 /C216 32S305 Intel 82579V + Intel 7260 EndeavourOS rolling 5.19.4-lqx1-2-lqx (x86_64) GNOME Shell 42.4 X Server 1.21.1.4 4.2 Mesa 22.1.7 OpenCL 2.0 beignet 1.4 (git-419c0417) 1.3.211 GCC 12.2.0 + Clang 14.0.6 + LLVM 14.0.6 ext4 1360x768 OpenBenchmarking.org - Transparent Huge Pages: madvise - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - CPU Microcode: 0x21 - SNA - Python 3.10.6 - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
CL-VK-COMPUTE-FULL blender: BMW27 - OpenCL ncnn: Vulkan GPU - FastestDet ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet darktable: Server Room - OpenCL darktable: Server Rack - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL rodinia: OpenCL Heartwall parboil: OpenCL BFS cl-mem: Read cl-mem: Copy shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - Triad waifu2x-ncnn: 2x - 3 - Yes waifu2x-ncnn: 2x - 3 - No realsr-ncnn: 4x - Yes realsr-ncnn: 4x - No vkpeak: int32-vec4 vkpeak: int32-scalar vkpeak: fp64-vec4 vkpeak: fp64-scalar vkpeak: fp32-vec4 vkpeak: fp32-scalar cl-mem: Write vkfft: Intel HD Graphics 4000 IVB GT2 2478.08 15.25 4755.64 21.21 55.09 110.34 124.28 66.85 52.35 310.65 42.41 9.05 27.99 16.19 12.98 15.70 15.47 58.38 29.204 4.843 38.966 54.306 26.667 3.814913 85.4 85.4 7.3228 4.9164 3.1191 7.075 23.153 14.589 10.082 31.67 30.52 60.04 60.07 215.05 115.08 87.0 OpenBenchmarking.org
Blender Blend File: BMW27 - Compute: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.2 Blend File: BMW27 - Compute: OpenCL Intel HD Graphics 4000 IVB GT2 500 1000 1500 2000 2500 SE +/- 7.38, N = 3 2478.08
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: FastestDet Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.19, N = 3 15.25 MIN: 10.62 / MAX: 26.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: vision_transformer Intel HD Graphics 4000 IVB GT2 1000 2000 3000 4000 5000 SE +/- 1.63, N = 3 4755.64 MIN: 4707.69 / MAX: 4845.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: regnety_400m Intel HD Graphics 4000 IVB GT2 5 10 15 20 25 SE +/- 0.14, N = 3 21.21 MIN: 20.68 / MAX: 24.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: squeezenet_ssd Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.27, N = 3 55.09 MIN: 53.63 / MAX: 58.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: yolov4-tiny Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.33, N = 3 110.34 MIN: 107.17 / MAX: 130.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: resnet50 Intel HD Graphics 4000 IVB GT2 30 60 90 120 150 SE +/- 0.09, N = 3 124.28 MIN: 123.54 / MAX: 126.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: alexnet Intel HD Graphics 4000 IVB GT2 15 30 45 60 75 SE +/- 0.03, N = 3 66.85 MIN: 66.15 / MAX: 68.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: resnet18 Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.11, N = 3 52.35 MIN: 51.8 / MAX: 53.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: vgg16 Intel HD Graphics 4000 IVB GT2 70 140 210 280 350 SE +/- 1.38, N = 3 310.65 MIN: 308.77 / MAX: 315.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: googlenet Intel HD Graphics 4000 IVB GT2 10 20 30 40 50 SE +/- 0.10, N = 3 42.41 MIN: 42.1 / MAX: 44.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: blazeface Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.02, N = 3 9.05 MIN: 8.64 / MAX: 12.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: efficientnet-b0 Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.14, N = 3 27.99 MIN: 27.44 / MAX: 30.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: mnasnet Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.01, N = 3 16.19 MIN: 16.08 / MAX: 17.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: shufflenet-v2 Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.05, N = 3 12.98 MIN: 12.81 / MAX: 14.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.03, N = 3 15.70 MIN: 15.57 / MAX: 16.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.01, N = 3 15.47 MIN: 15.36 / MAX: 15.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: mobilenet Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.02, N = 3 58.38 MIN: 55.51 / MAX: 66.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Room - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.05, N = 3 29.20
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Rack - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 1.0897 2.1794 3.2691 4.3588 5.4485 SE +/- 0.013, N = 3 4.843
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Masskrug - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 9 18 27 36 45 SE +/- 0.04, N = 3 38.97
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Boat - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.03, N = 3 54.31
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Heartwall Intel HD Graphics 4000 IVB GT2 6 12 18 24 30 SE +/- 0.02, N = 3 26.67 1. (CXX) g++ options: -O2 -lOpenCL
Parboil Test: OpenCL BFS OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL BFS Intel HD Graphics 4000 IVB GT2 0.8584 1.7168 2.5752 3.4336 4.292 SE +/- 0.005920, N = 3 3.814913 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel HD Graphics 4000 IVB GT2 2 4 6 8 10 SE +/- 0.0371, N = 3 7.3228 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel HD Graphics 4000 IVB GT2 1.1062 2.2124 3.3186 4.4248 5.531 SE +/- 0.0729, N = 15 4.9164 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel HD Graphics 4000 IVB GT2 0.7018 1.4036 2.1054 2.8072 3.509 SE +/- 0.0069, N = 3 3.1191 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Intel HD Graphics 4000 IVB GT2 2 4 6 8 10 SE +/- 0.040, N = 3 7.075
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Intel HD Graphics 4000 IVB GT2 6 12 18 24 30 SE +/- 0.09, N = 3 23.15
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.18, N = 3 14.59
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.08, N = 8 10.08
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-vec4 Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 31.67
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-scalar Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 30.52
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-vec4 Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.04
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-scalar Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.07
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-vec4 Intel HD Graphics 4000 IVB GT2 50 100 150 200 250 SE +/- 0.00, N = 3 215.05
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-scalar Intel HD Graphics 4000 IVB GT2 30 60 90 120 150 SE +/- 0.00, N = 3 115.08
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 1.72, N = 15 87.0 1. (CC) gcc options: -O2 -flto -lOpenCL
Phoronix Test Suite v10.8.4