CL-VK-COMPUTE-FULL Intel Core i3-3217U testing with a Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) and Intel HD 4000 IVB GT2 2GB on EndeavourOS rolling via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2209033-EIRI-CLVKCOM88 .
CL-VK-COMPUTE-FULL Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel HD Graphics 4000 IVB GT2 Intel Core i3-3217U (2 Cores / 4 Threads) Intel D33217GKE (GKPPT10H.86A.0040.2013.0325.1514 BIOS) Intel 3rd Gen Core DRAM 16GB 256GB SAMSUNG SSD PM85 Intel HD 4000 IVB GT2 2GB (1050MHz) Intel 7 /C216 32S305 Intel 82579V + Intel 7260 EndeavourOS rolling 5.19.4-lqx1-2-lqx (x86_64) GNOME Shell 42.4 X Server 1.21.1.4 4.2 Mesa 22.1.7 OpenCL 2.0 beignet 1.4 (git-419c0417) 1.3.211 GCC 12.2.0 + Clang 14.0.6 + LLVM 14.0.6 ext4 1360x768 OpenBenchmarking.org - Transparent Huge Pages: madvise - --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - CPU Microcode: 0x21 - SNA - Python 3.10.6 - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
CL-VK-COMPUTE-FULL vkpeak: fp32-scalar vkpeak: fp32-vec4 vkpeak: fp64-scalar vkpeak: fp64-vec4 vkpeak: int32-scalar vkpeak: int32-vec4 realsr-ncnn: 4x - No realsr-ncnn: 4x - Yes waifu2x-ncnn: 2x - 3 - No waifu2x-ncnn: 2x - 3 - Yes shoc: OpenCL - Triad shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback cl-mem: Copy cl-mem: Read cl-mem: Write parboil: OpenCL BFS rodinia: OpenCL Heartwall darktable: Boat - OpenCL darktable: Masskrug - OpenCL darktable: Server Rack - OpenCL darktable: Server Room - OpenCL ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - FastestDet blender: BMW27 - OpenCL Intel HD Graphics 4000 IVB GT2 115.08 215.05 60.07 60.04 30.52 31.67 10.082 14.589 23.153 7.075 3.1191 4.9164 7.3228 85.4 85.4 87.0 3.814913 26.667 54.306 38.966 4.843 29.204 58.38 15.47 15.70 12.98 16.19 27.99 9.05 42.41 310.65 52.35 66.85 124.28 110.34 55.09 21.21 4755.64 15.25 2478.08 OpenBenchmarking.org
vkpeak fp32-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-scalar Intel HD Graphics 4000 IVB GT2 30 60 90 120 150 SE +/- 0.00, N = 3 115.08
vkpeak fp32-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp32-vec4 Intel HD Graphics 4000 IVB GT2 50 100 150 200 250 SE +/- 0.00, N = 3 215.05
vkpeak fp64-scalar OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-scalar Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.07
vkpeak fp64-vec4 OpenBenchmarking.org GFLOPS, More Is Better vkpeak 20210424 fp64-vec4 Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.00, N = 3 60.04
vkpeak int32-scalar OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-scalar Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 30.52
vkpeak int32-vec4 OpenBenchmarking.org GIOPS, More Is Better vkpeak 20210424 int32-vec4 Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.00, N = 3 31.67
RealSR-NCNN Scale: 4x - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: No Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.08, N = 8 10.08
RealSR-NCNN Scale: 4x - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better RealSR-NCNN 20200818 Scale: 4x - TAA: Yes Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.18, N = 3 14.59
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: No OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: No Intel HD Graphics 4000 IVB GT2 6 12 18 24 30 SE +/- 0.09, N = 3 23.15
Waifu2x-NCNN Vulkan Scale: 2x - Denoise: 3 - TAA: Yes OpenBenchmarking.org Seconds, Fewer Is Better Waifu2x-NCNN Vulkan 20200818 Scale: 2x - Denoise: 3 - TAA: Yes Intel HD Graphics 4000 IVB GT2 2 4 6 8 10 SE +/- 0.040, N = 3 7.075
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel HD Graphics 4000 IVB GT2 0.7018 1.4036 2.1054 2.8072 3.509 SE +/- 0.0069, N = 3 3.1191 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel HD Graphics 4000 IVB GT2 1.1062 2.2124 3.3186 4.4248 5.531 SE +/- 0.0729, N = 15 4.9164 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel HD Graphics 4000 IVB GT2 2 4 6 8 10 SE +/- 0.0371, N = 3 7.3228 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.30, N = 3 85.4 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 1.72, N = 15 87.0 1. (CC) gcc options: -O2 -flto -lOpenCL
Parboil Test: OpenCL BFS OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenCL BFS Intel HD Graphics 4000 IVB GT2 0.8584 1.7168 2.5752 3.4336 4.292 SE +/- 0.005920, N = 3 3.814913 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Rodinia Test: OpenCL Heartwall OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenCL Heartwall Intel HD Graphics 4000 IVB GT2 6 12 18 24 30 SE +/- 0.02, N = 3 26.67 1. (CXX) g++ options: -O2 -lOpenCL
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Boat - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.03, N = 3 54.31
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Masskrug - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 9 18 27 36 45 SE +/- 0.04, N = 3 38.97
Darktable Test: Server Rack - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Rack - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 1.0897 2.1794 3.2691 4.3588 5.4485 SE +/- 0.013, N = 3 4.843
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.0.0 Test: Server Room - Acceleration: OpenCL Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.05, N = 3 29.20
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: mobilenet Intel HD Graphics 4000 IVB GT2 13 26 39 52 65 SE +/- 0.02, N = 3 58.38 MIN: 55.51 / MAX: 66.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.01, N = 3 15.47 MIN: 15.36 / MAX: 15.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.03, N = 3 15.70 MIN: 15.57 / MAX: 16.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: shufflenet-v2 Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.05, N = 3 12.98 MIN: 12.81 / MAX: 14.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: mnasnet Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.01, N = 3 16.19 MIN: 16.08 / MAX: 17.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: efficientnet-b0 Intel HD Graphics 4000 IVB GT2 7 14 21 28 35 SE +/- 0.14, N = 3 27.99 MIN: 27.44 / MAX: 30.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: blazeface Intel HD Graphics 4000 IVB GT2 3 6 9 12 15 SE +/- 0.02, N = 3 9.05 MIN: 8.64 / MAX: 12.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: googlenet Intel HD Graphics 4000 IVB GT2 10 20 30 40 50 SE +/- 0.10, N = 3 42.41 MIN: 42.1 / MAX: 44.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: vgg16 Intel HD Graphics 4000 IVB GT2 70 140 210 280 350 SE +/- 1.38, N = 3 310.65 MIN: 308.77 / MAX: 315.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: resnet18 Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.11, N = 3 52.35 MIN: 51.8 / MAX: 53.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: alexnet Intel HD Graphics 4000 IVB GT2 15 30 45 60 75 SE +/- 0.03, N = 3 66.85 MIN: 66.15 / MAX: 68.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: resnet50 Intel HD Graphics 4000 IVB GT2 30 60 90 120 150 SE +/- 0.09, N = 3 124.28 MIN: 123.54 / MAX: 126.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: yolov4-tiny Intel HD Graphics 4000 IVB GT2 20 40 60 80 100 SE +/- 0.33, N = 3 110.34 MIN: 107.17 / MAX: 130.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: squeezenet_ssd Intel HD Graphics 4000 IVB GT2 12 24 36 48 60 SE +/- 0.27, N = 3 55.09 MIN: 53.63 / MAX: 58.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: regnety_400m Intel HD Graphics 4000 IVB GT2 5 10 15 20 25 SE +/- 0.14, N = 3 21.21 MIN: 20.68 / MAX: 24.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: vision_transformer Intel HD Graphics 4000 IVB GT2 1000 2000 3000 4000 5000 SE +/- 1.63, N = 3 4755.64 MIN: 4707.69 / MAX: 4845.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: Vulkan GPU - Model: FastestDet Intel HD Graphics 4000 IVB GT2 4 8 12 16 20 SE +/- 0.19, N = 3 15.25 MIN: 10.62 / MAX: 26.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Blender Blend File: BMW27 - Compute: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.2 Blend File: BMW27 - Compute: OpenCL Intel HD Graphics 4000 IVB GT2 500 1000 1500 2000 2500 SE +/- 7.38, N = 3 2478.08
Phoronix Test Suite v10.8.4