Intel OpenCL 2.0 Beignet 1.3 benchmarking with Intel CPUs. Tests by Michael Larabel for a future article on Phoronix.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1701313-RI-BEIGNET1211 OpenCL 2.0 Intel Beignet CPU Celeron Kabylake Comparison - Phoronix Test Suite OpenCL 2.0 Intel Beignet CPU Celeron Kabylake Comparison Intel OpenCL 2.0 Beignet 1.3 benchmarking with Intel CPUs. Tests by Michael Larabel for a future article on Phoronix.
HTML result view exported from: https://openbenchmarking.org/result/1701313-RI-BEIGNET1211&grr&sro .
OpenCL 2.0 Intel Beignet CPU Celeron Kabylake Comparison Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Core i5 6500 Core i5 6600K Pentium G4400 Core i3 7100 Core i5 7600K Core i7 7700K Pentium G4600 Celeron G3930 Intel Core i5-6500 @ 3.20GHz (4 Cores) Gigabyte Z170M-D3H-CF Intel Skylake 8192MB 250GB Samsung SSD 850 Intel HD 530 (Skylake GT2) 3072MB (1050MHz) Realtek ALC892 DELL P2415Q Intel Connection Clear Linux 4.9.5-302.native (x86_64) Xfce 4.12 X Server 1.19.1 modesetting 1.19.1 4.5 Mesa 17.0.0-devel OpenCL 2.0 beignet 1.3 1.0.37 GCC 6.3.0 + Clang 3.9.1 + LLVM 3.9.1 ext4 1920x1080 Intel Core i5-6600K @ 3.50GHz (4 Cores) MSI Z170A GAMING PRO (MS-7984) v1.0 15360MB 256GB TS256GSSD370S Intel HD 530 (Skylake GT2) 3072MB (1150MHz) Realtek ALC1150 Intel Pentium G4400 @ 3.30GHz (2 Cores) MSI B150M MORTAR (MS-7972) v2.0 8192MB 120GB Samsung SSD 850 Intel HD 510 (Skylake GT1) 3072MB (1000MHz) Realtek ALC892 Realtek RTL8111/8168/8411 Intel Core i3-7100 @ 3.90GHz (4 Cores) ASUS PRIME Z270-P Intel Device 590f 16384MB Samsung SSD 950 PRO 256GB Intel Kabylake GT2 3072MB (1100MHz) Realtek ALC887-VD Intel Core i5-7600K @ 3.80GHz (4 Cores) Intel Device 591f Intel Kabylake GT2 3072MB (1150MHz) Intel Core i7-7700K @ 4.20GHz (8 Cores) Intel Pentium G4600 @ 3.60GHz (4 Cores) MSI Z270-A PRO (MS-7A71) v1.0 Intel Device 590f 15360MB Intel Kabylake GT2 3072MB (1100MHz) Realtek ALC892 Intel Celeron G3930 @ 2.90GHz (2 Cores) Intel Kabylake GT1 3072MB (1050MHz) OpenBenchmarking.org Compiler Details - --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Processor Details - Scaling Governor: acpi-cpufreq performance
OpenCL 2.0 Intel Beignet CPU Celeron Kabylake Comparison mandelgpu: GPU mandelbulbgpu: GPU juliagpu: GPU cl-mem: Write cl-mem: Read shoc: OpenCL - Texture Read Bandwidth shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - Max SP Flops shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP shoc: OpenCL - Triad Core i5 6500 Core i5 6600K Pentium G4400 Core i3 7100 Core i5 7600K Core i7 7700K Pentium G4600 Celeron G3930 7886706.37 6852443.00 31757837.23 40.48 45.88 49.86 29.57 21.51 292.46 0.28 9.32 7.43 9149327.20 7914468.33 36351871.53 46.02 39.92 57.60 33.96 25.60 340.71 0.32 10.89 8.27 3613230.53 16211893.10 42.30 40.32 31.98 24.35 21.07 138.13 6.24 8031118.63 6991142.00 31987657.03 47.97 44.12 51.52 32.08 24.86 297.39 0.28 9.67 6.61 9180692.03 7893608.83 36333556.83 42.92 39.72 58.11 35.12 26.18 340.69 0.32 10.90 8.37 9186739.20 7953468.13 37085079.77 39.13 41.07 57.89 38.52 28.39 340.71 0.32 10.88 12.23 8029783.93 7023912.23 32892655.40 44.32 39.98 51.56 26.35 22.39 297.39 0.28 9.66 7.41 3822342.23 17060871.17 46.35 41.07 33.76 19.10 17.97 146.02 6.24 OpenBenchmarking.org
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 2M 4M 6M 8M 10M SE +/- 29.87, N = 3 SE +/- 369.28, N = 3 SE +/- 473.77, N = 3 SE +/- 8446.97, N = 3 SE +/- 414.49, N = 3 SE +/- 7061.44, N = 3 SE +/- 1228.27, N = 3 SE +/- 5671.17, N = 3 3822342.23 8031118.63 7886706.37 9149327.20 9180692.03 9186739.20 3613230.53 8029783.93 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4600 2M 4M 6M 8M 10M SE +/- 467.88, N = 3 SE +/- 16180.63, N = 3 SE +/- 864.90, N = 3 SE +/- 16049.95, N = 3 SE +/- 19261.23, N = 3 SE +/- 15808.74, N = 3 6991142.00 6852443.00 7914468.33 7893608.83 7953468.13 7023912.23 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 8M 16M 24M 32M 40M SE +/- 721.79, N = 3 SE +/- 30806.75, N = 3 SE +/- 7819.95, N = 3 SE +/- 8517.71, N = 3 SE +/- 375450.50, N = 3 SE +/- 469338.36, N = 3 SE +/- 4354.56, N = 3 SE +/- 346084.01, N = 3 17060871.17 31987657.03 31757837.23 36351871.53 36333556.83 37085079.77 16211893.10 32892655.40 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 11 22 33 44 55 SE +/- 2.80, N = 6 SE +/- 3.35, N = 6 SE +/- 2.50, N = 6 SE +/- 1.25, N = 6 SE +/- 2.03, N = 6 SE +/- 0.77, N = 3 SE +/- 2.02, N = 6 SE +/- 3.56, N = 6 46.35 47.97 40.48 46.02 42.92 39.13 42.30 44.32 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 10 20 30 40 50 SE +/- 2.40, N = 6 SE +/- 1.44, N = 6 SE +/- 2.54, N = 6 SE +/- 1.86, N = 6 SE +/- 1.38, N = 6 SE +/- 2.40, N = 6 SE +/- 2.78, N = 6 SE +/- 2.47, N = 6 41.07 44.12 45.88 39.92 39.72 41.07 40.32 39.98 1. (CC) gcc options: -O2 -flto -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 13 26 39 52 65 SE +/- 0.42, N = 3 SE +/- 0.00, N = 3 SE +/- 0.35, N = 3 SE +/- 0.52, N = 3 SE +/- 0.00, N = 3 SE +/- 0.22, N = 3 SE +/- 0.40, N = 3 SE +/- 0.03, N = 3 33.76 51.52 49.86 57.60 58.11 57.89 31.98 51.56 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.28, N = 3 SE +/- 0.35, N = 3 SE +/- 0.36, N = 3 SE +/- 0.32, N = 3 SE +/- 0.25, N = 3 SE +/- 0.38, N = 3 19.10 32.08 29.57 33.96 35.12 38.52 24.35 26.35 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 7 14 21 28 35 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 SE +/- 0.19, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 17.97 24.86 21.51 25.60 26.18 28.39 21.07 22.39 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 70 140 210 280 350 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 146.02 297.39 292.46 340.71 340.69 340.71 138.13 297.39 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4600 0.072 0.144 0.216 0.288 0.36 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.28 0.28 0.32 0.32 0.32 0.28 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4600 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 9.67 9.32 10.89 10.90 10.88 9.66 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad Celeron G3930 Core i3 7100 Core i5 6500 Core i5 6600K Core i5 7600K Core i7 7700K Pentium G4400 Pentium G4600 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.19, N = 3 SE +/- 0.01, N = 3 SE +/- 0.12, N = 4 6.24 6.61 7.43 8.27 8.37 12.23 6.24 7.41 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
Phoronix Test Suite v10.8.4