OpenCL 2.0 Intel Beignet CPU Pentium G4600 Kabylake Comparison

Intel OpenCL 2.0 Beignet 1.3 benchmarking with Intel CPUs. Tests by Michael Larabel for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1701302-RI-BEIGNET0463&obr_sor=y&obr_rro=y&obr_hgv=Pentium+G4600.

OpenCL 2.0 Intel Beignet CPU Pentium G4600 Kabylake ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Pentium G4600Intel Core i5-6500 @ 3.20GHz (4 Cores)Gigabyte Z170M-D3H-CFIntel Skylake8192MB250GB Samsung SSD 850Intel HD 530 (Skylake GT2) 3072MB (1050MHz)Realtek ALC892DELL P2415QIntel ConnectionClear Linux4.9.5-302.native (x86_64)Xfce 4.12X Server 1.19.1modesetting 1.19.14.5 Mesa 17.0.0-develOpenCL 2.0 beignet 1.31.0.37GCC 6.3.0 + Clang 3.9.1 + LLVM 3.9.1ext41920x1080Intel Core i5-6600K @ 3.50GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.015360MB256GB TS256GSSD370SIntel HD 530 (Skylake GT2) 3072MB (1150MHz)Realtek ALC1150Intel Pentium G4400 @ 3.30GHz (2 Cores)MSI B150M MORTAR (MS-7972) v2.08192MB120GB Samsung SSD 850Intel HD 510 (Skylake GT1) 3072MB (1000MHz)Realtek ALC892Realtek RTL8111/8168/8411Intel Core i3-7100 @ 3.90GHz (4 Cores)ASUS PRIME Z270-PIntel Device 590f16384MBSamsung SSD 950 PRO 256GBIntel Kabylake GT2 3072MB (1100MHz)Realtek ALC887-VDIntel Core i5-7600K @ 3.80GHz (4 Cores)Intel Device 591fIntel Kabylake GT2 3072MB (1150MHz)Intel Core i7-7700K @ 4.20GHz (8 Cores)Intel Xeon E3-1235L v5 @ 2.00GHz (4 Cores)ASRockRack C236M WSIntel Skylake8192MB120GB OCZ TRION150Intel HD 530 (Skylake GT2) 3072MB (1000MHz)Realtek ALC1150DELL S2409WIntel ConnectionIntel Xeon E3-1245 v5 @ 3.50GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.032768MB120GB Samsung SSD 850Intel HD P530 (Skylake GT2) 3072MB (1150MHz)DELL P2415QIntel Pentium G4600 @ 3.60GHz (4 Cores)MSI Z270-A PRO (MS-7A71) v1.0Intel Device 590f15360MBSamsung SSD 950 PRO 256GBIntel Kabylake GT2 3072MB (1100MHz)Realtek ALC892Realtek RTL8111/8168/8411OpenBenchmarking.orgCompiler Details- --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Processor Details- Scaling Governor: acpi-cpufreq performance

OpenCL 2.0 Intel Beignet CPU Pentium G4600 Kabylake Comparisonshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Readcl-mem: Writejuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Pentium G46007.439.320.28292.4621.5129.5749.8645.8840.4831757837.236852443.007886706.378.2710.890.32340.7125.6033.9657.6039.9246.0236351871.537914468.339149327.206.24138.1321.0724.3531.9840.3242.3016211893.103613230.536.619.670.28297.3924.8632.0851.5244.1247.9731987657.036991142.008031118.638.3710.900.32340.6926.1835.1258.1139.7242.9236333556.837893608.839180692.0312.2310.880.32340.7128.3938.5257.8941.0739.1337085079.777953468.139186739.209.678.960.26280.0719.2325.6047.6039.8544.1529888422.606421850.377788605.9710.4910.330.30324.3624.1732.6555.5242.1846.6834988792.077523189.008686154.407.419.660.28297.3922.3926.3551.5639.9844.3232892655.407023912.238029783.93OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadPentium G4400Core i3 7100Pentium G4600Core i5 6500Core i5 6600KCore i5 7600KXeon E3-1235L v5Xeon E3-1245 v5Core i7 7700K3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 4SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 6SE +/- 0.08, N = 3SE +/- 0.19, N = 36.246.617.417.438.278.379.6710.4912.231. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPXeon E3-1235L v5Core i5 6500Pentium G4600Core i3 7100Xeon E3-1245 v5Core i7 7700KCore i5 6600KCore i5 7600K3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 38.969.329.669.6710.3310.8810.8910.901. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashXeon E3-1235L v5Core i5 6500Core i3 7100Pentium G4600Xeon E3-1245 v5Core i5 6600KCore i5 7600KCore i7 7700K0.0720.1440.2160.2880.36SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.260.280.280.280.300.320.320.321. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsPentium G4400Xeon E3-1235L v5Core i5 6500Core i3 7100Pentium G4600Xeon E3-1245 v5Core i5 7600KCore i5 6600KCore i7 7700K70140210280350SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3138.13280.07292.46297.39297.39324.36340.69340.71340.711. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadXeon E3-1235L v5Pentium G4400Core i5 6500Pentium G4600Xeon E3-1245 v5Core i3 7100Core i5 6600KCore i5 7600KCore i7 7700K714212835SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 319.2321.0721.5122.3924.1724.8625.6026.1828.391. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackPentium G4400Xeon E3-1235L v5Pentium G4600Core i5 6500Core i3 7100Xeon E3-1245 v5Core i5 6600KCore i5 7600KCore i7 7700K918273645SE +/- 0.25, N = 3SE +/- 0.22, N = 3SE +/- 0.38, N = 3SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.46, N = 3SE +/- 0.35, N = 3SE +/- 0.36, N = 3SE +/- 0.32, N = 324.3525.6026.3529.5732.0832.6533.9635.1238.521. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthPentium G4400Xeon E3-1235L v5Core i5 6500Core i3 7100Pentium G4600Xeon E3-1245 v5Core i5 6600KCore i7 7700KCore i5 7600K1326395265SE +/- 0.40, N = 3SE +/- 0.40, N = 3SE +/- 0.35, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.52, N = 3SE +/- 0.22, N = 3SE +/- 0.00, N = 331.9847.6049.8651.5251.5655.5257.6057.8958.111. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadCore i5 7600KXeon E3-1235L v5Core i5 6600KPentium G4600Pentium G4400Core i7 7700KXeon E3-1245 v5Core i3 7100Core i5 65001020304050SE +/- 1.38, N = 6SE +/- 1.86, N = 6SE +/- 1.86, N = 6SE +/- 2.47, N = 6SE +/- 2.78, N = 6SE +/- 2.40, N = 6SE +/- 2.19, N = 6SE +/- 1.44, N = 6SE +/- 2.54, N = 639.7239.8539.9239.9840.3241.0742.1844.1245.881. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteCore i7 7700KCore i5 6500Pentium G4400Core i5 7600KXeon E3-1235L v5Pentium G4600Core i5 6600KXeon E3-1245 v5Core i3 71001122334455SE +/- 0.77, N = 3SE +/- 2.50, N = 6SE +/- 2.02, N = 6SE +/- 2.03, N = 6SE +/- 3.36, N = 6SE +/- 3.56, N = 6SE +/- 1.25, N = 6SE +/- 0.73, N = 4SE +/- 3.35, N = 639.1340.4842.3042.9244.1544.3246.0246.6847.971. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUPentium G4400Xeon E3-1235L v5Core i5 6500Core i3 7100Pentium G4600Xeon E3-1245 v5Core i5 7600KCore i5 6600KCore i7 7700K8M16M24M32M40MSE +/- 4354.56, N = 3SE +/- 2225.50, N = 3SE +/- 7819.95, N = 3SE +/- 30806.75, N = 3SE +/- 346084.01, N = 3SE +/- 430217.30, N = 3SE +/- 375450.50, N = 3SE +/- 8517.71, N = 3SE +/- 469338.36, N = 316211893.1029888422.6031757837.2331987657.0332892655.4034988792.0736333556.8336351871.5337085079.771. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUXeon E3-1235L v5Core i5 6500Core i3 7100Pentium G4600Xeon E3-1245 v5Core i5 7600KCore i5 6600KCore i7 7700K2M4M6M8M10MSE +/- 21481.69, N = 3SE +/- 16180.63, N = 3SE +/- 467.88, N = 3SE +/- 15808.74, N = 3SE +/- 1640.04, N = 3SE +/- 16049.95, N = 3SE +/- 864.90, N = 3SE +/- 19261.23, N = 36421850.376852443.006991142.007023912.237523189.007893608.837914468.337953468.131. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUPentium G4400Xeon E3-1235L v5Core i5 6500Pentium G4600Core i3 7100Xeon E3-1245 v5Core i5 6600KCore i5 7600KCore i7 7700K2M4M6M8M10MSE +/- 1228.27, N = 3SE +/- 10634.78, N = 3SE +/- 473.77, N = 3SE +/- 5671.17, N = 3SE +/- 369.28, N = 3SE +/- 6393.20, N = 3SE +/- 8446.97, N = 3SE +/- 414.49, N = 3SE +/- 7061.44, N = 33613230.537788605.977886706.378029783.938031118.638686154.409149327.209180692.039186739.201. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.4