OpenCL Intel Beignet - Saturday Intel Core i5-6600K testing with a MSI Z170A GAMING PRO (MS-7984) v1.0 and Intel HD 530 (Skylake GT2) 3072MB on Clear Linux 13000 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1701280-RI-1701281RI71&rdt&grt&export=pdf .
OpenCL Intel Beignet - Saturday Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K Intel Core i5-7600K @ 3.80GHz (4 Cores) ASUS PRIME Z270-P Intel Device 591f 16384MB Samsung SSD 950 PRO 256GB Intel Kabylake GT2 3072MB (1150MHz) Realtek ALC887-VD DELL P2415Q Realtek RTL8111/8168/8411 Clear Linux 12950 4.9.5-302.native (x86_64) Xfce 4.12 X Server 1.19.1 modesetting 1.19.1 4.5 Mesa 17.0.0-devel OpenCL 2.0 beignet 1.3 1.0.37 GCC 6.3.0 + Clang 3.9.1 + LLVM 3.9.1 ext4 1920x1080 Intel Core i3-7100 @ 3.90GHz (4 Cores) Intel Device 590f Intel Kabylake GT2 3072MB (1100MHz) Intel Core i7-7700K @ 4.20GHz (8 Cores) Intel Device 591f Intel Kabylake GT2 3072MB (1150MHz) Clear Linux 12960 Intel Pentium G4400 @ 3.30GHz (2 Cores) MSI B150M MORTAR (MS-7972) v2.0 Intel Skylake 8192MB 120GB Samsung SSD 850 Intel HD 510 (Skylake GT1) 3072MB (1000MHz) Realtek ALC892 Intel Xeon E3-1235L v5 @ 2.00GHz (4 Cores) ASRockRack C236M WS 120GB OCZ TRION150 Intel HD 530 (Skylake GT2) 3072MB (1000MHz) Realtek ALC1150 DELL S2409W Intel Connection Intel Xeon E3-1245 v5 @ 3.50GHz (8 Cores) MSI C236A WORKSTATION (MS-7998) v1.0 32768MB 120GB Samsung SSD 850 Intel HD P530 (Skylake GT2) 3072MB (1150MHz) DELL P2415Q Intel Core i5-6600K @ 3.50GHz (4 Cores) MSI Z170A GAMING PRO (MS-7984) v1.0 15360MB 256GB TS256GSSD370S Intel HD 530 (Skylake GT2) 3072MB (1150MHz) Clear Linux 13000 OpenBenchmarking.org Compiler Details - --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Processor Details - Scaling Governor: acpi-cpufreq performance
OpenCL Intel Beignet - Saturday cl-mem: Read cl-mem: Write juliagpu: GPU mandelbulbgpu: GPU mandelgpu: GPU shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth smallpt-gpu: GPU - Complex smallpt-gpu: GPU - Cornell smallpt-gpu: GPU - Caustic3 Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 39.72 42.92 36333556.83 7893608.83 9180692.03 8.37 10.90 0.32 340.69 26.18 35.12 58.11 1485461759 1485462437 1485462559 44.12 47.97 31987657.03 6991142.00 8031118.63 6.61 9.67 0.28 297.39 24.86 32.08 51.52 1485491582 1485492354 1485492483 41.07 39.13 37085079.77 7953468.13 9186739.20 12.23 10.88 0.32 340.71 28.39 38.52 57.89 1485532534 1485533211 1485533333 40.32 42.30 16211893.10 3613230.53 6.24 138.13 21.07 24.35 31.98 39.85 44.15 29888422.60 6421850.37 7788605.97 9.67 8.96 0.26 280.07 19.23 25.60 47.60 42.18 46.68 34988792.07 7523189.00 8686154.40 10.49 10.33 0.30 324.36 24.17 32.65 55.52 1485574320 1485575033 1485575156 39.92 46.02 36351871.53 7914468.33 9149327.20 8.27 10.89 0.32 340.71 25.60 33.96 57.60 OpenBenchmarking.org
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 10 20 30 40 50 SE +/- 1.38, N = 6 SE +/- 1.44, N = 6 SE +/- 2.40, N = 6 SE +/- 2.78, N = 6 SE +/- 1.86, N = 6 SE +/- 2.19, N = 6 SE +/- 1.86, N = 6 39.72 44.12 41.07 40.32 39.85 42.18 39.92 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 11 22 33 44 55 SE +/- 2.03, N = 6 SE +/- 3.35, N = 6 SE +/- 0.77, N = 3 SE +/- 2.02, N = 6 SE +/- 3.36, N = 6 SE +/- 0.73, N = 4 SE +/- 1.25, N = 6 42.92 47.97 39.13 42.30 44.15 46.68 46.02 1. (CC) gcc options: -O2 -flto -lOpenCL
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 8M 16M 24M 32M 40M SE +/- 375450.50, N = 3 SE +/- 30806.75, N = 3 SE +/- 469338.36, N = 3 SE +/- 4354.56, N = 3 SE +/- 2225.50, N = 3 SE +/- 430217.30, N = 3 SE +/- 8517.71, N = 3 36333556.83 31987657.03 37085079.77 16211893.10 29888422.60 34988792.07 36351871.53 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
MandelbulbGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelbulbGPU 1.0pts1 OpenCL Device: GPU Core i5 7600K Core i3 7100 Core i7 7700K Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 2M 4M 6M 8M 10M SE +/- 16049.95, N = 3 SE +/- 467.88, N = 3 SE +/- 19261.23, N = 3 SE +/- 21481.69, N = 3 SE +/- 1640.04, N = 3 SE +/- 864.90, N = 3 7893608.83 6991142.00 7953468.13 6421850.37 7523189.00 7914468.33 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
MandelGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better MandelGPU 1.3pts1 OpenCL Device: GPU Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 2M 4M 6M 8M 10M SE +/- 414.49, N = 3 SE +/- 369.28, N = 3 SE +/- 7061.44, N = 3 SE +/- 1228.27, N = 3 SE +/- 10634.78, N = 3 SE +/- 6393.20, N = 3 SE +/- 8446.97, N = 3 9180692.03 8031118.63 9186739.20 3613230.53 7788605.97 8686154.40 9149327.20 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.19, N = 3 SE +/- 0.01, N = 3 SE +/- 0.14, N = 6 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 8.37 6.61 12.23 6.24 9.67 10.49 8.27 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP Core i5 7600K Core i3 7100 Core i7 7700K Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 10.90 9.67 10.88 8.96 10.33 10.89 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash Core i5 7600K Core i3 7100 Core i7 7700K Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 0.072 0.144 0.216 0.288 0.36 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.32 0.28 0.32 0.26 0.30 0.32 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 70 140 210 280 350 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 340.69 297.39 340.71 138.13 280.07 324.36 340.71 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.07, N = 3 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 SE +/- 0.06, N = 3 26.18 24.86 28.39 21.07 19.23 24.17 25.60 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 9 18 27 36 45 SE +/- 0.36, N = 3 SE +/- 0.29, N = 3 SE +/- 0.32, N = 3 SE +/- 0.25, N = 3 SE +/- 0.22, N = 3 SE +/- 0.46, N = 3 SE +/- 0.35, N = 3 35.12 32.08 38.52 24.35 25.60 32.65 33.96 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth Core i5 7600K Core i3 7100 Core i7 7700K Pentium G4400 Xeon E3-1235L v5 Xeon E3-1245 v5 Core i5 6600K 13 26 39 52 65 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.22, N = 3 SE +/- 0.40, N = 3 SE +/- 0.40, N = 3 SE +/- 0.06, N = 3 SE +/- 0.52, N = 3 58.11 51.52 57.89 31.98 47.60 55.52 57.60 1. (CXX) g++ options: -O2 -pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpicxx -lmpi
SmallPT GPU OpenCL Device: GPU - Scene: Complex OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Complex Core i5 7600K Core i3 7100 Core i7 7700K Xeon E3-1245 v5 300M 600M 900M 1200M 1500M SE +/- 335.15, N = 3 SE +/- 382.78, N = 3 SE +/- 335.15, N = 3 SE +/- 353.05, N = 3 1485461759 1485491582 1485532534 1485574320 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Scene: Cornell OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Cornell Core i5 7600K Core i3 7100 Core i7 7700K Xeon E3-1245 v5 300M 600M 900M 1200M 1500M SE +/- 26.27, N = 3 SE +/- 29.16, N = 3 SE +/- 25.98, N = 3 SE +/- 27.42, N = 3 1485462437 1485492354 1485533211 1485575033 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
SmallPT GPU OpenCL Device: GPU - Scene: Caustic3 OpenBenchmarking.org Samples/sec, More Is Better SmallPT GPU 1.6pts1 OpenCL Device: GPU - Scene: Caustic3 Core i5 7600K Core i3 7100 Core i7 7700K Xeon E3-1245 v5 300M 600M 900M 1200M 1500M SE +/- 20.21, N = 3 SE +/- 20.78, N = 3 SE +/- 20.21, N = 3 SE +/- 19.92, N = 3 1485462559 1485492483 1485533333 1485575156 1. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
Phoronix Test Suite v10.8.5