OpenCL 2.0 Intel Beignet CPU Comparison

Intel OpenCL benchmark on Eurocom Q6

HTML result view exported from: https://openbenchmarking.org/result/1806219-AR-1701295RI30&grt.

OpenCL 2.0 Intel Beignet CPU ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750HIntel Core i5-6500 @ 3.20GHz (4 Cores)Gigabyte Z170M-D3H-CFIntel Skylake8192MB250GB Samsung SSD 850Intel HD 530 (Skylake GT2) 3072MB (1050MHz)Realtek ALC892DELL P2415QIntel ConnectionClear Linux4.9.5-302.native (x86_64)Xfce 4.12X Server 1.19.1modesetting 1.19.14.5 Mesa 17.0.0-develOpenCL 2.0 beignet 1.31.0.37GCC 6.3.0 + Clang 3.9.1 + LLVM 3.9.1ext41920x1080Intel Core i5-6600K @ 3.50GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.015360MB256GB TS256GSSD370SIntel HD 530 (Skylake GT2) 3072MB (1150MHz)Realtek ALC1150Intel Pentium G4400 @ 3.30GHz (2 Cores)MSI B150M MORTAR (MS-7972) v2.08192MB120GB Samsung SSD 850Intel HD 510 (Skylake GT1) 3072MB (1000MHz)Realtek ALC892Realtek RTL8111/8168/8411Intel Core i3-7100 @ 3.90GHz (4 Cores)ASUS PRIME Z270-PIntel Device 590f16384MBSamsung SSD 950 PRO 256GBIntel Kabylake GT2 3072MB (1100MHz)Realtek ALC887-VDIntel Core i5-7600K @ 3.80GHz (4 Cores)Intel Device 591fIntel Kabylake GT2 3072MB (1150MHz)Intel Core i7-7700K @ 4.20GHz (8 Cores)Intel Xeon E3-1235L v5 @ 2.00GHz (4 Cores)ASRockRack C236M WSIntel Skylake8192MB120GB OCZ TRION150Intel HD 530 (Skylake GT2) 3072MB (1000MHz)Realtek ALC1150DELL S2409WIntel ConnectionIntel Xeon E3-1245 v5 @ 3.50GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.032768MB120GB Samsung SSD 850Intel HD P530 (Skylake GT2) 3072MB (1150MHz)DELL P2415QIntel Core i7-8750H @ 4.10GHz (6 Cores / 12 Threads)Eurocom Q6 (7.005 BIOS)Intel Cannon Lake PCH Shared SRAM2050GB Crucial_CT2050MX + 1000GB Samsung SSD 960 EVO 1TBNVIDIA GeForce GTX 1070 with Max-Q Design 8192MB (1101/4006MHz)Realtek ALC1220Realtek RTL8111/8168/8411 + Intel Wireless-AC 9260Ubuntu 18.044.17.2 (x86_64)GNOME Shell 3.28.1X Server 1.19.6NVIDIA 396.24.024.6.0OpenCL 1.2 CUDA 9.2.127 + OpenCL 2.1GCC 7.3.0 + Clang 4.0.1-10 + LLVM 4.0.1 + CUDA 9.2OpenBenchmarking.orgCompiler Details- Core i5 6500: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Core i5 6600K: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Pentium G4400: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Core i3 7100: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Core i5 7600K: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Core i7 7700K: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Xeon E3-1235L v5: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Xeon E3-1245 v5: --build=x86_64-generic-linux --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --enable-__cxa_atexit --enable-bootstrap --enable-clocale=gnu --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libmpx --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell- Intel Core i7 8750H: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Core i5 6500: Scaling Governor: acpi-cpufreq performance- Core i5 6600K: Scaling Governor: acpi-cpufreq performance- Pentium G4400: Scaling Governor: acpi-cpufreq performance- Core i3 7100: Scaling Governor: acpi-cpufreq performance- Core i5 7600K: Scaling Governor: acpi-cpufreq performance- Core i7 7700K: Scaling Governor: acpi-cpufreq performance- Xeon E3-1235L v5: Scaling Governor: acpi-cpufreq performance- Xeon E3-1245 v5: Scaling Governor: acpi-cpufreq performance- Intel Core i7 8750H: Scaling Governor: intel_pstate performanceKernel Details- Intel Core i7 8750H: drm.debug=0xeOpenCL Details- Intel Core i7 8750H: GPU Compute Cores: 2048Security Details- Intel Core i7 8750H: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW Protection

OpenCL 2.0 Intel Beignet CPU Comparisoncl-mem: Readcl-mem: Writejuliagpu: GPUmandelbulbgpu: GPUmandelgpu: GPUshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthsmallpt-gpu: GPU - Complexsmallpt-gpu: GPU - Cornellsmallpt-gpu: GPU - Caustic3Core i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H45.8840.4831757837.236852443.007886706.377.439.320.28292.4621.5129.5749.8614856501081485650894148565102139.9246.0236351871.537914468.339149327.208.2710.890.32340.7125.6033.9657.6040.3242.3016211893.103613230.536.24138.1321.0724.3531.9844.1247.9731987657.036991142.008031118.636.619.670.28297.3924.8632.0851.5214854915821485492354148549248339.7242.9236333556.837893608.839180692.038.3710.900.32340.6926.1835.1258.1114854617591485462437148546255941.0739.1337085079.777953468.139186739.2012.2310.880.32340.7128.3938.5257.8914855325341485533211148553333339.8544.1529888422.606421850.377788605.979.678.960.26280.0719.2325.6047.6042.1846.6834988792.077523189.008686154.4010.4910.330.30324.3624.1732.6555.52148557432014855750331485575156206.77204.80132480810.1074293202.03126728903.2011.29526.168.686226.8212.4812.03428.00152956243315295625491529562675OpenBenchmarking.org

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H50100150200250SE +/- 2.54, N = 6SE +/- 1.86, N = 6SE +/- 2.78, N = 6SE +/- 1.44, N = 6SE +/- 1.38, N = 6SE +/- 2.40, N = 6SE +/- 1.86, N = 6SE +/- 2.19, N = 6SE +/- 0.09, N = 345.8839.9240.3244.1239.7241.0739.8542.18206.771. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H4080120160200SE +/- 2.50, N = 6SE +/- 1.25, N = 6SE +/- 2.02, N = 6SE +/- 3.35, N = 6SE +/- 2.03, N = 6SE +/- 0.77, N = 3SE +/- 3.36, N = 6SE +/- 0.73, N = 4SE +/- 0.06, N = 340.4846.0242.3047.9742.9239.1344.1546.68204.801. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H30M60M90M120M150MSE +/- 7819.95, N = 3SE +/- 8517.71, N = 3SE +/- 4354.56, N = 3SE +/- 30806.75, N = 3SE +/- 375450.50, N = 3SE +/- 469338.36, N = 3SE +/- 2225.50, N = 3SE +/- 430217.30, N = 3SE +/- 192107.40, N = 331757837.2336351871.5316211893.1031987657.0336333556.8337085079.7729888422.6034988792.07132480810.101. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUCore i5 6500Core i5 6600KCore i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H16M32M48M64M80MSE +/- 16180.63, N = 3SE +/- 864.90, N = 3SE +/- 467.88, N = 3SE +/- 16049.95, N = 3SE +/- 19261.23, N = 3SE +/- 21481.69, N = 3SE +/- 1640.04, N = 3SE +/- 439288.05, N = 36852443.007914468.336991142.007893608.837953468.136421850.377523189.0074293202.031. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H30M60M90M120M150MSE +/- 473.77, N = 3SE +/- 8446.97, N = 3SE +/- 1228.27, N = 3SE +/- 369.28, N = 3SE +/- 414.49, N = 3SE +/- 7061.44, N = 3SE +/- 10634.78, N = 3SE +/- 6393.20, N = 3SE +/- 163827.23, N = 37886706.379149327.203613230.538031118.639180692.039186739.207788605.978686154.40126728903.201. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H3691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 6SE +/- 0.08, N = 3SE +/- 0.01, N = 37.438.276.246.618.3712.239.6710.4911.29-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPCore i5 6500Core i5 6600KCore i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H110220330440550SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 4.72, N = 39.3210.899.6710.9010.888.9610.33526.16-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashCore i5 6500Core i5 6600KCore i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 30.280.320.280.320.320.260.308.68-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H13002600390052006500SE +/- 0.17, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 3.57, N = 3292.46340.71138.13297.39340.69340.71280.07324.366226.82-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H714212835SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.19, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.00, N = 321.5125.6021.0724.8626.1828.3919.2324.1712.48-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H918273645SE +/- 0.28, N = 3SE +/- 0.35, N = 3SE +/- 0.25, N = 3SE +/- 0.29, N = 3SE +/- 0.36, N = 3SE +/- 0.32, N = 3SE +/- 0.22, N = 3SE +/- 0.46, N = 3SE +/- 0.00, N = 329.5733.9624.3532.0835.1238.5225.6032.6512.03-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthCore i5 6500Core i5 6600KPentium G4400Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1235L v5Xeon E3-1245 v5Intel Core i7 8750H90180270360450SE +/- 0.35, N = 3SE +/- 0.52, N = 3SE +/- 0.40, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.22, N = 3SE +/- 0.40, N = 3SE +/- 0.06, N = 3SE +/- 1.11, N = 349.8657.6031.9851.5258.1157.8947.6055.52428.00-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-pipe -fexceptions -fstack-protector -malign-data=abi -ftree-vectorize -fopt-info-vec -m64 -march=westmere -mtune=haswell -O3 -mtune=intel -lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -lmpicxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SmallPT GPU

OpenCL Device: GPU - Scene: Complex

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: ComplexCore i5 6500Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1245 v5Intel Core i7 8750H300M600M900M1200M1500MSE +/- 389.71, N = 3SE +/- 382.78, N = 3SE +/- 335.15, N = 3SE +/- 335.15, N = 3SE +/- 353.05, N = 3SE +/- 17.90, N = 31485650108148549158214854617591485532534148557432015295624331. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Scene: Cornell

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellCore i5 6500Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1245 v5Intel Core i7 8750H300M600M900M1200M1500MSE +/- 30.02, N = 3SE +/- 29.16, N = 3SE +/- 26.27, N = 3SE +/- 25.98, N = 3SE +/- 27.42, N = 3SE +/- 22.23, N = 31485650894148549235414854624371485533211148557503315295625491. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SmallPT GPU

OpenCL Device: GPU - Scene: Caustic3

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3Core i5 6500Core i3 7100Core i5 7600KCore i7 7700KXeon E3-1245 v5Intel Core i7 8750H300M600M900M1200M1500MSE +/- 19.63, N = 3SE +/- 20.78, N = 3SE +/- 20.21, N = 3SE +/- 20.21, N = 3SE +/- 19.92, N = 3SE +/- 23.09, N = 31485651021148549248314854625591485533333148557515615295626751. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.5