Intel OpenCL Neo vs. Beignet Linux Open-Source OpenCL

Intel OpenCL NEO vs. Beignet OpenCL open-source compute testing on Intel Coffee Lake for a future article on Phoronix.com.

HTML result view exported from: https://openbenchmarking.org/result/1803069-FO-OPENCLNEO41&grw&sor.

Intel OpenCL Neo vs. Beignet Linux Open-Source OpenCLProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionBeignet 1.4 GitIntel NEO OpenCL 2.1Intel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (0607 BIOS)Intel Device 3ec215360MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBIntel UHD 630 3072MB (1200MHz)Realtek ALC1220DELL P2415QIntel ConnectionUbuntu 16.044.15.7-041507-generic (x86_64)Unity 7.4.5modesetting 1.19.54.5 Mesa 18.1.0-devel- padoka PPAOpenCL 1.2 beignet 1.4 (git-7e181af)1.0.61GCC 5.4.0 20160609 + LLVM 5.0.0ext43840x2160OpenCL 2.1OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate powersaveSecurity Details- KPTI + __user pointer sanitization + Full generic retpoline Protection

Intel OpenCL Neo vs. Beignet Linux Open-Source OpenCLdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthblender: BMW27 - OpenCLblender: Classroom - OpenCLblender: Fishy Cat - OpenCLblender: Barbershop - OpenCLblender: Pabellon Barcelona - OpenCLcl-mem: Copycl-mem: Readcl-mem: Writeviennacl: OpenCL LU Factorizationjuliagpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRBeignet 1.4 GitIntel NEO OpenCL 2.121.830.350.3314.7313.620.37466.5819.8438.7860.89462.11681.691004.471357.421047.5755.2350.4256.6517.5838087166.6055456272821.560.340.3324.6525.77424.63676.47995.621360.23926.47OpenBenchmarking.org

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLIntel NEO OpenCL 2.1Beignet 1.4 Git510152025SE +/- 0.03, N = 3SE +/- 0.03, N = 321.5621.83

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLIntel NEO OpenCL 2.1Beignet 1.4 Git0.07880.15760.23640.31520.394SE +/- 0.00, N = 3SE +/- 0.00, N = 30.340.35

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLBeignet 1.4 GitIntel NEO OpenCL 2.10.07430.14860.22290.29720.3715SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.33

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadBeignet 1.4 Git48121620SE +/- 0.18, N = 314.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPBeignet 1.4 Git48121620SE +/- 0.00, N = 313.621. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashBeignet 1.4 Git0.08330.16660.24990.33320.4165SE +/- 0.00, N = 30.371. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsBeignet 1.4 Git100200300400500SE +/- 0.02, N = 3466.581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadIntel NEO OpenCL 2.1Beignet 1.4 Git612182430SE +/- 0.09, N = 3SE +/- 0.37, N = 324.6519.841. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackBeignet 1.4 GitIntel NEO OpenCL 2.1918273645SE +/- 0.06, N = 3SE +/- 0.11, N = 338.7825.771. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthBeignet 1.4 Git1428425670SE +/- 0.03, N = 360.891. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: BMW27 - Compute: OpenCLIntel NEO OpenCL 2.1Beignet 1.4 Git100200300400500424.63462.11

Blender

Blend File: Classroom - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Classroom - Compute: OpenCLIntel NEO OpenCL 2.1Beignet 1.4 Git150300450600750676.47681.69

Blender

Blend File: Fishy Cat - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Fishy Cat - Compute: OpenCLIntel NEO OpenCL 2.1Beignet 1.4 Git2004006008001000995.621004.47

Blender

Blend File: Barbershop - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: OpenCLBeignet 1.4 GitIntel NEO OpenCL 2.1300600900120015001357.421360.23

Blender

Blend File: Pabellon Barcelona - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Pabellon Barcelona - Compute: OpenCLIntel NEO OpenCL 2.1Beignet 1.4 Git2004006008001000926.471047.57

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyBeignet 1.4 Git1224364860SE +/- 4.55, N = 655.231. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadBeignet 1.4 Git1122334455SE +/- 3.37, N = 650.421. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteBeignet 1.4 Git1326395265SE +/- 4.52, N = 656.651. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationBeignet 1.4 Git48121620SE +/- 0.15, N = 317.581. (CXX) g++ options: -rdynamic -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUBeignet 1.4 Git8M16M24M32M40MSE +/- 41486.36, N = 338087166.601. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelBeignet 1.4 Git120240360480600SE +/- 1.67, N = 3554

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneBeignet 1.4 Git132639526556

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRBeignet 1.4 Git6001200180024003000SE +/- 5.00, N = 32728


Phoronix Test Suite v10.8.5