Coffeelake Beignet vs. OpenCL NEO Intel

Intel Core i7-8700K testing of OpenCL Linux drivers by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1905107-HV-COFFEELAK00
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

CPU Massive 5 Tests
Creator Workloads 2 Tests
HPC - High Performance Computing 4 Tests
Machine Learning 2 Tests
Multi-Core 3 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 8 Tests
OpenMPI Tests 2 Tests
Server CPU Tests 2 Tests
Common Workstation Benchmarks 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Beignet Git
May 10 2019
  3 Hours, 24 Minutes
Intel OpenCL NEO
May 10 2019
  1 Hour, 57 Minutes
Invert Hiding All Results Option
  2 Hours, 41 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Coffeelake Beignet vs. OpenCL NEO IntelProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionBeignet GitIntel OpenCL NEOIntel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS TUF Z370-PLUS GAMING (1802 BIOS)Intel 8th Gen Core16384MB128GB THNSN5128GPU7 TOSHIBAinteldrmfb (1200MHz)Realtek ALC887-VDDELL P2415QIntel I219-VUbuntu 18.044.18.0-18-generic (x86_64)GNOME Shell 3.28.3X Server 1.20.1modesetting 1.20.14.5 Mesa 18.2.8OpenCL 2.0 beignet 1.4 (git-fc5f430c)GCC 7.4.0 + Clang 6.0.0-1ubuntu2 + LLVM 6.0.0ext43840x2160Intel UHD 630 3GB (1200MHz)OpenCL 2.1OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersavePython Details- Python 2.7.15rc1 + Python 3.6.7Security Details- KPTI + __user pointer sanitization + Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + SSB disabled via prctl and seccomp + PTE Inversion

Beignet Git vs. Intel OpenCL NEO ComparisonPhoronix Test SuiteBaseline+932.8%+932.8%+1865.6%+1865.6%+2798.4%+2798.4%+3731.2%+3731.2%751.3%3731.3%284%216.1%149.2%102.5%51.2%44.1%13.6%10.5%6%5.4%4.4%S.P.FGPU - HotelOpenCL HeartwallOpenCL - Max SP FlopsO.L.FOpenCLOpenCL - TriadOpenCL - Bus Speed DownloadOpenCL - Bus Speed Readback33.5%OpenCL LBMBMW27 - OpenCL11.6%OpenCL - FFT SPKernel Latency9.8%OpenCL BFS8.3%G.M.BServer Rack - OpenCL5.6%OpenCL - MD5 HashOpenCL - T.R.B4.6%Fishy Cat - OpenCLclpeakLuxMarkRodiniaSHOC Scalable HeterOgeneous ComputingViennaCLLeelaChessZeroSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingParboilBlenderSHOC Scalable HeterOgeneous ComputingclpeakParboilclpeakDarktableSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingBlenderBeignet GitIntel OpenCL NEO

Coffeelake Beignet vs. OpenCL NEO Intellczero: OpenCLluxmark: GPU - Hotelblender: Barbershop - OpenCLblender: Fishy Cat - OpenCLblender: Pabellon Barcelona - OpenCLblender: Classroom - OpenCLblender: BMW27 - OpenCLclpeak: Single-Precision Floatclpeak: Global Memory Bandwidthshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthparboil: OpenCL LBMrodinia: OpenCL Heartwallshoc: OpenCL - MD5 Hashxsbench-cl: darktable: Boat - OpenCLviennacl: OpenCL LU Factorizationcomd-cl: Average Atom Update Rateclpeak: Transfer Bandwidth enqueueReadBuffershoc: OpenCL - Triaddarktable: Masskrug - OpenCLclpeak: Transfer Bandwidth enqueueWriteBufferdarktable: Server Room - OpenCLshoc: OpenCL - FFT SPclpeak: Kernel Latencyparboil: OpenCL BFSshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloaddarktable: Server Rack - OpenCLBeignet GitIntel OpenCL NEO52.21161355101492967641353.8831.4146759.7478.5635.170.371145550313.878.522.5815.429.206.4840.824.7515.9821.700.9639.0920.480.18105.736131372971926678461458.6833.28147657.0969.159.160.3913.8721.232.5815.3413.916.4841.314.7817.6523.831.0429.2729.510.19OpenBenchmarking.org

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLBeignet GitIntel OpenCL NEO20406080100SE +/- 3.79, N = 6SE +/- 0.42, N = 352.21105.731. (CXX) g++ options: -lpthread -lOpenCL -lz
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.20.1Backend: OpenCLBeignet GitIntel OpenCL NEO20406080100Min: 42.05 / Avg: 52.21 / Max: 68.3Min: 105.03 / Avg: 105.73 / Max: 106.481. (CXX) g++ options: -lpthread -lOpenCL -lz

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelBeignet GitIntel OpenCL NEO130260390520650SE +/- 13.09, N = 10SE +/- 6.08, N = 316613
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelBeignet GitIntel OpenCL NEO110220330440550Min: 2 / Avg: 16.2 / Max: 134Min: 603 / Avg: 613 / Max: 624

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: OpenCLBeignet GitIntel OpenCL NEO3006009001200150013551372

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Fishy Cat - Compute: OpenCLBeignet GitIntel OpenCL NEO20040060080010001014971

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Pabellon Barcelona - Compute: OpenCLBeignet GitIntel OpenCL NEO2004006008001000929926

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Classroom - Compute: OpenCLBeignet GitIntel OpenCL NEO150300450600750676678

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: BMW27 - Compute: OpenCLBeignet GitIntel OpenCL NEO100200300400500413461

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatBeignet GitIntel OpenCL NEO100200300400500SE +/- 5.07, N = 12SE +/- 0.07, N = 353.88458.681. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatBeignet GitIntel OpenCL NEO80160240320400Min: 30.23 / Avg: 53.88 / Max: 100.38Min: 458.6 / Avg: 458.68 / Max: 458.811. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthBeignet GitIntel OpenCL NEO816243240SE +/- 2.22, N = 12SE +/- 0.09, N = 331.4133.281. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthBeignet GitIntel OpenCL NEO714212835Min: 7.05 / Avg: 31.41 / Max: 35.41Min: 33.16 / Avg: 33.28 / Max: 33.451. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsBeignet GitIntel OpenCL NEO30060090012001500SE +/- 0.01, N = 3SE +/- 16.73, N = 1546714761. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsBeignet GitIntel OpenCL NEO30060090012001500Min: 467.21 / Avg: 467.22 / Max: 467.23Min: 1367.24 / Avg: 1475.63 / Max: 1629.061. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthBeignet GitIntel OpenCL NEO1326395265SE +/- 0.30, N = 3SE +/- 0.00, N = 359.7457.091. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthBeignet GitIntel OpenCL NEO1224364860Min: 59.21 / Avg: 59.74 / Max: 60.25Min: 57.09 / Avg: 57.09 / Max: 57.11. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMBeignet GitIntel OpenCL NEO20406080100SE +/- 0.03, N = 3SE +/- 0.41, N = 378.5669.151. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMBeignet GitIntel OpenCL NEO1530456075Min: 78.5 / Avg: 78.56 / Max: 78.6Min: 68.33 / Avg: 69.15 / Max: 69.561. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallBeignet GitIntel OpenCL NEO816243240SE +/- 2.44, N = 12SE +/- 0.04, N = 335.179.161. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL HeartwallBeignet GitIntel OpenCL NEO816243240Min: 12.52 / Avg: 35.17 / Max: 42.99Min: 9.09 / Avg: 9.16 / Max: 9.21. (CXX) g++ options: -O2 -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashBeignet GitIntel OpenCL NEO0.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.391. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashBeignet GitIntel OpenCL NEO12345Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.39 / Avg: 0.39 / Max: 0.391. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Xsbench OpenCL

Xsbench benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLookups/s, More Is BetterXsbench OpenCL 2017-07-06Beignet Git2M4M6M8M10MSE +/- 2043.29, N = 3114555031. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLBeignet GitIntel OpenCL NEO48121620SE +/- 0.03, N = 3SE +/- 0.05, N = 313.8713.87
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLBeignet GitIntel OpenCL NEO48121620Min: 13.8 / Avg: 13.87 / Max: 13.9Min: 13.77 / Avg: 13.87 / Max: 13.93

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationBeignet GitIntel OpenCL NEO510152025SE +/- 0.42, N = 12SE +/- 0.03, N = 38.5221.231. (CXX) g++ options: -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationBeignet GitIntel OpenCL NEO510152025Min: 7.43 / Avg: 8.52 / Max: 12.56Min: 21.19 / Avg: 21.23 / Max: 21.281. (CXX) g++ options: -rdynamic -lOpenCL

CoMD OpenCL

CoMD benchmark in OpenCL via GPUOpen. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus/atom/task, More Is BetterCoMD OpenCL 2017-07-06Average Atom Update RateBeignet GitIntel OpenCL NEO0.58051.1611.74152.3222.9025SE +/- 0.01, N = 3SE +/- 0.00, N = 32.582.581. (CC) gcc options: -std=c99 -O5 -lm -lOpenCL
OpenBenchmarking.orgus/atom/task, More Is BetterCoMD OpenCL 2017-07-06Average Atom Update RateBeignet GitIntel OpenCL NEO246810Min: 2.57 / Avg: 2.58 / Max: 2.59Min: 2.57 / Avg: 2.58 / Max: 2.581. (CC) gcc options: -std=c99 -O5 -lm -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferBeignet GitIntel OpenCL NEO48121620SE +/- 0.02, N = 3SE +/- 0.19, N = 515.4215.341. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferBeignet GitIntel OpenCL NEO48121620Min: 15.39 / Avg: 15.42 / Max: 15.46Min: 14.62 / Avg: 15.34 / Max: 15.641. (CXX) g++ options: -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadBeignet GitIntel OpenCL NEO48121620SE +/- 0.42, N = 15SE +/- 0.06, N = 39.2013.911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadBeignet GitIntel OpenCL NEO48121620Min: 6.95 / Avg: 9.2 / Max: 10.75Min: 13.79 / Avg: 13.91 / Max: 13.971. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLBeignet GitIntel OpenCL NEO246810SE +/- 0.01, N = 3SE +/- 0.01, N = 36.486.48
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLBeignet GitIntel OpenCL NEO3691215Min: 6.46 / Avg: 6.48 / Max: 6.5Min: 6.45 / Avg: 6.48 / Max: 6.5

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferBeignet GitIntel OpenCL NEO918273645SE +/- 0.15, N = 3SE +/- 0.09, N = 340.8241.311. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferBeignet GitIntel OpenCL NEO918273645Min: 40.51 / Avg: 40.82 / Max: 40.98Min: 41.17 / Avg: 41.31 / Max: 41.471. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLBeignet GitIntel OpenCL NEO1.07552.1513.22654.3025.3775SE +/- 0.01, N = 3SE +/- 0.01, N = 34.754.78
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLBeignet GitIntel OpenCL NEO246810Min: 4.75 / Avg: 4.75 / Max: 4.77Min: 4.76 / Avg: 4.78 / Max: 4.79

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPBeignet GitIntel OpenCL NEO48121620SE +/- 0.00, N = 3SE +/- 0.03, N = 315.9817.651. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPBeignet GitIntel OpenCL NEO48121620Min: 15.98 / Avg: 15.98 / Max: 15.98Min: 17.59 / Avg: 17.65 / Max: 17.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyBeignet GitIntel OpenCL NEO612182430SE +/- 0.26, N = 3SE +/- 0.10, N = 321.7023.831. (CXX) g++ options: -O3 -rdynamic -lOpenCL
OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyBeignet GitIntel OpenCL NEO612182430Min: 21.19 / Avg: 21.7 / Max: 21.98Min: 23.66 / Avg: 23.83 / Max: 24.021. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSBeignet GitIntel OpenCL NEO0.2340.4680.7020.9361.17SE +/- 0.00, N = 3SE +/- 0.01, N = 30.961.041. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSBeignet GitIntel OpenCL NEO246810Min: 0.96 / Avg: 0.96 / Max: 0.97Min: 1.02 / Avg: 1.04 / Max: 1.061. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackBeignet GitIntel OpenCL NEO918273645SE +/- 0.47, N = 6SE +/- 0.43, N = 439.0929.271. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackBeignet GitIntel OpenCL NEO816243240Min: 37.77 / Avg: 39.09 / Max: 40.39Min: 28.07 / Avg: 29.27 / Max: 30.111. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadBeignet GitIntel OpenCL NEO714212835SE +/- 0.09, N = 3SE +/- 0.15, N = 320.4829.511. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadBeignet GitIntel OpenCL NEO714212835Min: 20.34 / Avg: 20.48 / Max: 20.66Min: 29.31 / Avg: 29.51 / Max: 29.811. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLBeignet GitIntel OpenCL NEO0.04280.08560.12840.17120.214SE +/- 0.00, N = 3SE +/- 0.00, N = 30.180.19
OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLBeignet GitIntel OpenCL NEO12345Min: 0.18 / Avg: 0.18 / Max: 0.19Min: 0.19 / Avg: 0.19 / Max: 0.19