Intel OpenCL Linux Skylake Beignet Compute

Intel Xeon E3-1246 v3 testing with a TYAN S5535-HE and eVGA NVIDIA GeForce GTX 980 Ti 6144MB on Scientific 7.2 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1606129-HA-1602221GA56
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 2 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Intel Core i5-6600K
February 21 2016
 
Roxen-980TI
June 12 2016
 
Invert Hiding All Results Option
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel OpenCL Linux Skylake Beignet ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionIntel Core i5-6600KRoxen-980TIIntel Core i5-6600K @ 3.90GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.0Intel Sky Lake15360MB256GB TS256GSSD370SIntel Sky Lake (1150MHz)Realtek ALC1150DELL P2415QIntel ConnectionUbuntu 15.104.5.0-999-generic (x86_64) 20160218UnityX Server 1.17.2intel 2.99.9173.3 Mesa 11.2.0-devel (padoka PPA)OpenCL 1.2 beignet 1.2GCC 5.2.1 20151010 + Clang 3.7.1-1ubuntu3~gd~w + LLVM 3.7.1ext43840x2160Intel Xeon E3-1246 v3 @ 3.90GHz (8 Cores)TYAN S5535-HEIntel Xeon E3-1200 v3 DRAM16384MB160GB INTEL SSDSA2M160eVGA NVIDIA GeForce GTX 980 Ti 6144MB (1101/3505MHz)Realtek ALC892Intel I210 Gigabit ConnectionScientific 7.23.10.0-327.18.2.el7.x86_64 (x86_64)GNOME Shell 3.14.4NVIDIA 361.45.114.4.0GCC 4.8.5 20150623 + CUDA 7.5xfs1920x1080OpenBenchmarking.orgCompiler Details- Intel Core i5-6600K: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v - Roxen-980TI: --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,java,fortran,ada,go,lto --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-linker-hash-style=gnu --with-tune=generic Processor Details- Scaling Governor: intel_pstate powersaveOpenCL Details- Roxen-980TI: GPU Compute Cores: 2816System Details- Roxen-980TI: GPU Compute Cores: 2816. SELinux: Enabled.

Intel Core i5-6600K vs. Roxen-980TI ComparisonPhoronix Test SuiteBaseline+75838736850%+75838736850%+151677473700%+151677473700%+227516210550%+227516210550%777.8%762.4%739.2%584.5%572.5%496.2%333.9%2914.1%2325.7%1818.9%1490.8%1021.8%GPU - HotelGPUG.M.BGPU - Luxball HDROpenCL - T.R.BKernel LatencyGPUGPU - Caustic3303354947400%OpenCL - Max SP FlopsOpenCL - MD5 HashGPU - Cornell214132887547.1%OpenCL - Bus Speed Readback171%T.B.e146.2%T.B.e108.5%OpenCL - Bus Speed Download103.2%OpenCL - FFT SPS.P.FGPUOpenCL - Triad9.5%LuxMarkMandelbulbGPUclpeakLuxMarkSHOC Scalable HeterOgeneous ComputingclpeakJuliaGPUSmallPT GPUSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingSmallPT GPUSHOC Scalable HeterOgeneous ComputingclpeakclpeakSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingclpeakMandelGPUSHOC Scalable HeterOgeneous ComputingIntel Core i5-6600KRoxen-980TI

Intel OpenCL Linux Skylake Beignet Computeluxmark: GPU - Hotelluxmark: GPU - Luxball HDRshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthjuliagpu: GPUsmallpt-gpu: GPU - Cornellsmallpt-gpu: GPU - Caustic3clpeak: Kernel Latencyclpeak: Single-Precision Floatclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBuffermandelbulbgpu: GPUmandelgpu: GPUIntel Core i5-6600KRoxen-980TI316240713.1111.350.35229.2325.8134.6157.5231024743.431456103636145610374823.73389.3831.5614.0527.018529787.5012412737.8727741647511.97217.798.496909.2312.7012.77386.84134626226.470.680.483.986194.20264.866.7410.9773560642.00139246349.93OpenBenchmarking.org

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelIntel Core i5-6600KRoxen-980TI6001200180024003000SE +/- 0.00, N = 3SE +/- 23.80, N = 33162774
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelIntel Core i5-6600KRoxen-980TI5001000150020002500Min: 316 / Avg: 316 / Max: 316Min: 2727 / Avg: 2774 / Max: 2804

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRIntel Core i5-6600KRoxen-980TI4K8K12K16K20KSE +/- 6.33, N = 3SE +/- 43.51, N = 3240716475
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRIntel Core i5-6600KRoxen-980TI3K6K9K12K15KMin: 2394 / Avg: 2406.67 / Max: 2413Min: 16422 / Avg: 16474.67 / Max: 16561

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadIntel Core i5-6600KRoxen-980TI3691215SE +/- 0.25, N = 3SE +/- 0.01, N = 313.1111.971. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadIntel Core i5-6600KRoxen-980TI48121620Min: 12.63 / Avg: 13.11 / Max: 13.47Min: 11.96 / Avg: 11.97 / Max: 11.991. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPIntel Core i5-6600KRoxen-980TI50100150200250SE +/- 0.02, N = 3SE +/- 0.17, N = 311.35217.791. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPIntel Core i5-6600KRoxen-980TI4080120160200Min: 11.32 / Avg: 11.35 / Max: 11.39Min: 217.62 / Avg: 217.79 / Max: 218.131. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashIntel Core i5-6600KRoxen-980TI246810SE +/- 0.00, N = 3SE +/- 0.00, N = 30.358.491. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashIntel Core i5-6600KRoxen-980TI3691215Min: 0.35 / Avg: 0.35 / Max: 0.35Min: 8.48 / Avg: 8.49 / Max: 8.491. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsIntel Core i5-6600KRoxen-980TI15003000450060007500SE +/- 0.01, N = 3SE +/- 12.83, N = 3229.236909.231. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsIntel Core i5-6600KRoxen-980TI12002400360048006000Min: 229.22 / Avg: 229.23 / Max: 229.24Min: 6896.37 / Avg: 6909.23 / Max: 6934.881. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadIntel Core i5-6600KRoxen-980TI612182430SE +/- 0.23, N = 3SE +/- 0.00, N = 325.8112.701. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadIntel Core i5-6600KRoxen-980TI612182430Min: 25.4 / Avg: 25.81 / Max: 26.21Min: 12.69 / Avg: 12.7 / Max: 12.71. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackIntel Core i5-6600KRoxen-980TI816243240SE +/- 0.03, N = 3SE +/- 0.00, N = 334.6112.771. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackIntel Core i5-6600KRoxen-980TI714212835Min: 34.58 / Avg: 34.61 / Max: 34.68Min: 12.77 / Avg: 12.77 / Max: 12.771. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthIntel Core i5-6600KRoxen-980TI80160240320400SE +/- 0.35, N = 3SE +/- 1.15, N = 357.52386.841. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthIntel Core i5-6600KRoxen-980TI70140210280350Min: 56.82 / Avg: 57.52 / Max: 57.88Min: 385.66 / Avg: 386.84 / Max: 389.141. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUIntel Core i5-6600KRoxen-980TI30M60M90M120M150MSE +/- 362390.04, N = 3SE +/- 748204.06, N = 331024743.43134626226.471. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUIntel Core i5-6600KRoxen-980TI20M40M60M80M100MMin: 30519855.8 / Avg: 31024743.43 / Max: 31727516Min: 133860741.3 / Avg: 134626226.47 / Max: 136122500.51. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

SmallPT GPU

SmallPT GPU is an OpenCL benchmark that's run with various PTS changes compared to upstream and multiple rendering scenes are available. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellIntel Core i5-6600KRoxen-980TI300M600M900M1200M1500MSE +/- 19.92, N = 3SE +/- 0.12, N = 61456103636.000.681. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: CornellIntel Core i5-6600KRoxen-980TI300M600M900M1200M1500MMin: 1456103601 / Avg: 1456103635.67 / Max: 1456103670Min: 0.3 / Avg: 0.68 / Max: 11. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3Intel Core i5-6600KRoxen-980TI300M600M900M1200M1500MSE +/- 20.78, N = 3SE +/- 0.13, N = 61456103748.000.481. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterSmallPT GPU 1.6pts1OpenCL Device: GPU - Scene: Caustic3Intel Core i5-6600KRoxen-980TI300M600M900M1200M1500MMin: 1456103712 / Avg: 1456103748 / Max: 1456103784Min: 0.2 / Avg: 0.48 / Max: 11. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyIntel Core i5-6600KRoxen-980TI612182430SE +/- 0.11, N = 3SE +/- 0.03, N = 323.733.98
OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyIntel Core i5-6600KRoxen-980TI612182430Min: 23.53 / Avg: 23.73 / Max: 23.89Min: 3.93 / Avg: 3.98 / Max: 4.03

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatIntel Core i5-6600KRoxen-980TI13002600390052006500SE +/- 0.23, N = 3SE +/- 14.15, N = 3389.386194.20
OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatIntel Core i5-6600KRoxen-980TI11002200330044005500Min: 389.03 / Avg: 389.38 / Max: 389.8Min: 6167.5 / Avg: 6194.2 / Max: 6215.69

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthIntel Core i5-6600KRoxen-980TI60120180240300SE +/- 0.29, N = 3SE +/- 0.41, N = 331.56264.86
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthIntel Core i5-6600KRoxen-980TI50100150200250Min: 31.1 / Avg: 31.56 / Max: 32.09Min: 264.45 / Avg: 264.86 / Max: 265.68

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferIntel Core i5-6600KRoxen-980TI48121620SE +/- 0.16, N = 3SE +/- 0.02, N = 314.056.74
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferIntel Core i5-6600KRoxen-980TI48121620Min: 13.8 / Avg: 14.05 / Max: 14.34Min: 6.71 / Avg: 6.74 / Max: 6.76

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferIntel Core i5-6600KRoxen-980TI612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 327.0110.97
OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferIntel Core i5-6600KRoxen-980TI612182430Min: 27 / Avg: 27.01 / Max: 27.02Min: 10.96 / Avg: 10.97 / Max: 10.98

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUIntel Core i5-6600KRoxen-980TI16M32M48M64M80MSE +/- 4141.79, N = 3SE +/- 212640.59, N = 38529787.5073560642.001. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUIntel Core i5-6600KRoxen-980TI13M26M39M52M65MMin: 8521528.9 / Avg: 8529787.5 / Max: 8534473.4Min: 73136824.1 / Avg: 73560642 / Max: 73803077.31. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUIntel Core i5-6600KRoxen-980TI30M60M90M120M150MSE +/- 3785.85, N = 3SE +/- 125318.81, N = 312412737.87139246349.931. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUIntel Core i5-6600KRoxen-980TI20M40M60M80M100MMin: 12408269.1 / Avg: 12412737.87 / Max: 12420265.7Min: 139034392.2 / Avg: 139246349.93 / Max: 139468172.91. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL