OpenCL October 2016

NVIDIA binary blbo and AMDGPU-PRO Radeon tests with different OpenCL workloads.

HTML result view exported from: https://openbenchmarking.org/result/1610096-LO-OPENCLOCT49&obr_sor=y&obr_rro=y.

OpenCL October 2016ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB INTEL SSDPEKKW256G7AMD Radeon RX 460 2048MBRealtek ALC1150Intel ConnectionUbuntu 16.044.4.0-38-generic (x86_64)Unity 7.4.0X Server 1.18.44.5.13448OpenCL 2.0 AMD-APP (2117.7)1.0.8GCC 5.4.0 20160609 + LLVM 3.8.0ext43840x2160AMD Radeon RX 480 8192MBSapphire AMD Radeon R9 Fury 4096MBNVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)NVIDIA 370.284.5.0eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1505/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1502/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1619/5005MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceGraphics Details- Radeon RX 460, Radeon RX 480, Radeon R9 Fury: GLAMOROpenCL Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.

OpenCL October 2016shoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 10803.0929.932.422152.886.847.1577.45293455956.5825.725.235804.7712.9612.95161.857793141745.9524.605.787116.3712.9412.27219.6687151946811.53157.093.924989.3212.4813.22286.7610.97119.932.672216.7912.5213.21256.453003595110.78128.173.822941.3712.4513.20279.573054614711.43214.685.414332.4012.5013.21293.9255871091811.84205.417.696177.1212.4913.21351.5679891507211.45224.155.584810.0412.5113.23393.5352061162011.76307.958.297108.0712.5213.22450.6072201620611.83343.2611.669380.3012.4713.24524.30628712774OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 460Radeon R9 FuryRadeon RX 480GeForce GTX 960GeForce GTX 950GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 980 Ti3691215SE +/- 0.05, N = 3SE +/- 1.27, N = 6SE +/- 1.20, N = 6SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.095.956.5810.7810.9711.4311.4511.5311.7611.8311.841. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon R9 FuryRadeon RX 480Radeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 1070GeForce GTX 108070140210280350SE +/- 0.94, N = 6SE +/- 0.52, N = 6SE +/- 3.84, N = 6SE +/- 0.55, N = 3SE +/- 1.09, N = 3SE +/- 0.47, N = 3SE +/- 0.50, N = 3SE +/- 1.68, N = 3SE +/- 0.95, N = 3SE +/- 3.11, N = 3SE +/- 1.62, N = 324.6025.7229.93119.93128.17157.09205.41214.68224.15307.95343.261. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 780 TiRadeon RX 480GeForce GTX 970GeForce GTX 1060Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1070GeForce GTX 10803691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.422.673.823.925.235.415.585.787.698.2911.661. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiRadeon RX 480GeForce GTX 980 TiGeForce GTX 1070Radeon R9 FuryGeForce GTX 10802K4K6K8K10KSE +/- 2.31, N = 3SE +/- 6.65, N = 3SE +/- 1.19, N = 3SE +/- 2.38, N = 3SE +/- 6.65, N = 3SE +/- 15.86, N = 3SE +/- 0.33, N = 3SE +/- 18.69, N = 3SE +/- 49.40, N = 3SE +/- 6.08, N = 3SE +/- 23.12, N = 32152.882216.792941.374332.404810.044989.325804.776177.127108.077116.379380.301. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadRadeon RX 460GeForce GTX 960GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 950GeForce GTX 1070Radeon R9 FuryRadeon RX 4803691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36.8412.4512.4712.4812.4912.5012.5112.5212.5212.9412.961. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackRadeon RX 460Radeon R9 FuryRadeon RX 480GeForce GTX 960GeForce GTX 950GeForce GTX 970GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1070GeForce GTX 1060GeForce GTX 10803691215SE +/- 0.00, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.1512.2712.9513.2013.2113.2113.2113.2213.2213.2313.241. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 950GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080110220330440550SE +/- 0.65, N = 3SE +/- 0.30, N = 3SE +/- 1.46, N = 3SE +/- 0.11, N = 3SE +/- 0.70, N = 3SE +/- 0.16, N = 3SE +/- 0.71, N = 3SE +/- 0.16, N = 3SE +/- 0.83, N = 3SE +/- 0.32, N = 3SE +/- 1.97, N = 377.45161.85219.66256.45279.57286.76293.92351.56393.53450.60524.301. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 1060GeForce GTX 970GeForce GTX 1080GeForce GTX 1070Radeon RX 480GeForce GTX 980 TiRadeon R9 Fury2K4K6K8K10KSE +/- 14.68, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 5.00, N = 3SE +/- 24.27, N = 3SE +/- 7.69, N = 3SE +/- 1.86, N = 3SE +/- 2.08, N = 3SE +/- 8.54, N = 3SE +/- 13.64, N = 32934300330545206558762877220779379898715

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 1080Radeon RX 480GeForce GTX 980 TiGeForce GTX 1070Radeon R9 Fury4K8K12K16K20KSE +/- 16.56, N = 3SE +/- 20.50, N = 3SE +/- 1.20, N = 3SE +/- 35.71, N = 3SE +/- 1.20, N = 3SE +/- 26.67, N = 3SE +/- 38.84, N = 3SE +/- 35.02, N = 3SE +/- 1.15, N = 3SE +/- 88.05, N = 355955951614710918116201277414174150721620619468


Phoronix Test Suite v10.8.4