OpenCL October 2016

NVIDIA binary blbo and AMDGPU-PRO Radeon tests with different OpenCL workloads.

HTML result view exported from: https://openbenchmarking.org/result/1610096-LO-OPENCLOCT49&obr_sor=y&obr_rro=y&grs.

OpenCL October 2016ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB INTEL SSDPEKKW256G7AMD Radeon RX 460 2048MBRealtek ALC1150Intel ConnectionUbuntu 16.044.4.0-38-generic (x86_64)Unity 7.4.0X Server 1.18.44.5.13448OpenCL 2.0 AMD-APP (2117.7)1.0.8GCC 5.4.0 20160609 + LLVM 3.8.0ext43840x2160AMD Radeon RX 480 8192MBSapphire AMD Radeon R9 Fury 4096MBNVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)NVIDIA 370.284.5.0eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1505/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1502/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1619/5005MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceGraphics Details- Radeon RX 460, Radeon RX 480, Radeon R9 Fury: GLAMOROpenCL Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.

OpenCL October 2016shoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Triadluxmark: GPU - Luxball HDRluxmark: GPU - Microphoneshoc: OpenCL - FFT SPshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed ReadbackRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 108077.452.422152.883.095595293429.936.847.15161.855.235804.776.5814174779325.7212.9612.95219.665.787116.375.9519468871524.6012.9412.27286.763.924989.3211.53157.0912.4813.22256.452.672216.7910.9759513003119.9312.5213.21279.573.822941.3710.7861473054128.1712.4513.20293.925.414332.4011.43109185587214.6812.5013.21351.567.696177.1211.84150727989205.4112.4913.21393.535.584810.0411.45116205206224.1512.5113.23450.608.297108.0711.76162067220307.9512.5213.22524.3011.669380.3011.83127746287343.2612.4713.24OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 950GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080110220330440550SE +/- 0.65, N = 3SE +/- 0.30, N = 3SE +/- 1.46, N = 3SE +/- 0.11, N = 3SE +/- 0.70, N = 3SE +/- 0.16, N = 3SE +/- 0.71, N = 3SE +/- 0.16, N = 3SE +/- 0.83, N = 3SE +/- 0.32, N = 3SE +/- 1.97, N = 377.45161.85219.66256.45279.57286.76293.92351.56393.53450.60524.301. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 780 TiRadeon RX 480GeForce GTX 970GeForce GTX 1060Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1070GeForce GTX 10803691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.422.673.823.925.235.415.585.787.698.2911.661. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiRadeon RX 480GeForce GTX 980 TiGeForce GTX 1070Radeon R9 FuryGeForce GTX 10802K4K6K8K10KSE +/- 2.31, N = 3SE +/- 6.65, N = 3SE +/- 1.19, N = 3SE +/- 2.38, N = 3SE +/- 6.65, N = 3SE +/- 15.86, N = 3SE +/- 0.33, N = 3SE +/- 18.69, N = 3SE +/- 49.40, N = 3SE +/- 6.08, N = 3SE +/- 23.12, N = 32152.882216.792941.374332.404810.044989.325804.776177.127108.077116.379380.301. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 460Radeon R9 FuryRadeon RX 480GeForce GTX 960GeForce GTX 950GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 980 Ti3691215SE +/- 0.05, N = 3SE +/- 1.27, N = 6SE +/- 1.20, N = 6SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.095.956.5810.7810.9711.4311.4511.5311.7611.8311.841. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 1080Radeon RX 480GeForce GTX 980 TiGeForce GTX 1070Radeon R9 Fury4K8K12K16K20KSE +/- 16.56, N = 3SE +/- 20.50, N = 3SE +/- 1.20, N = 3SE +/- 35.71, N = 3SE +/- 1.20, N = 3SE +/- 26.67, N = 3SE +/- 38.84, N = 3SE +/- 35.02, N = 3SE +/- 1.15, N = 3SE +/- 88.05, N = 355955951614710918116201277414174150721620619468

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 1060GeForce GTX 970GeForce GTX 1080GeForce GTX 1070Radeon RX 480GeForce GTX 980 TiRadeon R9 Fury2K4K6K8K10KSE +/- 14.68, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 5.00, N = 3SE +/- 24.27, N = 3SE +/- 7.69, N = 3SE +/- 1.86, N = 3SE +/- 2.08, N = 3SE +/- 8.54, N = 3SE +/- 13.64, N = 32934300330545206558762877220779379898715

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon R9 FuryRadeon RX 480Radeon RX 460GeForce GTX 950GeForce GTX 960GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 1070GeForce GTX 108070140210280350SE +/- 0.94, N = 6SE +/- 0.52, N = 6SE +/- 3.84, N = 6SE +/- 0.55, N = 3SE +/- 1.09, N = 3SE +/- 0.47, N = 3SE +/- 0.50, N = 3SE +/- 1.68, N = 3SE +/- 0.95, N = 3SE +/- 3.11, N = 3SE +/- 1.62, N = 324.6025.7229.93119.93128.17157.09205.41214.68224.15307.95343.261. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadRadeon RX 460GeForce GTX 960GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 950GeForce GTX 1070Radeon R9 FuryRadeon RX 4803691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36.8412.4512.4712.4812.4912.5012.5112.5212.5212.9412.961. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackRadeon RX 460Radeon R9 FuryRadeon RX 480GeForce GTX 960GeForce GTX 950GeForce GTX 970GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1070GeForce GTX 1060GeForce GTX 10803691215SE +/- 0.00, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.1512.2712.9513.2013.2113.2113.2113.2213.2213.2313.241. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt


Phoronix Test Suite v10.8.4