OpenCL October 2016

NVIDIA binary blbo and AMDGPU-PRO Radeon tests with different OpenCL workloads.

HTML result view exported from: https://openbenchmarking.org/result/1610096-LO-OPENCLOCT49&sro&grs.

OpenCL October 2016ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB INTEL SSDPEKKW256G7AMD Radeon RX 460 2048MBRealtek ALC1150Intel ConnectionUbuntu 16.044.4.0-38-generic (x86_64)Unity 7.4.0X Server 1.18.44.5.13448OpenCL 2.0 AMD-APP (2117.7)1.0.8GCC 5.4.0 20160609 + LLVM 3.8.0ext43840x2160AMD Radeon RX 480 8192MBSapphire AMD Radeon R9 Fury 4096MBNVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)NVIDIA 370.284.5.0eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1505/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1502/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1619/5005MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceGraphics Details- Radeon RX 460, Radeon RX 480, Radeon R9 Fury: GLAMOROpenCL Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.

OpenCL October 2016shoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Triadluxmark: GPU - Luxball HDRluxmark: GPU - Microphoneshoc: OpenCL - FFT SPshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed ReadbackRadeon RX 460Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 108077.452.422152.883.095595293429.936.847.15161.855.235804.776.5814174779325.7212.9612.95219.665.787116.375.9519468871524.6012.9412.27286.763.924989.3211.53157.0912.4813.22256.452.672216.7910.9759513003119.9312.5213.21279.573.822941.3710.7861473054128.1712.4513.20293.925.414332.4011.43109185587214.6812.5013.21351.567.696177.1211.84150727989205.4112.4913.21393.535.584810.0411.45116205206224.1512.5113.23450.608.297108.0711.76162067220307.9512.5213.22524.3011.669380.3011.83127746287343.2612.4713.24OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 480110220330440550SE +/- 0.83, N = 3SE +/- 0.32, N = 3SE +/- 1.97, N = 3SE +/- 0.16, N = 3SE +/- 0.11, N = 3SE +/- 0.70, N = 3SE +/- 0.71, N = 3SE +/- 0.16, N = 3SE +/- 1.46, N = 3SE +/- 0.65, N = 3SE +/- 0.30, N = 3393.53450.60524.30286.76256.45279.57293.92351.56219.6677.45161.851. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4803691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 35.588.2911.663.922.673.825.417.695.782.425.231. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4802K4K6K8K10KSE +/- 6.65, N = 3SE +/- 49.40, N = 3SE +/- 23.12, N = 3SE +/- 15.86, N = 3SE +/- 6.65, N = 3SE +/- 1.19, N = 3SE +/- 2.38, N = 3SE +/- 18.69, N = 3SE +/- 6.08, N = 3SE +/- 2.31, N = 3SE +/- 0.33, N = 34810.047108.079380.304989.322216.792941.374332.406177.127116.372152.885804.771. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4803691215SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 1.27, N = 6SE +/- 0.05, N = 3SE +/- 1.20, N = 611.4511.7611.8311.5310.9710.7811.4311.845.953.096.581. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4804K8K12K16K20KSE +/- 1.20, N = 3SE +/- 1.15, N = 3SE +/- 26.67, N = 3SE +/- 20.50, N = 3SE +/- 1.20, N = 3SE +/- 35.71, N = 3SE +/- 35.02, N = 3SE +/- 88.05, N = 3SE +/- 16.56, N = 3SE +/- 38.84, N = 311620162061277459516147109181507219468559514174

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4802K4K6K8K10KSE +/- 5.00, N = 3SE +/- 1.86, N = 3SE +/- 7.69, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 24.27, N = 3SE +/- 8.54, N = 3SE +/- 13.64, N = 3SE +/- 14.68, N = 3SE +/- 2.08, N = 35206722062873003305455877989871529347793

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 48070140210280350SE +/- 0.95, N = 3SE +/- 3.11, N = 3SE +/- 1.62, N = 3SE +/- 0.47, N = 3SE +/- 0.55, N = 3SE +/- 1.09, N = 3SE +/- 1.68, N = 3SE +/- 0.50, N = 3SE +/- 0.94, N = 6SE +/- 3.84, N = 6SE +/- 0.52, N = 6224.15307.95343.26157.09119.93128.17214.68205.4124.6029.9325.721. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4803691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.5112.5212.4712.4812.5212.4512.5012.4912.946.8412.961. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980 TiRadeon R9 FuryRadeon RX 460Radeon RX 4803691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.15, N = 3SE +/- 0.00, N = 3SE +/- 0.24, N = 313.2313.2213.2413.2213.2113.2013.2113.2112.277.1512.951. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt


Phoronix Test Suite v10.8.5