OpenCL NVIDIA AMD Ubuntu 16.04 Compute

NVIDIA GeForce and AMD Radeon performance OpenCL GPGPU compute benchmarks on Ubuntu 16.04 64-bit. Tests by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1603238-GA-OPENCLNVI91&grw.

OpenCL NVIDIA AMD Ubuntu 16.04 ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX TITAN XIntel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB120GB Samsung SSD 850XFX AMD Radeon R9 200Realtek ALC1150DELL P2415QIntel ConnectionUbuntu 16.044.4.0-13-generic (x86_64)Unity 7.4.0X Server 1.18.2amdgpu 0.0.24.5.13830OpenCL 2.0 AMD-APP (2036.3)1.0.5GCC 5.3.1 20160311ext43840x2160Sapphire AMD Radeon R9 Fury 4096MBXFX AMD Radeon R9 200 4096MBeVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)NVIDIA 364.124.5.0OpenCL 1.2 CUDA 8.0.0eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceGraphics Details- Radeon R9 285, Radeon R9 Fury, Radeon R9 290: GLAMOROpenCL Details- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX TITAN X: GPU Compute Cores: 3072System Details- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX TITAN X: GPU Compute Cores: 3072.

OpenCL NVIDIA AMD Ubuntu 16.04 Computeshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX TITAN X10.8617.633.743276.9012.7913.65170.03890342675174.0127.315.717131.0412.7013.03220.9317336361143266.3973.135.614783.867.486.12255.78151253501202110.93119.822.692217.2612.5013.21240.7511173104597210.81131.353.852944.5212.5013.21270.8112653103627711.43209.695.444335.3812.5113.21284.94194358241098111.57209.946.484991.3212.5213.22329.47209360101212111.78199.127.776177.7212.5013.21350.05248080501531011.56153.113.785028.5012.5313.24286.8012654557964711.84202.038.466925.5212.5313.22351.492614821215585OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X3691215SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.46, N = 6SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 310.864.016.3911.7810.9310.8111.4311.5711.5611.841. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X50100150200250SE +/- 0.32, N = 3SE +/- 2.32, N = 6SE +/- 0.27, N = 3SE +/- 0.47, N = 3SE +/- 0.46, N = 3SE +/- 0.27, N = 3SE +/- 0.24, N = 3SE +/- 0.86, N = 3SE +/- 2.66, N = 4SE +/- 0.20, N = 317.6327.3173.13199.12119.82131.35209.69209.94153.11202.031. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X246810SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.745.715.617.772.693.855.446.483.788.461. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X15003000450060007500SE +/- 0.26, N = 3SE +/- 0.20, N = 3SE +/- 0.42, N = 3SE +/- 19.11, N = 3SE +/- 6.33, N = 3SE +/- 0.41, N = 3SE +/- 0.54, N = 3SE +/- 24.68, N = 3SE +/- 10.75, N = 3SE +/- 47.81, N = 33276.907131.044783.866177.722217.262944.524335.384991.325028.506925.521. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.39, N = 6SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.7912.707.4812.5012.5012.5012.5112.5212.5312.531. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X48121620SE +/- 0.17, N = 3SE +/- 0.34, N = 6SE +/- 0.46, N = 6SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313.6513.036.1213.2113.2113.2113.2113.2213.2413.221. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X80160240320400SE +/- 1.33, N = 3SE +/- 1.87, N = 3SE +/- 0.83, N = 3SE +/- 1.14, N = 3SE +/- 1.02, N = 3SE +/- 0.71, N = 3SE +/- 0.50, N = 3SE +/- 1.07, N = 3SE +/- 0.10, N = 3SE +/- 1.38, N = 3170.03220.93255.78350.05240.75270.81284.94329.47286.80351.491. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X6001200180024003000SE +/- 3.33, N = 3SE +/- 2.85, N = 3SE +/- 0.33, N = 3SE +/- 3.00, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 2.33, N = 3SE +/- 6.08, N = 3SE +/- 1.53, N = 3SE +/- 7.33, N = 3890173315122480111712651943209312652614

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X2K4K6K8K10KSE +/- 16.37, N = 3SE +/- 5.51, N = 3SE +/- 13.00, N = 3SE +/- 38.50, N = 3SE +/- 2.33, N = 3SE +/- 0.58, N = 3SE +/- 4.67, N = 3SE +/- 2.67, N = 3SE +/- 2.52, N = 3SE +/- 14.45, N = 33426636153508050310431035824601045578212

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon R9 285Radeon R9 FuryRadeon R9 290GeForce GTX 980 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 780 TiGeForce GTX TITAN X3K6K9K12K15KSE +/- 14.50, N = 3SE +/- 20.67, N = 3SE +/- 17.44, N = 3SE +/- 34.86, N = 3SE +/- 13.50, N = 3SE +/- 3.18, N = 3SE +/- 22.15, N = 3SE +/- 0.33, N = 3SE +/- 34.84, N = 3SE +/- 3.71, N = 37517143261202115310597262771098112121964715585


Phoronix Test Suite v10.8.4