AMD NVIDIA Linux GPU OpenCL Compute

AMD Polaris and NVIDIA Pascal OpenCL benchmarks on Ubuntu 16.04 LTS Linux. Benchmarks by Michael Larabel for a future article on phoronix.com..

HTML result view exported from: https://openbenchmarking.org/result/1608190-KH-1608193LO70.

AMD NVIDIA Linux GPU OpenCL ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260XR9 FuryIntel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB4 x 120GB TOSHIBA-TR150 + Samsung SSD 950 PRO 256GBeVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-34-generic (x86_64)Unity 7.4.0X Server 1.18.3NVIDIA 367.354.5.01.0.8GCC 5.4.0 20160609ext43840x2160eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1612/5005MHz)XFX AMD Radeon R9 200 2048MB4.5.13448XFX AMD Radeon R9 200 4096MBSapphire AMD Radeon R9 Fury 4096MBAMD Radeon RX 460 2048MBSapphire AMD Radeon RX 470 4096MBAMD Radeon RX 480 8192MBSapphire AMD Radeon R7 200 2048MBIntel Core i5-6600 @ 3.90GHz (4 Cores)ASRock Z170 Pro4SIntel Sky Lake /DRAM120GB GOODRAM C50 + 32GB ADATA SP800 + 250GB Crucial_CT250MX2 + 1000GB Seagate ST1000DM003-1SB1Sapphire AMD Radeon R9 Fury 4096MBRealtek ALC892IPS224Xfce 4.121920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 950: Scaling Governor: intel_pstate performance- GeForce GTX 960: Scaling Governor: intel_pstate performance- GeForce GTX 970: Scaling Governor: intel_pstate performance- GeForce GTX 980: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX TITAN X Maxwell: Scaling Governor: intel_pstate performance- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- Radeon R9 285: Scaling Governor: intel_pstate performance- Radeon R9 290: Scaling Governor: intel_pstate performance- Radeon R9 Fury: Scaling Governor: intel_pstate performance- Radeon RX 460: Scaling Governor: intel_pstate performance- Radeon RX 470 OC: Scaling Governor: intel_pstate performance- Radeon RX 480: Scaling Governor: intel_pstate performance- Radeon R7 260X: Scaling Governor: intel_pstate performance- R9 Fury: Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X Maxwell: GPU Compute Cores: 3072- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X Maxwell: GPU Compute Cores: 3072.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.Graphics Details- Radeon R9 285, Radeon R9 290, Radeon R9 Fury, Radeon RX 460, Radeon RX 470 OC, Radeon RX 480, Radeon R7 260X, R9 Fury: GLAMOR

AMD NVIDIA Linux GPU OpenCL Computemixbench: Single Precisionmixbench: Double Precisionmixbench: Integerfinancebench: Black-Scholes OpenCLluxmark: GPU - Luxball HDRluxmark: GPU - Microphoneluxmark: GPU - Hotelshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Triadjuliagpu: GPUmandelbulbgpu: GPUGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260XR9 Fury2153.3870.30633.3326.7159883116112212.4513.202210.69244.03115.102.7010.8367614640.4737190054.672788.5092.97837.7017.7862693177129712.4813.202943.15270.35123.933.8610.7482726348.8044815450.034157.94137.331222.2214.08110215873184612.4813.144336.29284.31209.225.4611.36107959473.8358806852.004747.21159.021405.0210.66122216111198312.4913.175019.38331.43212.196.2411.45117722561.5063820883.305874.35197.781727.278.95152167981251512.4313.156173.34349.42203.377.7411.68132005802.9771117240.806585.34221.821924.277.85156668309249712.4313.226885.13354.01203.768.3911.69140161877.9776108478.634407.44151.941367.7811.21114595204211212.4713.214785.98369.71214.665.6311.36114733414.7762160246.306502.90226.752008.757.79158397350308312.4813.237080.77451.96294.608.3711.70143250390.9777948508.238556.64300.162688.916.03125136557293312.5313.199389.10521.92327.7811.8011.82164343285.1389708762.471764.93648.248504404412.9512.543275.62162.3416.703.778.424585.081575.631534469148.938.214776.98251.5745.455.526.016597.131358.2711.5019564879412.9412.967130.64221.9636.656.044.001718.13425.18545528766.847.142119.7676.6334.042.373.124891.211021.889.2112732703012.9612.375145.09156.2227.794.688.345516.271152.827.5514180779812.9612.685805.78162.7427.375.173.98841.28328.774649231211.529.272028.0997.9520.372.367.056724.13449.371428.807.72199309022OpenBenchmarking.org

Mixbench

Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260XR9 Fury2K4K6K8K10KSE +/- 0.18, N = 3SE +/- 1.13, N = 3SE +/- 2.04, N = 3SE +/- 1.51, N = 3SE +/- 8.48, N = 3SE +/- 8.77, N = 3SE +/- 4.04, N = 3SE +/- 85.54, N = 3SE +/- 198.39, N = 3SE +/- 193.76, N = 3SE +/- 0.85, N = 3SE +/- 1.05, N = 3SE +/- 260.96, N = 3SE +/- 0.41, N = 3SE +/- 0.43, N = 3SE +/- 45.05, N = 3SE +/- 0.37, N = 32153.382788.504157.944747.215874.356585.344407.446502.908556.641764.934585.086597.131718.134891.215516.27841.286724.131. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Double PrecisionGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080R9 Fury100200300400500SE +/- 0.08, N = 3SE +/- 0.21, N = 3SE +/- 0.05, N = 3SE +/- 0.65, N = 3SE +/- 0.13, N = 3SE +/- 0.13, N = 3SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 370.3092.97137.33159.02197.78221.82151.94226.75300.16449.371. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260XR9 Fury6001200180024003000SE +/- 0.60, N = 3SE +/- 0.54, N = 3SE +/- 1.90, N = 3SE +/- 2.17, N = 3SE +/- 4.84, N = 3SE +/- 2.12, N = 3SE +/- 0.55, N = 3SE +/- 27.29, N = 3SE +/- 2.27, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 44.24, N = 3SE +/- 1.34, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 37.53, N = 3SE +/- 0.06, N = 3633.33837.701222.221405.021727.271924.271367.782008.752688.91648.241575.631358.27425.181021.881152.82328.771428.801. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 FuryRadeon RX 470 OCRadeon RX 480R9 Fury612182430SE +/- 1.69, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 2.57, N = 3SE +/- 0.22, N = 3SE +/- 0.22, N = 3SE +/- 1.27, N = 626.7117.7814.0810.668.957.8511.217.796.0311.509.217.557.721. (CXX) g++ options: -O3 -lOpenCL

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260XR9 Fury4K8K12K16K20KSE +/- 14.86, N = 3SE +/- 4.33, N = 3SE +/- 26.69, N = 3SE +/- 71.17, N = 3SE +/- 143.18, N = 3SE +/- 44.68, N = 3SE +/- 4.26, N = 3SE +/- 73.73, N = 3SE +/- 12.12, N = 3SE +/- 33.31, N = 3SE +/- 16.65, N = 3SE +/- 79.45, N = 3SE +/- 14.50, N = 3SE +/- 63.01, N = 3SE +/- 71.34, N = 3SE +/- 11.78, N = 3SE +/- 93.11, N = 359886269110211222115216156661145915839125138504153441956454551273214180464919930

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X2040608010047.5847.7163.5465.3366.8964.1283.5594.9176.3448.6464.9287.8256.3468.1080.3246.20

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X50100150200250Min: 43.2 / Avg: 125.85 / Max: 131.1Min: 98.5 / Avg: 131.39 / Max: 133.5Min: 86.2 / Avg: 173.44 / Max: 177.1Min: 45.9 / Avg: 187.08 / Max: 194Min: 59.5 / Avg: 227.47 / Max: 238Min: 50.3 / Avg: 244.33 / Max: 254.9Min: 133.4 / Avg: 137.14 / Max: 138.9Min: 40.4 / Avg: 166.89 / Max: 173.7Min: 40 / Avg: 163.91 / Max: 170.1Min: 54.2 / Avg: 174.83 / Max: 179Min: 63.2 / Avg: 236.36 / Max: 246.2Min: 55.4 / Avg: 222.78 / Max: 234.6Min: 44.4 / Avg: 96.82 / Max: 98.7Min: 54.3 / Avg: 186.97 / Max: 199.1Min: 58.6 / Avg: 176.55 / Max: 184.2Min: 48.9 / Avg: 100.63 / Max: 103.2

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260XR9 Fury2K4K6K8K10KSE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 5.57, N = 3SE +/- 0.88, N = 3SE +/- 1.73, N = 3SE +/- 9.00, N = 3SE +/- 2.40, N = 3SE +/- 8.69, N = 3SE +/- 1.76, N = 3SE +/- 18.84, N = 3SE +/- 9.39, N = 3SE +/- 39.47, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 9.60, N = 331163177587361117981830952047350655740446914879428767030779823129022

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X112233445525.3025.8936.9735.5236.5735.0643.0748.0442.1924.6931.9342.6530.1539.4745.7423.92

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X4080120160200Min: 76 / Avg: 123.15 / Max: 126.8Min: 44 / Avg: 122.71 / Max: 126.1Min: 47.2 / Avg: 158.84 / Max: 165.2Min: 48.4 / Avg: 172.06 / Max: 181.7Min: 66.1 / Avg: 218.21 / Max: 223.9Min: 66.4 / Avg: 237.03 / Max: 242.6Min: 40 / Avg: 120.84 / Max: 124.1Min: 43.2 / Avg: 153.01 / Max: 155.8Min: 42.3 / Avg: 155.42 / Max: 159.2Min: 71.6 / Avg: 163.81 / Max: 169.2Min: 78.6 / Avg: 216.57 / Max: 227.2Min: 57.9 / Avg: 206.19 / Max: 213.3Min: 46.3 / Avg: 95.39 / Max: 97.6Min: 69.3 / Avg: 178.13 / Max: 187.5Min: 67.3 / Avg: 170.49 / Max: 176.8Min: 50.8 / Avg: 96.65 / Max: 99.3

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 10807001400210028003500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.33, N = 3SE +/- 4.00, N = 3SE +/- 37.43, N = 3SE +/- 3.61, N = 3SE +/- 1.20, N = 3SE +/- 0.58, N = 3SE +/- 2.00, N = 3112212971846198325152497211230832933

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080481216208.389.0110.4410.3410.7810.5214.6916.8115.41

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X50100150200250Min: 43.9 / Avg: 133.94 / Max: 140.1Min: 44.4 / Avg: 143.88 / Max: 146.8Min: 46.9 / Avg: 176.81 / Max: 181.4Min: 47.8 / Avg: 191.79 / Max: 199.2Min: 51.6 / Avg: 233.39 / Max: 243.6Min: 53.7 / Avg: 237.45 / Max: 253.9Min: 72.9 / Avg: 143.73 / Max: 147.8Min: 42.9 / Avg: 183.4 / Max: 188.3Min: 42.3 / Avg: 190.35 / Max: 194.9Min: 71.4 / Avg: 81.13 / Max: 95.3Min: 81.8 / Avg: 84.84 / Max: 87.8Min: 58.3 / Avg: 70.63 / Max: 77.6Min: 55.3 / Avg: 62.12 / Max: 67.3Min: 68.5 / Avg: 74.85 / Max: 88.6Min: 60.3 / Avg: 73.58 / Max: 77Min: 66.1 / Avg: 71.26 / Max: 79.3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X3691215SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.93, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.08, N = 312.4512.4812.4812.4912.4312.4312.4712.4812.5312.958.9312.946.8412.9612.9611.521. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.30, N = 3SE +/- 1.36, N = 3SE +/- 0.46, N = 3SE +/- 0.00, N = 3SE +/- 0.24, N = 3SE +/- 0.22, N = 3SE +/- 1.53, N = 313.2013.2013.1413.1713.1513.2213.2113.2313.1912.548.2112.967.1412.3712.689.271. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X2K4K6K8K10KSE +/- 0.17, N = 3SE +/- 0.74, N = 3SE +/- 0.64, N = 3SE +/- 2.08, N = 3SE +/- 16.67, N = 3SE +/- 4.90, N = 3SE +/- 13.64, N = 3SE +/- 17.78, N = 3SE +/- 18.40, N = 3SE +/- 0.30, N = 3SE +/- 0.49, N = 3SE +/- 0.31, N = 3SE +/- 9.80, N = 3SE +/- 0.30, N = 3SE +/- 0.27, N = 3SE +/- 0.38, N = 32210.692943.154336.295019.386173.346885.134785.987080.779389.103275.624776.987130.642119.765145.095805.782028.091. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X153045607520.8026.7032.9934.3934.5336.0547.0356.4467.3423.9831.5637.8926.9435.0241.5723.50

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X50100150200250Min: 100.1 / Avg: 106.29 / Max: 146Min: 102.5 / Avg: 110.22 / Max: 167.4Min: 46.9 / Avg: 131.44 / Max: 203.8Min: 132.6 / Avg: 145.97 / Max: 231.6Min: 51.2 / Avg: 178.77 / Max: 277.6Min: 56.7 / Avg: 190.99 / Max: 303.2Min: 39.8 / Avg: 101.76 / Max: 162.5Min: 42.1 / Avg: 125.46 / Max: 191Min: 41.6 / Avg: 139.42 / Max: 239.6Min: 85.3 / Avg: 136.57 / Max: 156.9Min: 64.8 / Avg: 151.38 / Max: 196.5Min: 58.1 / Avg: 188.21 / Max: 219Min: 45.4 / Avg: 78.7 / Max: 90.1Min: 67.1 / Avg: 146.91 / Max: 166.5Min: 58.1 / Avg: 139.65 / Max: 184.1Min: 50.1 / Avg: 86.31 / Max: 110.7

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X110220330440550SE +/- 0.50, N = 3SE +/- 0.25, N = 3SE +/- 0.51, N = 3SE +/- 0.23, N = 3SE +/- 0.27, N = 3SE +/- 2.25, N = 3SE +/- 1.18, N = 3SE +/- 0.97, N = 3SE +/- 0.40, N = 3SE +/- 1.03, N = 3SE +/- 1.94, N = 3SE +/- 1.10, N = 3SE +/- 0.57, N = 3SE +/- 0.37, N = 3SE +/- 0.23, N = 3SE +/- 0.15, N = 3244.03270.35284.31331.43349.42354.01369.71451.96521.92162.34251.57221.9676.63156.22162.7497.951. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X0.95851.9172.87553.8344.79252.432.422.272.282.011.953.743.764.261.401.821.991.021.481.641.24

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X4080120160200Min: 43.9 / Avg: 100.35 / Max: 118.2Min: 105.6 / Avg: 111.63 / Max: 118.8Min: 46.4 / Avg: 125.36 / Max: 145.2Min: 107.6 / Avg: 145.23 / Max: 159.1Min: 51 / Avg: 173.78 / Max: 208Min: 100.1 / Avg: 181.34 / Max: 220.2Min: 39.6 / Avg: 98.87 / Max: 114.5Min: 41.9 / Avg: 120.3 / Max: 141Min: 41.9 / Avg: 122.51 / Max: 141.9Min: 73.3 / Avg: 115.8 / Max: 147.4Min: 92.8 / Avg: 138.33 / Max: 188.2Min: 58.6 / Avg: 111.6 / Max: 141.3Min: 46.1 / Avg: 75.1 / Max: 82.4Min: 56.8 / Avg: 105.47 / Max: 125.8Min: 63.6 / Avg: 99.32 / Max: 121.2Min: 50 / Avg: 78.73 / Max: 88.9

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X70140210280350SE +/- 0.17, N = 3SE +/- 0.52, N = 3SE +/- 0.49, N = 3SE +/- 0.45, N = 3SE +/- 0.52, N = 3SE +/- 1.14, N = 3SE +/- 0.46, N = 3SE +/- 0.63, N = 3SE +/- 0.85, N = 3SE +/- 0.32, N = 3SE +/- 13.07, N = 3SE +/- 0.29, N = 3SE +/- 6.49, N = 3SE +/- 0.20, N = 3SE +/- 0.73, N = 3SE +/- 10.55, N = 3115.10123.93209.22212.19203.37203.76214.66294.60327.7816.7045.4536.6534.0427.7927.3720.371. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.25, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.33, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 32.703.865.466.247.748.395.638.3711.803.775.526.042.374.685.172.361. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X3691215SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.85, N = 3SE +/- 0.03, N = 3SE +/- 0.33, N = 310.8310.7411.3611.4511.6811.6911.3611.7011.828.426.014.003.128.343.987.051. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 108040M80M120M160M200MSE +/- 109829.25, N = 3SE +/- 462038.74, N = 3SE +/- 474968.42, N = 3SE +/- 182338.25, N = 3SE +/- 172847.35, N = 3SE +/- 140800.69, N = 3SE +/- 291564.71, N = 3SE +/- 70686.86, N = 3SE +/- 413616.13, N = 367614640.4782726348.80107959473.83117722561.50132005802.97140161877.97114733414.77143250390.97164343285.131. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec Per Watt, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080300K600K900K1200K1500K640150.26786780.08904351.51901570.45828315.02834214.131258904.561364506.031492898.88

JuliaGPU

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterJuliaGPU 1.2pts1System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X306090120150Min: 81.9 / Avg: 105.62 / Max: 108.9Min: 44.3 / Avg: 105.15 / Max: 112.5Min: 46.6 / Avg: 119.38 / Max: 129.1Min: 102.3 / Avg: 130.58 / Max: 137.8Min: 115.3 / Avg: 159.37 / Max: 171.9Min: 133.8 / Avg: 168.02 / Max: 180Min: 38.7 / Avg: 91.14 / Max: 102.2Min: 41 / Avg: 104.98 / Max: 118.9Min: 40.7 / Avg: 110.08 / Max: 129.2Min: 55.8 / Avg: 106.08 / Max: 124.3Min: 65.3 / Avg: 130.24 / Max: 152.5Min: 57.3 / Avg: 75.59 / Max: 98.1Min: 51.6 / Avg: 58.46 / Max: 60.5Min: 55.4 / Avg: 80.42 / Max: 84.3Min: 58.8 / Avg: 84.51 / Max: 90.3Min: 50.6 / Avg: 77.24 / Max: 81.5

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 108020M40M60M80M100MSE +/- 13927.02, N = 3SE +/- 60575.76, N = 3SE +/- 104185.29, N = 3SE +/- 2971.36, N = 3SE +/- 608054.60, N = 3SE +/- 202155.32, N = 3SE +/- 289951.05, N = 3SE +/- 473735.75, N = 3SE +/- 419322.49, N = 337190054.6744815450.0358806852.0063820883.3071117240.8076108478.6362160246.3077948508.2389708762.471. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec Per Watt, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080200K400K600K800K1000K386239.70452300.59563553.92563390.57463103.37446973.89711704.22749144.72802942.60

MandelbulbGPU

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterMandelbulbGPU 1.0pts1System Power Consumption MonitorGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X306090120150Min: 44.1 / Avg: 96.29 / Max: 105.4Min: 80.8 / Avg: 99.08 / Max: 106.9Min: 48.2 / Avg: 104.35 / Max: 122.8Min: 52.8 / Avg: 113.28 / Max: 132.7Min: 130.3 / Avg: 153.57 / Max: 166Min: 164 / Avg: 170.28 / Max: 175.1Min: 59.2 / Avg: 87.34 / Max: 99.4Min: 68.1 / Avg: 104.05 / Max: 117.1Min: 70.2 / Avg: 111.73 / Max: 125.8Min: 113.8 / Avg: 114.86 / Max: 116.5Min: 141.1 / Avg: 144.95 / Max: 147.4Min: 57.3 / Avg: 76.06 / Max: 83.7Min: 45.6 / Avg: 60.06 / Max: 62.8Min: 55 / Avg: 76.23 / Max: 83.5Min: 88 / Avg: 88.59 / Max: 89.4Min: 50 / Avg: 77.5 / Max: 83

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X MaxwellGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon R9 285Radeon R9 290Radeon R9 FuryRadeon RX 460Radeon RX 470 OCRadeon RX 480Radeon R7 260X50100150200250Min: 43.2 / Avg: 106.98 / Max: 146Min: 43.8 / Avg: 110.57 / Max: 167.4Min: 46.2 / Avg: 138 / Max: 213.1Min: 45.9 / Avg: 149.24 / Max: 231.6Min: 49.5 / Avg: 182.17 / Max: 277.6Min: 50.3 / Avg: 193.1 / Max: 303.2Min: 38.4 / Avg: 106.15 / Max: 162.5Min: 40.4 / Avg: 133.78 / Max: 202.8Min: 40 / Avg: 145.64 / Max: 243.4Min: 54.2 / Avg: 132.75 / Max: 179Min: 63.1 / Avg: 163.82 / Max: 246.2Min: 55.1 / Avg: 154.66 / Max: 234.6Min: 44.4 / Avg: 76.79 / Max: 98.7Min: 52.9 / Avg: 132.38 / Max: 199.1Min: 57.4 / Avg: 128.01 / Max: 184.2Min: 48.3 / Avg: 84.62 / Max: 110.7


Phoronix Test Suite v10.8.4