Radeon ROCm vs. NVIDIA OpenCL August 2017

Radeon ROCm and NVIDIA OpenCL Linux testing by Michael Larabel for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1708107-TY-OPENCLVEG85&grs&sor.

Radeon ROCm vs. NVIDIA OpenCL August 2017ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLRadeon R9 285Radeon R9 290Radeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiIntel Core i7-7740K @ 4.50GHz (8 Cores)ASUS PRIME X299-AIntel Device 591f16384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBXFX AMD TONGA 2048MBRealtek GenericAcer B286HKIntel ConnectionUbuntu 16.044.11.0-kfd-compute-rocm-rel-1.6-127 (x86_64)Unity 7.4.0modesetting 1.19.34.5 Mesa 17.3.0-devel- padoka PPA (LLVM 6.0.0)1.0.42GCC 5.4.0 20160609ext43840x2160XFX AMD HAWAII 4096MBOpenCL 2.0 AMD-APP (2450.0)AMD POLARIS10 8192MBamdgpu 1.3.0AMD POLARIS11 4096MBMSI AMD POLARIS10 8192MBmodesetting 1.19.3Sapphire AMD FIJI 4096MBNVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)Realtek ALC12204.13.0-999-generic (x86_64) 20170730NVIDIA 384.594.5.0OpenCL 1.2 CUDA 9.0.130eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Scaling Governor: intel_pstate performanceOpenCL Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584System Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.

Radeon ROCm vs. NVIDIA OpenCL August 2017mixbench: Integermixbench: Single Precisionclpeak: Double-Precision Doublemixbench: Double Precisionluxmark: GPU - Hotelshoc: OpenCL - Max SP Flopsdarktable: Boat - OpenCLclpeak: Integer Compute INTshoc: OpenCL - MD5 Hashclpeak: Single-Precision Floatcl-mem: Writeclpeak: Global Memory Bandwidthviennacl: OpenCL LU Factorizationshoc: OpenCL - Texture Read Bandwidthclpeak: Transfer Bandwidth enqueueWriteBuffershoc: OpenCL - FFT SPcl-mem: Copyluxmark: GPU - Luxball HDRcl-mem: Readfahbench: shoc: OpenCL - Triaddarktable: Server Room - OpenCLclpeak: Kernel Latencydarktable: Masskrug - OpenCLRadeon R9 285Radeon R9 290Radeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti599.0610134790.313.851595.416.114755.89217.70272.1720.97253.6430.33192.4010400123.000.156.670.171140.245450.06364.04362.2910705812.644.161162.887.335737.62179.70209.3912.05193.4330.64462.35185.379885157.879.610.135.860.14508.692465.62164.50164.005092634.536.91524.703.342588.6680.9089.2512.08115.8130.38209.7681.03446893.504.950.275.820.271227.325863.73391.77389.9712186260.484.081251.637.966175.41180.70208.9611.98207.9730.38498.10183.4710400159.739.680.175.630.181386.506508.34447.26440.8513677144.883.421429.329.097068.81391.50431.2621.76245.6830.74550.79206.1313212123.3010.170.135.940.13968.934263.71246.21245.1111994944.7215.24961.134.663847.88252.00252.9854.59287.3912.36429.87237.109516271.6772.7712.350.255.560.27835.012765.1392.4492.7411302960.4919.30781.164.462429.1070.8081.1247.64277.1112.46209.1570.60614881.4058.3511.240.253.880.241220.804125.24137.56136.1816494361.6516.261128.646.543728.55129.70143.4153.03288.8712.49382.02125.2710704143.5585.4511.910.224.050.211402.274700.91159.68159.8617565051.8915.231296.117.544288.05152.30164.1054.96332.1112.48447.62142.6011959164.4797.3812.030.214.070.211717.515563.69195.67194.7221426208.654.191583.859.275292.12238.40263.3256.91351.4512.21693.44216.3714811266.17108.1512.290.224.330.22614.522040.4666.8866.7610222125.7818.18568.643.241937.0085.7392.4341.80274.916.44204.3687.5065699549.786.120.233.820.221366.844389.15150.41152.1617424829.994.661223.587.364157.55138.70146.2354.03382.2112.37322.16137.7011572151.6097.2411.960.194.100.182027.966402.60225.45223.8722867125.883.781631.0310.616359.20191.43196.3658.95454.4712.61470.12186.6316186205.43132.7912.200.193.550.182662.468493.48295.22295.3427259446.743.672349.8614.298249.62213.13218.1561.26520.1312.59597.69206.5312776227.20145.3812.310.184.060.1727.8889.23415.066.39353213274.173.133200.3019.7411780.26335.47329.4363.67596.8212.61974.37316.8019662338.07186.7412.510.183.550.17OpenBenchmarking.org

Mixbench

Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980Radeon R9 FuryGeForce GTX 1060Radeon RX 580GeForce GTX 970Radeon RX 480GeForce GTX 780 TiGeForce GTX 960GeForce GTX 1050Radeon RX 560GeForce GTX 1080 Ti6001200180024003000SE +/- 3.22, N = 3SE +/- 2.60, N = 3SE +/- 1.33, N = 3SE +/- 0.90, N = 3SE +/- 0.06, N = 3SE +/- 0.54, N = 3SE +/- 0.04, N = 3SE +/- 0.50, N = 3SE +/- 0.04, N = 3SE +/- 0.88, N = 3SE +/- 2.57, N = 3SE +/- 0.97, N = 3SE +/- 0.12, N = 3SE +/- 1.30, N = 32662.462027.961717.511402.271386.501366.841227.321220.801140.24968.93835.01614.52508.6927.881. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionGeForce GTX 1080Radeon R9 FuryGeForce GTX 1070Radeon RX 580GeForce GTX 980 TiRadeon RX 480GeForce GTX 980GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 970GeForce GTX 960Radeon RX 560GeForce GTX 1050GeForce GTX 1080 Ti2K4K6K8K10KSE +/- 48.08, N = 3SE +/- 1.46, N = 3SE +/- 54.41, N = 3SE +/- 4.20, N = 3SE +/- 255.36, N = 3SE +/- 2.06, N = 3SE +/- 5.25, N = 3SE +/- 2.95, N = 3SE +/- 5.00, N = 3SE +/- 2.66, N = 3SE +/- 0.91, N = 3SE +/- 0.14, N = 3SE +/- 0.81, N = 3SE +/- 3.63, N = 38493.486508.346402.605863.735563.695450.064700.914389.154263.714125.242765.132465.622040.4689.231. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRadeon R9 290Radeon R9 FuryGeForce GTX 1080 TiRadeon RX 580Radeon RX 480GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 1070GeForce GTX 980 TiRadeon RX 560GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 960GeForce GTX 1050130260390520650SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.65, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.76, N = 3SE +/- 0.29, N = 3SE +/- 1.12, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.30, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3599.06447.26415.06391.77364.04295.22246.21225.45195.67164.50159.68150.41137.5692.4466.88

clpeak

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorGeForce GTX 1060GeForce GTX 1050Radeon RX 560GeForce GTX 1070GeForce GTX 780 TiGeForce GTX 960GeForce GTX 970Radeon RX 580Radeon RX 480GeForce GTX 980 TiGeForce GTX 1080Radeon R9 FuryRadeon R9 290GeForce GTX 980GeForce GTX 1080 Ti60120180240300Min: 44.6 / Avg: 55.5 / Max: 66.4Min: 57.7 / Avg: 83.39 / Max: 110.6Min: 50.7 / Avg: 88 / Max: 125.3Min: 59.7 / Avg: 109 / Max: 158.3Min: 53.3 / Avg: 128.25 / Max: 203.2Min: 82.9 / Avg: 130.65 / Max: 178.4Min: 62.1 / Avg: 135.37 / Max: 215.7Min: 74.5 / Avg: 139.09 / Max: 198.1Min: 63.5 / Avg: 152.7 / Max: 241.9Min: 51.8 / Avg: 159 / Max: 266.2Min: 91.2 / Avg: 171.7 / Max: 293.2Min: 158.9 / Avg: 196.69 / Max: 311.4Min: 254.1 / Avg: 254.4 / Max: 254.7

Mixbench

Benchmark: Double Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Double PrecisionRadeon R9 FuryRadeon RX 580Radeon RX 480GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 1070GeForce GTX 980 TiRadeon RX 560GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 960GeForce GTX 1050GeForce GTX 1080 Ti100200300400500SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.92, N = 3SE +/- 0.15, N = 3SE +/- 0.00, N = 3SE +/- 2.32, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 3SE +/- 0.71, N = 3SE +/- 0.25, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3440.85389.97362.29295.34245.11223.87194.72164.00159.86152.16136.1892.7466.766.391. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 970Radeon R9 FuryRadeon RX 580GeForce GTX 780 TiGeForce GTX 960Radeon RX 480GeForce GTX 1050Radeon R9 290Radeon RX 5608001600240032004000SE +/- 54.49, N = 3SE +/- 28.15, N = 3SE +/- 6.67, N = 3SE +/- 36.99, N = 3SE +/- 55.00, N = 3SE +/- 9.21, N = 3SE +/- 36.71, N = 3SE +/- 3.84, N = 3SE +/- 3.33, N = 3SE +/- 5.33, N = 3SE +/- 7.64, N = 3SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 3SE +/- 2.33, N = 335322725228621421756174216491367121811991130107010221013509

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1050Radeon RX 560GeForce GTX 1060GeForce GTX 960GeForce GTX 1070GeForce GTX 970GeForce GTX 1080Radeon RX 480GeForce GTX 980Radeon RX 580GeForce GTX 980 TiGeForce GTX 1080 TiRadeon R9 290Radeon R9 FuryGeForce GTX 780 Ti60120180240300Min: 65.4 / Avg: 91.36 / Max: 114.7Min: 50.6 / Avg: 92.86 / Max: 123.4Min: 73.6 / Avg: 118.38 / Max: 167.3Min: 117.1 / Avg: 124.83 / Max: 188.6Min: 78.5 / Avg: 140.86 / Max: 210.3Min: 57.8 / Avg: 149.11 / Max: 234.2Min: 127.9 / Avg: 156.13 / Max: 262.7Min: 66.7 / Avg: 159.05 / Max: 205.4Min: 55.9 / Avg: 163.16 / Max: 251Min: 61.4 / Avg: 165.94 / Max: 235Min: 151.8 / Avg: 198.76 / Max: 304.5Min: 52.6 / Avg: 203.91 / Max: 304.8Min: 109.8 / Avg: 204.67 / Max: 296.3Min: 121.6 / Avg: 215.47 / Max: 344.8Min: 59.2 / Avg: 226.01 / Max: 336.8

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon RX 560GeForce GTX 1050GeForce GTX 1060GeForce GTX 960Radeon RX 480Radeon RX 580GeForce GTX 1070GeForce GTX 970Radeon R9 FuryGeForce GTX 1080GeForce GTX 980Radeon R9 290GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 780 TiRadeon R9 28550100150200250Min: 51.4 / Avg: 94.7 / Max: 102.8Min: 44.6 / Avg: 107.9 / Max: 115.3Min: 61.8 / Avg: 152.59 / Max: 156.6Min: 68.3 / Avg: 154.46 / Max: 158.8Min: 103.4 / Avg: 161.24 / Max: 183.2Min: 62.8 / Avg: 167.57 / Max: 197.1Min: 52.1 / Avg: 183.39 / Max: 188.4Min: 57.1 / Avg: 191.79 / Max: 197.1Min: 87.9 / Avg: 202.29 / Max: 248.6Min: 61.4 / Avg: 203.35 / Max: 209.7Min: 56.4 / Avg: 203.63 / Max: 213.1Min: 164 / Avg: 223.01 / Max: 246.2Min: 61 / Avg: 242.75 / Max: 254.4Min: 75.8 / Avg: 261.7 / Max: 278.5Min: 79.8 / Avg: 279.87 / Max: 294.8

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorRadeon RX 560GeForce GTX 1050GeForce GTX 1060GeForce GTX 960Radeon RX 580Radeon RX 480GeForce GTX 1070GeForce GTX 1080GeForce GTX 970GeForce GTX 980Radeon R9 FuryRadeon R9 290GeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 780 Ti50100150200250Min: 51.3 / Avg: 83.15 / Max: 94.2Min: 44.8 / Avg: 89.24 / Max: 101.5Min: 49.9 / Avg: 117.9 / Max: 137.8Min: 96.7 / Avg: 124.57 / Max: 141.9Min: 62.8 / Avg: 129.51 / Max: 166.9Min: 69.5 / Avg: 134.82 / Max: 175Min: 50.7 / Avg: 138.56 / Max: 159.1Min: 52.2 / Avg: 141.33 / Max: 169.7Min: 58 / Avg: 146.29 / Max: 167.8Min: 55.7 / Avg: 159.95 / Max: 185.7Min: 167.4 / Avg: 178.62 / Max: 202Min: 111.2 / Avg: 180.14 / Max: 214.7Min: 66.8 / Avg: 190.32 / Max: 229.1Min: 143.3 / Avg: 204.53 / Max: 231.7Min: 94.2 / Avg: 240.33 / Max: 287.5

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1080 TiGeForce GTX 1080Radeon R9 FuryGeForce GTX 1070Radeon RX 580GeForce GTX 980 TiRadeon RX 480GeForce GTX 980GeForce GTX 780 TiGeForce GTX 1060Radeon R9 290GeForce GTX 970GeForce GTX 960Radeon RX 560GeForce GTX 10503K6K9K12K15KSE +/- 65.79, N = 3SE +/- 38.82, N = 3SE +/- 0.03, N = 3SE +/- 32.37, N = 3SE +/- 1.35, N = 3SE +/- 15.98, N = 3SE +/- 1.92, N = 3SE +/- 3.16, N = 3SE +/- 19.58, N = 3SE +/- 6.12, N = 3SE +/- 0.03, N = 3SE +/- 1.82, N = 3SE +/- 7.79, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 313274.179446.747144.887125.886260.486208.655812.645051.894944.724829.994790.314361.652960.492634.532125.781. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon RX 560GeForce GTX 1050GeForce GTX 960GeForce GTX 1060GeForce GTX 1080Radeon RX 480GeForce GTX 1070Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 780 TiRadeon R9 285GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1080 Ti60120180240300Min: 50.4 / Avg: 104.54 / Max: 107.3Min: 62.1 / Avg: 109.06 / Max: 119.8Min: 73.2 / Avg: 145.52 / Max: 149.6Min: 50 / Avg: 151.57 / Max: 157.1Min: 81.4 / Avg: 184.66 / Max: 189.3Min: 71.5 / Avg: 186.43 / Max: 197.7Min: 78.2 / Avg: 186.9 / Max: 196.9Min: 61.7 / Avg: 198.14 / Max: 205.1Min: 60.8 / Avg: 231.32 / Max: 244.7Min: 108.1 / Avg: 253.37 / Max: 262.7Min: 127.9 / Avg: 295.03 / Max: 310.7Min: 88.4 / Avg: 191.17 / Max: 194.5Min: 136.6 / Avg: 207.25 / Max: 212.8Min: 120.1 / Avg: 253.78 / Max: 259Min: 103.3 / Avg: 258.5 / Max: 265.6

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLGeForce GTX 1080 TiRadeon R9 FuryGeForce GTX 1080GeForce GTX 1070Radeon R9 290Radeon RX 580Radeon RX 480GeForce GTX 980 TiGeForce GTX 1060Radeon RX 560GeForce GTX 980GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1050GeForce GTX 960510152025SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 33.133.423.673.783.854.084.164.194.666.9115.2315.2416.2618.1819.30

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070Radeon R9 290GeForce GTX 980 TiRadeon R9 FuryGeForce GTX 980Radeon RX 580GeForce GTX 1060Radeon RX 480GeForce GTX 970GeForce GTX 780 TiGeForce GTX 960GeForce GTX 1050Radeon RX 5607001400210028003500SE +/- 117.22, N = 3SE +/- 29.30, N = 3SE +/- 19.23, N = 3SE +/- 0.05, N = 3SE +/- 16.78, N = 3SE +/- 0.05, N = 3SE +/- 0.62, N = 3SE +/- 0.01, N = 3SE +/- 52.05, N = 3SE +/- 0.03, N = 3SE +/- 22.00, N = 3SE +/- 17.95, N = 3SE +/- 6.09, N = 3SE +/- 15.64, N = 3SE +/- 0.03, N = 33200.302349.861631.031595.411583.851429.321296.111251.631223.581162.881128.64961.13781.16568.64524.70

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiRadeon R9 FuryRadeon RX 580GeForce GTX 980GeForce GTX 1060Radeon RX 480GeForce GTX 970Radeon R9 290GeForce GTX 780 TiGeForce GTX 960Radeon RX 560GeForce GTX 1050510152025SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 319.7414.2910.619.279.097.967.547.367.336.546.114.664.463.343.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGeForce GTX 1080 TiGeForce GTX 1080Radeon R9 FuryGeForce GTX 1070Radeon RX 580Radeon RX 480GeForce GTX 980 TiRadeon R9 290GeForce GTX 980GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 970Radeon RX 560GeForce GTX 960GeForce GTX 10503K6K9K12K15KSE +/- 0.82, N = 3SE +/- 157.37, N = 3SE +/- 0.51, N = 3SE +/- 0.57, N = 3SE +/- 0.38, N = 3SE +/- 0.06, N = 3SE +/- 13.41, N = 3SE +/- 0.30, N = 3SE +/- 0.64, N = 3SE +/- 74.08, N = 3SE +/- 0.62, N = 3SE +/- 0.43, N = 3SE +/- 0.20, N = 3SE +/- 70.89, N = 3SE +/- 0.29, N = 311780.268249.627068.816359.206175.415737.625292.124755.894288.054157.553847.883728.552588.662429.101937.00

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1050Radeon RX 560GeForce GTX 960GeForce GTX 1060GeForce GTX 1080Radeon RX 580GeForce GTX 1070GeForce GTX 980GeForce GTX 970Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080 TiRadeon R9 29050100150200250Min: 44.6 / Avg: 91.7 / Max: 98.4Min: 61.8 / Avg: 94.24 / Max: 99.4Min: 53 / Avg: 112.52 / Max: 126.6Min: 77.6 / Avg: 112.65 / Max: 130.4Min: 99.7 / Avg: 126.93 / Max: 164.8Min: 61.5 / Avg: 128.3 / Max: 177.9Min: 50.4 / Avg: 132.24 / Max: 154.3Min: 102 / Avg: 140.4 / Max: 165.1Min: 86.1 / Avg: 140.75 / Max: 165.4Min: 67.2 / Avg: 149.58 / Max: 172.7Min: 111.7 / Avg: 167.53 / Max: 194.9Min: 63.5 / Avg: 169.7 / Max: 225.2Min: 149 / Avg: 212.7 / Max: 266.1Min: 213.3 / Avg: 215.1 / Max: 216.9Min: 208.1 / Avg: 219.88 / Max: 227.1

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 980 TiRadeon RX 580GeForce GTX 980GeForce GTX 970Radeon RX 560GeForce GTX 1050GeForce GTX 1060Radeon RX 480GeForce GTX 780 TiGeForce GTX 960Radeon R9 290GeForce GTX 1070Radeon R9 FuryGeForce GTX 1080GeForce GTX 1080 Ti60120180240300Min: 61.6 / Avg: 65.85 / Max: 70.1Min: 104 / Avg: 105.7 / Max: 107.4Min: 98.4 / Avg: 109.4 / Max: 120.4Min: 114.5 / Avg: 114.7 / Max: 114.9Min: 116.8 / Avg: 117.17 / Max: 117.4Min: 82.3 / Avg: 132.4 / Max: 182.5Min: 112.9 / Avg: 157.65 / Max: 202.4Min: 130.1 / Avg: 172.7 / Max: 215.3Min: 168.7 / Avg: 174.7 / Max: 180.7Min: 109.8 / Avg: 199.5 / Max: 289.2Min: 63.4 / Avg: 204.7 / Max: 346

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon R9 FuryGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiRadeon R9 290GeForce GTX 1080GeForce GTX 1070Radeon RX 580Radeon RX 480GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 1050Radeon RX 560GeForce GTX 96080160240320400SE +/- 4.06, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 1.08, N = 3SE +/- 0.87, N = 3SE +/- 0.09, N = 3SE +/- 0.80, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.31, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.00, N = 2391.50335.47252.00238.40217.70213.13191.43180.70179.70152.30138.70129.7085.7380.9070.801. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthRadeon R9 FuryGeForce GTX 1080 TiRadeon R9 290GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080Radeon RX 480Radeon RX 580GeForce GTX 1070GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 1050Radeon RX 560GeForce GTX 96090180270360450SE +/- 0.90, N = 3SE +/- 0.82, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 10.22, N = 3SE +/- 3.85, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.45, N = 3SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3431.26329.43272.17263.32252.98218.15209.39208.96196.36164.10146.23143.4192.4389.2581.12

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 970GeForce GTX 960GeForce GTX 1050Radeon R9 FuryRadeon R9 290Radeon RX 560Radeon RX 480Radeon RX 5801428425670SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.34, N = 3SE +/- 0.03, N = 3SE +/- 2.80, N = 3SE +/- 0.37, N = 3SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.75, N = 3SE +/- 0.27, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 363.6761.2658.9556.9154.9654.5954.0353.0347.6441.8021.7620.9712.0812.0511.981. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1050Radeon RX 560GeForce GTX 960GeForce GTX 1060GeForce GTX 1070Radeon RX 580GeForce GTX 980GeForce GTX 1080GeForce GTX 970Radeon R9 FuryGeForce GTX 1080 TiRadeon RX 480Radeon R9 290GeForce GTX 980 TiGeForce GTX 780 Ti50100150200250Min: 51 / Avg: 88.33 / Max: 98.2Min: 50.7 / Avg: 89.22 / Max: 99.8Min: 75.2 / Avg: 119.81 / Max: 126.4Min: 113.9 / Avg: 126.18 / Max: 129.8Min: 50.8 / Avg: 133.6 / Max: 156.7Min: 61.6 / Avg: 134.4 / Max: 178.4Min: 55.9 / Avg: 137.83 / Max: 165.7Min: 120.4 / Avg: 154.88 / Max: 168.9Min: 145.8 / Avg: 161.02 / Max: 165.8Min: 63.2 / Avg: 162.7 / Max: 196.7Min: 131.4 / Avg: 163.5 / Max: 225.5Min: 158.5 / Avg: 168.35 / Max: 173.8Min: 110.3 / Avg: 194.92 / Max: 227.7Min: 164 / Avg: 209.78 / Max: 231Min: 205.2 / Avg: 232.33 / Max: 267.5

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorRadeon RX 560GeForce GTX 1050GeForce GTX 960GeForce GTX 1060GeForce GTX 1070Radeon RX 480GeForce GTX 970GeForce GTX 1080GeForce GTX 980Radeon R9 FuryRadeon RX 580Radeon R9 290GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080 Ti50100150200250Min: 51.1 / Avg: 91.53 / Max: 100.1Min: 97.2 / Avg: 97.62 / Max: 98.3Min: 75.3 / Avg: 117.34 / Max: 125.9Min: 123.2 / Avg: 128.26 / Max: 130.1Min: 51.8 / Avg: 133.92 / Max: 156.5Min: 73.8 / Avg: 137.13 / Max: 171.7Min: 57.9 / Avg: 143.07 / Max: 165.3Min: 99.9 / Avg: 143.28 / Max: 167.2Min: 110.7 / Avg: 156.45 / Max: 166Min: 63.6 / Avg: 161.35 / Max: 231Min: 126.3 / Avg: 165.33 / Max: 179.6Min: 110.7 / Avg: 173.45 / Max: 211.5Min: 59.1 / Avg: 193.83 / Max: 264Min: 136.5 / Avg: 208.38 / Max: 235Min: 215.6 / Avg: 216.75 / Max: 217.9

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 780 TiGeForce GTX 960GeForce GTX 1050Radeon R9 290Radeon R9 FuryRadeon RX 580Radeon RX 480Radeon RX 560130260390520650SE +/- 0.23, N = 3SE +/- 0.88, N = 3SE +/- 0.27, N = 3SE +/- 0.59, N = 3SE +/- 0.74, N = 3SE +/- 1.03, N = 3SE +/- 0.24, N = 3SE +/- 0.11, N = 3SE +/- 0.45, N = 3SE +/- 1.17, N = 3SE +/- 1.41, N = 3SE +/- 0.89, N = 3SE +/- 0.08, N = 3SE +/- 0.23, N = 3SE +/- 0.14, N = 3596.82520.13454.47382.21351.45332.11288.87287.39277.11274.91253.64245.68207.97193.43115.811. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorGeForce GTX 1050Radeon RX 560GeForce GTX 1060GeForce GTX 960GeForce GTX 970GeForce GTX 1070Radeon RX 580Radeon RX 480GeForce GTX 1080GeForce GTX 980Radeon R9 FuryRadeon R9 290GeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 780 Ti4080120160200Min: 44.7 / Avg: 85.97 / Max: 90.4Min: 51 / Avg: 87.81 / Max: 93.9Min: 49.1 / Avg: 105.58 / Max: 118Min: 85.3 / Avg: 121.06 / Max: 124.6Min: 61.8 / Avg: 134.1 / Max: 144.6Min: 133.6 / Avg: 138.18 / Max: 139.4Min: 95.5 / Avg: 138.34 / Max: 170Min: 73.8 / Avg: 140.82 / Max: 162.3Min: 52.7 / Avg: 144.57 / Max: 160.3Min: 57 / Avg: 152.7 / Max: 162.3Min: 63.9 / Avg: 177.34 / Max: 225.5Min: 110.3 / Avg: 178.68 / Max: 199.3Min: 57.5 / Avg: 179.46 / Max: 201.8Min: 139.5 / Avg: 189.78 / Max: 196.9Min: 220.2 / Avg: 220.42 / Max: 221

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS Per Watt, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRadeon RX 560Radeon RX 580Radeon RX 480Radeon R9 FuryRadeon R9 290GeForce GTX 1060GeForce GTX 970GeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 980 Ti0.08780.17560.26340.35120.4390.390.320.310.290.180.130.120.110.110.110.080.080.08

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferRadeon R9 FuryRadeon RX 480Radeon RX 580Radeon RX 560Radeon R9 290GeForce GTX 1080 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 970GeForce GTX 980GeForce GTX 960GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1050714212835SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 330.7430.6430.3830.3830.3312.6112.6112.5912.4912.4812.4612.3712.3612.216.44

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 1080Radeon R9 FuryRadeon RX 580GeForce GTX 1070Radeon RX 480GeForce GTX 980GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1060Radeon RX 560GeForce GTX 960GeForce GTX 10502004006008001000SE +/- 5.16, N = 3SE +/- 21.12, N = 3SE +/- 6.81, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 8.94, N = 3SE +/- 0.10, N = 3SE +/- 0.99, N = 3SE +/- 21.22, N = 3SE +/- 6.83, N = 3SE +/- 7.21, N = 3SE +/- 0.06, N = 3SE +/- 0.45, N = 3SE +/- 12.49, N = 3974.37693.44597.69550.79498.10470.12462.35447.62429.87382.02322.16209.76209.15204.361. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080Radeon R9 FuryRadeon R9 290GeForce GTX 1070Radeon RX 480Radeon RX 580GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 1050Radeon RX 560GeForce GTX 96070140210280350SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.84, N = 3SE +/- 0.48, N = 3SE +/- 0.64, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.38, N = 3SE +/- 0.00, N = 3SE +/- 0.40, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 2316.80237.10216.37206.53206.13192.40186.63185.37183.47142.60137.70125.2787.5081.0370.601. (CC) gcc options: -O2 -flto -lOpenCL

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterclpeakOpenCL Test: Double-Precision DoubleRadeon R9 290Radeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 1080 TiGeForce GTX 1080Radeon RX 560GeForce GTX 1070GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 970GeForce GTX 1050GeForce GTX 9600.75381.50762.26143.01523.7693.352.832.592.522.312.041.871.631.421.121.051.031.030.780.76

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 1080 TiGeForce GTX 1070GeForce GTX 980 TiRadeon R9 FuryGeForce GTX 1080GeForce GTX 980GeForce GTX 1060GeForce GTX 970Radeon RX 580Radeon R9 290Radeon RX 480GeForce GTX 780 TiGeForce GTX 1050GeForce GTX 960Radeon RX 5604K8K12K16K20KSE +/- 13.57, N = 3SE +/- 3.84, N = 3SE +/- 118.82, N = 3SE +/- 66.33, N = 3SE +/- 51.02, N = 3SE +/- 26.67, N = 3SE +/- 38.33, N = 3SE +/- 3.33, N = 3SE +/- 3.00, N = 3SE +/- 3.67, N = 3SE +/- 21.08, N = 3SE +/- 15.33, N = 3SE +/- 20.00, N = 3SE +/- 8.69, N = 31966216186148111321212776119591157210704104001040098859516656961484468

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980Radeon RX 580Radeon RX 480GeForce GTX 1060GeForce GTX 970Radeon R9 FuryRadeon R9 290GeForce GTX 1050Radeon RX 560GeForce GTX 96070140210280350SE +/- 0.22, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 1.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.41, N = 3SE +/- 2.42, N = 3SE +/- 0.21, N = 3SE +/- 0.05, N = 2SE +/- 0.12, N = 3SE +/- 0.30, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 2338.07271.67266.17227.20205.43164.47159.73157.87151.60143.55123.30123.0095.0093.5081.401. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon R9 FuryGeForce GTX 1080 TiGeForce GTX 1070GeForce GTX 1080Radeon RX 580GeForce GTX 980 TiRadeon R9 290GeForce GTX 980GeForce GTX 1060GeForce GTX 780 TiRadeon RX 480GeForce GTX 1050Radeon RX 560GeForce GTX 970GeForce GTX 9600.54231.08461.62692.16922.71152.412.051.431.381.341.141.121.111.101.081.070.970.910.810.59

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 780 TiGeForce GTX 960GeForce GTX 10504080120160200SE +/- 0.26, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3186.74145.38132.79108.1597.3897.2485.4572.7758.3549.78

clpeak

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorGeForce GTX 960GeForce GTX 1050Radeon RX 560GeForce GTX 1060Radeon RX 580Radeon RX 480GeForce GTX 970Radeon R9 FuryGeForce GTX 1070GeForce GTX 980GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 290GeForce GTX 780 Ti306090120150Min: 65.2 / Avg: 76.8 / Max: 89.4Min: 51.2 / Avg: 76.94 / Max: 84.1Min: 75.5 / Avg: 92.25 / Max: 105.6Min: 61.8 / Avg: 95.67 / Max: 107Min: 76 / Avg: 99.48 / Max: 105.5Min: 97.8 / Avg: 103.53 / Max: 108Min: 63.2 / Avg: 106.3 / Max: 120.3Min: 103.2 / Avg: 114.13 / Max: 119.3Min: 90.6 / Avg: 117.27 / Max: 130.7Min: 99 / Avg: 118.32 / Max: 126.1Min: 149.6 / Avg: 155.28 / Max: 157.1Min: 134.8 / Avg: 156.02 / Max: 164.4Min: 146.4 / Avg: 167.81 / Max: 175Min: 170.7 / Avg: 175.55 / Max: 180.4

ViennaCL

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.4.2System Power Consumption MonitorRadeon RX 560Radeon RX 580GeForce GTX 1060GeForce GTX 980Radeon R9 FuryGeForce GTX 1050Radeon RX 480GeForce GTX 780 TiGeForce GTX 960GeForce GTX 1070GeForce GTX 970Radeon R9 290GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 1080306090120150Min: 50.6 / Avg: 72.03 / Max: 82.1Min: 62.2 / Avg: 78.77 / Max: 96.1Min: 68.9 / Avg: 80.05 / Max: 91.2Min: 56 / Avg: 80.3 / Max: 104.6Min: 63.1 / Avg: 86.03 / Max: 126.9Min: 79.6 / Avg: 86.2 / Max: 92.8Min: 72.7 / Avg: 93.3 / Max: 104.2Min: 58.3 / Avg: 105.5 / Max: 152.7Min: 85.8 / Avg: 108.05 / Max: 130.3Min: 91.5 / Avg: 111.55 / Max: 131.6Min: 98.7 / Avg: 125.25 / Max: 151.8Min: 109.8 / Avg: 132.4 / Max: 158.7Min: 134.4 / Avg: 142.1 / Max: 149.8Min: 130.6 / Avg: 146.5 / Max: 162.4

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1060GeForce GTX 780 TiRadeon RX 580GeForce GTX 980Radeon RX 480GeForce GTX 1050GeForce GTX 970Radeon RX 560Radeon R9 FuryGeForce GTX 960Radeon R9 2900.40280.80561.20841.61122.0141.791.571.551.351.281.241.171.061.041.020.990.740.720.56

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050GeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960Radeon RX 580Radeon R9 FuryRadeon RX 480Radeon RX 560Radeon R9 290GeForce GTX 780 Ti369121513.5013.4012.4711.429.478.828.628.607.327.276.766.645.374.544.28

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiRadeon RX 580Radeon RX 480Radeon R9 290Radeon RX 560Radeon R9 FuryGeForce GTX 780 Ti0.8281.6562.4843.3124.143.683.283.243.143.082.222.081.971.721.611.431.411.391.381.20

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060Radeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970Radeon RX 560GeForce GTX 960Radeon R9 290GeForce GTX 1050GeForce GTX 780 Ti153045607565.1060.5050.5940.8037.7336.5533.1631.2430.9629.2528.3723.7223.4023.2721.88

FAHBench

OpenBenchmarking.orgNs Per Day Per Watt, More Is BetterFAHBench 2.3.2GeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980GeForce GTX 980 TiGeForce GTX 970GeForce GTX 1050GeForce GTX 960GeForce GTX 780 Ti0.23180.46360.69540.92721.1591.030.980.950.820.640.610.600.550.490.36

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 1070GeForce GTX 1060GeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1050GeForce GTX 980 TiGeForce GTX 980Radeon R9 FuryGeForce GTX 970Radeon RX 480Radeon RX 580Radeon RX 560GeForce GTX 960Radeon R9 290GeForce GTX 780 Ti2040608010086.6076.3576.0669.1960.2458.3657.7057.1255.9953.0252.4942.7442.2541.0532.25

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 960Radeon R9 FuryRadeon RX 580Radeon RX 480GeForce GTX 1050Radeon RX 5603691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 312.5112.3512.3112.2912.2012.0311.9611.9111.2410.179.689.616.124.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1080GeForce GTX 1070Radeon RX 480Radeon R9 FuryGeForce GTX 780 TiRadeon RX 580Radeon R9 290GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1050Radeon RX 560GeForce GTX 970GeForce GTX 9600.3240.6480.9721.2961.621.441.391.351.281.221.111.111.071.040.910.900.890.880.60

Mixbench

Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionRadeon RX 580Radeon R9 FuryRadeon RX 480GeForce GTX 1060GeForce GTX 980GeForce GTX 780 TiRadeon RX 560163248648070.4858.6053.3749.6345.6331.1930.69

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLRadeon RX 480Radeon R9 FuryRadeon R9 290Radeon RX 580GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 980GeForce GTX 970GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 780 TiGeForce GTX 960Radeon RX 5600.06080.12160.18240.24320.304SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.130.130.150.170.180.180.190.190.210.220.220.230.250.250.27

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyGeForce GTX 1070GeForce GTX 1080 TiGeForce GTX 1050GeForce GTX 960GeForce GTX 970GeForce GTX 1080GeForce GTX 980GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 780 TiRadeon RX 580Radeon RX 560Radeon RX 480Radeon R9 FuryRadeon R9 290246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.27, N = 3SE +/- 0.03, N = 3SE +/- 0.36, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.39, N = 33.553.553.823.884.054.064.074.104.335.565.635.825.865.946.67

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringRadeon RX 560GeForce GTX 1050GeForce GTX 1060GeForce GTX 960GeForce GTX 1070Radeon RX 480Radeon RX 580GeForce GTX 970GeForce GTX 1080GeForce GTX 980Radeon R9 290Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 780 Ti60120180240300Min: 50.2 / Avg: 92.38 / Max: 145.4Min: 44.2 / Avg: 92.91 / Max: 132Min: 48.3 / Avg: 116.84 / Max: 190.4Min: 52.1 / Avg: 123.58 / Max: 210.4Min: 47.8 / Avg: 138.1 / Max: 211.9Min: 66.5 / Avg: 142.76 / Max: 205.4Min: 61.1 / Avg: 144.94 / Max: 235Min: 55.7 / Avg: 145.83 / Max: 234.2Min: 48.9 / Avg: 149.85 / Max: 266.2Min: 54.8 / Avg: 152.53 / Max: 254.7Min: 108.1 / Avg: 170.72 / Max: 311.4Min: 60.8 / Avg: 177.77 / Max: 346Min: 60.5 / Avg: 188.98 / Max: 336.2Min: 51.8 / Avg: 191.22 / Max: 330.8Min: 57.6 / Avg: 202.67 / Max: 336.8

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLRadeon R9 FuryRadeon RX 480Radeon R9 290GeForce GTX 1080GeForce GTX 1080 TiRadeon RX 580GeForce GTX 1060GeForce GTX 1070GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 960Radeon RX 560GeForce GTX 780 Ti0.06080.12160.18240.24320.304SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.130.140.170.170.170.180.180.180.210.210.220.220.240.270.27


Phoronix Test Suite v10.8.5