Radeon ROCm vs. NVIDIA OpenCL August 2017

Radeon ROCm and NVIDIA OpenCL Linux testing by Michael Larabel for a future article on Phoronix.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1708103-TY-1708107TY43
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Radeon R9 285
August 08 2017
 
Radeon R9 290
August 08 2017
 
Radeon RX 480
August 08 2017
 
Radeon RX 560
August 08 2017
 
Radeon RX 580
August 08 2017
 
Radeon R9 Fury
August 08 2017
 
GeForce GTX 780 Ti
August 02 2017
 
GeForce GTX 960
August 06 2017
 
GeForce GTX 970
August 02 2017
 
GeForce GTX 980
August 01 2017
 
GeForce GTX 980 Ti
August 01 2017
 
GeForce GTX 1050
August 01 2017
 
GeForce GTX 1060
August 01 2017
 
GeForce GTX 1070
August 01 2017
 
GeForce GTX 1080
August 01 2017
 
GeForce GTX 1080 Ti
August 02 2017
 
Radeon Vega FE
August 10 2017
 
Invert Behavior (Only Show Selected Data)
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Radeon ROCm vs. NVIDIA OpenCL August 2017ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLRadeon R9 285Radeon R9 290Radeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiRadeon Vega FEIntel Core i7-7740K @ 4.50GHz (8 Cores)ASUS PRIME X299-AIntel Device 591f16384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBXFX AMD TONGA 2048MBRealtek GenericAcer B286HKIntel ConnectionUbuntu 16.044.11.0-kfd-compute-rocm-rel-1.6-127 (x86_64)Unity 7.4.0modesetting 1.19.34.5 Mesa 17.3.0-devel- padoka PPA (LLVM 6.0.0)1.0.42GCC 5.4.0 20160609ext43840x2160XFX AMD HAWAII 4096MBOpenCL 2.0 AMD-APP (2450.0)AMD POLARIS10 8192MBamdgpu 1.3.0AMD POLARIS11 4096MBMSI AMD POLARIS10 8192MBmodesetting 1.19.3Sapphire AMD FIJI 4096MBNVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)Realtek ALC12204.13.0-999-generic (x86_64) 20170730NVIDIA 384.594.5.0OpenCL 1.2 CUDA 9.0.130eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)AMD Ryzen 5 1600 Six-Core @ 3.20GHz (12 Cores)Gigabyte AB350M-D3H-CFAMD Device 1450Samsung SSD 960 EVO 500GBamdgpudrmfbAMD Device aaf8HP 27erRealtek RTL8111/8168/84114.11.0-kfd-compute-rocm-rel-1.6-127 (x86_64)OpenCL 2.0 AMD-APP (2450.0)1920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Radeon R9 285: Scaling Governor: intel_pstate performance- Radeon R9 290: Scaling Governor: intel_pstate performance- Radeon RX 480: Scaling Governor: intel_pstate performance- Radeon RX 560: Scaling Governor: intel_pstate performance- Radeon RX 580: Scaling Governor: intel_pstate performance- Radeon R9 Fury: Scaling Governor: intel_pstate performance- GeForce GTX 780 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 960: Scaling Governor: intel_pstate performance- GeForce GTX 970: Scaling Governor: intel_pstate performance- GeForce GTX 980: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050: Scaling Governor: intel_pstate performance- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performance- Radeon Vega FE: Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584System Details- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.

Radeon ROCm vs. NVIDIA OpenCL August 2017cl-mem: Readcl-mem: Writecl-mem: Copyclpeak: Global Memory Bandwidthclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Integer Compute INTclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Kernel Latencydarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLfahbench: luxmark: GPU - Luxball HDRluxmark: GPU - Hotelmixbench: Single Precisionmixbench: Double Precisionmixbench: Integershoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Triadviennacl: OpenCL LU FactorizationRadeon R9 285Radeon R9 290Radeon RX 480Radeon RX 560Radeon RX 580Radeon R9 FuryGeForce GTX 780 TiGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiRadeon Vega FE123.00217.70192.40272.174755.89599.061595.4130.336.673.850.170.151040010134790.31253.646.1120.97157.87179.70185.37209.395737.62364.041162.8830.645.864.160.140.13988510705450.06362.291140.245812.64193.43462.357.339.6112.0593.5080.9081.0389.252588.66164.50524.7030.385.826.910.270.2744685092465.62164.00508.692634.53115.81209.763.344.9512.08159.73180.70183.47208.966175.41391.771251.6330.385.634.080.180.171040012185863.73389.971227.326260.48207.97498.107.969.6811.98123.30391.50206.13431.267068.81447.261429.3230.745.943.420.130.131321213676508.34440.851386.507144.88245.68550.799.0910.1721.76271.67252.00237.10252.983847.88246.21961.1312.365.5615.240.270.2572.77951611994263.71245.11968.934944.72287.39429.874.6612.3554.5981.4070.8070.6081.122429.1092.44781.1612.463.8819.300.240.2558.35614811302765.1392.74835.012960.49277.11209.154.4611.2447.64143.55129.70125.27143.413728.55137.561128.6412.494.0516.260.210.2285.451070416494125.24136.181220.804361.65288.87382.026.5411.9153.03164.47152.30142.60164.104288.05159.681296.1112.484.0715.230.210.2197.381195917564700.91159.861402.275051.89332.11447.627.5412.0354.96266.17238.40216.37263.325292.12195.671583.8512.214.334.190.220.22108.151481121425563.69194.721717.516208.65351.45693.449.2712.2956.919585.7387.5092.431937.0066.88568.646.443.8218.180.220.2349.78656910222040.4666.76614.522125.78274.91204.363.246.1241.80151.60138.70137.70146.234157.55150.411223.5812.374.104.660.180.1997.241157217424389.15152.161366.844829.99382.21322.167.3611.9654.03205.43191.43186.63196.366359.20225.451631.0312.613.553.780.180.19132.791618622866402.60223.872027.967125.88454.47470.1210.6112.2058.95227.20213.13206.53218.158249.62295.222349.8612.594.063.670.170.18145.381277627258493.48295.342662.469446.74520.13597.6914.2912.3161.26338.07335.47316.80329.4311780.26415.063200.3012.613.553.130.170.18186.7419662353289.236.3927.8813274.17596.82974.3719.7412.5163.67154.20362.80211.933.7892.7412.6619.7012653.80422.09885.0416.0011.8230.68OpenBenchmarking.org

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 960Radeon RX 560GeForce GTX 1050Radeon R9 290Radeon R9 FuryGeForce GTX 970GeForce GTX 1060Radeon Vega FERadeon RX 480Radeon RX 580GeForce GTX 980GeForce GTX 1070GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080 Ti70140210280350SE +/- 0.00, N = 2SE +/- 0.00, N = 3SE +/- 0.30, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 2SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 2.42, N = 3SE +/- 0.41, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 1.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.22, N = 381.4093.5095.00123.00123.30143.55151.60154.20157.87159.73164.47205.43227.20266.17271.67338.071. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon R9 290GeForce GTX 960Radeon R9 FuryRadeon RX 560GeForce GTX 970GeForce GTX 1050Radeon RX 480GeForce GTX 980Radeon RX 580GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 10800.40280.80561.20841.61122.0140.560.720.740.991.021.041.061.171.241.281.351.551.571.79

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorRadeon R9 290GeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 480GeForce GTX 970GeForce GTX 980GeForce GTX 1070Radeon RX 580GeForce GTX 1080GeForce GTX 1060GeForce GTX 960Radeon RX 560GeForce GTX 105050100150200250Min: 208.1 / Avg: 219.88 / Max: 227.1Min: 213.3 / Avg: 215.1 / Max: 216.9Min: 149 / Avg: 212.7 / Max: 266.1Min: 63.5 / Avg: 169.7 / Max: 225.2Min: 111.7 / Avg: 167.53 / Max: 194.9Min: 67.2 / Avg: 149.58 / Max: 172.7Min: 86.1 / Avg: 140.75 / Max: 165.4Min: 102 / Avg: 140.4 / Max: 165.1Min: 50.4 / Avg: 132.24 / Max: 154.3Min: 61.5 / Avg: 128.3 / Max: 177.9Min: 99.7 / Avg: 126.93 / Max: 164.8Min: 77.6 / Avg: 112.65 / Max: 130.4Min: 53 / Avg: 112.52 / Max: 126.6Min: 61.8 / Avg: 94.24 / Max: 99.4Min: 44.6 / Avg: 91.7 / Max: 98.4

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 960Radeon RX 560GeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980Radeon RX 480Radeon RX 580GeForce GTX 1070GeForce GTX 1080Radeon R9 290GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080 TiRadeon Vega FERadeon R9 Fury80160240320400SE +/- 0.00, N = 2SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.31, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.80, N = 3SE +/- 0.09, N = 3SE +/- 0.87, N = 3SE +/- 1.08, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.21, N = 3SE +/- 4.06, N = 370.8080.9085.73129.70138.70152.30179.70180.70191.43213.13217.70238.40252.00335.47362.80391.501. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 960GeForce GTX 970Radeon RX 560GeForce GTX 1050Radeon RX 480GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 980Radeon R9 290GeForce GTX 980 TiRadeon RX 580GeForce GTX 1080GeForce GTX 1070GeForce GTX 1080 TiRadeon R9 Fury0.54231.08461.62692.16922.71150.590.810.910.971.071.081.101.111.121.141.341.381.432.052.41

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 780 TiGeForce GTX 980 TiRadeon R9 290Radeon RX 480GeForce GTX 1080 TiRadeon R9 FuryGeForce GTX 970GeForce GTX 1080GeForce GTX 980Radeon RX 580GeForce GTX 1070GeForce GTX 1060GeForce GTX 960Radeon RX 560GeForce GTX 105050100150200250Min: 205.2 / Avg: 232.33 / Max: 267.5Min: 164 / Avg: 209.78 / Max: 231Min: 110.3 / Avg: 194.92 / Max: 227.7Min: 158.5 / Avg: 168.35 / Max: 173.8Min: 131.4 / Avg: 163.5 / Max: 225.5Min: 63.2 / Avg: 162.7 / Max: 196.7Min: 145.8 / Avg: 161.02 / Max: 165.8Min: 120.4 / Avg: 154.88 / Max: 168.9Min: 55.9 / Avg: 137.83 / Max: 165.7Min: 61.6 / Avg: 134.4 / Max: 178.4Min: 50.8 / Avg: 133.6 / Max: 156.7Min: 113.9 / Avg: 126.18 / Max: 129.8Min: 75.2 / Avg: 119.81 / Max: 126.4Min: 50.7 / Avg: 89.22 / Max: 99.8Min: 51 / Avg: 88.33 / Max: 98.2

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 960Radeon RX 560GeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980Radeon RX 580Radeon RX 480GeForce GTX 1070Radeon R9 290Radeon R9 FuryGeForce GTX 1080Radeon Vega FEGeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080 Ti70140210280350SE +/- 0.00, N = 2SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.40, N = 3SE +/- 0.00, N = 3SE +/- 0.38, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.64, N = 3SE +/- 0.48, N = 3SE +/- 0.84, N = 3SE +/- 0.32, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 370.6081.0387.50125.27137.70142.60183.47185.37186.63192.40206.13206.53211.93216.37237.10316.801. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 960GeForce GTX 970Radeon RX 560GeForce GTX 1050GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060Radeon R9 290Radeon RX 580GeForce GTX 780 TiRadeon R9 FuryRadeon RX 480GeForce GTX 1070GeForce GTX 10800.3240.6480.9721.2961.620.600.880.890.900.911.041.071.111.111.221.281.351.391.44

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 780 TiRadeon R9 290Radeon RX 580Radeon R9 FuryGeForce GTX 980GeForce GTX 1080GeForce GTX 970Radeon RX 480GeForce GTX 1070GeForce GTX 1060GeForce GTX 960GeForce GTX 1050Radeon RX 56050100150200250Min: 215.6 / Avg: 216.75 / Max: 217.9Min: 136.5 / Avg: 208.38 / Max: 235Min: 59.1 / Avg: 193.83 / Max: 264Min: 110.7 / Avg: 173.45 / Max: 211.5Min: 126.3 / Avg: 165.33 / Max: 179.6Min: 63.6 / Avg: 161.35 / Max: 231Min: 110.7 / Avg: 156.45 / Max: 166Min: 99.9 / Avg: 143.28 / Max: 167.2Min: 57.9 / Avg: 143.07 / Max: 165.3Min: 73.8 / Avg: 137.13 / Max: 171.7Min: 51.8 / Avg: 133.92 / Max: 156.5Min: 123.2 / Avg: 128.26 / Max: 130.1Min: 75.3 / Avg: 117.34 / Max: 125.9Min: 97.2 / Avg: 97.62 / Max: 98.3Min: 51.1 / Avg: 91.53 / Max: 100.1

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGeForce GTX 960Radeon RX 560GeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 1070Radeon RX 580Radeon RX 480GeForce GTX 1080GeForce GTX 780 TiGeForce GTX 980 TiRadeon R9 290GeForce GTX 1080 TiRadeon R9 Fury90180270360450SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.03, N = 3SE +/- 0.45, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 3.85, N = 3SE +/- 10.22, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.82, N = 3SE +/- 0.90, N = 381.1289.2592.43143.41146.23164.10196.36208.96209.39218.15252.98263.32272.17329.43431.26

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGeForce GTX 1050GeForce GTX 960Radeon RX 560GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 980Radeon R9 290GeForce GTX 980 TiRadeon RX 480Radeon RX 580GeForce GTX 1070Radeon R9 FuryGeForce GTX 1080GeForce GTX 1080 Ti3K6K9K12K15KSE +/- 0.29, N = 3SE +/- 70.89, N = 3SE +/- 0.20, N = 3SE +/- 0.43, N = 3SE +/- 0.62, N = 3SE +/- 74.08, N = 3SE +/- 0.64, N = 3SE +/- 0.30, N = 3SE +/- 13.41, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 3SE +/- 0.57, N = 3SE +/- 0.51, N = 3SE +/- 157.37, N = 3SE +/- 0.82, N = 31937.002429.102588.663728.553847.884157.554288.054755.895292.125737.626175.416359.207068.818249.6211780.26

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGeForce GTX 1050GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 980Radeon RX 560GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 780 TiGeForce GTX 1080Radeon RX 480Radeon RX 580GeForce GTX 1080 TiRadeon R9 FuryRadeon R9 290130260390520650SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.30, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 1.12, N = 3SE +/- 0.29, N = 3SE +/- 0.76, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 1.65, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 366.8892.44137.56150.41159.68164.50195.67225.45246.21295.22364.04391.77415.06447.26599.06

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGeForce GTX 960GeForce GTX 1050GeForce GTX 970GeForce GTX 980 TiGeForce GTX 980GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 1070Radeon RX 560GeForce GTX 1080GeForce GTX 1080 TiRadeon R9 FuryRadeon RX 480Radeon RX 580Radeon R9 2900.75381.50762.26143.01523.7690.760.781.031.031.051.121.421.631.872.042.312.522.592.833.35

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080 TiRadeon R9 290Radeon R9 FuryGeForce GTX 980GeForce GTX 1080Radeon RX 480Radeon RX 580GeForce GTX 1070GeForce GTX 970GeForce GTX 960GeForce GTX 1060Radeon RX 560GeForce GTX 10504080120160200Min: 220.2 / Avg: 220.42 / Max: 221Min: 139.5 / Avg: 189.78 / Max: 196.9Min: 57.5 / Avg: 179.46 / Max: 201.8Min: 110.3 / Avg: 178.68 / Max: 199.3Min: 63.9 / Avg: 177.34 / Max: 225.5Min: 57 / Avg: 152.7 / Max: 162.3Min: 52.7 / Avg: 144.57 / Max: 160.3Min: 73.8 / Avg: 140.82 / Max: 162.3Min: 95.5 / Avg: 138.34 / Max: 170Min: 133.6 / Avg: 138.18 / Max: 139.4Min: 61.8 / Avg: 134.1 / Max: 144.6Min: 85.3 / Avg: 121.06 / Max: 124.6Min: 49.1 / Avg: 105.58 / Max: 118Min: 51 / Avg: 87.81 / Max: 93.9Min: 44.7 / Avg: 85.97 / Max: 90.4

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTRadeon RX 560GeForce GTX 1050GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970Radeon RX 480GeForce GTX 1060Radeon RX 580GeForce GTX 980Radeon R9 FuryGeForce GTX 980 TiRadeon R9 290GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti7001400210028003500SE +/- 0.03, N = 3SE +/- 15.64, N = 3SE +/- 6.09, N = 3SE +/- 17.95, N = 3SE +/- 22.00, N = 3SE +/- 0.03, N = 3SE +/- 52.05, N = 3SE +/- 0.01, N = 3SE +/- 0.62, N = 3SE +/- 0.05, N = 3SE +/- 16.78, N = 3SE +/- 0.05, N = 3SE +/- 19.23, N = 3SE +/- 29.30, N = 3SE +/- 117.22, N = 3524.70568.64781.16961.131128.641162.881223.581251.631296.111429.321583.851595.411631.032349.863200.30

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorGeForce GTX 1080 TiGeForce GTX 980Radeon R9 290Radeon R9 FuryGeForce GTX 1080GeForce GTX 980 TiRadeon RX 480Radeon RX 580GeForce GTX 970GeForce GTX 960GeForce GTX 780 TiGeForce GTX 1070Radeon RX 560GeForce GTX 1050GeForce GTX 106060120180240300Min: 254.1 / Avg: 254.4 / Max: 254.7Min: 158.9 / Avg: 196.69 / Max: 311.4Min: 91.2 / Avg: 171.7 / Max: 293.2Min: 51.8 / Avg: 159 / Max: 266.2Min: 63.5 / Avg: 152.7 / Max: 241.9Min: 74.5 / Avg: 139.09 / Max: 198.1Min: 62.1 / Avg: 135.37 / Max: 215.7Min: 82.9 / Avg: 130.65 / Max: 178.4Min: 53.3 / Avg: 128.25 / Max: 203.2Min: 59.7 / Avg: 109 / Max: 158.3Min: 50.7 / Avg: 88 / Max: 125.3Min: 57.7 / Avg: 83.39 / Max: 110.6Min: 44.6 / Avg: 55.5 / Max: 66.4

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferGeForce GTX 1050GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1060GeForce GTX 960GeForce GTX 980GeForce GTX 970GeForce GTX 1080GeForce GTX 1070GeForce GTX 1080 TiRadeon R9 290Radeon RX 560Radeon RX 580Radeon RX 480Radeon R9 Fury714212835SE +/- 0.00, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 36.4412.2112.3612.3712.4612.4812.4912.5912.6112.6130.3330.3830.3830.6430.74

OpenBenchmarking.orgGBPS Per Watt, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferGeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 980GeForce GTX 1070GeForce GTX 1080GeForce GTX 970GeForce GTX 1060Radeon R9 290Radeon R9 FuryRadeon RX 480Radeon RX 580Radeon RX 5600.08780.17560.26340.35120.4390.080.080.080.110.110.110.120.130.180.290.310.320.39

OpenBenchmarking.orgWatts, Fewer Is BetterclpeakSystem Power Consumption MonitorGeForce GTX 780 TiRadeon R9 290GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 980GeForce GTX 1070Radeon R9 FuryGeForce GTX 970Radeon RX 480Radeon RX 580GeForce GTX 1060Radeon RX 560GeForce GTX 1050GeForce GTX 960306090120150Min: 170.7 / Avg: 175.55 / Max: 180.4Min: 146.4 / Avg: 167.81 / Max: 175Min: 134.8 / Avg: 156.02 / Max: 164.4Min: 149.6 / Avg: 155.28 / Max: 157.1Min: 99 / Avg: 118.32 / Max: 126.1Min: 90.6 / Avg: 117.27 / Max: 130.7Min: 103.2 / Avg: 114.13 / Max: 119.3Min: 63.2 / Avg: 106.3 / Max: 120.3Min: 97.8 / Avg: 103.53 / Max: 108Min: 76 / Avg: 99.48 / Max: 105.5Min: 61.8 / Avg: 95.67 / Max: 107Min: 75.5 / Avg: 92.25 / Max: 105.6Min: 51.2 / Avg: 76.94 / Max: 84.1Min: 65.2 / Avg: 76.8 / Max: 89.4

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyRadeon R9 290Radeon R9 FuryRadeon RX 480Radeon RX 560Radeon RX 580GeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1060GeForce GTX 980GeForce GTX 1080GeForce GTX 970GeForce GTX 960GeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1070246810SE +/- 0.39, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.36, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.675.945.865.825.635.564.334.104.074.064.053.883.823.553.55

Darktable

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLGeForce GTX 960GeForce GTX 1050GeForce GTX 970GeForce GTX 780 TiGeForce GTX 980Radeon RX 560GeForce GTX 1060GeForce GTX 980 TiRadeon RX 480Radeon RX 580Radeon R9 290Radeon Vega FEGeForce GTX 1070GeForce GTX 1080Radeon R9 FuryGeForce GTX 1080 Ti510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 319.3018.1816.2615.2415.236.914.664.194.164.083.853.783.783.673.423.13

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLGeForce GTX 780 TiRadeon RX 560GeForce GTX 960GeForce GTX 1050GeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1070GeForce GTX 1060Radeon RX 580GeForce GTX 1080 TiGeForce GTX 1080Radeon R9 290Radeon RX 480Radeon R9 Fury0.06080.12160.18240.24320.304SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.270.270.240.220.220.210.210.180.180.180.170.170.170.140.13

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLRadeon RX 560GeForce GTX 960GeForce GTX 780 TiGeForce GTX 1050GeForce GTX 980 TiGeForce GTX 970GeForce GTX 980GeForce GTX 1070GeForce GTX 1060GeForce GTX 1080 TiGeForce GTX 1080Radeon RX 580Radeon R9 290Radeon R9 FuryRadeon RX 4800.06080.12160.18240.24320.304SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.270.250.250.230.220.220.210.190.190.180.180.170.150.130.13

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2GeForce GTX 1050GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti4080120160200SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.26, N = 349.7858.3572.7785.4597.2497.38108.15132.79145.38186.74

OpenBenchmarking.orgNs Per Day Per Watt, More Is BetterFAHBench 2.3.2GeForce GTX 780 TiGeForce GTX 960GeForce GTX 1050GeForce GTX 970GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti0.23180.46360.69540.92721.1590.360.490.550.600.610.640.820.950.981.03

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 560GeForce GTX 960GeForce GTX 1050GeForce GTX 780 TiRadeon RX 480Radeon R9 290Radeon RX 580GeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 1080Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080 Ti4K8K12K16K20KSE +/- 8.69, N = 3SE +/- 20.00, N = 3SE +/- 15.33, N = 3SE +/- 21.08, N = 3SE +/- 3.67, N = 3SE +/- 3.00, N = 3SE +/- 3.33, N = 3SE +/- 38.33, N = 3SE +/- 26.67, N = 3SE +/- 51.02, N = 3SE +/- 66.33, N = 3SE +/- 118.82, N = 3SE +/- 3.84, N = 3SE +/- 13.57, N = 34468614865699516988510400104001070411572119591277613212148111618619662

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 780 TiRadeon R9 290GeForce GTX 960Radeon RX 560Radeon RX 580Radeon RX 480GeForce GTX 970Radeon R9 FuryGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 1060GeForce GTX 10702040608010032.2541.0542.2542.7452.4953.0255.9957.1257.7058.3660.2469.1976.0676.3586.60

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970Radeon R9 285GeForce GTX 780 TiRadeon R9 290Radeon R9 FuryRadeon RX 580GeForce GTX 1070Radeon RX 480GeForce GTX 1080GeForce GTX 1060GeForce GTX 960GeForce GTX 1050Radeon RX 56060120180240300Min: 103.3 / Avg: 258.5 / Max: 265.6Min: 120.1 / Avg: 253.78 / Max: 259Min: 136.6 / Avg: 207.25 / Max: 212.8Min: 88.4 / Avg: 191.17 / Max: 194.5Min: 127.9 / Avg: 295.03 / Max: 310.7Min: 108.1 / Avg: 253.37 / Max: 262.7Min: 60.8 / Avg: 231.32 / Max: 244.7Min: 61.7 / Avg: 198.14 / Max: 205.1Min: 78.2 / Avg: 186.9 / Max: 196.9Min: 71.5 / Avg: 186.43 / Max: 197.7Min: 81.4 / Avg: 184.66 / Max: 189.3Min: 50 / Avg: 151.57 / Max: 157.1Min: 73.2 / Avg: 145.52 / Max: 149.6Min: 62.1 / Avg: 109.06 / Max: 119.8Min: 50.4 / Avg: 104.54 / Max: 107.3

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX 560Radeon R9 290GeForce GTX 1050Radeon RX 480GeForce GTX 960GeForce GTX 780 TiRadeon RX 580Radeon R9 FuryGeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti8001600240032004000SE +/- 2.33, N = 3SE +/- 0.67, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 3SE +/- 7.64, N = 3SE +/- 5.33, N = 3SE +/- 3.33, N = 3SE +/- 3.84, N = 3SE +/- 36.71, N = 3SE +/- 9.21, N = 3SE +/- 55.00, N = 3SE +/- 36.99, N = 3SE +/- 6.67, N = 3SE +/- 28.15, N = 3SE +/- 54.49, N = 350910131022107011301199121813671649174217562142228627253532

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 780 TiRadeon R9 290Radeon RX 560Radeon RX 480Radeon R9 FuryRadeon RX 580GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti36912154.284.545.376.646.767.277.328.608.628.829.4711.4212.4713.4013.50

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon R9 285GeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 290GeForce GTX 980GeForce GTX 1080Radeon R9 FuryGeForce GTX 970GeForce GTX 1070Radeon RX 580Radeon RX 480GeForce GTX 960GeForce GTX 1060GeForce GTX 1050Radeon RX 56050100150200250Min: 79.8 / Avg: 279.87 / Max: 294.8Min: 75.8 / Avg: 261.7 / Max: 278.5Min: 61 / Avg: 242.75 / Max: 254.4Min: 164 / Avg: 223.01 / Max: 246.2Min: 56.4 / Avg: 203.63 / Max: 213.1Min: 61.4 / Avg: 203.35 / Max: 209.7Min: 87.9 / Avg: 202.29 / Max: 248.6Min: 57.1 / Avg: 191.79 / Max: 197.1Min: 52.1 / Avg: 183.39 / Max: 188.4Min: 62.8 / Avg: 167.57 / Max: 197.1Min: 103.4 / Avg: 161.24 / Max: 183.2Min: 68.3 / Avg: 154.46 / Max: 158.8Min: 61.8 / Avg: 152.59 / Max: 156.6Min: 44.6 / Avg: 107.9 / Max: 115.3Min: 51.4 / Avg: 94.7 / Max: 102.8

Mixbench

A benchmark suite for GPUs on mixed operational intensity kernels. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionGeForce GTX 1080 TiRadeon Vega FEGeForce GTX 1050Radeon RX 560GeForce GTX 960GeForce GTX 970GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 980Radeon RX 480GeForce GTX 980 TiRadeon RX 580GeForce GTX 1070Radeon R9 FuryGeForce GTX 10802K4K6K8K10KSE +/- 3.63, N = 3SE +/- 0.06, N = 3SE +/- 0.81, N = 3SE +/- 0.14, N = 3SE +/- 0.91, N = 3SE +/- 2.66, N = 3SE +/- 5.00, N = 3SE +/- 2.95, N = 3SE +/- 5.25, N = 3SE +/- 2.06, N = 3SE +/- 255.36, N = 3SE +/- 4.20, N = 3SE +/- 54.41, N = 3SE +/- 1.46, N = 3SE +/- 48.08, N = 389.2392.742040.462465.622765.134125.244263.714389.154700.915450.065563.695863.736402.606508.348493.481. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionRadeon RX 560GeForce GTX 780 TiGeForce GTX 980GeForce GTX 1060Radeon RX 480Radeon R9 FuryRadeon RX 580163248648030.6931.1945.6349.6353.3758.6070.48

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Double PrecisionGeForce GTX 1080 TiRadeon Vega FEGeForce GTX 1050GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 980Radeon RX 560GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 780 TiGeForce GTX 1080Radeon RX 480Radeon RX 580Radeon R9 Fury100200300400500SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3SE +/- 0.71, N = 3SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 2.32, N = 3SE +/- 0.00, N = 3SE +/- 0.15, N = 3SE +/- 0.92, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.3912.6666.7692.74136.18152.16159.86164.00194.72223.87245.11295.34362.29389.97440.851. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerRadeon Vega FEGeForce GTX 1080 TiRadeon RX 560GeForce GTX 1050GeForce GTX 960GeForce GTX 780 TiRadeon RX 480GeForce GTX 970Radeon RX 580GeForce GTX 1060Radeon R9 FuryGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 10806001200180024003000SE +/- 0.01, N = 3SE +/- 1.30, N = 3SE +/- 0.12, N = 3SE +/- 0.97, N = 3SE +/- 2.57, N = 3SE +/- 0.88, N = 3SE +/- 0.04, N = 3SE +/- 0.50, N = 3SE +/- 0.04, N = 3SE +/- 0.54, N = 3SE +/- 0.06, N = 3SE +/- 0.90, N = 3SE +/- 1.33, N = 3SE +/- 2.60, N = 3SE +/- 3.22, N = 319.7027.88508.69614.52835.01968.931140.241220.801227.321366.841386.501402.271717.512027.962662.461. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1050Radeon RX 560GeForce GTX 960GeForce GTX 970Radeon R9 290GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980Radeon RX 480GeForce GTX 980 TiRadeon RX 580GeForce GTX 1070Radeon R9 FuryGeForce GTX 1080Radeon Vega FEGeForce GTX 1080 Ti3K6K9K12K15KSE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 7.79, N = 3SE +/- 1.82, N = 3SE +/- 0.03, N = 3SE +/- 6.12, N = 3SE +/- 19.58, N = 3SE +/- 3.16, N = 3SE +/- 1.92, N = 3SE +/- 15.98, N = 3SE +/- 1.35, N = 3SE +/- 32.37, N = 3SE +/- 0.03, N = 3SE +/- 38.82, N = 3SE +/- 8.00, N = 3SE +/- 65.79, N = 32125.782634.532960.494361.654790.314829.994944.725051.895812.646208.656260.487125.887144.889446.7412653.8013274.171. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 780 TiGeForce GTX 1050Radeon R9 290GeForce GTX 960Radeon RX 560GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiRadeon R9 FuryRadeon RX 480Radeon RX 580GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti153045607521.8823.2723.4023.7228.3729.2530.9631.2433.1636.5537.7340.8050.5960.5065.10

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 780 TiRadeon R9 FuryRadeon R9 290GeForce GTX 1080 TiGeForce GTX 980 TiRadeon RX 580GeForce GTX 980Radeon RX 480GeForce GTX 1080GeForce GTX 970GeForce GTX 1070GeForce GTX 960GeForce GTX 1060Radeon RX 560GeForce GTX 105060120180240300Min: 59.2 / Avg: 226.01 / Max: 336.8Min: 121.6 / Avg: 215.47 / Max: 344.8Min: 109.8 / Avg: 204.67 / Max: 296.3Min: 52.6 / Avg: 203.91 / Max: 304.8Min: 151.8 / Avg: 198.76 / Max: 304.5Min: 61.4 / Avg: 165.94 / Max: 235Min: 55.9 / Avg: 163.16 / Max: 251Min: 66.7 / Avg: 159.05 / Max: 205.4Min: 127.9 / Avg: 156.13 / Max: 262.7Min: 57.8 / Avg: 149.11 / Max: 234.2Min: 78.5 / Avg: 140.86 / Max: 210.3Min: 117.1 / Avg: 124.83 / Max: 188.6Min: 73.6 / Avg: 118.38 / Max: 167.3Min: 50.6 / Avg: 92.86 / Max: 123.4Min: 65.4 / Avg: 91.36 / Max: 114.7

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 560Radeon RX 480Radeon RX 580Radeon R9 FuryRadeon R9 290GeForce GTX 1050GeForce GTX 960GeForce GTX 780 TiGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060Radeon Vega FEGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti130260390520650SE +/- 0.14, N = 3SE +/- 0.23, N = 3SE +/- 0.08, N = 3SE +/- 0.89, N = 3SE +/- 1.41, N = 3SE +/- 1.17, N = 3SE +/- 0.45, N = 3SE +/- 0.11, N = 3SE +/- 0.24, N = 3SE +/- 1.03, N = 3SE +/- 0.74, N = 3SE +/- 0.59, N = 3SE +/- 1.01, N = 3SE +/- 0.27, N = 3SE +/- 0.88, N = 3SE +/- 0.23, N = 3115.81193.43207.97245.68253.64274.91277.11287.39288.87332.11351.45382.21422.09454.47520.13596.821. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 780 TiRadeon R9 FuryRadeon RX 560Radeon R9 290Radeon RX 480Radeon RX 580GeForce GTX 980 TiGeForce GTX 970GeForce GTX 980GeForce GTX 960GeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 10800.8281.6562.4843.3124.141.201.381.391.411.431.611.721.972.082.223.083.143.243.283.68

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080 TiRadeon R9 290Radeon R9 FuryGeForce GTX 980GeForce GTX 970GeForce GTX 1080GeForce GTX 1070Radeon RX 480Radeon RX 580GeForce GTX 960GeForce GTX 1060GeForce GTX 1050Radeon RX 56050100150200250Min: 94.2 / Avg: 240.33 / Max: 287.5Min: 143.3 / Avg: 204.53 / Max: 231.7Min: 66.8 / Avg: 190.32 / Max: 229.1Min: 111.2 / Avg: 180.14 / Max: 214.7Min: 167.4 / Avg: 178.62 / Max: 202Min: 55.7 / Avg: 159.95 / Max: 185.7Min: 58 / Avg: 146.29 / Max: 167.8Min: 52.2 / Avg: 141.33 / Max: 169.7Min: 50.7 / Avg: 138.56 / Max: 159.1Min: 69.5 / Avg: 134.82 / Max: 175Min: 62.8 / Avg: 129.51 / Max: 166.9Min: 96.7 / Avg: 124.57 / Max: 141.9Min: 49.9 / Avg: 117.9 / Max: 137.8Min: 44.8 / Avg: 89.24 / Max: 101.5Min: 51.3 / Avg: 83.15 / Max: 94.2

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1050GeForce GTX 960Radeon RX 560GeForce GTX 1060GeForce GTX 970GeForce GTX 780 TiGeForce GTX 980Radeon RX 480GeForce GTX 1070Radeon RX 580Radeon R9 FuryGeForce GTX 1080GeForce GTX 980 TiRadeon Vega FEGeForce GTX 1080 Ti2004006008001000SE +/- 12.49, N = 3SE +/- 0.45, N = 3SE +/- 0.06, N = 3SE +/- 7.21, N = 3SE +/- 6.83, N = 3SE +/- 21.22, N = 3SE +/- 0.99, N = 3SE +/- 0.10, N = 3SE +/- 8.94, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 6.81, N = 3SE +/- 21.12, N = 3SE +/- 0.88, N = 3SE +/- 5.16, N = 3204.36209.15209.76322.16382.02429.87447.62462.35470.12498.10550.79597.69693.44885.04974.371. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1050Radeon RX 560GeForce GTX 960GeForce GTX 780 TiRadeon R9 290GeForce GTX 970Radeon RX 480GeForce GTX 1060GeForce GTX 980Radeon RX 580Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080Radeon Vega FEGeForce GTX 1080 Ti510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 33.243.344.464.666.116.547.337.367.547.969.099.2710.6114.2916.0019.741. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1080 TiGeForce GTX 1080Radeon R9 FuryGeForce GTX 1070Radeon R9 290GeForce GTX 960GeForce GTX 780 TiRadeon RX 480GeForce GTX 1060GeForce GTX 1050Radeon RX 560GeForce GTX 970GeForce GTX 980Radeon RX 580GeForce GTX 980 Ti60120180240300Min: 63.4 / Avg: 204.7 / Max: 346Min: 109.8 / Avg: 199.5 / Max: 289.2Min: 168.7 / Avg: 174.7 / Max: 180.7Min: 130.1 / Avg: 172.7 / Max: 215.3Min: 112.9 / Avg: 157.65 / Max: 202.4Min: 82.3 / Avg: 132.4 / Max: 182.5Min: 116.8 / Avg: 117.17 / Max: 117.4Min: 114.5 / Avg: 114.7 / Max: 114.9Min: 98.4 / Avg: 109.4 / Max: 120.4Min: 104 / Avg: 105.7 / Max: 107.4Min: 61.6 / Avg: 65.85 / Max: 70.1

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 560GeForce GTX 1050Radeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 960Radeon Vega FEGeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 1080GeForce GTX 780 TiGeForce GTX 1080 Ti3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.22, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.956.129.619.6810.1711.2411.8211.9111.9612.0312.2012.2912.3112.3512.511. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

System Power Consumption Monitor

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGeForce GTX 780 TiGeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon R9 290GeForce GTX 980GeForce GTX 1080GeForce GTX 970Radeon RX 580Radeon RX 480GeForce GTX 1070GeForce GTX 960GeForce GTX 1060GeForce GTX 1050Radeon RX 56060120180240300Min: 57.6 / Avg: 202.67 / Max: 336.8Min: 51.8 / Avg: 191.22 / Max: 330.8Min: 60.5 / Avg: 188.98 / Max: 336.2Min: 60.8 / Avg: 177.77 / Max: 346Min: 108.1 / Avg: 170.72 / Max: 311.4Min: 54.8 / Avg: 152.53 / Max: 254.7Min: 48.9 / Avg: 149.85 / Max: 266.2Min: 55.7 / Avg: 145.83 / Max: 234.2Min: 61.1 / Avg: 144.94 / Max: 235Min: 66.5 / Avg: 142.76 / Max: 205.4Min: 47.8 / Avg: 138.1 / Max: 211.9Min: 52.1 / Avg: 123.58 / Max: 210.4Min: 48.3 / Avg: 116.84 / Max: 190.4Min: 44.2 / Avg: 92.91 / Max: 132Min: 50.2 / Avg: 92.38 / Max: 145.4

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationRadeon RX 580Radeon RX 480Radeon RX 560Radeon R9 290Radeon R9 FuryRadeon Vega FEGeForce GTX 1050GeForce GTX 960GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti1428425670SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.27, N = 3SE +/- 0.75, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 0.37, N = 3SE +/- 2.80, N = 3SE +/- 0.03, N = 3SE +/- 0.34, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 311.9812.0512.0820.9721.7630.6841.8047.6453.0354.0354.5954.9656.9158.9561.2663.671. (CXX) g++ options: -rdynamic -lOpenCL

OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.4.2System Power Consumption MonitorGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 290GeForce GTX 970GeForce GTX 1070GeForce GTX 960GeForce GTX 780 TiRadeon RX 480GeForce GTX 1050Radeon R9 FuryGeForce GTX 980GeForce GTX 1060Radeon RX 580Radeon RX 560306090120150Min: 130.6 / Avg: 146.5 / Max: 162.4Min: 134.4 / Avg: 142.1 / Max: 149.8Min: 109.8 / Avg: 132.4 / Max: 158.7Min: 98.7 / Avg: 125.25 / Max: 151.8Min: 91.5 / Avg: 111.55 / Max: 131.6Min: 85.8 / Avg: 108.05 / Max: 130.3Min: 58.3 / Avg: 105.5 / Max: 152.7Min: 72.7 / Avg: 93.3 / Max: 104.2Min: 79.6 / Avg: 86.2 / Max: 92.8Min: 63.1 / Avg: 86.03 / Max: 126.9Min: 56 / Avg: 80.3 / Max: 104.6Min: 68.9 / Avg: 80.05 / Max: 91.2Min: 62.2 / Avg: 78.77 / Max: 96.1Min: 50.6 / Avg: 72.03 / Max: 82.1