AMDGPU-PRO 17.40 ROCm, NVIDIA 384.98 12-Way NVIDIA AMD GPU Linux OpenCL Comparison

Radeon and NVIDIA Linux OpenCL compute comparison for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1711114-TY-1711111AL70&grs&sro.

AMDGPU-PRO 17.40 ROCm, NVIDIA 384.98 12-Way NVIDIA AMD GPU Linux OpenCL ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 TestIntel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-AIntel Device 3ec216384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBMSI AMD Radeon RX 580 8192MBRealtek GenericDELL P2415QIntel ConnectionUbuntu 17.044.10.0-38-generic (x86_64)Unity 7.5.0X Server 1.19.3modesetting 1.19.34.5.13496OpenCL 2.0 AMD-APP (2482.3)1.0.42GCC 6.3.0 20170406ext43840x2160Sapphire AMD Radeon 4096MBAMD Radeon RX Vega 8176MBamdgpu 1.3.99NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA 384.984.5.0Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1354/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1480/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)AMD Ryzen 7 1700 Eight-Core @ 3.00GHz (16 Cores)MSI X370 SLI PLUS (MS-7A33) v2.0AMD Device 14502 x 8192 MB DDR4-1200MT/s512GB ADATA SU800 + 120GB KINGSTON SV300S3 + Samsung SSD 960 EVO 250GBMSI NVIDIA GeForce GTX 1060 6GB 6144MB (1569/4006MHz)NVIDIA GP106 HD AudioSAMSUNGRealtek RTL8111/8168/8411Ubuntu 17.104.13.0-16-generic (x86_64)KDE Frameworks 5X Server 1.19.5NVIDIA 387.22GCC 7.2.01920x1080OpenBenchmarking.orgKernel Details- Radeon RX 580, Radeon R9 Fury, Radeon RX Vega 56, Radeon RX Vega 64: amdgpu.vm_fragment_size=9Compiler Details- Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 56: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 64: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GTX 1060 Test: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Details- Radeon RX 580: Scaling Governor: intel_pstate performance- Radeon R9 Fury: Scaling Governor: intel_pstate performance- Radeon RX Vega 56: Scaling Governor: intel_pstate performance- Radeon RX Vega 64: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050: Scaling Governor: intel_pstate performance- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1070 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performance- GTX 1060 Test: Scaling Governor: acpi-cpufreq ondemandGraphics Details- Radeon RX 580, Radeon R9 Fury, Radeon RX Vega 56, Radeon RX Vega 64: GLAMOROpenCL Details- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GTX 1060 Test: GPU Compute Cores: 1280System Details- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1070 Ti: GPU Compute Cores: 2432.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.- GTX 1060 Test: GPU Compute Cores: 1280.

AMDGPU-PRO 17.40 ROCm, NVIDIA 384.98 12-Way NVIDIA AMD GPU Linux OpenCL Comparisondarktable: Boat - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Max SP Flopsshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPjuliagpu: GPUcl-mem: Readviennacl: OpenCL LU Factorizationcl-mem: Writecl-mem: Copyluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRdarktable: Masskrug - OpenCLshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Triaddarktable: Server Room - OpenCLdarktable: Masskrug - OpenCLdarktable: Boat - OpenCLRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 Test3.570.826261.977.73521.55248225442.30208.4014.13177.30174.9025407268145004.18216.5612.023.081.057141.338.84793.70269983077.70448.2024.55381.60330.7024898352207114.27249.3411.3910702.8014.23917.73331818340.60339.1041.47321.90304.7037501107221871377.5212.4712923.6016.991072.78348619117.20397.2042.58378.20363.9042621263023328431.2911.943.711.516250.359.28652.04138987453.50266.2059.24238.80216.5024958367150694.49351.9812.2624.896.222123.343.24197.206715780895.1043.4186.3087.1011343383658613.46267.7111.4413.385.872702.954.11197.9381114034.5094.2047.5585.2086.8013423706726613.77303.8811.464.231.2848547.31333.30121054859.20153.4056.09140139.1021085380116504.60377.9511.943.381.097256.7610.60481.09149885098.20205.4059.66191.80186.7028717430162434.29451.3812.183.321.099127.6413.80514.97171465266205.5063.24190.60186.5032137747161514.28503.4112.183.211.089593.3814.30582.56176402393.6022963.58215.60208.9031646457128994.29524.6012.282.721.0213411.9019.90990.47202544036.40338.1065.24336.30317.10388410037201634.22598.5812.4620525182115381.395.734.16OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6460120180240300Min: 56.9 / Avg: 97.37 / Max: 116.1Min: 50 / Avg: 96.89 / Max: 107Min: 38.9 / Avg: 118.7 / Max: 163.6Min: 41.5 / Avg: 139.43 / Max: 169.9Min: 73.1 / Avg: 152.73 / Max: 248.4Min: 69.1 / Avg: 153.58 / Max: 182.1Min: 114.3 / Avg: 202.15 / Max: 323.2Min: 104 / Avg: 192.19 / Max: 240.9Min: 51.4 / Avg: 193.91 / Max: 296.3Min: 51.8 / Avg: 157.21 / Max: 212.5Min: 62.8 / Avg: 250.03 / Max: 273.6Min: 62.6 / Avg: 311.77 / Max: 357.7

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 58061218243024.8913.384.233.383.323.212.723.713.083.57

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6460120180240300Min: 37.2 / Avg: 112.78 / Max: 118.8Min: 37 / Avg: 115.85 / Max: 121.6Min: 90.8 / Avg: 157.64 / Max: 163.2Min: 50.4 / Avg: 191.85 / Max: 202.1Min: 45.2 / Avg: 198.87 / Max: 210.9Min: 40.6 / Avg: 203.13 / Max: 212.7Min: 47.8 / Avg: 259.41 / Max: 276.4Min: 88.6 / Avg: 240.44 / Max: 253.6Min: 55.6 / Avg: 194.55 / Max: 219.8Min: 53.8 / Avg: 180.54 / Max: 202.4Min: 64.1 / Avg: 239.06 / Max: 257.4Min: 67.3 / Avg: 309.79 / Max: 336.4

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6460120180240300Min: 36.8 / Avg: 105.98 / Max: 111.1Min: 36.6 / Avg: 106.5 / Max: 111.7Min: 38.6 / Avg: 135.41 / Max: 142.2Min: 41.7 / Avg: 164.43 / Max: 172.7Min: 45.3 / Avg: 170.09 / Max: 179.1Min: 40.7 / Avg: 166.47 / Max: 174.4Min: 47.7 / Avg: 228.36 / Max: 239.7Min: 50.6 / Avg: 228.51 / Max: 239.3Min: 69.1 / Avg: 195.52 / Max: 211Min: 54 / Avg: 168.06 / Max: 180.1Min: 122.2 / Avg: 247.77 / Max: 263.6Min: 68.1 / Avg: 308.3 / Max: 332.6

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 5802468106.225.871.281.091.091.081.021.511.050.82

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6460120180240300Min: 56.6 / Avg: 111.62 / Max: 114.8Min: 61.2 / Avg: 113.65 / Max: 117Min: 86.9 / Avg: 149.68 / Max: 155.8Min: 80.3 / Avg: 181.7 / Max: 190.9Min: 85.6 / Avg: 186.58 / Max: 194.4Min: 86 / Avg: 178.27 / Max: 185.1Min: 117.7 / Avg: 247.08 / Max: 256.7Min: 119.9 / Avg: 242.06 / Max: 250.3Min: 94.2 / Avg: 227.63 / Max: 241.9Min: 52.4 / Avg: 182.06 / Max: 201.3Min: 192.4 / Avg: 254.85 / Max: 276.2Min: 64.7 / Avg: 320.36 / Max: 336.9

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 643K6K9K12K15K2123.342702.954854.007256.769127.649593.3813411.906250.357141.336261.9710702.8012923.601. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 645101520253.244.117.3110.6013.8014.3019.909.288.847.7314.2316.991. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 642004006008001000197.20197.93333.30481.09514.97582.56990.47652.04793.70521.55917.731072.781. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6470M140M210M280M350M67157808.0081114034.50121054859.20149885098.20171465266.00176402393.60202544036.40138987453.50269983077.70248225442.30331818340.60348619117.201. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6410020030040050095.1094.20153.40205.40205.50229.00338.10266.20448.20208.40339.10397.201. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64153045607543.4147.5556.0959.6663.2463.5865.2459.2424.5514.1341.4742.581. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 648016024032040086.3085.20140.00191.80190.60215.60336.30238.80381.60177.30321.90378.201. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 648016024032040087.1086.80139.10186.70186.50208.90317.10216.50330.70174.90304.70363.901. (CC) gcc options: -O2 -flto -lOpenCL

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGTX 1060 TestGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 649001800270036004500SE +/- 5.17, N = 32052113413422108287132133164388424952489254037504262

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGTX 1060 TestGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 643K6K9K12K15KSE +/- 1.20, N = 35182338337065380743077476457100378367835272681107212630

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGTX 1060 TestGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 645K10K15K20K25KSE +/- 172.59, N = 5115386586726611650162431615112899201631506920711145002187123328

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 5804812162013.4613.774.604.294.284.294.224.494.274.18

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64153045607521.8127.9040.8952.0559.7662.4666.3532.5236.8339.8342.8141.45

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64130260390520650267.71303.88377.95451.38503.41524.60598.58351.98249.34216.56377.52431.291. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 644812162010.0611.5813.3714.9716.1615.5814.9710.3812.7914.0715.6913.76

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 642040608010059.0163.9377.8389.3986.5672.3681.6162.2590.9979.6585.8272.82

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64102030405031.9234.8039.7345.1945.5538.7943.9536.6242.7243.2544.6940.97

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 64369121511.4411.4611.9412.1812.1812.2812.4612.2611.3912.0212.4711.941. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Server Room - Acceleration: OpenCLGTX 1060 Test0.31280.62560.93841.25121.564SE +/- 0.01, N = 31.39

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Masskrug - Acceleration: OpenCLGTX 1060 Test1.28932.57863.86795.15726.4465SE +/- 0.08, N = 55.73

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Boat - Acceleration: OpenCLGTX 1060 Test0.9361.8722.8083.7444.68SE +/- 0.04, N = 34.16

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 980 TiRadeon R9 FuryRadeon RX 580Radeon RX Vega 56Radeon RX Vega 6460120180240300Min: 35.5 / Avg: 93.14 / Max: 164.4Min: 36.4 / Avg: 91.32 / Max: 148.3Min: 38.5 / Avg: 113.75 / Max: 163.6Min: 40.9 / Avg: 135.13 / Max: 202.1Min: 44.6 / Avg: 143.93 / Max: 248.4Min: 39.9 / Avg: 143.08 / Max: 212.7Min: 45.3 / Avg: 191.14 / Max: 323.2Min: 48.8 / Avg: 183.62 / Max: 282.5Min: 51.4 / Avg: 153.02 / Max: 296.3Min: 51.6 / Avg: 133.26 / Max: 212.5Min: 62.8 / Avg: 197.85 / Max: 276.2Min: 62.6 / Avg: 237.02 / Max: 357.7


Phoronix Test Suite v10.8.4