AMDGPU-PRO 17.40 ROCm, NVIDIA 384.98 12-Way NVIDIA AMD GPU Linux OpenCL Comparison

Radeon and NVIDIA Linux OpenCL compute comparison for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1711114-TY-1711111AL70.

AMDGPU-PRO 17.40 ROCm, NVIDIA 384.98 12-Way NVIDIA AMD GPU Linux OpenCL ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 TestIntel Core i7-8700K @ 4.70GHz (6 Cores / 12 Threads)ASUS PRIME Z370-AIntel Device 3ec216384MB525GB Crucial_CT525MX3 + Samsung SSD 950 PRO 256GBMSI AMD Radeon RX 580 8192MBRealtek GenericDELL P2415QIntel ConnectionUbuntu 17.044.10.0-38-generic (x86_64)Unity 7.5.0X Server 1.19.3modesetting 1.19.34.5.13496OpenCL 2.0 AMD-APP (2482.3)1.0.42GCC 6.3.0 20170406ext43840x2160Sapphire AMD Radeon 4096MBAMD Radeon RX Vega 8176MBamdgpu 1.3.99NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA 384.984.5.0Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1354/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1480/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)AMD Ryzen 7 1700 Eight-Core @ 3.00GHz (16 Cores)MSI X370 SLI PLUS (MS-7A33) v2.0AMD Device 14502 x 8192 MB DDR4-1200MT/s512GB ADATA SU800 + 120GB KINGSTON SV300S3 + Samsung SSD 960 EVO 250GBMSI NVIDIA GeForce GTX 1060 6GB 6144MB (1569/4006MHz)NVIDIA GP106 HD AudioSAMSUNGRealtek RTL8111/8168/8411Ubuntu 17.104.13.0-16-generic (x86_64)KDE Frameworks 5X Server 1.19.5NVIDIA 387.22GCC 7.2.01920x1080OpenBenchmarking.orgKernel Details- Radeon RX 580, Radeon R9 Fury, Radeon RX Vega 56, Radeon RX Vega 64: amdgpu.vm_fragment_size=9Compiler Details- Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 56: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX Vega 64: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GTX 1060 Test: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -vProcessor Details- Radeon RX 580: Scaling Governor: intel_pstate performance- Radeon R9 Fury: Scaling Governor: intel_pstate performance- Radeon RX Vega 56: Scaling Governor: intel_pstate performance- Radeon RX Vega 64: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050: Scaling Governor: intel_pstate performance- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1070 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performance- GTX 1060 Test: Scaling Governor: acpi-cpufreq ondemandGraphics Details- Radeon RX 580, Radeon R9 Fury, Radeon RX Vega 56, Radeon RX Vega 64: GLAMOROpenCL Details- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GTX 1060 Test: GPU Compute Cores: 1280System Details- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1070 Ti: GPU Compute Cores: 2432.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.- GTX 1060 Test: GPU Compute Cores: 1280.

AMDGPU-PRO 17.40 ROCm, NVIDIA 384.98 12-Way NVIDIA AMD GPU Linux OpenCL Comparisonshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Triaddarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLluxmark: GPU - Luxball HDRluxmark: GPU - Microphoneluxmark: GPU - Hotelcl-mem: Readcl-mem: Writecl-mem: Copyjuliagpu: GPUviennacl: OpenCL LU Factorizationdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 Test6261.97216.56521.557.7312.023.574.180.821450072682540208.40177.30174.90248225442.3014.137141.33249.34793.708.8411.393.084.271.052071183522489448.20381.60330.70269983077.7024.5510702.80377.52917.7314.2312.4721871110723750339.10321.90304.70331818340.6041.4712923.60431.291072.7816.9911.9423328126304262397.20378.20363.90348619117.2042.586250.35351.98652.049.2812.263.714.491.511506983672495266.20238.80216.50138987453.5059.242123.34267.71197.203.2411.4424.8913.466.2265863383113495.1086.3087.106715780843.412702.95303.88197.934.1111.4613.3813.775.8772663706134294.2085.2086.8081114034.5047.554854377.95333.307.3111.944.234.601.281165053802108153.40140139.10121054859.2056.097256.76451.38481.0910.6012.183.384.291.091624374302871205.40191.80186.70149885098.2059.669127.64503.41514.9713.8012.183.324.281.091615177473213205.50190.60186.5017146526663.249593.38524.60582.5614.3012.283.214.291.081289964573164229215.60208.90176402393.6063.5813411.90598.58990.4719.9012.462.724.221.0220163100373884338.10336.30317.10202544036.4065.2411538518220524.165.731.39OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti3K6K9K12K15K6261.977141.3310702.8012923.606250.352123.342702.954854.007256.769127.649593.3813411.901. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti153045607539.8336.8342.8141.4532.5221.8127.9040.8952.0559.7662.4666.35

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti60120180240300Min: 51.8 / Avg: 157.21 / Max: 212.5Min: 51.4 / Avg: 193.91 / Max: 296.3Min: 62.8 / Avg: 250.03 / Max: 273.6Min: 62.6 / Avg: 311.77 / Max: 357.7Min: 104 / Avg: 192.19 / Max: 240.9Min: 56.9 / Avg: 97.37 / Max: 116.1Min: 50 / Avg: 96.89 / Max: 107Min: 38.9 / Avg: 118.7 / Max: 163.6Min: 41.5 / Avg: 139.43 / Max: 169.9Min: 73.1 / Avg: 152.73 / Max: 248.4Min: 69.1 / Avg: 153.58 / Max: 182.1Min: 114.3 / Avg: 202.15 / Max: 323.2

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti130260390520650216.56249.34377.52431.29351.98267.71303.88377.95451.38503.41524.60598.581. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti2004006008001000521.55793.70917.731072.78652.04197.20197.93333.30481.09514.97582.56990.471. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti5101520257.738.8414.2316.999.283.244.117.3110.6013.8014.3019.901. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti369121512.0211.3912.4711.9412.2611.4411.4611.9412.1812.1812.2812.461. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLRadeon RX 580Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti6121824303.573.083.7124.8913.384.233.383.323.212.72

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLRadeon RX 580Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti481216204.184.274.4913.4613.774.604.294.284.294.22

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLRadeon RX 580Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti2468100.821.051.516.225.871.281.091.091.081.02

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 Test5K10K15K20K25KSE +/- 172.59, N = 5145002071121871233281506965867266116501624316151128992016311538

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti2040608010079.6590.9985.8272.8262.2559.0163.9377.8389.3986.5672.3681.61

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti60120180240300Min: 52.4 / Avg: 182.06 / Max: 201.3Min: 94.2 / Avg: 227.63 / Max: 241.9Min: 192.4 / Avg: 254.85 / Max: 276.2Min: 64.7 / Avg: 320.36 / Max: 336.9Min: 119.9 / Avg: 242.06 / Max: 250.3Min: 56.6 / Avg: 111.62 / Max: 114.8Min: 61.2 / Avg: 113.65 / Max: 117Min: 86.9 / Avg: 149.68 / Max: 155.8Min: 80.3 / Avg: 181.7 / Max: 190.9Min: 85.6 / Avg: 186.58 / Max: 194.4Min: 86 / Avg: 178.27 / Max: 185.1Min: 117.7 / Avg: 247.08 / Max: 256.7

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 Test3K6K9K12K15KSE +/- 1.20, N = 37268835211072126308367338337065380743077476457100375182

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti102030405043.2542.7244.6940.9736.6231.9234.8039.7345.1945.5538.7943.95

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti60120180240300Min: 54 / Avg: 168.06 / Max: 180.1Min: 69.1 / Avg: 195.52 / Max: 211Min: 122.2 / Avg: 247.77 / Max: 263.6Min: 68.1 / Avg: 308.3 / Max: 332.6Min: 50.6 / Avg: 228.51 / Max: 239.3Min: 36.8 / Avg: 105.98 / Max: 111.1Min: 36.6 / Avg: 106.5 / Max: 111.7Min: 38.6 / Avg: 135.41 / Max: 142.2Min: 41.7 / Avg: 164.43 / Max: 172.7Min: 45.3 / Avg: 170.09 / Max: 179.1Min: 40.7 / Avg: 166.47 / Max: 174.4Min: 47.7 / Avg: 228.36 / Max: 239.7

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiGTX 1060 Test9001800270036004500SE +/- 5.17, N = 32540248937504262249511341342210828713213316438842052

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti4812162014.0712.7915.6913.7610.3810.0611.5813.3714.9716.1615.5814.97

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti60120180240300Min: 53.8 / Avg: 180.54 / Max: 202.4Min: 55.6 / Avg: 194.55 / Max: 219.8Min: 64.1 / Avg: 239.06 / Max: 257.4Min: 67.3 / Avg: 309.79 / Max: 336.4Min: 88.6 / Avg: 240.44 / Max: 253.6Min: 37.2 / Avg: 112.78 / Max: 118.8Min: 37 / Avg: 115.85 / Max: 121.6Min: 90.8 / Avg: 157.64 / Max: 163.2Min: 50.4 / Avg: 191.85 / Max: 202.1Min: 45.2 / Avg: 198.87 / Max: 210.9Min: 40.6 / Avg: 203.13 / Max: 212.7Min: 47.8 / Avg: 259.41 / Max: 276.4

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti100200300400500208.40448.20339.10397.20266.2095.1094.20153.40205.40205.50229.00338.101. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti80160240320400177.30381.60321.90378.20238.8086.3085.20140.00191.80190.60215.60336.301. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti80160240320400174.90330.70304.70363.90216.5087.1086.80139.10186.70186.50208.90317.101. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPURadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti70M140M210M280M350M248225442.30269983077.70331818340.60348619117.20138987453.5067157808.0081114034.50121054859.20149885098.20171465266.00176402393.60202544036.401. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

ViennaCL

OpenCL LU Factorization

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti153045607514.1324.5541.4742.5859.2443.4147.5556.0959.6663.2463.5865.241. (CXX) g++ options: -rdynamic -lOpenCL

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringRadeon RX 580Radeon R9 FuryRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 Ti60120180240300Min: 51.6 / Avg: 133.26 / Max: 212.5Min: 51.4 / Avg: 153.02 / Max: 296.3Min: 62.8 / Avg: 197.85 / Max: 276.2Min: 62.6 / Avg: 237.02 / Max: 357.7Min: 48.8 / Avg: 183.62 / Max: 282.5Min: 35.5 / Avg: 93.14 / Max: 164.4Min: 36.4 / Avg: 91.32 / Max: 148.3Min: 38.5 / Avg: 113.75 / Max: 163.6Min: 40.9 / Avg: 135.13 / Max: 202.1Min: 44.6 / Avg: 143.93 / Max: 248.4Min: 39.9 / Avg: 143.08 / Max: 212.7Min: 45.3 / Avg: 191.14 / Max: 323.2

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Boat - Acceleration: OpenCLGTX 1060 Test0.9361.8722.8083.7444.68SE +/- 0.04, N = 34.16

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Masskrug - Acceleration: OpenCLGTX 1060 Test1.28932.57863.86795.15726.4465SE +/- 0.08, N = 55.73

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.5Test: Server Room - Acceleration: OpenCLGTX 1060 Test0.31280.62560.93841.25121.564SE +/- 0.01, N = 31.39


Phoronix Test Suite v10.8.4