Radeon ROCm 2.0 OpenCL Compute Versus NVIDIA Linux

ROCm 2.0 Linux GPGPU/compute benchmarks for a future article on Phoronix.com by Michael Larabel.

TITAN RTX

Processor: Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads), Motherboard: ASUS PRIME Z390-A (0602 BIOS), Chipset: Intel Cannon Lake PCH Shared SRAM, Memory: 16384MB, Disk: 2000GB SABRENT + Samsung SSD 970 EVO 250GB, Graphics: NVIDIA TITAN RTX 24GB (1350/7000MHz), Audio: Realtek ALC1220, Monitor: Acer B286HK, Network: Intel Connection

OS: Ubuntu 18.04, Kernel: 4.19.5-041905-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.19.6, Display Driver: NVIDIA 415.23, OpenGL: 4.6.0, OpenCL: OpenCL 1.2 CUDA 10.0.132, Vulkan: 1.1.84, Compiler: GCC 7.3.0 + LLVM 6.0.0 + CUDA 10.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance
OpenCL Notes: GPU Compute Cores: 4608
Python Notes: Python 2.7.15rc1 + Python 3.6.7
Security Notes: __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp

GTX TITAN X GM200

Changed Graphics to NVIDIA GeForce GTX TITAN X 12GB (1001/3505MHz).

OpenCL Change: GPU Compute Cores: 3072

GTX 1080

Changed Graphics to NVIDIA GeForce GTX 1080 8GB (1607/5005MHz).

OpenCL Change: GPU Compute Cores: 2560

RTX 2080

Changed Graphics to Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz).

OpenCL Change: GPU Compute Cores: 2944

GTX 1080 Ti

Changed Graphics to NVIDIA GeForce GTX 1080 Ti 11GB (1480/5508MHz).

OpenCL Change: GPU Compute Cores: 3584

GTX 1070

Changed Graphics to NVIDIA GeForce GTX 1070 8GB (1506/4006MHz).

OpenCL Change: GPU Compute Cores: 1920

GTX 980 Ti

Changed Graphics to NVIDIA GeForce GTX 980 Ti 6GB (999/3505MHz).

OpenCL Change: GPU Compute Cores: 2816

RTX 2080 Ti

Changed Graphics to NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz).

OpenCL Change: GPU Compute Cores: 4352

RX Vega 64

Processor: Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads), Motherboard: ASUS PRIME Z390-A (0602 BIOS), Chipset: Intel Cannon Lake PCH Shared SRAM, Memory: 16384MB, Disk: 2000GB SABRENT + Samsung SSD 970 EVO 250GB, Graphics: AMD Radeon RX Vega 8GB (1630/945MHz), Audio: Realtek ALC1220, Monitor: Acer B286HK, Network: Intel Connection

OS: Ubuntu 18.04, Kernel: 4.15.0-43-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.19.6, OpenGL: 4.5 Mesa 19.0.0-devel (git-17218a0406) (LLVM 8.0.0), OpenCL: OpenCL 2.1 AMD-APP (2783.0), Vulkan: 1.1.90, Compiler: GCC 7.3.0 + LLVM 6.0.0 + CUDA 10.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance
Python Notes: Python 2.7.15rc1 + Python 3.6.7
Security Notes: __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp

RX 580

Processor: Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads), Motherboard: ASUS PRIME Z390-A (0602 BIOS), Chipset: Intel Cannon Lake PCH Shared SRAM, Memory: 16384MB, Disk: 2000GB SABRENT + Samsung SSD 970 EVO 250GB, Graphics: MSI AMD Radeon RX 470/480 8GB (1366/2000MHz), Audio: Realtek ALC1220, Monitor: Acer B286HK, Network: Intel Connection

OS: Ubuntu 18.04, Kernel: 4.19.5-041905-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.19.6, OpenGL: 4.5 Mesa 19.0.0-devel (git-17218a0406) (LLVM 8.0.0), OpenCL: OpenCL 2.1 AMD-APP (2783.0), Vulkan: 1.1.90, Compiler: GCC 7.3.0 + LLVM 6.0.0 + CUDA 10.0, File-System: ext4, Screen Resolution: 3840x2160

RX Vega 56

Processor: Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads), Motherboard: ASUS PRIME Z390-A (0602 BIOS), Chipset: Intel Cannon Lake PCH Shared SRAM, Memory: 16384MB, Disk: 2000GB SABRENT + Samsung SSD 970 EVO 250GB, Graphics: AMD Radeon RX Vega 8GB (1590/800MHz), Audio: Realtek ALC1220, Monitor: Acer B286HK, Network: Intel Connection

GTX 1060

Processor: Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads), Motherboard: ASUS PRIME Z390-A (0602 BIOS), Chipset: Intel Cannon Lake PCH Shared SRAM, Memory: 16384MB, Disk: 2000GB SABRENT + Samsung SSD 970 EVO 250GB, Graphics: NVIDIA GeForce GTX 1060 6GB (1506/4006MHz), Audio: Realtek ALC1220, Monitor: Acer B286HK, Network: Intel Connection

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance
OpenCL Notes: GPU Compute Cores: 1280
Python Notes: Python 2.7.15rc1 + Python 3.6.7
Security Notes: __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp

Radeon ROCm 2.0 OpenCL Compute Versus NVIDIA Linux

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

TITAN RTX

GTX TITAN X GM200

GTX 1080

RTX 2080

GTX 1080 Ti

GTX 1070

GTX 980 Ti

RTX 2080 Ti

RX Vega 64

RX 580

RX Vega 56

GTX 1060

Darktable

Parboil

Rodinia

Mixbench

clpeak

FAHBench

LuxMark

SHOC Scalable HeterOgeneous Computing

cl-mem

13 Results Shown

TITAN RTX

GTX TITAN X GM200

GTX 1080

RTX 2080

GTX 1080 Ti

GTX 1070

GTX 980 Ti

RTX 2080 Ti

RX Vega 64

RX 580

RX Vega 56

GTX 1060