OpenCL ROCm 2.0 Vs. AMDGPU-PRO Linux Benchmarks

Radeon RX Vega 64 ROCm 2.0 OpenCL versus PAL OpenCL driver in AMDGPU-PRO 18.50. Benchmarks by Michael Larabel for a future article on Phoronix.com.

ROCm 2.0

Processor: AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads), Motherboard: ASUS ROG ZENITH EXTREME (1601 BIOS), Chipset: AMD Family 17h, Memory: 32768MB, Disk: 16GB Voyager 3.0 + Samsung SSD 970 EVO 500GB, Graphics: AMD Radeon RX Vega 8GB (1630/945MHz), Audio: Realtek ALC1220, Monitor: ASUS VP28U, Network: Intel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11ad

OS: Ubuntu 18.04, Kernel: 4.15.0-43-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.19.6, Display Driver: amdgpu 18.0.1, OpenGL: 4.5 Mesa 18.0.5 (LLVM 6.0.0), Compiler: GCC 7.3.0, File-System: ext4, Screen Resolution: 3840x2160

Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: acpi-cpufreq ondemand
Graphics Notes: GLAMOR
Python Notes: Python 2.7.15rc1 + Python 3.6.7
Security Notes: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp

AMDGPU-PRO 18.50 PAL

OS: Ubuntu 18.04, Kernel: 4.15.0-43-generic (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.19.6, Display Driver: amdgpu 18.1.99, OpenGL: 4.6.13542, Compiler: GCC 7.3.0, File-System: ext4, Screen Resolution: 3840x2160

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

clpeak

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

Darktable

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

Darktable

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenCL ROCm 2.0 vs. AMDGPU-PRO Linux

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

ROCm 2.0

AMDGPU-PRO 18.50 PAL

PlaidML

LeelaChessZero

PlaidML

clpeak

SHOC Scalable HeterOgeneous Computing

Darktable

PlaidML

JuliaGPU

Darktable

cl-mem

Darktable

Rodinia

SHOC Scalable HeterOgeneous Computing

Darktable

27 Results Shown

ROCm 2.0

AMDGPU-PRO 18.50 PAL