OpenCL ROCm vs. AMDGPU-PRO 17.40 + 2MB FS

AMDGPU-PRO 17.40 vs. ROCm. Linux OpenCL compute performance. Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1710282-AL-OPENCLNEW89&sor.

ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionAMDGPU-PRO 17.40ROCm RX 560 RX 580 R9 Fury RX Vega 56 RX Vega 64 RX 560 RX 580 R9 Fury RX Vega 56 RX Vega 64Intel Core i7-7700K @ 4.50GHz (4 Cores / 8 Threads)MSI Z270-A PRO (MS-7A71) v1.0Intel Device 591f16384MB128GB THNSN5128GPU7 TOSHIBAAMD Radeon RX 560 4096MBRealtek ALC892DELL P2415QRealtek RTL8111/8168/8411Ubuntu 16.044.10.0-37-generic (x86_64)Unity 7.4.0amdgpu 1.3.994.5.13496OpenCL 2.0 AMD-APP (2482.3)GCC 5.4.0 20160609ext43840x2160MSI AMD Radeon RX 580 8192MBmodesetting 1.19.3Sapphire AMD Radeon 4096MBAMD Radeon RX Vega 8176MBamdgpu 1.3.99AMD POLARIS11 4096MB4.11.0-kfd-compute-rocm-rel-1.6-180 (x86_64)amdgpu 1.3.04.5 Mesa 17.0.7 Gallium 0.4 (LLVM 4.0.0)OpenCL 2.0 AMD-APP (2508.0)MSI AMD POLARIS10 8192MBmodesetting 1.19.3Sapphire AMD Radeon R9 FURY / NANO 4096MBLLVMpipe3.3 Mesa 17.0.7 Gallium 0.4 (LLVM 4.0 256 bits)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceGraphics Details- AMDGPU-PRO 17.40: RX 560, AMDGPU-PRO 17.40: RX 580, AMDGPU-PRO 17.40: R9 Fury, AMDGPU-PRO 17.40: RX Vega 56, AMDGPU-PRO 17.40: RX Vega 64, ROCm: RX 560, ROCm: RX 580, ROCm: R9 Fury: GLAMOREnvironment Details- ROCm: RX Vega 56, ROCm: RX Vega 64: LIBGL_ALWAYS_SOFTWARE=1

cl-mem: Copycl-mem: Readcl-mem: Writeethminer: GPU OpenCLluxmark: GPU - Hotelluxmark: GPU - Luxball HDRAMDGPU-PRO 17.40ROCm RX 560 RX 580 R9 Fury RX Vega 56 RX Vega 64 RX 560 RX 580 R9 Fury RX Vega 56 RX Vega 6477.0790.2080.301091247210906314173.20208.33177.9721963298252914622330.10447.90374.0725445444246820764306.63337.00321.6029370321370821543363.77397.50378.233230924742442217880.9793.5080.8384978335846309184.27166.43180.3316895180138414481206.30123.50390.9724804647156119647204.17178.47317.5029552366156123058222.20189.13389.1332990822183523626OpenBenchmarking.org

cl-mem

Benchmark: Copy

AMDGPU-PRO 17.40ROCmOpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRX Vega 64R9 FuryRX Vega 56RX 580RX 56080160240320400SE +/- 0.03, N = 3SE +/- 1.55, N = 3SE +/- 1.56, N = 3SE +/- 0.75, N = 3SE +/- 0.52, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3363.77330.10306.63173.2077.07222.20206.30204.17184.2780.971. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

AMDGPU-PRO 17.40ROCmOpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadR9 FuryRX Vega 64RX Vega 56RX 580RX 560100200300400500SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 1.71, N = 3SE +/- 0.20, N = 3SE +/- 0.66, N = 3SE +/- 0.19, N = 3SE +/- 0.15, N = 3SE +/- 1.95, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3447.90397.50337.00208.3390.20189.13178.47166.43123.5093.501. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

ROCmAMDGPU-PRO 17.40OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteR9 FuryRX Vega 64RX Vega 56RX 580RX 56080160240320400SE +/- 3.08, N = 3SE +/- 0.09, N = 3SE +/- 3.10, N = 3SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 2.49, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.06, N = 3390.97389.13317.50180.3380.83378.23374.07321.60177.9780.301. (CC) gcc options: -O2 -flto -lOpenCL

Ethereum Ethminer

Device: GPU OpenCL

ROCmAMDGPU-PRO 17.40OpenBenchmarking.orgH/s, More Is BetterEthereum Ethminer 1.2.9Device: GPU OpenCLRX Vega 64RX Vega 56R9 FuryRX 580RX 5607M14M21M28M35MSE +/- 578374.82, N = 3SE +/- 126928.78, N = 3SE +/- 121114.30, N = 3SE +/- 65584.53, N = 3SE +/- 39402.49, N = 3SE +/- 446721.48, N = 3SE +/- 131902.80, N = 3SE +/- 100519.99, N = 3SE +/- 51510.63, N = 3SE +/- 39105.19, N = 33299082229552366248046471689518084978333230924729370321254454442196329810912472

LuxMark

OpenCL Device: GPU - Scene: Hotel

AMDGPU-PRO 17.40ROCmOpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelRX Vega 64RX Vega 56RX 580R9 FuryRX 5609001800270036004500SE +/- 15.59, N = 3SE +/- 20.37, N = 3SE +/- 4.81, N = 3SE +/- 8.17, N = 3SE +/- 2.33, N = 3SE +/- 7.22, N = 3SE +/- 3.18, N = 3SE +/- 3.00, N = 3SE +/- 2.85, N = 3SE +/- 1.20, N = 3424437082529246810901835156115611384584

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

ROCmAMDGPU-PRO 17.40OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRRX Vega 64RX Vega 56R9 FuryRX 580RX 5605K10K15K20K25KSE +/- 157.33, N = 3SE +/- 15.19, N = 3SE +/- 99.50, N = 3SE +/- 17.85, N = 3SE +/- 12.14, N = 3SE +/- 194.29, N = 3SE +/- 3.84, N = 3SE +/- 112.25, N = 3SE +/- 62.00, N = 3SE +/- 8.76, N = 3236262305819647144816309221782154320764146226314


Phoronix Test Suite v10.8.4