AMDGPU-PRO 17.40 vs. ROCm. Linux OpenCL compute performance. Tests for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1710282-AL-OPENCLNEW89 OpenCL ROCm vs. AMDGPU-PRO 17.40 + 2MB FS - Phoronix Test Suite OpenCL ROCm vs. AMDGPU-PRO 17.40 + 2MB FS AMDGPU-PRO 17.40 vs. ROCm. Linux OpenCL compute performance. Tests for a future article.
HTML result view exported from: https://openbenchmarking.org/result/1710282-AL-OPENCLNEW89&rdt&grr .
Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Driver OpenGL OpenCL Compiler File-System Screen Resolution AMDGPU-PRO 17.40 ROCm RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 RX 580 RX 560 R9 Fury RX Vega 56 RX Vega 64 Intel Core i7-7700K @ 4.50GHz (4 Cores / 8 Threads) MSI Z270-A PRO (MS-7A71) v1.0 Intel Device 591f 16384MB 128GB THNSN5128GPU7 TOSHIBA AMD Radeon RX Vega 8176MB Realtek ALC892 DELL P2415Q Realtek RTL8111/8168/8411 Ubuntu 16.04 4.10.0-37-generic (x86_64) Unity 7.4.0 amdgpu 1.3.99 4.5.13496 OpenCL 2.0 AMD-APP (2482.3) GCC 5.4.0 20160609 ext4 3840x2160 Sapphire AMD Radeon 4096MB modesetting 1.19.3 AMD Radeon RX 560 4096MB amdgpu 1.3.99 MSI AMD Radeon RX 580 8192MB modesetting 1.19.3 AMD Radeon RX Vega 8176MB amdgpu 1.3.99 MSI AMD POLARIS10 8192MB 4.11.0-kfd-compute-rocm-rel-1.6-180 (x86_64) modesetting 1.19.3 4.5 Mesa 17.0.7 Gallium 0.4 (LLVM 4.0.0) OpenCL 2.0 AMD-APP (2508.0) AMD POLARIS11 4096MB amdgpu 1.3.0 Sapphire AMD Radeon R9 FURY / NANO 4096MB modesetting 1.19.3 LLVMpipe 3.3 Mesa 17.0.7 Gallium 0.4 (LLVM 4.0 256 bits) OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: intel_pstate performance Graphics Details - AMDGPU-PRO 17.40: RX Vega 64, AMDGPU-PRO 17.40: R9 Fury, AMDGPU-PRO 17.40: RX 560, AMDGPU-PRO 17.40: RX 580, AMDGPU-PRO 17.40: RX Vega 56, ROCm: RX 580, ROCm: RX 560, ROCm: R9 Fury: GLAMOR Environment Details - ROCm: RX Vega 56, ROCm: RX Vega 64: LIBGL_ALWAYS_SOFTWARE=1
luxmark: GPU - Luxball HDR luxmark: GPU - Hotel ethminer: GPU OpenCL cl-mem: Write cl-mem: Read cl-mem: Copy AMDGPU-PRO 17.40 ROCm RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 RX 580 RX 560 R9 Fury RX Vega 56 RX Vega 64 22178 4244 32309247 378.23 397.50 363.77 20764 2468 25445444 374.07 447.90 330.10 6314 1090 10912472 80.30 90.20 77.07 14622 2529 21963298 177.97 208.33 173.20 21543 3708 29370321 321.60 337.00 306.63 14481 1384 16895180 180.33 166.43 184.27 6309 584 8497833 80.83 93.50 80.97 19647 1561 24804647 390.97 123.50 206.30 23058 1561 29552366 317.50 178.47 204.17 23626 1835 32990822 389.13 189.13 222.20 OpenBenchmarking.org
LuxMark OpenCL Device: GPU - Scene: Luxball HDR AMDGPU-PRO 17.40 ROCm OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Luxball HDR RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 5K 10K 15K 20K 25K SE +/- 194.29, N = 3 SE +/- 112.25, N = 3 SE +/- 8.76, N = 3 SE +/- 62.00, N = 3 SE +/- 3.84, N = 3 SE +/- 17.85, N = 3 SE +/- 12.14, N = 3 SE +/- 99.50, N = 3 SE +/- 15.19, N = 3 SE +/- 157.33, N = 3 22178 20764 6314 14622 21543 14481 6309 19647 23058 23626
LuxMark OpenCL Device: GPU - Scene: Hotel AMDGPU-PRO 17.40 ROCm OpenBenchmarking.org Score, More Is Better LuxMark 3.0 OpenCL Device: GPU - Scene: Hotel RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 900 1800 2700 3600 4500 SE +/- 15.59, N = 3 SE +/- 8.17, N = 3 SE +/- 2.33, N = 3 SE +/- 4.81, N = 3 SE +/- 20.37, N = 3 SE +/- 2.85, N = 3 SE +/- 1.20, N = 3 SE +/- 3.00, N = 3 SE +/- 3.18, N = 3 SE +/- 7.22, N = 3 4244 2468 1090 2529 3708 1384 584 1561 1561 1835
Ethereum Ethminer Device: GPU OpenCL AMDGPU-PRO 17.40 ROCm OpenBenchmarking.org H/s, More Is Better Ethereum Ethminer 1.2.9 Device: GPU OpenCL RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 7M 14M 21M 28M 35M SE +/- 446721.48, N = 3 SE +/- 100519.99, N = 3 SE +/- 39105.19, N = 3 SE +/- 51510.63, N = 3 SE +/- 131902.80, N = 3 SE +/- 65584.53, N = 3 SE +/- 39402.49, N = 3 SE +/- 121114.30, N = 3 SE +/- 126928.78, N = 3 SE +/- 578374.82, N = 3 32309247 25445444 10912472 21963298 29370321 16895180 8497833 24804647 29552366 32990822
cl-mem Benchmark: Write AMDGPU-PRO 17.40 ROCm OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 80 160 240 320 400 SE +/- 0.03, N = 3 SE +/- 2.49, N = 3 SE +/- 0.06, N = 3 SE +/- 0.33, N = 3 SE +/- 0.00, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 3.08, N = 3 SE +/- 3.10, N = 3 SE +/- 0.09, N = 3 378.23 374.07 80.30 177.97 321.60 180.33 80.83 390.97 317.50 389.13 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read AMDGPU-PRO 17.40 ROCm OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 100 200 300 400 500 SE +/- 0.00, N = 3 SE +/- 0.12, N = 3 SE +/- 0.66, N = 3 SE +/- 0.20, N = 3 SE +/- 1.71, N = 3 SE +/- 1.95, N = 3 SE +/- 0.00, N = 3 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 SE +/- 0.19, N = 3 397.50 447.90 90.20 208.33 337.00 166.43 93.50 123.50 178.47 189.13 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Copy AMDGPU-PRO 17.40 ROCm OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy RX Vega 64 R9 Fury RX 560 RX 580 RX Vega 56 80 160 240 320 400 SE +/- 0.03, N = 3 SE +/- 1.55, N = 3 SE +/- 0.52, N = 3 SE +/- 0.75, N = 3 SE +/- 1.56, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 SE +/- 0.10, N = 3 363.77 330.10 77.07 173.20 306.63 184.27 80.97 206.30 204.17 222.20 1. (CC) gcc options: -O2 -flto -lOpenCL
Phoronix Test Suite v10.8.4