OpenCL Radeon Open-Source vs. Catalyst, hd7970 Radeon R600g and RadeonSI Gallium3D Clover OpenCL benchmarks with clpeak using Linux 4.5 + Mesa 11.2-devel. Compared to Catalyst. Benchmark results by Michael Larabel of Phoronix.com for a future article.
hd7970: Intel Core i5-3330 testing with a ASRock H61 Pro BTC and Intel HD 2500 3072MB on SUSE LINUX 42.1 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1602026-GA-OPENCLAMD14,1602036-HA-HD797015261&sro&gru .
OpenCL Radeon Open-Source vs. Catalyst, hd7970 Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R9 285 Open-Source: R9 290 Open-Source: R7 370 Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R9 285 Catalyst: R9 290 Catalyst: R7 370 hd7970 Intel Core i7-5960X @ 3.50GHz (16 Cores) Gigabyte X99-UD4-CF Intel Xeon E7 v3/Xeon 16384MB 2 x 120GB SanDisk SDSSDA12 + 120GB Samsung SSD 850 Sapphire AMD Radeon HD 6870 1024MB Realtek ALC1150 DELL P2415Q Intel Connection Ubuntu 15.10 4.5.0-rc2amdgpu-pp+ (x86_64) 20160201 Unity X Server 1.17.2 radeon 7.6.99 3.3 Mesa 11.2.0-devel (padoka PPA) Gallium 0.4 GCC 5.2.1 20151010 ext4 3840x2160 XFX AMD Radeon HD 7950/8950 OEM / R9 280 3072MB 4.1 Mesa 11.2.0-devel (padoka PPA) Gallium 0.4 XFX AMD Radeon R9 285/380 2048MB amdgpu 1.0.99 XFX AMD Radeon R9 290 4096MB radeon 7.6.99 MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB Sapphire AMD Radeon HD 6800 1024MB 4.2.0-27-generic (x86_64) fglrx 15.20.3 4.5.13399 XFX AMD Radeon HD 7900 3072MB Supported device 6939 2048MB Supported device 67B1 4096MB MSI AMD Radeon R7 370 4096MB Intel Core i5-3330 @ 3.20GHz (4 Cores) ASRock H61 Pro BTC Intel Xeon E3-1200 v2/3rd 8192MB 65GB Intel HD 2500 3072MB (1050MHz) VIA VT1705 H225W Realtek RTL8111/8168/8411 SUSE LINUX 42.1 4.5.0-rc1-3-default (x86_64) KDE Frameworks 5 X Server 1.18.99.1 intel 2.99.917 4.1 Mesa 11.2.0-devel (git-43a401a pontostroy:X11) Gallium 0.4 GCC 4.8.5 + Clang 3.9.0 + LLVM 3.9.0svn xfs 1920x1080 OpenBenchmarking.org Kernel Details - Open-Source: HD 6870, Open-Source: HD 7950, Open-Source: R9 285, Open-Source: R9 290, Open-Source: R7 370: amdgpu.powerplay=1 Compiler Details - Open-Source: HD 6870, Open-Source: HD 7950, Open-Source: R9 285, Open-Source: R9 290, Open-Source: R7 370, Catalyst: HD 6870, Catalyst: HD 7950, Catalyst: R9 285, Catalyst: R9 290, Catalyst: R7 370: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Open-Source: HD 6870: Scaling Governor: intel_pstate performance - Open-Source: HD 7950: Scaling Governor: intel_pstate performance - Open-Source: R9 285: Scaling Governor: intel_pstate performance - Open-Source: R9 290: Scaling Governor: intel_pstate performance - Open-Source: R7 370: Scaling Governor: intel_pstate performance - Catalyst: HD 6870: Scaling Governor: intel_pstate performance - Catalyst: HD 7950: Scaling Governor: intel_pstate performance - Catalyst: R9 285: Scaling Governor: intel_pstate performance - Catalyst: R9 290: Scaling Governor: intel_pstate performance - Catalyst: R7 370: Scaling Governor: intel_pstate performance - hd7970: Scaling Governor: intel_pstate powersave Graphics Details - Open-Source: HD 6870: EXA - Open-Source: HD 7950: GLAMOR - Open-Source: R9 285: GLAMOR - Open-Source: R9 290: GLAMOR - Open-Source: R7 370: GLAMOR Environment Details - Catalyst: HD 6870, Catalyst: HD 7950, Catalyst: R9 285, Catalyst: R9 290, Catalyst: R7 370: LIBGL_DRIVERS_PATH=/usr/lib/fglrx/dri:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/dri:/usr/lib32/fglrx/dri:/usr/lib/i386-linux-gnu/dri
OpenCL Radeon Open-Source vs. Catalyst, hd7970 clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Global Memory Bandwidth clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Integer Compute INT clpeak: Kernel Latency Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R9 285 Open-Source: R9 290 Open-Source: R7 370 Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R9 285 Catalyst: R9 290 Catalyst: R7 370 hd7970 3.58 5.26 360.92 182.44 145.85 10.40 15.72 2345.12 547.03 491.62 105.54 10.09 14.82 3102.60 204.75 650.80 143.39 9.82 12.32 4608.55 596.55 1578.01 118.71 10.41 15.77 2031.91 131.49 418.74 110.24 8.80 9.49 398.29 199.38 24.90 10.47 13.88 2292.54 536.84 516.74 34.03 9.00 13.21 3177.62 202.71 647.08 26.78 10.64 14.81 4721.18 598.81 1591.74 37.77 10.67 14.97 2095.91 128.73 415.67 35.42 5.83 7.99 168.85 4149.22 1058.38 847.70 69.45 OpenBenchmarking.org
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R7 370 Catalyst: R9 285 Catalyst: R9 290 Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R7 370 Open-Source: R9 285 Open-Source: R9 290 hd7970 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.07, N = 3 SE +/- 0.13, N = 3 SE +/- 0.10, N = 6 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.30, N = 6 SE +/- 0.25, N = 6 SE +/- 0.01, N = 3 8.80 10.47 10.67 9.00 10.64 3.58 10.40 10.41 10.09 9.82 5.83
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R7 370 Catalyst: R9 285 Catalyst: R9 290 Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R7 370 Open-Source: R9 285 Open-Source: R9 290 hd7970 4 8 12 16 20 SE +/- 0.12, N = 3 SE +/- 0.04, N = 3 SE +/- 0.00, N = 3 SE +/- 0.19, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.59, N = 6 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 9.49 13.88 14.97 13.21 14.81 5.26 15.72 15.77 14.82 12.32 7.99
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth hd7970 40 80 120 160 200 SE +/- 0.73, N = 3 168.85
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R7 370 Catalyst: R9 285 Catalyst: R9 290 Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R7 370 Open-Source: R9 285 Open-Source: R9 290 hd7970 1000 2000 3000 4000 5000 SE +/- 0.00, N = 3 SE +/- 44.66, N = 6 SE +/- 2.25, N = 3 SE +/- 0.24, N = 3 SE +/- 0.95, N = 3 SE +/- 0.08, N = 3 SE +/- 47.92, N = 6 SE +/- 0.22, N = 3 SE +/- 1.52, N = 3 SE +/- 2.13, N = 3 SE +/- 2.09, N = 3 398.29 2292.54 2095.91 3177.62 4721.18 360.92 2345.12 2031.91 3102.60 4608.55 4149.22
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double Catalyst: HD 7950 Catalyst: R7 370 Catalyst: R9 285 Catalyst: R9 290 Open-Source: HD 7950 Open-Source: R7 370 Open-Source: R9 285 Open-Source: R9 290 hd7970 200 400 600 800 1000 SE +/- 11.37, N = 6 SE +/- 0.36, N = 3 SE +/- 0.22, N = 3 SE +/- 0.01, N = 3 SE +/- 16.28, N = 6 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 536.84 128.73 202.71 598.81 547.03 131.49 204.75 596.55 1058.38
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R7 370 Catalyst: R9 285 Catalyst: R9 290 Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R7 370 Open-Source: R9 285 Open-Source: R9 290 hd7970 300 600 900 1200 1500 SE +/- 0.02, N = 3 SE +/- 21.03, N = 6 SE +/- 3.65, N = 3 SE +/- 1.77, N = 3 SE +/- 0.05, N = 3 SE +/- 0.16, N = 3 SE +/- 7.09, N = 6 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.49, N = 3 SE +/- 0.06, N = 3 199.38 516.74 415.67 647.08 1591.74 182.44 491.62 418.74 650.80 1578.01 847.70
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency Catalyst: HD 6870 Catalyst: HD 7950 Catalyst: R7 370 Catalyst: R9 285 Catalyst: R9 290 Open-Source: HD 6870 Open-Source: HD 7950 Open-Source: R7 370 Open-Source: R9 285 Open-Source: R9 290 hd7970 30 60 90 120 150 SE +/- 0.28, N = 3 SE +/- 0.68, N = 3 SE +/- 0.52, N = 5 SE +/- 0.38, N = 3 SE +/- 2.12, N = 6 SE +/- 8.80, N = 6 SE +/- 1.78, N = 6 SE +/- 3.85, N = 6 SE +/- 14.32, N = 6 SE +/- 2.08, N = 6 SE +/- 0.81, N = 3 24.90 34.03 35.42 26.78 37.77 145.85 105.54 110.24 143.39 118.71 69.45
Phoronix Test Suite v10.8.4