OpenCL Radeon Open-Source vs. Catalyst Radeon R600g and RadeonSI Gallium3D Clover OpenCL benchmarks with clpeak using Linux 4.5 + Mesa 11.2-devel. Compared to Catalyst. Benchmark results by Michael Larabel of Phoronix.com for a future article.
HTML result view exported from: https://openbenchmarking.org/result/1602034-GA-1602026GA07&sor&grs .
Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution Open-Source Catalyst HD 6870 HD 7950 R9 285 R9 290 R7 370 HD 6870 HD 7950 R9 285 R9 290 R7 370 R9 290x Intel Core i7-5960X @ 3.50GHz (16 Cores) Gigabyte X99-UD4-CF Intel Xeon E7 v3/Xeon 16384MB 2 x 120GB SanDisk SDSSDA12 + 120GB Samsung SSD 850 Sapphire AMD Radeon HD 6870 1024MB Realtek ALC1150 DELL P2415Q Intel Connection Ubuntu 15.10 4.5.0-rc2amdgpu-pp+ (x86_64) 20160201 Unity X Server 1.17.2 radeon 7.6.99 3.3 Mesa 11.2.0-devel (padoka PPA) Gallium 0.4 GCC 5.2.1 20151010 ext4 3840x2160 XFX AMD Radeon HD 7950/8950 OEM / R9 280 3072MB 4.1 Mesa 11.2.0-devel (padoka PPA) Gallium 0.4 XFX AMD Radeon R9 285/380 2048MB amdgpu 1.0.99 XFX AMD Radeon R9 290 4096MB radeon 7.6.99 MSI AMD Radeon R7 370 / R9 270/370 OEM 4096MB Sapphire AMD Radeon HD 6800 1024MB 4.2.0-27-generic (x86_64) fglrx 15.20.3 4.5.13399 XFX AMD Radeon HD 7900 3072MB Supported device 6939 2048MB Supported device 67B1 4096MB MSI AMD Radeon R7 370 4096MB Intel Core i7-4790K @ 4.50GHz (8 Cores) ASUS SABERTOOTH Z97 MARK 1 Intel 4th Gen Core DRAM 512GB OCZ VERTEX4 + 256GB CORSAIR FORCE LX Supported device 67B0 4096MB (1030/1250MHz) AMD Hawaii HDMI Audio ASUS PB278 Realtek RTL8111/8168/8411 Mageia 5 4.1.15-desktop-2.mga5 (x86_64) KDE 4.14.5 X Server 1.16.4 fglrx 15.30.3 4.4.13416 GCC 4.9.2 + Open64 opencc: option-- d Open Chinese Convert (OpenCC) Command Line Tool 0.4.3Author: BYVoid Bug Report: github /BYVoid/OpenCC/issuesUsage: opencc [Options]Options:-i [file]--input=[file] Read original text from [file].-o [file]--output=[file] Write converted text to [file].-c [file]--config=[file] Load configuration of con from [file].-v-- Print and build information.-h--help Print this help.With no input file reads standard input and writes converted stream to standard output.Default configuration(zhs2zht.ini) will be loaded if not set. + Clang 3.5.2 + LLVM 3.5.2 2560x1440 OpenBenchmarking.org Kernel Details - Open-Source: HD 6870, Open-Source: HD 7950, Open-Source: R9 285, Open-Source: R9 290, Open-Source: R7 370: amdgpu.powerplay=1 Compiler Details - Open-Source: HD 6870, Open-Source: HD 7950, Open-Source: R9 285, Open-Source: R9 290, Open-Source: R7 370, Catalyst: HD 6870, Catalyst: HD 7950, Catalyst: R9 285, Catalyst: R9 290, Catalyst: R7 370: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - Scaling Governor: intel_pstate performance Graphics Details - Open-Source: HD 6870: EXA - Open-Source: HD 7950: GLAMOR - Open-Source: R9 285: GLAMOR - Open-Source: R9 290: GLAMOR - Open-Source: R7 370: GLAMOR Environment Details - Catalyst: HD 6870: LIBGL_DRIVERS_PATH=/usr/lib/fglrx/dri:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/dri:/usr/lib32/fglrx/dri:/usr/lib/i386-linux-gnu/dri - Catalyst: HD 7950: LIBGL_DRIVERS_PATH=/usr/lib/fglrx/dri:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/dri:/usr/lib32/fglrx/dri:/usr/lib/i386-linux-gnu/dri - Catalyst: R9 285: LIBGL_DRIVERS_PATH=/usr/lib/fglrx/dri:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/dri:/usr/lib32/fglrx/dri:/usr/lib/i386-linux-gnu/dri - Catalyst: R9 290: LIBGL_DRIVERS_PATH=/usr/lib/fglrx/dri:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/dri:/usr/lib32/fglrx/dri:/usr/lib/i386-linux-gnu/dri - Catalyst: R7 370: LIBGL_DRIVERS_PATH=/usr/lib/fglrx/dri:/usr/lib/x86_64-linux-gnu/dri:/usr/lib/dri:/usr/lib32/fglrx/dri:/usr/lib/i386-linux-gnu/dri - Catalyst: R9 290x: LIBGL_DRIVERS_PATH=/usr/lib/dri:/usr/lib64/dri
clpeak: Double-Precision Double clpeak: Transfer Bandwidth enqueueWriteBuffer clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Single-Precision Float clpeak: Integer Compute INT clpeak: Kernel Latency Open-Source Catalyst HD 6870 HD 7950 R9 285 R9 290 R7 370 HD 6870 HD 7950 R9 285 R9 290 R7 370 R9 290x 5.26 3.58 360.92 182.44 145.85 547.03 15.72 10.40 2345.12 491.62 105.54 204.75 14.82 10.09 3102.60 650.80 143.39 596.55 12.32 9.82 4608.55 1578.01 118.71 131.49 15.77 10.41 2031.91 418.74 110.24 9.49 8.80 398.29 199.38 24.90 536.84 13.88 10.47 2292.54 516.74 34.03 202.71 13.21 9.00 3177.62 647.08 26.78 598.81 14.81 10.64 4721.18 1591.74 37.77 128.73 14.97 10.67 2095.91 415.67 35.42 716.17 26.42 12.22 5697.10 1871.32 17.96 OpenBenchmarking.org
clpeak OpenCL Test: Double-Precision Double Catalyst Open-Source OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double R9 290x R9 290 HD 7950 R9 285 R7 370 150 300 450 600 750 SE +/- 0.37, N = 3 SE +/- 0.01, N = 3 SE +/- 11.37, N = 6 SE +/- 0.22, N = 3 SE +/- 0.36, N = 3 SE +/- 0.10, N = 3 SE +/- 16.28, N = 6 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 716.17 598.81 536.84 202.71 128.73 596.55 547.03 204.75 131.49
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer Catalyst Open-Source OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer R9 290x R7 370 HD 7950 R9 285 R9 290 HD 6870 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.19, N = 3 SE +/- 0.12, N = 3 SE +/- 0.00, N = 3 SE +/- 0.05, N = 3 SE +/- 0.59, N = 6 SE +/- 0.06, N = 3 SE +/- 0.00, N = 3 26.42 14.97 14.81 13.88 13.21 9.49 15.77 15.72 14.82 12.32 5.26
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer Catalyst Open-Source OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer R9 290x R7 370 R9 290 HD 7950 R9 285 HD 6870 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 SE +/- 0.11, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.30, N = 6 SE +/- 0.25, N = 6 SE +/- 0.10, N = 6 12.22 10.67 10.64 10.47 9.00 8.80 10.41 10.40 10.09 9.82 3.58
clpeak OpenCL Test: Single-Precision Float Catalyst Open-Source OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float R9 290x R9 290 R9 285 HD 7950 R7 370 HD 6870 1200 2400 3600 4800 6000 SE +/- 5.48, N = 3 SE +/- 0.95, N = 3 SE +/- 0.24, N = 3 SE +/- 44.66, N = 6 SE +/- 2.25, N = 3 SE +/- 0.00, N = 3 SE +/- 2.13, N = 3 SE +/- 1.52, N = 3 SE +/- 47.92, N = 6 SE +/- 0.22, N = 3 SE +/- 0.08, N = 3 5697.10 4721.18 3177.62 2292.54 2095.91 398.29 4608.55 3102.60 2345.12 2031.91 360.92
clpeak OpenCL Test: Integer Compute INT Catalyst Open-Source OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT R9 290x R9 290 R9 285 HD 7950 R7 370 HD 6870 400 800 1200 1600 2000 SE +/- 1.20, N = 3 SE +/- 0.05, N = 3 SE +/- 1.77, N = 3 SE +/- 21.03, N = 6 SE +/- 3.65, N = 3 SE +/- 0.02, N = 3 SE +/- 0.49, N = 3 SE +/- 0.05, N = 3 SE +/- 7.09, N = 6 SE +/- 0.01, N = 3 SE +/- 0.16, N = 3 1871.32 1591.74 647.08 516.74 415.67 199.38 1578.01 650.80 491.62 418.74 182.44
clpeak OpenCL Test: Kernel Latency Catalyst Open-Source OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency R9 290x HD 6870 R9 285 HD 7950 R7 370 R9 290 30 60 90 120 150 SE +/- 0.42, N = 6 SE +/- 0.28, N = 3 SE +/- 0.38, N = 3 SE +/- 0.68, N = 3 SE +/- 0.52, N = 5 SE +/- 2.12, N = 6 SE +/- 1.78, N = 6 SE +/- 3.85, N = 6 SE +/- 2.08, N = 6 SE +/- 14.32, N = 6 SE +/- 8.80, N = 6 17.96 24.90 26.78 34.03 35.42 37.77 105.54 110.24 118.71 143.39 145.85
Phoronix Test Suite v10.8.5