OpenCL tests for a future article on Phoronix.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 1704286-RI-1704211TR78 OpenCL Testing - Phoronix Test Suite OpenCL Testing OpenCL tests for a future article on Phoronix.
HTML result view exported from: https://openbenchmarking.org/result/1704286-RI-1704211TR78&grs&sor .
OpenCL Testing Processor Motherboard Chipset Memory Disk Graphics Audio Network Monitor OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution OpenCL GeForce GTX 970 GeForce GTX 980 GeForce GTX 980 Ti GeForce GTX 1050 GeForce GTX 1080 Ti #1 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 GeForce GTX 1080 Ti #2 Radeon RX 480 Radeon RX 580 Radeon R9 Fury Gigabyte AMD Radeon RX 480 Intel Core i7-7700K @ 4.50GHz (8 Cores) MSI Z270-A PRO (MS-7A71) v1.0 Intel Device 591f + Z270 16384MB Samsung SSD 950 PRO 256GB eVGA NVIDIA GeForce GTX 970 4096MB (1164/3505MHz) Realtek ALC892 Realtek RTL8111/8168/8411 Ubuntu 17.04 4.10.0-19-generic (x86_64) Unity 7.5.0 X Server 1.19.3 NVIDIA 381.09 4.5.0 1.0.42 GCC 6.3.0 20170406 ext4 3840x2160 NVIDIA GeForce GTX 980 4096MB (135/324MHz) NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz) Zotac NVIDIA GeForce GTX 1050 2048MB (1316/3504MHz) NVIDIA GeForce GTX 1080 Ti 11264MB (1468/5508MHz) eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz) NVIDIA GeForce GTX 1060 6GB 6144MB (1505/4006MHz) NVIDIA GeForce GTX 1070 8192MB (250/4006MHz) NVIDIA GeForce GTX 1080 8192MB (84/5005MHz) NVIDIA GeForce GTX 1080 Ti 11264MB (1472/5508MHz) AMD Radeon RX 470/480 8192MB Acer B286HK 4.8.0-040800-generic (x86_64) modesetting 1.19.3 OpenCL 2.0 AMD-APP (2348.3) MSI AMD Radeon RX 470/480 8192MB Sapphire AMD Radeon R9 FURY / NANO 4096MB AMD FX-8320 Eight-Core @ 3.51GHz (8 Cores) ASUS Crosshair V Formula AMD RD890 bridge 4 x 2048 MB DDR3-800MHz 160GB Western Digital WD1600JS-60N + 1000GB Hitachi HDS72101 + 240GB ADATA SP550 Gigabyte AMD Radeon RX 480 8192MB Realtek ALC889 LD24L11FHD Intel 82583V Gigabit Connection Ubuntu 16.04 4.4.0-75-generic (x86_64) KDE Frameworks 5 X Server 1.18.4 modesetting 1.18.4 4.5.13474 1.0.3 GCC 5.4.0 20160609 1920x1080 OpenBenchmarking.org Compiler Details - GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 1080 Ti #1: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - Radeon RX 480: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v - Gigabyte AMD Radeon RX 480: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details - GeForce GTX 970, GeForce GTX 980, GeForce GTX 980 Ti, GeForce GTX 1050, GeForce GTX 1080 Ti #1, GeForce GTX 1050 Ti, GeForce GTX 1060, GeForce GTX 1070, GeForce GTX 1080, GeForce GTX 1080 Ti #2, Radeon RX 480, Radeon RX 580, Radeon R9 Fury: Scaling Governor: intel_pstate powersave OpenCL Details - GeForce GTX 970: GPU Compute Cores: 1664 - GeForce GTX 980: GPU Compute Cores: 2048 - GeForce GTX 980 Ti: GPU Compute Cores: 2816 - GeForce GTX 1050: GPU Compute Cores: 640 - GeForce GTX 1080 Ti #1: GPU Compute Cores: 3584 - GeForce GTX 1050 Ti: GPU Compute Cores: 768 - GeForce GTX 1060: GPU Compute Cores: 1280 - GeForce GTX 1070: GPU Compute Cores: 1920 - GeForce GTX 1080: GPU Compute Cores: 2560 System Details - GeForce GTX 970: GPU Compute Cores: 1664. - GeForce GTX 980: GPU Compute Cores: 2048. - GeForce GTX 980 Ti: GPU Compute Cores: 2816. - GeForce GTX 1050: GPU Compute Cores: 640. - GeForce GTX 1080 Ti #1: GPU Compute Cores: 3584. - GeForce GTX 1050 Ti: GPU Compute Cores: 768. - GeForce GTX 1060: GPU Compute Cores: 1280. - GeForce GTX 1070: GPU Compute Cores: 1920. - GeForce GTX 1080: GPU Compute Cores: 2560. Graphics Details - Radeon R9 Fury, Gigabyte AMD Radeon RX 480: GLAMOR
OpenCL Testing darktable: Boat - OpenCL shoc: OpenCL - Texture Read Bandwidth darktable: Masskrug - OpenCL shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback juliagpu: GPU shoc: OpenCL - MD5 Hash shoc: OpenCL - FFT SP cl-mem: Write cl-mem: Copy darktable: Server Room - OpenCL cl-mem: Read shoc: OpenCL - Max SP Flops shoc: OpenCL - Triad darktable: Server Room - OpenCL darktable: Masskrug - OpenCL darktable: Boat - OpenCL GeForce GTX 970 GeForce GTX 980 GeForce GTX 980 Ti GeForce GTX 1050 GeForce GTX 1080 Ti #1 GeForce GTX 1050 Ti GeForce GTX 1060 GeForce GTX 1070 GeForce GTX 1080 GeForce GTX 1080 Ti #2 Radeon RX 480 Radeon RX 580 Radeon R9 Fury Gigabyte AMD Radeon RX 480 280.99 12.81 13.17 111315195.20 6.57 399.01 133.30 125.70 143.63 4333.19 11.82 333.16 12.81 13.17 121825827.07 7.59 459.23 154.80 142.77 164.57 4989.37 11.94 3.58 351.70 5.46 12.81 13.17 137239997.10 9.34 712.96 242.50 216.90 0.97 265.83 6156.85 12.20 271.76 12.81 13.17 66598918.93 3.25 246.28 86.13 87.13 94.83 2112.97 11.40 2.78 592.43 5.28 12.81 13.17 202729946.73 19.81 986.68 341.33 316.63 0.77 337.87 13088.70 12.41 16.30 301.62 18.40 12.81 13.17 80390064.17 4.13 207.62 85.03 86.67 12.15 94.10 2677.91 11.38 4.27 380.73 5.53 12.81 13.17 119921484.47 7.34 329.51 145.20 139.10 1.07 153.23 4776.31 11.88 3.46 450.04 5.35 12.81 13.17 151370700.07 10.69 518.16 195.73 186.70 0.87 204.90 7080.15 12.11 3.28 520.35 5.35 12.81 13.17 174049828.87 14.23 634.14 219.97 208.97 0.87 229.03 9342.41 12.21 26.36 14.89 18.68 39.81 39.43 0.08 4.05 1.40 1.50 9.16 2.67 130.20 8.39 26.37 14.89 18.69 40.20 39.12 0.08 4.06 1.40 1.50 9.17 2.70 130.21 8.40 26.36 14.89 18.69 41.20 40.40 0.08 4.03 1.40 1.50 9.17 2.70 130.16 8.42 188.67 178.87 213.33 0.27 0.27 4.95 OpenBenchmarking.org
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.2.1 Test: Boat - Acceleration: OpenCL GeForce GTX 1080 Ti #1 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 980 Ti GeForce GTX 1060 GeForce GTX 1050 Ti Radeon RX 480 Radeon R9 Fury Radeon RX 580 6 12 18 24 30 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 2.78 3.28 3.46 3.58 4.27 16.30 26.36 26.36 26.37
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth GeForce GTX 1080 Ti #1 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 980 Ti GeForce GTX 980 GeForce GTX 1050 Ti GeForce GTX 970 GeForce GTX 1050 Radeon R9 Fury Radeon RX 580 Radeon RX 480 130 260 390 520 650 SE +/- 1.29, N = 3 SE +/- 2.01, N = 3 SE +/- 2.14, N = 3 SE +/- 0.39, N = 3 SE +/- 1.47, N = 3 SE +/- 1.29, N = 3 SE +/- 3.36, N = 3 SE +/- 0.19, N = 3 SE +/- 2.55, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 592.43 520.35 450.04 380.73 351.70 333.16 301.62 280.99 271.76 14.89 14.89 14.89 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.2.1 Test: Masskrug - Acceleration: OpenCL GeForce GTX 1080 Ti #1 GeForce GTX 1070 GeForce GTX 1080 GeForce GTX 980 Ti GeForce GTX 1060 GeForce GTX 1050 Ti Radeon RX 480 Radeon RX 580 Radeon R9 Fury 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 5.28 5.35 5.35 5.46 5.53 18.40 18.68 18.69 18.69
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download Radeon R9 Fury Radeon RX 580 Radeon RX 480 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1080 Ti #1 GeForce GTX 1050 GeForce GTX 980 Ti GeForce GTX 980 GeForce GTX 970 9 18 27 36 45 SE +/- 0.30, N = 3 SE +/- 0.53, N = 3 SE +/- 0.57, N = 6 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 41.20 40.20 39.81 12.81 12.81 12.81 12.81 12.81 12.81 12.81 12.81 12.81 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback Radeon R9 Fury Radeon RX 480 Radeon RX 580 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 1060 GeForce GTX 1050 Ti GeForce GTX 1080 Ti #1 GeForce GTX 1050 GeForce GTX 980 Ti GeForce GTX 980 GeForce GTX 970 9 18 27 36 45 SE +/- 0.36, N = 3 SE +/- 0.36, N = 3 SE +/- 0.28, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 40.40 39.43 39.12 13.17 13.17 13.17 13.17 13.17 13.17 13.17 13.17 13.17 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
JuliaGPU OpenCL Device: GPU OpenBenchmarking.org Samples/sec, More Is Better JuliaGPU 1.2pts1 OpenCL Device: GPU GeForce GTX 1080 Ti #1 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 980 Ti GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 Ti GeForce GTX 1050 40M 80M 120M 160M 200M SE +/- 235270.97, N = 3 SE +/- 171757.93, N = 3 SE +/- 74030.18, N = 3 SE +/- 153929.03, N = 3 SE +/- 162823.13, N = 3 SE +/- 296618.42, N = 3 SE +/- 75709.36, N = 3 SE +/- 89962.63, N = 3 SE +/- 80114.08, N = 3 202729946.73 174049828.87 151370700.07 137239997.10 121825827.07 119921484.47 111315195.20 80390064.17 66598918.93 1. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash GeForce GTX 1080 Ti #1 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 980 Ti GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon R9 Fury Radeon RX 580 Radeon RX 480 5 10 15 20 25 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 19.81 14.23 10.69 9.34 7.59 7.34 6.57 4.13 3.25 0.08 0.08 0.08 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP GeForce GTX 1080 Ti #1 GeForce GTX 980 Ti GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 980 GeForce GTX 970 GeForce GTX 1060 GeForce GTX 1050 GeForce GTX 1050 Ti Radeon RX 580 Radeon RX 480 Radeon R9 Fury 200 400 600 800 1000 SE +/- 2.48, N = 3 SE +/- 15.11, N = 6 SE +/- 1.90, N = 3 SE +/- 7.10, N = 3 SE +/- 1.07, N = 3 SE +/- 2.00, N = 3 SE +/- 3.36, N = 3 SE +/- 8.54, N = 6 SE +/- 6.55, N = 6 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 986.68 712.96 634.14 518.16 459.23 399.01 329.51 246.28 207.62 4.06 4.05 4.03 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write GeForce GTX 1080 Ti #1 GeForce GTX 980 Ti GeForce GTX 1080 GeForce GTX 1070 Gigabyte AMD Radeon RX 480 GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 GeForce GTX 1050 Ti Radeon R9 Fury Radeon RX 580 Radeon RX 480 70 140 210 280 350 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.27, N = 3 SE +/- 0.19, N = 3 SE +/- 0.23, N = 3 SE +/- 0.00, N = 3 SE +/- 0.15, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 341.33 242.50 219.97 195.73 188.67 154.80 145.20 133.30 86.13 85.03 1.40 1.40 1.40 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy GeForce GTX 1080 Ti #1 GeForce GTX 980 Ti GeForce GTX 1080 GeForce GTX 1070 Gigabyte AMD Radeon RX 480 GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 GeForce GTX 1050 Ti Radeon R9 Fury Radeon RX 580 Radeon RX 480 70 140 210 280 350 SE +/- 0.47, N = 3 SE +/- 0.00, N = 3 SE +/- 0.29, N = 3 SE +/- 0.15, N = 3 SE +/- 0.69, N = 3 SE +/- 0.03, N = 3 SE +/- 0.10, N = 3 SE +/- 0.00, N = 3 SE +/- 0.26, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 316.63 216.90 208.97 186.70 178.87 142.77 139.10 125.70 87.13 86.67 1.50 1.50 1.50 1. (CC) gcc options: -O2 -flto -lOpenCL
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.2.1 Test: Server Room - Acceleration: OpenCL GeForce GTX 1080 Ti #1 GeForce GTX 1070 GeForce GTX 1080 GeForce GTX 980 Ti GeForce GTX 1060 Radeon RX 480 Radeon RX 580 Radeon R9 Fury GeForce GTX 1050 Ti 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 0.77 0.87 0.87 0.97 1.07 9.16 9.17 9.17 12.15
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read GeForce GTX 1080 Ti #1 GeForce GTX 980 Ti GeForce GTX 1080 Gigabyte AMD Radeon RX 480 GeForce GTX 1070 GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 GeForce GTX 1050 Ti Radeon R9 Fury Radeon RX 580 Radeon RX 480 70 140 210 280 350 SE +/- 0.55, N = 3 SE +/- 0.03, N = 3 SE +/- 0.39, N = 3 SE +/- 1.17, N = 3 SE +/- 0.15, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 337.87 265.83 229.03 213.33 204.90 164.57 153.23 143.63 94.83 94.10 2.70 2.70 2.67 1. (CC) gcc options: -O2 -flto -lOpenCL
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops GeForce GTX 1080 Ti #1 GeForce GTX 1080 GeForce GTX 1070 GeForce GTX 980 Ti GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 Ti GeForce GTX 1050 Radeon RX 580 Radeon RX 480 Radeon R9 Fury 3K 6K 9K 12K 15K SE +/- 25.00, N = 3 SE +/- 14.19, N = 3 SE +/- 21.97, N = 3 SE +/- 2.28, N = 3 SE +/- 19.83, N = 3 SE +/- 14.72, N = 3 SE +/- 1.10, N = 3 SE +/- 5.27, N = 3 SE +/- 0.29, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 13088.70 9342.41 7080.15 6156.85 4989.37 4776.31 4333.19 2677.91 2112.97 130.21 130.20 130.16 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad GeForce GTX 1080 Ti #1 GeForce GTX 1080 GeForce GTX 980 Ti GeForce GTX 1070 GeForce GTX 980 GeForce GTX 1060 GeForce GTX 970 GeForce GTX 1050 GeForce GTX 1050 Ti Radeon R9 Fury Radeon RX 580 Radeon RX 480 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 6 SE +/- 0.02, N = 3 12.41 12.21 12.20 12.11 11.94 11.88 11.82 11.40 11.38 8.42 8.40 8.39 1. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt
Darktable Test: Server Room - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.0.3 Test: Server Room - Acceleration: OpenCL Gigabyte AMD Radeon RX 480 0.0608 0.1216 0.1824 0.2432 0.304 SE +/- 0.00, N = 3 0.27
Darktable Test: Masskrug - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.0.3 Test: Masskrug - Acceleration: OpenCL Gigabyte AMD Radeon RX 480 0.0608 0.1216 0.1824 0.2432 0.304 SE +/- 0.00, N = 3 0.27
Darktable Test: Boat - Acceleration: OpenCL OpenBenchmarking.org Seconds, Fewer Is Better Darktable 2.0.3 Test: Boat - Acceleration: OpenCL Gigabyte AMD Radeon RX 480 1.1138 2.2276 3.3414 4.4552 5.569 SE +/- 0.07, N = 3 4.95
Phoronix Test Suite v10.8.4