OpenCL Testing

OpenCL tests for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1704286-RI-1704211TR78&grw&sro&rro.

OpenCL TestingProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 Ti #1GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #2Radeon RX 480Radeon RX 580Radeon R9 FuryGigabyte AMD Radeon RX 480Intel Core i7-7700K @ 4.50GHz (8 Cores)MSI Z270-A PRO (MS-7A71) v1.0Intel Device 591f + Z27016384MBSamsung SSD 950 PRO 256GBeVGA NVIDIA GeForce GTX 970 4096MB (1164/3505MHz)Realtek ALC892Realtek RTL8111/8168/8411Ubuntu 17.044.10.0-19-generic (x86_64)Unity 7.5.0X Server 1.19.3NVIDIA 381.094.5.01.0.42GCC 6.3.0 20170406ext43840x2160NVIDIA GeForce GTX 980 4096MB (135/324MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1316/3504MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1468/5508MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1505/4006MHz)NVIDIA GeForce GTX 1070 8192MB (250/4006MHz)NVIDIA GeForce GTX 1080 8192MB (84/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1472/5508MHz)AMD Radeon RX 470/480 8192MBAcer B286HK4.8.0-040800-generic (x86_64)modesetting 1.19.3OpenCL 2.0 AMD-APP (2348.3)MSI AMD Radeon RX 470/480 8192MBSapphire AMD Radeon R9 FURY / NANO 4096MBAMD FX-8320 Eight-Core @ 3.51GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge4 x 2048 MB DDR3-800MHz160GB Western Digital WD1600JS-60N + 1000GB Hitachi HDS72101 + 240GB ADATA SP550Gigabyte AMD Radeon RX 480 8192MBRealtek ALC889LD24L11FHDIntel 82583V Gigabit ConnectionUbuntu 16.044.4.0-75-generic (x86_64)KDE Frameworks 5X Server 1.18.4modesetting 1.18.44.5.134741.0.3GCC 5.4.0 201606091920x1080OpenBenchmarking.orgCompiler Details- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080 Ti #1: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX 480: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Gigabyte AMD Radeon RX 480: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 970, GeForce GTX 980, GeForce GTX 980 Ti, GeForce GTX 1050, GeForce GTX 1080 Ti #1, GeForce GTX 1050 Ti, GeForce GTX 1060, GeForce GTX 1070, GeForce GTX 1080, GeForce GTX 1080 Ti #2, Radeon RX 480, Radeon RX 580, Radeon R9 Fury: Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1080 Ti #1: GPU Compute Cores: 3584- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1080 Ti #1: GPU Compute Cores: 3584.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.Graphics Details- Radeon R9 Fury, Gigabyte AMD Radeon RX 480: GLAMOR

OpenCL Testingdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writejuliagpu: GPUGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 Ti #1GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #2Radeon RX 480Radeon RX 580Radeon R9 FuryGigabyte AMD Radeon RX 48011.82399.016.574333.1912.8113.17280.99125.70143.63133.30111315195.2011.94459.237.594989.3712.8113.17333.16142.77164.57154.80121825827.073.585.460.9712.20712.969.346156.8512.8113.17351.70216.90265.83242.50137239997.1011.40246.283.252112.9712.8113.17271.7687.1394.8386.1366598918.932.785.280.7712.41986.6819.8113088.7012.8113.17592.43316.63337.87341.33202729946.7316.3018.4012.1511.38207.624.132677.9112.8113.17301.6286.6794.1085.0380390064.174.275.531.0711.88329.517.344776.3112.8113.17380.73139.10153.23145.20119921484.473.465.350.8712.11518.1610.697080.1512.8113.17450.04186.70204.90195.73151370700.073.285.350.8712.21634.1414.239342.4112.8113.17520.35208.97229.03219.97174049828.8726.3618.689.168.394.050.08130.2039.8139.4314.891.502.671.4026.3718.699.178.404.060.08130.2140.2039.1214.891.502.701.4026.3618.699.178.424.030.08130.1641.2040.4014.891.502.701.404.950.270.27178.87213.33188.67OpenBenchmarking.org

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLGigabyte AMD Radeon RX 4801.11382.22763.34144.45525.569SE +/- 0.07, N = 34.95

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLGigabyte AMD Radeon RX 4800.06080.12160.18240.24320.304SE +/- 0.00, N = 30.27

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLGigabyte AMD Radeon RX 4800.06080.12160.18240.24320.304SE +/- 0.00, N = 30.27

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 Ti612182430SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 326.3726.3626.363.582.783.283.464.2716.30

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 Ti510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 318.6918.6818.695.465.285.355.355.5318.40

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 Ti3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.179.169.170.970.770.870.871.0712.15

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 10503691215SE +/- 0.11, N = 6SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 38.408.398.4212.2011.9411.8212.4112.2112.1111.8811.3811.401. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 10502004006008001000SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 15.11, N = 6SE +/- 1.07, N = 3SE +/- 2.00, N = 3SE +/- 2.48, N = 3SE +/- 1.90, N = 3SE +/- 7.10, N = 3SE +/- 3.36, N = 3SE +/- 6.55, N = 6SE +/- 8.54, N = 64.064.054.03712.96459.23399.01986.68634.14518.16329.51207.62246.281. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.080.080.089.347.596.5719.8114.2310.697.344.133.251. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 10503K6K9K12K15KSE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 2.28, N = 3SE +/- 19.83, N = 3SE +/- 1.10, N = 3SE +/- 25.00, N = 3SE +/- 14.19, N = 3SE +/- 21.97, N = 3SE +/- 14.72, N = 3SE +/- 5.27, N = 3SE +/- 0.29, N = 3130.21130.20130.166156.854989.374333.1913088.709342.417080.154776.312677.912112.971. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050918273645SE +/- 0.53, N = 3SE +/- 0.57, N = 6SE +/- 0.30, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 340.2039.8141.2012.8112.8112.8112.8112.8112.8112.8112.8112.811. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050918273645SE +/- 0.28, N = 3SE +/- 0.36, N = 3SE +/- 0.36, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 339.1239.4340.4013.1713.1713.1713.1713.1713.1713.1713.1713.171. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 580Radeon RX 480Radeon R9 FuryGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 1050130260390520650SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.47, N = 3SE +/- 1.29, N = 3SE +/- 0.19, N = 3SE +/- 1.29, N = 3SE +/- 2.01, N = 3SE +/- 2.14, N = 3SE +/- 0.39, N = 3SE +/- 3.36, N = 3SE +/- 2.55, N = 314.8914.8914.89351.70333.16280.99592.43520.35450.04380.73301.62271.761. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX 580Radeon RX 480Radeon R9 FuryGigabyte AMD Radeon RX 480GeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 105070140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.69, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.47, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 31.501.501.50178.87216.90142.77125.70316.63208.97186.70139.1086.6787.131. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX 580Radeon RX 480Radeon R9 FuryGigabyte AMD Radeon RX 480GeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 105070140210280350SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 1.17, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.55, N = 3SE +/- 0.39, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 32.702.672.70213.33265.83164.57143.63337.87229.03204.90153.2394.1094.831. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX 580Radeon RX 480Radeon R9 FuryGigabyte AMD Radeon RX 480GeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 105070140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.23, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.23, N = 3SE +/- 0.27, N = 3SE +/- 0.19, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 31.401.401.40188.67242.50154.80133.30341.33219.97195.73145.2085.0386.131. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 1080 Ti #1GeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 1050 TiGeForce GTX 105040M80M120M160M200MSE +/- 153929.03, N = 3SE +/- 162823.13, N = 3SE +/- 75709.36, N = 3SE +/- 235270.97, N = 3SE +/- 171757.93, N = 3SE +/- 74030.18, N = 3SE +/- 296618.42, N = 3SE +/- 89962.63, N = 3SE +/- 80114.08, N = 3137239997.10121825827.07111315195.20202729946.73174049828.87151370700.07119921484.4780390064.1766598918.931. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm


Phoronix Test Suite v10.8.5