OpenCL Testing

OpenCL tests for a future article on Phoronix.

HTML result view exported from: https://openbenchmarking.org/result/1704286-RI-1704211TR78&grw&export=txt&sor&rro.

OpenCL TestingProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 Ti #1GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #2Radeon RX 480Radeon RX 580Radeon R9 FuryGigabyte AMD Radeon RX 480Intel Core i7-7700K @ 4.50GHz (8 Cores)MSI Z270-A PRO (MS-7A71) v1.0Intel Device 591f + Z27016384MBSamsung SSD 950 PRO 256GBeVGA NVIDIA GeForce GTX 970 4096MB (1164/3505MHz)Realtek ALC892Realtek RTL8111/8168/8411Ubuntu 17.044.10.0-19-generic (x86_64)Unity 7.5.0X Server 1.19.3NVIDIA 381.094.5.01.0.42GCC 6.3.0 20170406ext43840x2160NVIDIA GeForce GTX 980 4096MB (135/324MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1316/3504MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1468/5508MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1505/4006MHz)NVIDIA GeForce GTX 1070 8192MB (250/4006MHz)NVIDIA GeForce GTX 1080 8192MB (84/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1472/5508MHz)AMD Radeon RX 470/480 8192MBAcer B286HK4.8.0-040800-generic (x86_64)modesetting 1.19.3OpenCL 2.0 AMD-APP (2348.3)MSI AMD Radeon RX 470/480 8192MBSapphire AMD Radeon R9 FURY / NANO 4096MBAMD FX-8320 Eight-Core @ 3.51GHz (8 Cores)ASUS Crosshair V FormulaAMD RD890 bridge4 x 2048 MB DDR3-800MHz160GB Western Digital WD1600JS-60N + 1000GB Hitachi HDS72101 + 240GB ADATA SP550Gigabyte AMD Radeon RX 480 8192MBRealtek ALC889LD24L11FHDIntel 82583V Gigabit ConnectionUbuntu 16.044.4.0-75-generic (x86_64)KDE Frameworks 5X Server 1.18.4modesetting 1.18.44.5.134741.0.3GCC 5.4.0 201606091920x1080OpenBenchmarking.orgCompiler Details- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080 Ti #1: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX 480: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon RX 580: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Radeon R9 Fury: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v- Gigabyte AMD Radeon RX 480: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 970, GeForce GTX 980, GeForce GTX 980 Ti, GeForce GTX 1050, GeForce GTX 1080 Ti #1, GeForce GTX 1050 Ti, GeForce GTX 1060, GeForce GTX 1070, GeForce GTX 1080, GeForce GTX 1080 Ti #2, Radeon RX 480, Radeon RX 580, Radeon R9 Fury: Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1080 Ti #1: GPU Compute Cores: 3584- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560System Details- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1080 Ti #1: GPU Compute Cores: 3584.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.Graphics Details- Radeon R9 Fury, Gigabyte AMD Radeon RX 480: GLAMOR

OpenCL Testingdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writejuliagpu: GPUGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 Ti #1GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #2Radeon RX 480Radeon RX 580Radeon R9 FuryGigabyte AMD Radeon RX 48011.82399.016.574333.1912.8113.17280.99125.70143.63133.30111315195.2011.94459.237.594989.3712.8113.17333.16142.77164.57154.80121825827.073.585.460.9712.20712.969.346156.8512.8113.17351.70216.90265.83242.50137239997.1011.40246.283.252112.9712.8113.17271.7687.1394.8386.1366598918.932.785.280.7712.41986.6819.8113088.7012.8113.17592.43316.63337.87341.33202729946.7316.3018.4012.1511.38207.624.132677.9112.8113.17301.6286.6794.1085.0380390064.174.275.531.0711.88329.517.344776.3112.8113.17380.73139.10153.23145.20119921484.473.465.350.8712.11518.1610.697080.1512.8113.17450.04186.70204.90195.73151370700.073.285.350.8712.21634.1414.239342.4112.8113.17520.35208.97229.03219.97174049828.8726.3618.689.168.394.050.08130.2039.8139.4314.891.502.671.4026.3718.699.178.404.060.08130.2140.2039.1214.891.502.701.4026.3618.699.178.424.030.08130.1641.2040.4014.891.502.701.404.950.270.27178.87213.33188.67OpenBenchmarking.org

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Boat - Acceleration: OpenCLGigabyte AMD Radeon RX 4801.11382.22763.34144.45525.569SE +/- 0.07, N = 34.95

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Masskrug - Acceleration: OpenCLGigabyte AMD Radeon RX 4800.06080.12160.18240.24320.304SE +/- 0.00, N = 30.27

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.0.3Test: Server Room - Acceleration: OpenCLGigabyte AMD Radeon RX 4800.06080.12160.18240.24320.304SE +/- 0.00, N = 30.27

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLRadeon RX 580Radeon R9 FuryRadeon RX 480GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #1612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 326.3726.3626.3616.304.273.583.463.282.78

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLRadeon R9 FuryRadeon RX 580Radeon RX 480GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1080 Ti #1510152025SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 318.6918.6918.6818.405.535.465.355.355.28

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1050 TiRadeon R9 FuryRadeon RX 580Radeon RX 480GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1080 Ti #13691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.159.179.179.161.070.970.870.870.77

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadRadeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1080 Ti #13691215SE +/- 0.02, N = 3SE +/- 0.11, N = 6SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 38.398.408.4211.3811.4011.8211.8811.9412.1112.2012.2112.411. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon R9 FuryRadeon RX 480Radeon RX 580GeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 1060GeForce GTX 970GeForce GTX 980GeForce GTX 1070GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1080 Ti #12004006008001000SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 6.55, N = 6SE +/- 8.54, N = 6SE +/- 3.36, N = 3SE +/- 2.00, N = 3SE +/- 1.07, N = 3SE +/- 7.10, N = 3SE +/- 1.90, N = 3SE +/- 15.11, N = 6SE +/- 2.48, N = 34.034.054.06207.62246.28329.51399.01459.23518.16634.14712.96986.681. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #1510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 30.080.080.083.254.136.577.347.599.3410.6914.2319.811. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsRadeon R9 FuryRadeon RX 480Radeon RX 580GeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #13K6K9K12K15KSE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.29, N = 3SE +/- 5.27, N = 3SE +/- 1.10, N = 3SE +/- 14.72, N = 3SE +/- 19.83, N = 3SE +/- 2.28, N = 3SE +/- 21.97, N = 3SE +/- 14.19, N = 3SE +/- 25.00, N = 3130.16130.20130.212112.972677.914333.194776.314989.376156.857080.159342.4113088.701. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 Ti #1GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 480Radeon RX 580Radeon R9 Fury918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.57, N = 6SE +/- 0.53, N = 3SE +/- 0.30, N = 312.8112.8112.8112.8112.8112.8112.8112.8112.8139.8140.2041.201. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackGeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1080 Ti #1GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080Radeon RX 580Radeon RX 480Radeon R9 Fury918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.28, N = 3SE +/- 0.36, N = 3SE +/- 0.36, N = 313.1713.1713.1713.1713.1713.1713.1713.1713.1739.1239.4340.401. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 1050GeForce GTX 970GeForce GTX 1050 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #1130260390520650SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 2.55, N = 3SE +/- 0.19, N = 3SE +/- 3.36, N = 3SE +/- 1.29, N = 3SE +/- 1.47, N = 3SE +/- 0.39, N = 3SE +/- 2.14, N = 3SE +/- 2.01, N = 3SE +/- 1.29, N = 314.8914.8914.89271.76280.99301.62333.16351.70380.73450.04520.35592.431. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980Gigabyte AMD Radeon RX 480GeForce GTX 1070GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1080 Ti #170140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.69, N = 3SE +/- 0.15, N = 3SE +/- 0.29, N = 3SE +/- 0.00, N = 3SE +/- 0.47, N = 31.501.501.5086.6787.13125.70139.10142.77178.87186.70208.97216.90316.631. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 1070Gigabyte AMD Radeon RX 480GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1080 Ti #170140210280350SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 1.17, N = 3SE +/- 0.39, N = 3SE +/- 0.03, N = 3SE +/- 0.55, N = 32.672.702.7094.1094.83143.63153.23164.57204.90213.33229.03265.83337.871. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX 480Radeon RX 580Radeon R9 FuryGeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 970GeForce GTX 1060GeForce GTX 980Gigabyte AMD Radeon RX 480GeForce GTX 1070GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1080 Ti #170140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.15, N = 3SE +/- 0.00, N = 3SE +/- 0.23, N = 3SE +/- 0.19, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 3SE +/- 0.23, N = 31.401.401.4085.0386.13133.30145.20154.80188.67195.73219.97242.50341.331. (CC) gcc options: -O2 -flto -lOpenCL

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 970GeForce GTX 1060GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti #140M80M120M160M200MSE +/- 80114.08, N = 3SE +/- 89962.63, N = 3SE +/- 75709.36, N = 3SE +/- 296618.42, N = 3SE +/- 162823.13, N = 3SE +/- 153929.03, N = 3SE +/- 74030.18, N = 3SE +/- 171757.93, N = 3SE +/- 235270.97, N = 366598918.9380390064.17111315195.20119921484.47121825827.07137239997.10151370700.07174049828.87202729946.731. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm


Phoronix Test Suite v10.8.5