OpenCL NVIDIA GeForce GTX 1080 Ti Linux

NVIDIA GeForce GTX 1080 Ti Linux performance for OpenCL. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1703105-RI-OPENCLNVI39&grs&sor.

OpenCL NVIDIA GeForce GTX 1080 Ti LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiIntel Core i7-7700K @ 4.50GHz (8 Cores)MSI Z270-A PRO (MS-7A71) v1.0Intel Device 591f16384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 680 2048MB (1006/3004MHz)Realtek ALC892Realtek RTL8111/8168/8411Ubuntu 17.044.9.13-040913-generic (x86_64)Unity 7.5.0X Server 1.18.4NVIDIA 378.134.5.01.0.39GCC 6.3.0 20170221ext43840x2160NVIDIA GeForce GTX 780 Ti 3072MB (324/324MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1602/5005MHz)Device 11264MB (1471/5508MHz)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.

OpenCL NVIDIA GeForce GTX 1080 Ti Linuxshoc: OpenCL - MD5 Hashdarktable: Boat - OpenCLshoc: OpenCL - FFT SPdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLshoc: OpenCL - Max SP Flopscl-mem: Copyblender: BMW27 - OpenCLcl-mem: Readshoc: OpenCL - Texture Read Bandwidthcl-mem: WriteGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 Ti2.2518.49268.8718.0614.86119.40343.29134.87240.33143.274.5814.39442.9718.0215.884947.56235.93218.25271.90287.06250.937.5714.47447.5418.0518.014989.55142.57217.78164.57333.54151.979.323.65711.305.421.006173.28216.50194.62265.67350.80238.177.354.33346.325.461.114774.37139.03233.63153.50379.85139.5010.563.50520.865.300.897077.27186.67176.37205.33448.07191.2714.283.35642.105.300.899428.50208.97172.04228.93525.83215.0719.742.84983.155.250.8113109.00317.20131.24338.20593.60337.77OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 680510152025SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 319.7414.2810.569.327.577.354.582.251. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 980GeForce GTX 680GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 780 Ti60120180240300Min: 107.5 / Avg: 115.02 / Max: 176.2Min: 39.4 / Avg: 140.08 / Max: 206.5Min: 40.9 / Avg: 153.22 / Max: 258.1Min: 77.1 / Avg: 160.14 / Max: 247.6Min: 89.2 / Avg: 161.51 / Max: 237.1Min: 109.4 / Avg: 194.38 / Max: 301.1Min: 181.3 / Avg: 201.95 / Max: 334.5Min: 47 / Avg: 222.36 / Max: 332

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 980GeForce GTX 1080GeForce GTX 680GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 780 Ti50100150200250Min: 37.4 / Avg: 110.62 / Max: 132.4Min: 40.1 / Avg: 134.23 / Max: 160Min: 55.9 / Avg: 152.2 / Max: 186.5Min: 147.3 / Avg: 154.42 / Max: 177.2Min: 94.2 / Avg: 168.38 / Max: 196.5Min: 48.3 / Avg: 187.11 / Max: 233.6Min: 46.1 / Avg: 192.33 / Max: 234.6Min: 48.5 / Avg: 222.9 / Max: 286.9

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 980GeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 680GeForce GTX 780 Ti50100150200250Min: 37.4 / Avg: 108.02 / Max: 127.2Min: 41.6 / Avg: 126.63 / Max: 156.9Min: 41.5 / Avg: 136.2 / Max: 164.5Min: 44.1 / Avg: 139.8 / Max: 165.8Min: 92.5 / Avg: 152 / Max: 191.2Min: 51.8 / Avg: 165.53 / Max: 218.8Min: 69.3 / Avg: 169.87 / Max: 202.1Min: 47.9 / Avg: 211.5 / Max: 270.5

Blender

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterBlender 2.78aSystem Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 980GeForce GTX 680GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 780 Ti50100150200250Min: 67.9 / Avg: 135.76 / Max: 146.3Min: 39.9 / Avg: 161.89 / Max: 173.9Min: 89.3 / Avg: 172.35 / Max: 183.4Min: 93.3 / Avg: 181.24 / Max: 193.6Min: 93.8 / Avg: 196.95 / Max: 205.5Min: 125.1 / Avg: 215.89 / Max: 238.2Min: 116.3 / Avg: 224.39 / Max: 246.6Min: 47.4 / Avg: 255.78 / Max: 277.2

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 980GeForce GTX 1080GeForce GTX 680GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 1080 Ti50100150200250Min: 38.3 / Avg: 107.44 / Max: 127.1Min: 40.6 / Avg: 126.05 / Max: 155.7Min: 127.9 / Avg: 158.42 / Max: 166.5Min: 144.3 / Avg: 159.63 / Max: 165.7Min: 58.8 / Avg: 172.43 / Max: 201.3Min: 50 / Avg: 174.4 / Max: 216.5Min: 49.3 / Avg: 192.67 / Max: 264.9Min: 219.6 / Avg: 220.47 / Max: 221.6

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1060GeForce GTX 980GeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 680GeForce GTX 1080 TiGeForce GTX 780 Ti50100150200250Min: 38.9 / Avg: 103.9 / Max: 127.3Min: 44.9 / Avg: 131.96 / Max: 165.2Min: 90.1 / Avg: 138.65 / Max: 165.2Min: 117.8 / Avg: 143.98 / Max: 154.5Min: 126.6 / Avg: 179.93 / Max: 215.4Min: 149.1 / Avg: 182.82 / Max: 194.7Min: 123.5 / Avg: 188.07 / Max: 220.4Min: 66.4 / Avg: 214.78 / Max: 265.9

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980GeForce GTX 680510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.843.353.503.654.3314.3914.4718.49

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 6802004006008001000SE +/- 1.87, N = 3SE +/- 9.60, N = 3SE +/- 1.18, N = 3SE +/- 3.38, N = 3SE +/- 1.54, N = 3SE +/- 22.07, N = 3SE +/- 1.31, N = 3SE +/- 7.60, N = 3983.15711.30642.10520.86447.54442.97346.32268.871. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLGeForce GTX 1080 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980GeForce GTX 68048121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.255.305.305.425.4618.0218.0518.06

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1080 TiGeForce GTX 1070GeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 680GeForce GTX 780 TiGeForce GTX 98048121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.810.890.891.001.1114.8615.8818.01

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980GeForce GTX 780 Ti142842567064.9161.5450.5241.5131.7631.1622.25

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 780 TiGeForce GTX 10603K6K9K12K15KSE +/- 24.13, N = 3SE +/- 61.28, N = 3SE +/- 40.25, N = 3SE +/- 17.97, N = 3SE +/- 23.46, N = 3SE +/- 20.67, N = 3SE +/- 21.45, N = 313109.009428.507077.276173.284989.554947.564774.371. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1080GeForce GTX 1070GeForce GTX 1080 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 680GeForce GTX 780 Ti0.77181.54362.31543.08723.8593.433.413.343.092.191.871.431.29

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 1060GeForce GTX 68070140210280350SE +/- 0.10, N = 3SE +/- 0.19, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3317.20235.93216.50208.97186.67142.57139.03119.401. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980GeForce GTX 6800.49950.9991.49851.9982.49752.221.581.511.441.291.191.090.84

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.78aBlend File: BMW27 - Compute: OpenCLGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 780 TiGeForce GTX 1060GeForce GTX 68070140210280350131.24172.04176.37194.62217.78218.25233.63343.29

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1060GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 780 TiGeForce GTX 980GeForce GTX 6800.38030.76061.14091.52121.90151.691.511.341.301.201.101.080.65

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 1060GeForce GTX 68070140210280350SE +/- 0.61, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3338.20271.90265.67228.93205.33164.57153.50134.871. (CC) gcc options: -O2 -flto -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1080 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 1060GeForce GTX 980 TiGeForce GTX 980GeForce GTX 780 TiGeForce GTX 680130260390520650SE +/- 1.33, N = 3SE +/- 0.57, N = 3SE +/- 1.21, N = 3SE +/- 0.73, N = 3SE +/- 1.08, N = 3SE +/- 0.42, N = 3SE +/- 0.03, N = 3SE +/- 0.17, N = 3593.60525.83448.07379.85350.80333.54287.06240.331. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 680GeForce GTX 106070140210280350SE +/- 0.27, N = 3SE +/- 1.22, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.00, N = 3337.77250.93238.17215.07191.27151.97143.27139.501. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1070GeForce GTX 1080 TiGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 980GeForce GTX 6800.36680.73361.10041.46721.8341.631.531.521.431.431.411.040.78

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 980GeForce GTX 680GeForce GTX 980 TiGeForce GTX 1080 TiGeForce GTX 780 Ti60120180240300Min: 37.4 / Avg: 111.65 / Max: 180.2Min: 39.4 / Avg: 135.16 / Max: 207Min: 38.4 / Avg: 149.33 / Max: 258.1Min: 44.1 / Avg: 155.94 / Max: 247.6Min: 58.5 / Avg: 162.74 / Max: 237.9Min: 48.3 / Avg: 189.36 / Max: 301.1Min: 44.6 / Avg: 191.64 / Max: 334.5Min: 47 / Avg: 212.59 / Max: 332


Phoronix Test Suite v10.8.4