OpenCL NVIDIA GeForce GTX 1080 Ti Linux

NVIDIA GeForce GTX 1080 Ti Linux performance for OpenCL. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2002182-HU-1703105RI77&sro.

OpenCL NVIDIA GeForce GTX 1080 Ti LinuxProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionSystem LayerGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiCirrus Logic GD 5446Intel Core i7-7700K @ 4.50GHz (8 Cores)MSI Z270-A PRO (MS-7A71) v1.0Intel Device 591f16384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 680 2048MB (1006/3004MHz)Realtek ALC892Realtek RTL8111/8168/8411Ubuntu 17.044.9.13-040913-generic (x86_64)Unity 7.5.0X Server 1.18.4NVIDIA 378.134.5.01.0.39GCC 6.3.0 20170221ext43840x2160NVIDIA GeForce GTX 780 Ti 3072MB (324/324MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1504/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1602/5005MHz)Device 11264MB (1471/5508MHz)Intel Xeon E5-2680 v4 (5 Cores)Selectel OpenStack Nova (1.10.2-1ubuntu1 BIOS)Intel 440FX 82441FX PMC1 x 10240 MB RAM QEMU54GB QEMU HDDCirrus Logic GD 5446 8GBRed Hat Virtio deviceUbuntu 18.044.15.0-76-generic (x86_64)X Server 1.19.6GCC 7.4.0 + CUDA 9.1800x600vm-otherOpenBenchmarking.orgCompiler Details- GeForce GTX 680, GeForce GTX 780 Ti, GeForce GTX 980, GeForce GTX 980 Ti, GeForce GTX 1060, GeForce GTX 1070, GeForce GTX 1080, GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic -v Processor Details- GeForce GTX 680: Scaling Governor: intel_pstate performance- GeForce GTX 780 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 980: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1080: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performance- Cirrus Logic GD 5446: CPU Microcode: 0x1OpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1080 Ti: GPU Compute Cores: 3584.Security Details- Cirrus Logic GD 5446: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + tsx_async_abort: Mitigation of Clear buffers; SMT Host state unknown

OpenCL NVIDIA GeForce GTX 1080 Ti Linuxshoc: OpenCL - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Room - OpenCLblender: BMW27 - OpenCLcl-mem: Readcl-mem: Writecl-mem: CopyGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiCirrus Logic GD 5446240.33268.872.2518.4918.0614.86343.29134.87143.27119.404947.56287.06442.974.5814.3918.0215.88218.25271.90250.93235.934989.55333.54447.547.5714.4718.0518.01217.78164.57151.97142.576173.28350.80711.309.323.655.421.00194.62265.67238.17216.504774.37379.85346.327.354.335.461.11233.63153.50139.50139.037077.27448.07520.8610.563.505.300.89176.37205.33191.27186.679428.50525.83642.1014.283.355.300.89172.04228.93215.07208.9713109.00593.60983.1519.742.845.250.81131.24338.20337.77317.2044.4OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti3K6K9K12K15KSE +/- 21.45, N = 3SE +/- 40.25, N = 3SE +/- 61.28, N = 3SE +/- 24.13, N = 3SE +/- 20.67, N = 3SE +/- 23.46, N = 3SE +/- 17.97, N = 34774.377077.279428.5013109.004947.564989.556173.281. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti142842567041.5150.5261.5464.9122.2531.1631.76

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti60120180240300Min: 107.5 / Avg: 115.02 / Max: 176.2Min: 39.4 / Avg: 140.08 / Max: 206.5Min: 40.9 / Avg: 153.22 / Max: 258.1Min: 181.3 / Avg: 201.95 / Max: 334.5Min: 89.2 / Avg: 161.51 / Max: 237.1Min: 47 / Avg: 222.36 / Max: 332Min: 77.1 / Avg: 160.14 / Max: 247.6Min: 109.4 / Avg: 194.38 / Max: 301.1

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti130260390520650SE +/- 0.73, N = 3SE +/- 1.21, N = 3SE +/- 0.57, N = 3SE +/- 1.33, N = 3SE +/- 0.17, N = 3SE +/- 0.03, N = 3SE +/- 0.42, N = 3SE +/- 1.08, N = 3379.85448.07525.83593.60240.33287.06333.54350.801. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s Per Watt, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti0.77181.54362.31543.08723.8593.433.343.413.091.431.292.191.87

SHOC Scalable HeterOgeneous Computing

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti50100150200250Min: 37.4 / Avg: 110.62 / Max: 132.4Min: 40.1 / Avg: 134.23 / Max: 160Min: 147.3 / Avg: 154.42 / Max: 177.2Min: 46.1 / Avg: 192.33 / Max: 234.6Min: 94.2 / Avg: 168.38 / Max: 196.5Min: 48.5 / Avg: 222.9 / Max: 286.9Min: 55.9 / Avg: 152.2 / Max: 186.5Min: 48.3 / Avg: 187.11 / Max: 233.6

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti2004006008001000SE +/- 1.31, N = 3SE +/- 3.38, N = 3SE +/- 1.18, N = 3SE +/- 1.87, N = 3SE +/- 7.60, N = 3SE +/- 22.07, N = 3SE +/- 1.54, N = 3SE +/- 9.60, N = 3346.32520.86642.10983.15268.87442.97447.54711.301. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 37.3510.5614.2819.742.254.587.579.321. (CXX) g++ options: -O2 -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Boat - Acceleration: OpenCLGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.333.503.352.8418.4914.3914.473.65

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Masskrug - Acceleration: OpenCLGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.465.305.305.2518.0618.0218.055.42

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.2.1Test: Server Room - Acceleration: OpenCLGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.110.890.890.8114.8615.8818.011.00

Blender

Blend File: BMW27 - Compute: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.78aBlend File: BMW27 - Compute: OpenCLCirrus Logic GD 5446GeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti7014021028035044.40233.63176.37172.04131.24343.29218.25217.78194.62

Blender

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterBlender 2.78aSystem Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti50100150200250Min: 67.9 / Avg: 135.76 / Max: 146.3Min: 39.9 / Avg: 161.89 / Max: 173.9Min: 89.3 / Avg: 172.35 / Max: 183.4Min: 116.3 / Avg: 224.39 / Max: 246.6Min: 93.8 / Avg: 196.95 / Max: 205.5Min: 47.4 / Avg: 255.78 / Max: 277.2Min: 93.3 / Avg: 181.24 / Max: 193.6Min: 125.1 / Avg: 215.89 / Max: 238.2

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti70140210280350SE +/- 0.00, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 3SE +/- 0.61, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3153.50205.33228.93338.20134.87271.90164.57265.671. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti0.36680.73361.10041.46721.8341.431.631.431.530.781.411.041.52

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti50100150200250Min: 38.3 / Avg: 107.44 / Max: 127.1Min: 40.6 / Avg: 126.05 / Max: 155.7Min: 144.3 / Avg: 159.63 / Max: 165.7Min: 219.6 / Avg: 220.47 / Max: 221.6Min: 58.8 / Avg: 172.43 / Max: 201.3Min: 49.3 / Avg: 192.67 / Max: 264.9Min: 127.9 / Avg: 158.42 / Max: 166.5Min: 50 / Avg: 174.4 / Max: 216.5

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti70140210280350SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.18, N = 3SE +/- 1.22, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3139.50191.27215.07337.77143.27250.93151.97238.171. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti0.49950.9991.49851.9982.49751.291.511.582.220.841.191.091.44

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti50100150200250Min: 37.4 / Avg: 108.02 / Max: 127.2Min: 41.6 / Avg: 126.63 / Max: 156.9Min: 41.5 / Avg: 136.2 / Max: 164.5Min: 92.5 / Avg: 152 / Max: 191.2Min: 69.3 / Avg: 169.87 / Max: 202.1Min: 47.9 / Avg: 211.5 / Max: 270.5Min: 44.1 / Avg: 139.8 / Max: 165.8Min: 51.8 / Avg: 165.53 / Max: 218.8

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti70140210280350SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.19, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3139.03186.67208.97317.20119.40235.93142.57216.501. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s Per Watt, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti0.38030.76061.14091.52121.90151.341.301.511.690.651.101.081.20

cl-mem

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13System Power Consumption MonitorGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti50100150200250Min: 38.9 / Avg: 103.9 / Max: 127.3Min: 117.8 / Avg: 143.98 / Max: 154.5Min: 90.1 / Avg: 138.65 / Max: 165.2Min: 123.5 / Avg: 188.07 / Max: 220.4Min: 149.1 / Avg: 182.82 / Max: 194.7Min: 66.4 / Avg: 214.78 / Max: 265.9Min: 44.9 / Avg: 131.96 / Max: 165.2Min: 126.6 / Avg: 179.93 / Max: 215.4

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 1080 TiGeForce GTX 680GeForce GTX 780 TiGeForce GTX 980GeForce GTX 980 Ti60120180240300Min: 37.4 / Avg: 111.65 / Max: 180.2Min: 39.4 / Avg: 135.16 / Max: 207Min: 38.4 / Avg: 149.33 / Max: 258.1Min: 44.6 / Avg: 191.64 / Max: 334.5Min: 58.5 / Avg: 162.74 / Max: 237.9Min: 47 / Avg: 212.59 / Max: 332Min: 44.1 / Avg: 155.94 / Max: 247.6Min: 48.3 / Avg: 189.36 / Max: 301.1


Phoronix Test Suite v10.8.4