NVIDIA GTX 1060 CUDA OpenCL Benchmarks

NVIDIA GeForce GTX 1060 Linux benchmark comparison on Ubuntu 16.04 LTS. Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1607179-LO-GTX1060CO58&grs&sro.

NVIDIA GTX 1060 CUDA OpenCL BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay ServerGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 1060Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 680 2043MB (1006/3004MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-22-generic (x86_64)Unity 7.4.0NVIDIA 367.184.5.0OpenCL 1.2 CUDA 8.0.01.0.8GCC 5.3.1 20160413ext43840x2160NVIDIA GeForce GTX 760 2043MB (980/3004MHz)NVIDIA GeForce GTX 770 2043MB (1045/3505MHz)NVIDIA GeForce GTX 780 Ti 3067MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2043MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2043MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4091MB (1163/3505MHz)NVIDIA GeForce GTX 980 4091MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6139MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12283MB (1001/3505MHz)Device 8187MB (1503/4006MHz)GCC 5.3.1 20160413 + CUDA 8.0Device 8187MB (1605/5005MHz)OpenCL 1.2 CUDA 8.0.0GCC 5.3.1 20160413Device 6144MB (35/4006MHz)4.4.0-31-generic (x86_64)X Server 1.18.3NVIDIA 367.27GCC 5.4.0 20160609 + CUDA 8.0OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 770: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X: GPU Compute Cores: 3072- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1060: GPU Compute Cores: 1280System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 770: GPU Compute Cores: 1536.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X: GPU Compute Cores: 3072.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1060: GPU Compute Cores: 1280.

NVIDIA GTX 1060 CUDA OpenCL Benchmarksmixbench: Integerluxmark: GPU - Hoteljuliagpu: GPUmixbench: Single Precisionshoc: CUDA - Max SP Flopsshoc: OpenCL - Texture Read Bandwidthcuda-mini-nbody: Originalshoc: CUDA - FFT SPshoc: OpenCL - MD5 HashGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 1060548.3969050081701.172721.76245.461.39426.1857540097662.032123.85169.141.01579.9072952218685.372892.37254.451.44963.90126481940505.604393.74286.833.79634.60113667434492.302140.89240.652.70837.96127683477125.432785.862944.94268.2482.29189.143.851221.101859107727475.634123.914316.43282.8652.04265.175.461387.471842117989151.134736.404999.85329.7546.51292.786.491703.812433132582358.605872.216144.29351.2135.35302.767.781928.272671140885161.736584.436886.69356.1733.09322.578.402026.762988145091231.376589.577047.10455.6639.12372.3210.612714.033339166710991.878746.299397.41525.2730.51461.2811.851362.782113114800164.404421.434759.31374.2157.32306.497.14OpenBenchmarking.org

Mixbench

Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X6001200180024003000SE +/- 5.81, N = 3SE +/- 15.83, N = 3SE +/- 1.99, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.79, N = 3SE +/- 10.11, N = 3SE +/- 0.31, N = 3SE +/- 0.93, N = 3SE +/- 0.26, N = 3SE +/- 18.45, N = 3SE +/- 13.54, N = 3SE +/- 1.49, N = 31362.782026.762714.03548.39426.18579.90963.90634.60837.961221.101387.471703.811928.271. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X7001400210028003500SE +/- 0.67, N = 3SE +/- 28.49, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 6.67, N = 3SE +/- 15.72, N = 3SE +/- 28.73, N = 3SE +/- 3.06, N = 3SE +/- 2.40, N = 3SE +/- 11.79, N = 32113298833396905757291264113612761859184224332671

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X40M80M120M160M200MSE +/- 163631.84, N = 3SE +/- 406321.65, N = 3SE +/- 798182.24, N = 3SE +/- 171285.36, N = 3SE +/- 17993.17, N = 3SE +/- 12263.11, N = 3SE +/- 102386.58, N = 3SE +/- 212242.91, N = 3SE +/- 231781.82, N = 3SE +/- 112147.99, N = 3SE +/- 276417.82, N = 3SE +/- 549882.41, N = 3SE +/- 331776.18, N = 3114800164.40145091231.37166710991.8750081701.1740097662.0352218685.3781940505.6067434492.3083477125.43107727475.63117989151.13132582358.60140885161.731. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

Mixbench

Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X2K4K6K8K10KSE +/- 7.95, N = 3SE +/- 1.49, N = 3SE +/- 7.69, N = 3SE +/- 2.47, N = 3SE +/- 0.62, N = 3SE +/- 4.35, N = 3SE +/- 9.12, N = 3SE +/- 6.64, N = 3SE +/- 0.26, N = 3SE +/- 31.43, N = 3SE +/- 4.88, N = 3SE +/- 1.67, N = 3SE +/- 3.02, N = 34421.436589.578746.292721.762123.852892.374393.742140.892785.864123.914736.405872.216584.431. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Max SP FlopsGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X2K4K6K8K10KSE +/- 0.24, N = 3SE +/- 1.49, N = 3SE +/- 88.40, N = 3SE +/- 7.67, N = 3SE +/- 1.66, N = 3SE +/- 11.01, N = 3SE +/- 21.31, N = 3SE +/- 41.66, N = 34759.317047.109397.412944.944316.434999.856144.296886.691. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X110220330440550SE +/- 0.79, N = 3SE +/- 0.08, N = 3SE +/- 2.95, N = 3SE +/- 3.27, N = 3SE +/- 0.21, N = 3SE +/- 0.50, N = 3SE +/- 0.12, N = 3SE +/- 0.45, N = 3SE +/- 1.51, N = 3SE +/- 0.22, N = 3SE +/- 1.23, N = 3SE +/- 0.36, N = 3SE +/- 1.86, N = 3374.21455.66525.27245.46169.14254.45286.83240.65268.24282.86329.75351.21356.17-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X20406080100SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.21, N = 3SE +/- 0.18, N = 357.3239.1230.5182.2952.0446.5135.3533.09

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X100200300400500SE +/- 2.06, N = 3SE +/- 0.68, N = 3SE +/- 2.81, N = 3SE +/- 1.12, N = 3SE +/- 0.05, N = 3SE +/- 0.60, N = 3SE +/- 4.36, N = 5SE +/- 0.29, N = 3306.49372.32461.28189.14265.17292.78302.76322.571. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080GeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 37.1410.6111.851.391.011.443.792.703.855.466.497.788.40-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL1. (CXX) g++ options: -O2 -lSHOCCommon -lrt


Phoronix Test Suite v10.8.4