NVIDIA GTX 1060 CUDA OpenCL Benchmarks

NVIDIA GeForce GTX 1060 Linux benchmark comparison on Ubuntu 16.04 LTS. Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1607179-LO-GTX1060CO58.

NVIDIA GTX 1060 CUDA OpenCL BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay ServerGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 1060Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MBSamsung SSD 950 PRO 256GBNVIDIA GeForce GTX 680 2043MB (1006/3004MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-22-generic (x86_64)Unity 7.4.0NVIDIA 367.184.5.0OpenCL 1.2 CUDA 8.0.01.0.8GCC 5.3.1 20160413ext43840x2160NVIDIA GeForce GTX 760 2043MB (980/3004MHz)NVIDIA GeForce GTX 770 2043MB (1045/3505MHz)NVIDIA GeForce GTX 780 Ti 3067MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2043MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2043MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4091MB (1163/3505MHz)NVIDIA GeForce GTX 980 4091MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6139MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12283MB (1001/3505MHz)Device 8187MB (1503/4006MHz)GCC 5.3.1 20160413 + CUDA 8.0Device 8187MB (1605/5005MHz)OpenCL 1.2 CUDA 8.0.0GCC 5.3.1 20160413Device 6144MB (35/4006MHz)4.4.0-31-generic (x86_64)X Server 1.18.3NVIDIA 367.27GCC 5.4.0 20160609 + CUDA 8.0OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- Scaling Governor: intel_pstate performanceOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 770: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X: GPU Compute Cores: 3072- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1060: GPU Compute Cores: 1280System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 770: GPU Compute Cores: 1536.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X: GPU Compute Cores: 3072.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- GeForce GTX 1060: GPU Compute Cores: 1280.

NVIDIA GTX 1060 CUDA OpenCL Benchmarksmixbench: Integermixbench: Single Precisionshoc: OpenCL - MD5 Hashshoc: OpenCL - Texture Read Bandwidthjuliagpu: GPUluxmark: GPU - Hotelshoc: CUDA - FFT SPshoc: CUDA - Max SP Flopscuda-mini-nbody: OriginalGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 1060548.392721.761.39245.4650081701.17690426.182123.851.01169.1440097662.03575579.902892.371.44254.4552218685.37729963.904393.743.79286.8381940505.601264634.602140.892.70240.6567434492.301136837.962785.863.85268.2483477125.431276189.142944.9482.291221.104123.915.46282.86107727475.631859265.174316.4352.041387.474736.406.49329.75117989151.131842292.784999.8546.511703.815872.217.78351.21132582358.602433302.766144.2935.351928.276584.438.40356.17140885161.732671322.576886.6933.092026.766589.5710.61455.66145091231.372988372.327047.1039.122714.038746.2911.85525.27166710991.873339461.289397.4130.511362.784421.437.14374.21114800164.402113306.494759.3157.32OpenBenchmarking.org

Mixbench

Benchmark: Integer

OpenBenchmarking.orgGIOPS, More Is BetterMixbench 2016-06-06Benchmark: IntegerGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 10606001200180024003000SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.79, N = 3SE +/- 10.11, N = 3SE +/- 0.31, N = 3SE +/- 0.93, N = 3SE +/- 0.26, N = 3SE +/- 18.45, N = 3SE +/- 13.54, N = 3SE +/- 1.49, N = 3SE +/- 15.83, N = 3SE +/- 1.99, N = 3SE +/- 5.81, N = 3548.39426.18579.90963.90634.60837.961221.101387.471703.811928.272026.762714.031362.781. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

Mixbench

Benchmark: Single Precision

OpenBenchmarking.orgGFLOPS, More Is BetterMixbench 2016-06-06Benchmark: Single PrecisionGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 10602K4K6K8K10KSE +/- 2.47, N = 3SE +/- 0.62, N = 3SE +/- 4.35, N = 3SE +/- 9.12, N = 3SE +/- 6.64, N = 3SE +/- 0.26, N = 3SE +/- 31.43, N = 3SE +/- 4.88, N = 3SE +/- 1.67, N = 3SE +/- 3.02, N = 3SE +/- 1.49, N = 3SE +/- 7.69, N = 3SE +/- 7.95, N = 32721.762123.852892.374393.742140.892785.864123.914736.405872.216584.436589.578746.294421.431. (CXX) g++ options: -lm -lstdc++ -lOpenCL -lrt -O2

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 10603691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.391.011.443.792.703.855.466.497.788.4010.6111.857.14-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 1060110220330440550SE +/- 3.27, N = 3SE +/- 0.21, N = 3SE +/- 0.50, N = 3SE +/- 0.12, N = 3SE +/- 0.45, N = 3SE +/- 1.51, N = 3SE +/- 0.22, N = 3SE +/- 1.23, N = 3SE +/- 0.36, N = 3SE +/- 1.86, N = 3SE +/- 0.08, N = 3SE +/- 2.95, N = 3SE +/- 0.79, N = 3245.46169.14254.45286.83240.65268.24282.86329.75351.21356.17455.66525.27374.21-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft-lSHOCCommonOpenCL -lOpenCL-lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 106040M80M120M160M200MSE +/- 171285.36, N = 3SE +/- 17993.17, N = 3SE +/- 12263.11, N = 3SE +/- 102386.58, N = 3SE +/- 212242.91, N = 3SE +/- 231781.82, N = 3SE +/- 112147.99, N = 3SE +/- 276417.82, N = 3SE +/- 549882.41, N = 3SE +/- 331776.18, N = 3SE +/- 406321.65, N = 3SE +/- 798182.24, N = 3SE +/- 163631.84, N = 350081701.1740097662.0352218685.3781940505.6067434492.3083477125.43107727475.63117989151.13132582358.60140885161.73145091231.37166710991.87114800164.401. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 680GeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 10607001400210028003500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 6.67, N = 3SE +/- 15.72, N = 3SE +/- 28.73, N = 3SE +/- 3.06, N = 3SE +/- 2.40, N = 3SE +/- 11.79, N = 3SE +/- 28.49, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 36905757291264113612761859184224332671298833392113

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 1060100200300400500SE +/- 1.12, N = 3SE +/- 0.05, N = 3SE +/- 0.60, N = 3SE +/- 4.36, N = 5SE +/- 0.29, N = 3SE +/- 0.68, N = 3SE +/- 2.81, N = 3SE +/- 2.06, N = 3189.14265.17292.78302.76322.57372.32461.28306.491. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Max SP FlopsGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 10602K4K6K8K10KSE +/- 7.67, N = 3SE +/- 1.66, N = 3SE +/- 11.01, N = 3SE +/- 21.31, N = 3SE +/- 41.66, N = 3SE +/- 1.49, N = 3SE +/- 88.40, N = 3SE +/- 0.24, N = 32944.944316.434999.856144.296886.697047.109397.414759.311. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070GeForce GTX 1080GeForce GTX 106020406080100SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.21, N = 3SE +/- 0.18, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 382.2952.0446.5135.3533.0939.1230.5157.32


Phoronix Test Suite v10.8.4