CUDA 2016 NVIDIA Linux Ubuntu

NVIDIA CUDA Linux 2016 compute benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1702097-TA-1612261TA27&sor&grt.

CUDA 2016 NVIDIA Linux UbuntuProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkMonitorOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080deepTest1Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores)MSI C236A WORKSTATION (MS-7998) v1.0Intel Sky Lake16384MB256GB TOSHIBA-RD400MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)Realtek ALC1150Intel ConnectionUbuntu 16.044.4.0-57-generic (x86_64)Unity 7.4.0X Server 1.18.4NVIDIA 375.264.5.01.0.24GCC 5.4.0 20160609 + CUDA 8.0ext43840x2160NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz)NVIDIA GeForce GTX 760 2048MB (1124/3004MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1681/3504MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz)NVIDIA GeForce GTX 1060 6GB 6144MB (557/4006MHz)NVIDIA GeForce GTX 1070 8192MB (1069/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1538/5005MHz)Intel Core i7-4790K @ 4.40GHz (8 Cores)Gigabyte Z97X-UD3H-CFIntel 4th Gen Core DRAM1000GB Western Digital WD10EZEX-00W + 512GB ADATA SP900Gigabyte NVIDIA GeForce GTX 560 1024MB (810/2010MHz)Intel Xeon E3-1200 v3/4thC27F390Intel Connection I217-VUbuntu 14.044.2.0-30-generic (x86_64)Unity 7.2.6X Server 1.17.2NVIDIA 367.484.3.01.0.8GCC 4.8.4 + CUDA 8.01920x1080OpenBenchmarking.orgCompiler Details- GeForce GTX 650: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 960: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 1050: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 1070: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 1080: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- deepTest1: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 750: GPU Compute Cores: 512- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX 1050: GPU Compute Cores: 640- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1080: GPU Compute Cores: 2560- deepTest1: GPU Compute Cores: 336System Details- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 750: GPU Compute Cores: 512.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX 1050: GPU Compute Cores: 640.- GeForce GTX 1050 Ti: GPU Compute Cores: 768.- GeForce GTX 1060: GPU Compute Cores: 1280.- GeForce GTX 1070: GPU Compute Cores: 1920.- GeForce GTX 1080: GPU Compute Cores: 2560.- deepTest1: GPU Compute Cores: 336.Graphics Details- deepTest1: SNA

CUDA 2016 NVIDIA Linux Ubuntuaskap: Griddingaskap: Degriddingcaffe: CUDA AlexNetcaffe: CUDA Googlenetcuda-mini-nbody: Originalshoc: CUDA - FFT SPshoc: CUDA - MD5 Hashshoc: CUDA - Max SP Flopsshoc: CUDA - Texture Read BandwidthGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX 1050GeForce GTX 1050 TiGeForce GTX 1060GeForce GTX 1070GeForce GTX 1080deepTest152573.77133624182.08116.131.281160.99160.5866411.8316400928595.1065105.9361.443399.145625.6830595.4768771.30104.15178.562.692210.77364.333132.425325.1227360.7359805.4782.70194.103.832941.88379.435255.519399.8417005.6740125.4752.51266.735.444320.28349.906006.4510798.1314977.2035955.4746.60292.366.475002.78335.278320.5017010.8011722.1031440.4036.06308.497.736145.80349.0936985873.9230845.0369616.53115.24171.272.492109.11433.213715.365961.6226985.9060253.57101.85199.713.032688.23453.155625.689861.3316266.2337468.7758.42304.625.644765.98503.117607.3113312.8011451.9027658.1739.70377.278.407096.44501.088236.4514273.009738.6524039.5733.06462.6011.909385.11526.213273.753582.06OpenBenchmarking.org

ASKAP tConvolveCuda

Processing: Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 950deepTest1GeForce GTX 9602K4K6K8K10KSE +/- 0.00, N = 3SE +/- 84.05, N = 3SE +/- 0.00, N = 3SE +/- 44.82, N = 3SE +/- 39.34, N = 3SE +/- 34.80, N = 3SE +/- 17.36, N = 3SE +/- 0.00, N = 3SE +/- 14.40, N = 3SE +/- 13.36, N = 3SE +/- 0.00, N = 38320.508236.457607.316006.455625.685255.513715.363698.003399.143273.753132.421. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ASKAP tConvolveCuda

Processing: Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGeForce GTX 980 TiGeForce GTX 1080GeForce GTX 1070GeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 950GeForce GTX 960deepTest14K8K12K16K20KSE +/- 369.80, N = 3SE +/- 259.50, N = 3SE +/- 0.00, N = 3SE +/- 147.93, N = 3SE +/- 0.00, N = 3SE +/- 109.30, N = 3SE +/- 44.82, N = 3SE +/- 42.88, N = 3SE +/- 39.34, N = 3SE +/- 0.00, N = 3SE +/- 15.99, N = 317010.8014273.0013312.8010798.139861.339399.845961.625873.925625.685325.123582.061. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

Caffe AlexNet

Build: CUDA AlexNet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CUDA AlexNetGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 1050 TiGeForce GTX 960GeForce GTX 780 TiGeForce GTX 950GeForce GTX 1050GeForce GTX 680GeForce GTX 76014K28K42K56K70KSE +/- 11.97, N = 3SE +/- 2.23, N = 3SE +/- 12.80, N = 3SE +/- 2.22, N = 3SE +/- 7.27, N = 3SE +/- 8.15, N = 3SE +/- 10.92, N = 3SE +/- 5.41, N = 3SE +/- 3.70, N = 3SE +/- 34.72, N = 3SE +/- 2.73, N = 3SE +/- 76.71, N = 3SE +/- 12.08, N = 39738.6511451.9011722.1014977.2016266.2317005.6726985.9027360.7328595.1030595.4730845.0352573.7766411.831. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

Caffe AlexNet

Build: CUDA Googlenet

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe AlexNet 2016-06-11Build: CUDA GooglenetGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 960GeForce GTX 1050 TiGeForce GTX 780 TiGeForce GTX 950GeForce GTX 1050GeForce GTX 680GeForce GTX 76040K80K120K160K200KSE +/- 5.57, N = 3SE +/- 66.90, N = 3SE +/- 95.74, N = 3SE +/- 77.16, N = 3SE +/- 4.75, N = 3SE +/- 15.33, N = 3SE +/- 13.56, N = 3SE +/- 33.10, N = 3SE +/- 20.21, N = 3SE +/- 9.66, N = 3SE +/- 7.85, N = 3SE +/- 31.51, N = 3SE +/- 93.86, N = 324039.5727658.1731440.4035955.4737468.7740125.4759805.4760253.5765105.9368771.3069616.53133624.00164009.001. (CXX) g++ options: -pthread -fPIC -O2 -lcaffe -lglog -lgflags -lprotobuf -lboost_system -lboost_filesystem -lm -lhdf5_hl -lhdf5 -lleveldb -lsnappy -llmdb -lopencv_core -lopencv_highgui -lopencv_imgproc -lboost_thread -lstdc++ -lcblas -latlas

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 1080GeForce GTX 980 TiGeForce GTX 1070GeForce GTX 980GeForce GTX 970GeForce GTX 1060GeForce GTX 780 TiGeForce GTX 960GeForce GTX 1050 TiGeForce GTX 950GeForce GTX 1050GeForce GTX 7504080120160200SE +/- 0.09, N = 3SE +/- 0.48, N = 3SE +/- 0.05, N = 3SE +/- 0.23, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.16, N = 3SE +/- 0.31, N = 3SE +/- 0.14, N = 3SE +/- 0.40, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 333.0636.0639.7046.6052.5158.4261.4482.70101.85104.15115.24182.08

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 1060GeForce GTX 980GeForce GTX 970GeForce GTX 1050 TiGeForce GTX 960GeForce GTX 950GeForce GTX 1050GeForce GTX 750100200300400500SE +/- 1.33, N = 3SE +/- 2.67, N = 3SE +/- 0.35, N = 3SE +/- 0.78, N = 3SE +/- 0.74, N = 3SE +/- 1.20, N = 3SE +/- 1.12, N = 3SE +/- 1.44, N = 3SE +/- 0.50, N = 3SE +/- 0.73, N = 3SE +/- 0.20, N = 3462.60377.27308.49304.62292.36266.73199.71194.10178.56171.27116.131. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 960GeForce GTX 1050 TiGeForce GTX 950GeForce GTX 1050GeForce GTX 7503691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.908.407.736.475.645.443.833.032.692.491.281. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Max SP FlopsGeForce GTX 1080GeForce GTX 1070GeForce GTX 980 TiGeForce GTX 980GeForce GTX 1060GeForce GTX 970GeForce GTX 960GeForce GTX 1050 TiGeForce GTX 950GeForce GTX 1050GeForce GTX 7502K4K6K8K10KSE +/- 64.36, N = 3SE +/- 50.45, N = 3SE +/- 20.77, N = 3SE +/- 9.75, N = 3SE +/- 21.65, N = 3SE +/- 3.47, N = 3SE +/- 8.34, N = 3SE +/- 5.43, N = 3SE +/- 6.61, N = 3SE +/- 0.30, N = 3SE +/- 0.08, N = 39385.117096.446145.805002.784765.984320.282941.882688.232210.772109.111160.991. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthGeForce GTX 1080GeForce GTX 1060GeForce GTX 1070GeForce GTX 1050 TiGeForce GTX 1050GeForce GTX 960GeForce GTX 950GeForce GTX 970GeForce GTX 980 TiGeForce GTX 980GeForce GTX 750110220330440550SE +/- 1.23, N = 3SE +/- 0.09, N = 3SE +/- 1.65, N = 3SE +/- 1.17, N = 3SE +/- 1.01, N = 3SE +/- 0.09, N = 3SE +/- 0.34, N = 3SE +/- 0.05, N = 3SE +/- 0.22, N = 3SE +/- 0.52, N = 3SE +/- 0.46, N = 3526.21503.11501.08453.15433.21379.43364.33349.90349.09335.27160.581. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft


Phoronix Test Suite v10.8.4