CUDA 2016 NVIDIA Linux Ubuntu NVIDIA CUDA Linux 2016 compute benchmarks by Michael Larabel for a future article. GeForce GTX 650: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 680: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 680 2048MB (1006/3004MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 750: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 760: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 760 2048MB (1124/3004MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 780 Ti: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 950: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 960: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 970: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 980: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 980 4096MB (1126/3505MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 980 Ti: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 1050: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: Zotac NVIDIA GeForce GTX 1050 2048MB (1681/3504MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 1050 Ti: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1341/3504MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 1060: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 1060 6GB 6144MB (557/4006MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 1070: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 1070 8192MB (1069/4006MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 1080: Processor: Intel Xeon E3-1280 v5 @ 4.00GHz (8 Cores), Motherboard: MSI C236A WORKSTATION (MS-7998) v1.0, Chipset: Intel Sky Lake, Memory: 16384MB, Disk: 256GB TOSHIBA-RD400, Graphics: NVIDIA GeForce GTX 1080 8192MB (1538/5005MHz), Audio: Realtek ALC1150, Network: Intel Connection OS: Ubuntu 16.04, Kernel: 4.4.0-57-generic (x86_64), Desktop: Unity 7.4.0, Display Server: X Server 1.18.4, Display Driver: NVIDIA 375.26, OpenGL: 4.5.0, Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 3840x2160 ubu_deep: Processor: 2 x Intel 0000 @ 3.00GHz (48 Cores), Motherboard: Supermicro X10DRi-LN4+ v1.01, Chipset: Intel Xeon E7 v4/Xeon, Memory: 64512MB, Disk: 1000GB My Passport 0820, Audio: NVIDIA Device 10f0, Network: Intel I350 Gigabit Connection OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Vulkan: 1.0.8, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 1024x768 ubu_ml3: Processor: 2 x Intel 0000 @ 3.00GHz (48 Cores), Motherboard: Supermicro X10DRi-LN4+ v1.01, Chipset: Intel Xeon E7 v4/Xeon, Memory: 64512MB, Disk: 1000GB My Passport 0820, Audio: NVIDIA Device 10f0, Network: Intel I350 Gigabit Connection OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Vulkan: 1.0.8, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 1024x768 ubu_ml_375.26: Processor: 2 x Intel 0000 @ 3.00GHz (48 Cores), Motherboard: Supermicro X10DRi-LN4+ v1.01, Chipset: Intel Xeon E7 v4/Xeon, Memory: 64512MB, Disk: 1000GB My Passport 0820, Audio: NVIDIA Device 10f0, Network: Intel I350 Gigabit Connection OS: Ubuntu 16.04, Kernel: 4.4.0-59-generic (x86_64), Vulkan: 1.0.24, Compiler: GCC 5.4.0 20160609 + CUDA 8.0, File-System: ext4, Screen Resolution: 1024x768 SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: FFT SP GFLOPS > Higher Is Better GeForce GTX 750 ..... 116.13 |============ GeForce GTX 950 ..... 178.56 |=================== GeForce GTX 960 ..... 194.10 |===================== GeForce GTX 970 ..... 266.73 |============================ GeForce GTX 980 ..... 292.36 |=============================== GeForce GTX 980 Ti .. 308.49 |================================= GeForce GTX 1050 .... 171.27 |================== GeForce GTX 1050 Ti . 199.71 |===================== GeForce GTX 1060 .... 304.62 |================================ GeForce GTX 1070 .... 377.27 |======================================== GeForce GTX 1080 .... 462.60 |================================================= SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: MD5 Hash GHash/s > Higher Is Better GeForce GTX 750 ..... 1.28 |===== GeForce GTX 950 ..... 2.69 |=========== GeForce GTX 960 ..... 3.83 |================ GeForce GTX 970 ..... 5.44 |======================= GeForce GTX 980 ..... 6.47 |=========================== GeForce GTX 980 Ti .. 7.73 |================================ GeForce GTX 1050 .... 2.49 |========== GeForce GTX 1050 Ti . 3.03 |============= GeForce GTX 1060 .... 5.64 |======================== GeForce GTX 1070 .... 8.40 |=================================== GeForce GTX 1080 .... 11.90 |================================================== SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: Max SP Flops GFLOPS > Higher Is Better GeForce GTX 750 ..... 1160.99 |====== GeForce GTX 950 ..... 2210.77 |=========== GeForce GTX 960 ..... 2941.88 |=============== GeForce GTX 970 ..... 4320.28 |====================== GeForce GTX 980 ..... 5002.78 |========================== GeForce GTX 980 Ti .. 6145.80 |=============================== GeForce GTX 1050 .... 2109.11 |=========== GeForce GTX 1050 Ti . 2688.23 |============== GeForce GTX 1060 .... 4765.98 |======================== GeForce GTX 1070 .... 7096.44 |==================================== GeForce GTX 1080 .... 9385.11 |================================================ SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: CUDA - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better GeForce GTX 750 ..... 160.58 |=============== GeForce GTX 950 ..... 364.33 |================================== GeForce GTX 960 ..... 379.43 |=================================== GeForce GTX 970 ..... 349.90 |================================= GeForce GTX 980 ..... 335.27 |=============================== GeForce GTX 980 Ti .. 349.09 |================================= GeForce GTX 1050 .... 433.21 |======================================== GeForce GTX 1050 Ti . 453.15 |========================================== GeForce GTX 1060 .... 503.11 |=============================================== GeForce GTX 1070 .... 501.08 |=============================================== GeForce GTX 1080 .... 526.21 |================================================= ASKAP tConvolveCuda 2015-11-10 Processing: Gridding Million Grid Points Per Second > Higher Is Better GeForce GTX 950 ..... 3399.14 |==================== GeForce GTX 960 ..... 3132.42 |================== GeForce GTX 970 ..... 5255.51 |============================== GeForce GTX 980 ..... 6006.45 |=================================== GeForce GTX 980 Ti .. 8320.50 |================================================ GeForce GTX 1050 .... 3698.00 |===================== GeForce GTX 1050 Ti . 3715.36 |===================== GeForce GTX 1060 .... 5625.68 |================================ GeForce GTX 1070 .... 7607.31 |============================================ GeForce GTX 1080 .... 8236.45 |================================================ ASKAP tConvolveCuda 2015-11-10 Processing: Degridding Million Grid Points Per Second > Higher Is Better GeForce GTX 950 ..... 5625.68 |================ GeForce GTX 960 ..... 5325.12 |=============== GeForce GTX 970 ..... 9399.84 |========================== GeForce GTX 980 ..... 10798.13 |============================== GeForce GTX 980 Ti .. 17010.80 |=============================================== GeForce GTX 1050 .... 5873.92 |================ GeForce GTX 1050 Ti . 5961.62 |================ GeForce GTX 1060 .... 9861.33 |=========================== GeForce GTX 1070 .... 13312.80 |===================================== GeForce GTX 1080 .... 14273.00 |======================================= CUDA Mini-Nbody 2015-11-10 Test: Original Seconds < Lower Is Better GeForce GTX 750 ..... 182.08 |================================================= GeForce GTX 780 Ti .. 61.44 |================= GeForce GTX 950 ..... 104.15 |============================ GeForce GTX 960 ..... 82.70 |====================== GeForce GTX 970 ..... 52.51 |============== GeForce GTX 980 ..... 46.60 |============= GeForce GTX 980 Ti .. 36.06 |========== GeForce GTX 1050 .... 115.24 |=============================== GeForce GTX 1050 Ti . 101.85 |=========================== GeForce GTX 1060 .... 58.42 |================ GeForce GTX 1070 .... 39.70 |=========== GeForce GTX 1080 .... 33.06 |========= Caffe AlexNet 2016-06-11 Build: CUDA AlexNet Milli-Seconds < Lower Is Better GeForce GTX 680 ..... 52573.77 |===================================== GeForce GTX 760 ..... 66411.83 |=============================================== GeForce GTX 780 Ti .. 28595.10 |==================== GeForce GTX 950 ..... 30595.47 |====================== GeForce GTX 960 ..... 27360.73 |=================== GeForce GTX 970 ..... 17005.67 |============ GeForce GTX 980 ..... 14977.20 |=========== GeForce GTX 980 Ti .. 11722.10 |======== GeForce GTX 1050 .... 30845.03 |====================== GeForce GTX 1050 Ti . 26985.90 |=================== GeForce GTX 1060 .... 16266.23 |============ GeForce GTX 1070 .... 11451.90 |======== GeForce GTX 1080 .... 9738.65 |======= ubu_deep ............ 9734.94 |======= ubu_ml3 ............. 9462.11 |======= ubu_ml_375.26 ....... 9791.41 |======= Caffe AlexNet 2016-06-11 Build: CUDA Googlenet Milli-Seconds < Lower Is Better GeForce GTX 680 ..... 133624.00 |===================================== GeForce GTX 760 ..... 164009.00 |============================================== GeForce GTX 780 Ti .. 65105.93 |================== GeForce GTX 950 ..... 68771.30 |=================== GeForce GTX 960 ..... 59805.47 |================= GeForce GTX 970 ..... 40125.47 |=========== GeForce GTX 980 ..... 35955.47 |========== GeForce GTX 980 Ti .. 31440.40 |========= GeForce GTX 1050 .... 69616.53 |==================== GeForce GTX 1050 Ti . 60253.57 |================= GeForce GTX 1060 .... 37468.77 |=========== GeForce GTX 1070 .... 27658.17 |======== GeForce GTX 1080 .... 24039.57 |======= ubu_deep ............ 25701.77 |======= ubu_ml3 ............. 25817.43 |======= ubu_ml_375.26 ....... 25693.67 |=======