OpenCL CUDA NVIDIA GPGPU Linux Tests A Pascal test with NVIDIA Linux driver on 16.10 using new GeForce GTX 680: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: NVIDIA GeForce GTX 680 2048MB (1006/3004MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 750: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 760: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: NVIDIA GeForce GTX 760 2048MB (980/3004MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 780 Ti: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 950: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: eVGA NVIDIA GeForce GTX 950 2048MB (135/405MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 960: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 970: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 980: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: NVIDIA GeForce GTX 980 4096MB (1126/3505MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX 980 Ti: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GeForce GTX TITAN X: Processor: Intel Core i5-6600K @ 3.50GHz (4 Cores), Motherboard: MSI Z170A GAMING PRO (MS-7984) v1.0, Chipset: Intel Device 191f, Memory: 16384MB, Disk: 256GB TS256GSSD370S, Graphics: NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz), Audio: Intel Device a170, Network: Intel Device 15b8 OS: Ubuntu 14.04, Kernel: 3.19.0-33-generic (x86_64), Desktop: Unity 7.2.5, Display Server: X Server 1.17.1, Display Driver: NVIDIA 352.39, OpenGL: 4.3.0, Compiler: GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5, File-System: ext4, Screen Resolution: 3840x2160 GTX 1050 Ti: Processor: Intel Core i3-7100 @ 3.90GHz (4 Cores), Motherboard: MSI B250M PRO-VD(MS-7A74) v1.0, Chipset: Intel Device 590f, Memory: 16384MB, Disk: 120GB Samsung SSD 750 + 1000GB Seagate ST1000DM010-2EP1, Graphics: Intel Kabylake GT2 3072MB, Audio: Realtek ALC887-VD, Monitor: Compaq W17q, Network: Realtek RTL8111/8168/8411 OS: Ubuntu 16.10, Kernel: 4.8.0-37-generic (x86_64), Desktop: LXDE 0.8.2, Display Server: X Server 1.18.4, OpenGL: 4.3 Mesa 12.0.3, OpenCL: OpenCL 1.2 CUDA 8.0.0, Vulkan: 1.0.24, Compiler: GCC 5.4.1 20160929 + CUDA 8.0, File-System: ext4, Screen Resolution: 1440x900 CUDA Mini-Nbody 2015-11-10 Test: Original Seconds < Lower Is Better GeForce GTX 750 ..... 180.66 |================================================= GeForce GTX 780 Ti .. 61.03 |================= GeForce GTX 950 ..... 105.30 |============================= GeForce GTX 960 ..... 82.01 |====================== GeForce GTX 970 ..... 54.32 |=============== GeForce GTX 980 ..... 45.38 |============ GeForce GTX 980 Ti .. 34.58 |========= GeForce GTX TITAN X . 32.37 |========= GTX 1050 Ti ......... 101.91 |============================ CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking Seconds < Lower Is Better GeForce GTX 750 ..... 98.19 |================================================== GeForce GTX 780 Ti .. 29.99 |=============== GeForce GTX 950 ..... 49.89 |========================= GeForce GTX 960 ..... 37.08 |=================== GeForce GTX 970 ..... 28.53 |=============== GeForce GTX 980 ..... 25.13 |============= GeForce GTX 980 Ti .. 19.77 |========== GeForce GTX TITAN X . 18.65 |========= GTX 1050 Ti ......... 45.84 |======================= CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling Seconds < Lower Is Better GeForce GTX 750 ..... 89.34 |================================================== GeForce GTX 780 Ti .. 27.05 |=============== GeForce GTX 950 ..... 47.54 |=========================== GeForce GTX 960 ..... 35.35 |==================== GeForce GTX 970 ..... 26.42 |=============== GeForce GTX 980 ..... 23.88 |============= GeForce GTX 980 Ti .. 18.46 |========== GeForce GTX TITAN X . 17.59 |========== GTX 1050 Ti ......... 42.44 |======================== CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout Seconds < Lower Is Better GeForce GTX 750 ..... 199.95 |================================================= GeForce GTX 780 Ti .. 54.39 |============= GeForce GTX 950 ..... 108.50 |=========================== GeForce GTX 960 ..... 79.97 |==================== GeForce GTX 970 ..... 55.87 |============== GeForce GTX 980 ..... 50.15 |============ GeForce GTX 980 Ti .. 40.94 |========== GeForce GTX TITAN X . 37.43 |========= GTX 1050 Ti ......... 92.63 |======================= CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero Seconds < Lower Is Better GeForce GTX 750 ..... 199.83 |================================================= GeForce GTX 780 Ti .. 53.26 |============= GeForce GTX 950 ..... 108.48 |=========================== GeForce GTX 960 ..... 79.84 |==================== GeForce GTX 970 ..... 55.80 |============== GeForce GTX 980 ..... 49.53 |============ GeForce GTX 980 Ti .. 40.85 |========== GeForce GTX TITAN X . 37.37 |========= GTX 1050 Ti ......... 91.34 |======================