VGAoutput CompareVGAGPUCUDA VGAoutputCUDA: Processor: 2 x Intel Xeon E5-2660 v3 @ 3.30GHz (40 Cores), Motherboard: Supermicro X10DRG-OT+-CPU v1.00, Chipset: Intel Xeon E7 v3/Xeon, Memory: 8 x 16384 MB 2133MHz, Disk: 240GB INTEL SSDSC2BB24, Graphics: LLVMpipe, Audio: NVIDIA Device 10ef, Network: Intel 10-Gigabit X540-AT2 OS: CentOS Linux 7, Kernel: 3.10.0-514.6.2.el7.x86_64 (x86_64), Desktop: GNOME Shell 3.14.4, Display Server: X Server 1.17.2, Display Driver: modesetting 1.17.2, OpenGL: 2.1 Mesa 11.2.2 Gallium 0.4 (LLVM 3.8 256 bits), Vulkan: 1.0.24, Compiler: GCC 4.8.5 20150623 + CUDA 8.0, File-System: xfs, Screen Resolution: 1024x768 CompareVGAGPUCUDA: Processor: 2 x Intel Xeon E5-2660 v3 @ 3.30GHz (40 Cores), Motherboard: Supermicro X10DRG-OT+-CPU v1.00, Chipset: Intel Xeon E7 v3/Xeon, Memory: 8 x 16384 MB 2133MHz, Disk: 240GB INTEL SSDSC2BB24, Graphics: TITAN X (Pascal) 12288MB (1288/5005MHz), Audio: NVIDIA Device 10ef, Network: Intel 10-Gigabit X540-AT2 OS: CentOS Linux 7, Kernel: 3.10.0-514.6.2.el7.x86_64 (x86_64), Desktop: GNOME Shell 3.14.4, Display Server: X Server 1.17.2, Display Driver: NVIDIA 375.26, OpenGL: 4.4.0, Vulkan: 1.0.24, Compiler: GCC 4.8.5 20150623 + CUDA 8.0, File-System: xfs, Screen Resolution: 1920x1080 ASKAP tConvolveCuda 2015-11-10 Processing: Gridding Million Grid Points Per Second > Higher Is Better VGAoutputCUDA ..... 10650.20 |=============================================== CompareVGAGPUCUDA . 11094.00 |================================================= ASKAP tConvolveCuda 2015-11-10 Processing: Degridding Million Grid Points Per Second > Higher Is Better VGAoutputCUDA ..... 21050.13 |================================================ CompareVGAGPUCUDA . 21619.07 |================================================= CUDA Mini-Nbody 2015-11-10 Test: Original Seconds < Lower Is Better VGAoutputCUDA ..... 35.45 |==================================================== CompareVGAGPUCUDA . 24.80 |==================================== CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking Seconds < Lower Is Better VGAoutputCUDA ..... 25.13 |==================================================== CompareVGAGPUCUDA . 14.56 |============================== CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling Seconds < Lower Is Better VGAoutputCUDA ..... 25.98 |==================================================== CompareVGAGPUCUDA . 15.48 |=============================== CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout Seconds < Lower Is Better VGAoutputCUDA ..... 36.63 |==================================================== CompareVGAGPUCUDA . 27.03 |====================================== CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero Seconds < Lower Is Better VGAoutputCUDA ..... 36.56 |==================================================== CompareVGAGPUCUDA . 27.17 |=======================================