cuda-mini-nbody_jul13_1525 ARMv8 rev 3 testing with a quill and NVIDIA Tegra X2 on Ubuntu 16.04 via the Phoronix Test Suite. test1: Processor: ARMv8 rev 3 @ 2.04GHz (6 Cores), Motherboard: quill, Memory: 8192MB, Disk: 500GB Portable SSD T5 + 31GB 032G34, Graphics: NVIDIA Tegra X2, Monitor: U28E850 OS: Ubuntu 16.04, Kernel: 4.4.38-tegra (aarch64), Desktop: Unity 7.4.5, Display Server: X Server 1.18.4, Display Driver: NVIDIA 28.2.1, Compiler: GCC 5.4.0 20160609 + CUDA 9.0, File-System: ext4, Screen Resolution: 5760x1200 CUDA Mini-Nbody 2015-11-10 Test: Original Seconds < Lower Is Better test1 . 394.75 |=============================================================== CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking Seconds < Lower Is Better test1 . 159.85 |=============================================================== CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling Seconds < Lower Is Better test1 . 152.05 |=============================================================== CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout Seconds < Lower Is Better test1 . 359.32 |=============================================================== CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero Seconds < Lower Is Better test1 . 358.20 |===============================================================