nbody Intel Core i5-6600K testing with a MSI Z170A GAMING M5 (MS-7977) v1.0 (1.I0 BIOS) and MSI NVIDIA GeForce RTX 3080 10GB on Ubuntu 20.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2012197-FI-NBODY897883 .
nbody Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution MSI NVIDIA GeForce RTX 3080 Intel Core i5-6600K @ 3.90GHz (4 Cores) MSI Z170A GAMING M5 (MS-7977) v1.0 (1.I0 BIOS) Intel Xeon E3-1200 v5/E3-1500 16GB 1024GB INTEL SSDPEKNW010T8 + 6001GB Western Digital WD60EFRX-68L + 3001GB Western Digital WD30EZRX-22D + 8002GB Seagate ST8000DM004-2CX1 MSI NVIDIA GeForce RTX 3080 10GB (1755/9501MHz) Realtek ALC1150 U28E570 Qualcomm Atheros Killer E2400 Ubuntu 20.10 5.8.0-34-generic (x86_64) GNOME Shell 3.38.1 X Server 1.20.9 NVIDIA 455.45.01 4.6.0 1.2.142 GCC 10.2.0 + CUDA 11.0 ext4 3840x2160 OpenBenchmarking.org - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 2.3 - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT disabled
nbody cuda-mini-nbody: Original cuda-mini-nbody: Cache Blocking cuda-mini-nbody: Loop Unrolling cuda-mini-nbody: SOA Data Layout cuda-mini-nbody: Flush Denormals To Zero MSI NVIDIA GeForce RTX 3080 507.621 831.127 872.983 619.836 654.090 OpenBenchmarking.org
CUDA Mini-Nbody Test: Original OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Original MSI NVIDIA GeForce RTX 3080 110 220 330 440 550 SE +/- 1.76, N = 3 507.62
CUDA Mini-Nbody Test: Cache Blocking OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking MSI NVIDIA GeForce RTX 3080 200 400 600 800 1000 SE +/- 0.45, N = 3 831.13
CUDA Mini-Nbody Test: Loop Unrolling OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling MSI NVIDIA GeForce RTX 3080 200 400 600 800 1000 SE +/- 0.71, N = 3 872.98
CUDA Mini-Nbody Test: SOA Data Layout OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout MSI NVIDIA GeForce RTX 3080 130 260 390 520 650 SE +/- 0.90, N = 3 619.84
CUDA Mini-Nbody Test: Flush Denormals To Zero OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero MSI NVIDIA GeForce RTX 3080 140 280 420 560 700 SE +/- 0.79, N = 3 654.09
Phoronix Test Suite v10.8.4