nbody

Intel Core i5-6600K testing with a MSI Z170A GAMING M5 (MS-7977) v1.0 (1.I0 BIOS) and MSI NVIDIA GeForce RTX 3080 10GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012197-FI-NBODY897883.

nbodyProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionMSI NVIDIA GeForce RTX 3080Intel Core i5-6600K @ 3.90GHz (4 Cores)MSI Z170A GAMING M5 (MS-7977) v1.0 (1.I0 BIOS)Intel Xeon E3-1200 v5/E3-150016GB1024GB INTEL SSDPEKNW010T8 + 6001GB Western Digital WD60EFRX-68L + 3001GB Western Digital WD30EZRX-22D + 8002GB Seagate ST8000DM004-2CX1MSI NVIDIA GeForce RTX 3080 10GB (1755/9501MHz)Realtek ALC1150U28E570Qualcomm Atheros Killer E2400Ubuntu 20.105.8.0-34-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9NVIDIA 455.45.014.6.01.2.142GCC 10.2.0 + CUDA 11.0ext43840x2160OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xe2 - Thermald 2.3- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT disabled + mds: Mitigation of Clear buffers; SMT disabled + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: disabled RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT disabled

nbodycuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To ZeroMSI NVIDIA GeForce RTX 3080507.621831.127872.983619.836654.090OpenBenchmarking.org

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalMSI NVIDIA GeForce RTX 3080110220330440550SE +/- 1.76, N = 3507.62

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingMSI NVIDIA GeForce RTX 30802004006008001000SE +/- 0.45, N = 3831.13

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingMSI NVIDIA GeForce RTX 30802004006008001000SE +/- 0.71, N = 3872.98

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutMSI NVIDIA GeForce RTX 3080130260390520650SE +/- 0.90, N = 3619.84

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroMSI NVIDIA GeForce RTX 3080140280420560700SE +/- 0.79, N = 3654.09


Phoronix Test Suite v10.8.4