cudatest AMD Ryzen Threadripper 1920X 12-Core testing with a ASRock X399 Taichi (P3.50 BIOS) and eVGA NVIDIA GeForce RTX 2070 8GB on Arch Linux via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1905034-SP-CUDATEST998&grr .
cudatest Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 AMD Ryzen Threadripper 1920X 12-Core @ 3.50GHz (12 Cores / 24 Threads) ASRock X399 Taichi (P3.50 BIOS) AMD Family 17h 32768MB 960GB Force MP510 + 2000GB TOSHIBA HDWD120 eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz) NVIDIA TU106 HD Audio 2 x Intel I211 + Intel Dual Band-AC 3168NGW Arch Linux 5.0.10-arch1-1-ARCH (x86_64) Xfce 4.12 X Server 1.20.4 NVIDIA 418.56 GCC 8.3.0 + CUDA 10.1 ext4 1920x1080 eVGA NVIDIA GeForce RTX 2070 8GB (315/405MHz) eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz) OpenBenchmarking.org Compiler Details - --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-libmpx --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details - Scaling Governor: acpi-cpufreq schedutil Security Details - __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
cudatest cuda-mini-nbody: Flush Denormals To Zero cuda-mini-nbody: Original cuda-mini-nbody: SOA Data Layout cuda-mini-nbody: Cache Blocking cuda-mini-nbody: Loop Unrolling cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 242 249 251 333 334 254 242 249 325 333 257 248 252 335 339 257 250 252 337 339 OpenBenchmarking.org
CUDA Mini-Nbody Test: Flush Denormals To Zero OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 60 120 180 240 300 SE +/- 3.50, N = 5 SE +/- 1.29, N = 3 SE +/- 0.52, N = 3 SE +/- 0.27, N = 3 242 254 257 257
CUDA Mini-Nbody Test: Original OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Original cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 50 100 150 200 250 SE +/- 0.10, N = 3 SE +/- 3.08, N = 3 SE +/- 0.14, N = 3 SE +/- 1.55, N = 3 249 242 248 250
CUDA Mini-Nbody Test: SOA Data Layout OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 60 120 180 240 300 SE +/- 0.63, N = 3 SE +/- 0.90, N = 3 SE +/- 0.52, N = 3 SE +/- 0.51, N = 3 251 249 252 252
CUDA Mini-Nbody Test: Cache Blocking OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 70 140 210 280 350 SE +/- 2.88, N = 3 SE +/- 5.02, N = 5 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 333 325 335 337
CUDA Mini-Nbody Test: Loop Unrolling OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 70 140 210 280 350 SE +/- 2.89, N = 3 SE +/- 1.76, N = 3 SE +/- 0.50, N = 3 SE +/- 0.50, N = 3 334 333 339 339
Phoronix Test Suite v10.8.4