cudatest AMD Ryzen Threadripper 1920X 12-Core testing with a ASRock X399 Taichi (P3.50 BIOS) and eVGA NVIDIA GeForce RTX 2070 8GB on Arch Linux via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1905034-SP-CUDATEST998 .
cudatest Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 AMD Ryzen Threadripper 1920X 12-Core @ 3.50GHz (12 Cores / 24 Threads) ASRock X399 Taichi (P3.50 BIOS) AMD Family 17h 32768MB 960GB Force MP510 + 2000GB TOSHIBA HDWD120 eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz) NVIDIA TU106 HD Audio 2 x Intel I211 + Intel Dual Band-AC 3168NGW Arch Linux 5.0.10-arch1-1-ARCH (x86_64) Xfce 4.12 X Server 1.20.4 NVIDIA 418.56 GCC 8.3.0 + CUDA 10.1 ext4 1920x1080 eVGA NVIDIA GeForce RTX 2070 8GB (315/405MHz) eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz) OpenBenchmarking.org Compiler Details - --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-libmpx --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details - Scaling Governor: acpi-cpufreq schedutil Security Details - __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
cudatest cuda-mini-nbody: Original cuda-mini-nbody: Cache Blocking cuda-mini-nbody: Loop Unrolling cuda-mini-nbody: SOA Data Layout cuda-mini-nbody: Flush Denormals To Zero cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 249 333 334 251 242 242 325 333 249 254 248 335 339 252 257 250 337 339 252 257 OpenBenchmarking.org
CUDA Mini-Nbody Test: Original OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Original cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 50 100 150 200 250 SE +/- 0.10, N = 3 SE +/- 3.08, N = 3 SE +/- 0.14, N = 3 SE +/- 1.55, N = 3 249 242 248 250
CUDA Mini-Nbody Test: Cache Blocking OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 70 140 210 280 350 SE +/- 2.88, N = 3 SE +/- 5.02, N = 5 SE +/- 0.37, N = 3 SE +/- 0.64, N = 3 333 325 335 337
CUDA Mini-Nbody Test: Loop Unrolling OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 70 140 210 280 350 SE +/- 2.89, N = 3 SE +/- 1.76, N = 3 SE +/- 0.50, N = 3 SE +/- 0.50, N = 3 334 333 339 339
CUDA Mini-Nbody Test: SOA Data Layout OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 60 120 180 240 300 SE +/- 0.63, N = 3 SE +/- 0.90, N = 3 SE +/- 0.52, N = 3 SE +/- 0.51, N = 3 251 249 252 252
CUDA Mini-Nbody Test: Flush Denormals To Zero OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero cudatest1 cudatest2 cudatest eVGA NVIDIA GeForce RTX 2070 60 120 180 240 300 SE +/- 3.50, N = 5 SE +/- 1.29, N = 3 SE +/- 0.52, N = 3 SE +/- 0.27, N = 3 242 254 257 257
Phoronix Test Suite v10.8.4