cudatest AMD Ryzen Threadripper 1920X 12-Core testing with a ASRock X399 Taichi (P3.50 BIOS) and eVGA NVIDIA GeForce RTX 2070 8GB on Arch Linux via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1905030-SP-CUDATEST693 .
cudatest Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server Display Driver Compiler File-System Screen Resolution cudatest1 cudatest2 cudatest AMD Ryzen Threadripper 1920X 12-Core @ 3.50GHz (12 Cores / 24 Threads) ASRock X399 Taichi (P3.50 BIOS) AMD Family 17h 32768MB 960GB Force MP510 + 2000GB TOSHIBA HDWD120 eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz) NVIDIA TU106 HD Audio 2 x Intel I211 + Intel Dual Band-AC 3168NGW Arch Linux 5.0.10-arch1-1-ARCH (x86_64) Xfce 4.12 X Server 1.20.4 NVIDIA 418.56 GCC 8.3.0 + CUDA 10.1 ext4 1920x1080 eVGA NVIDIA GeForce RTX 2070 8GB (315/405MHz) eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz) OpenBenchmarking.org Compiler Details - --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-libmpx --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details - Scaling Governor: acpi-cpufreq schedutil Security Details - __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp
cudatest cuda-mini-nbody: Original cuda-mini-nbody: Cache Blocking cuda-mini-nbody: Loop Unrolling cuda-mini-nbody: SOA Data Layout cuda-mini-nbody: Flush Denormals To Zero cudatest1 cudatest2 cudatest 249 333 334 251 242 242 325 333 249 254 248 335 339 252 257 OpenBenchmarking.org
CUDA Mini-Nbody Test: Original OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Original cudatest1 cudatest2 cudatest 50 100 150 200 250 SE +/- 0.10, N = 3 SE +/- 3.08, N = 3 SE +/- 0.14, N = 3 249 242 248
CUDA Mini-Nbody Test: Cache Blocking OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Cache Blocking cudatest1 cudatest2 cudatest 70 140 210 280 350 SE +/- 2.88, N = 3 SE +/- 5.02, N = 5 SE +/- 0.37, N = 3 333 325 335
CUDA Mini-Nbody Test: Loop Unrolling OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Loop Unrolling cudatest1 cudatest2 cudatest 70 140 210 280 350 SE +/- 2.89, N = 3 SE +/- 1.76, N = 3 SE +/- 0.50, N = 3 334 333 339
CUDA Mini-Nbody Test: SOA Data Layout OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: SOA Data Layout cudatest1 cudatest2 cudatest 60 120 180 240 300 SE +/- 0.63, N = 3 SE +/- 0.90, N = 3 SE +/- 0.52, N = 3 251 249 252
CUDA Mini-Nbody Test: Flush Denormals To Zero OpenBenchmarking.org (NBody^2)/s, More Is Better CUDA Mini-Nbody 2015-11-10 Test: Flush Denormals To Zero cudatest1 cudatest2 cudatest 60 120 180 240 300 SE +/- 3.50, N = 5 SE +/- 1.29, N = 3 SE +/- 0.52, N = 3 242 254 257
Phoronix Test Suite v10.8.4