cudatest

AMD Ryzen Threadripper 1920X 12-Core testing with a ASRock X399 Taichi (P3.50 BIOS) and eVGA NVIDIA GeForce RTX 2070 8GB on Arch Linux via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/1905034-SP-CUDATEST998.

cudatestProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolutioncudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 2070AMD Ryzen Threadripper 1920X 12-Core @ 3.50GHz (12 Cores / 24 Threads)ASRock X399 Taichi (P3.50 BIOS)AMD Family 17h32768MB960GB Force MP510 + 2000GB TOSHIBA HDWD120eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz)NVIDIA TU106 HD Audio2 x Intel I211 + Intel Dual Band-AC 3168NGWArch Linux5.0.10-arch1-1-ARCH (x86_64)Xfce 4.12X Server 1.20.4NVIDIA 418.56GCC 8.3.0 + CUDA 10.1ext41920x1080eVGA NVIDIA GeForce RTX 2070 8GB (315/405MHz)eVGA NVIDIA GeForce RTX 2070 8GB (300/405MHz)OpenBenchmarking.orgCompiler Details- --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++ --enable-libmpx --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details- Scaling Governor: acpi-cpufreq schedutilSecurity Details- __user pointer sanitization + Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + SSB disabled via prctl and seccomp

cudatestcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerocudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 2070249333334251242242325333249254248335339252257250337339252257OpenBenchmarking.org

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Originalcudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 207050100150200250SE +/- 0.10, N = 3SE +/- 3.08, N = 3SE +/- 0.14, N = 3SE +/- 1.55, N = 3249242248250

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache Blockingcudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 207070140210280350SE +/- 2.88, N = 3SE +/- 5.02, N = 5SE +/- 0.37, N = 3SE +/- 0.64, N = 3333325335337

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop Unrollingcudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 207070140210280350SE +/- 2.89, N = 3SE +/- 1.76, N = 3SE +/- 0.50, N = 3SE +/- 0.50, N = 3334333339339

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data Layoutcudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 207060120180240300SE +/- 0.63, N = 3SE +/- 0.90, N = 3SE +/- 0.52, N = 3SE +/- 0.51, N = 3251249252252

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.org(NBody^2)/s, More Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To Zerocudatest1cudatest2cudatesteVGA NVIDIA GeForce RTX 207060120180240300SE +/- 3.50, N = 5SE +/- 1.29, N = 3SE +/- 0.52, N = 3SE +/- 0.27, N = 3242254257257


Phoronix Test Suite v10.8.4