GPUoutput

Testing the difference between VGA and DVI video output on CUDA performance

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1702285-RI-GPUOUTPUT50
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GPUoutputCUDA
February 28 2017
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GPUoutputOpenBenchmarking.orgPhoronix Test Suite2 x Intel Xeon E5-2660 v3 @ 3.30GHz (40 Cores)Supermicro X10DRG-OT+-CPU v1.00Intel Xeon E7 v3/Xeon8 x 16384 MB 2133MHz240GB INTEL SSDSC2BB24TITAN X (Pascal) 12288MB (1417/5005MHz)NVIDIA Device 10efIntel 10-Gigabit X540-AT2CentOS Linux 73.10.0-514.6.2.el7.x86_64 (x86_64)GNOME Shell 3.14.4NVIDIA 375.264.4.01.0.24GCC 4.8.5 20150623 + CUDA 8.0xfs1920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionGPUoutput BenchmarksSystem Logs- --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,java,fortran,ada,go,lto --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-linker-hash-style=gnu --with-tune=generic - Scaling Governor: intel_pstate powersave- SELinux: Enabled.

GPUoutputaskap: Griddingaskap: Degriddingcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To ZeroGPUoutputCUDA10946.0721619.0724.6514.6015.5227.2127.17OpenBenchmarking.org

ASKAP tConvolveCuda

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGPUoutputCUDA2K4K6K8K10KSE +/- 147.93, N = 310946.071. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGPUoutputCUDA5K10K15K20K25KSE +/- 568.93, N = 321619.071. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGPUoutputCUDA612182430SE +/- 0.06, N = 324.65

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGPUoutputCUDA48121620SE +/- 0.01, N = 314.60

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGPUoutputCUDA48121620SE +/- 0.04, N = 315.52

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGPUoutputCUDA612182430SE +/- 0.30, N = 327.21

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGPUoutputCUDA612182430SE +/- 0.15, N = 327.17

7 Results Shown

ASKAP tConvolveCuda:
  Gridding
  Degridding
CUDA Mini-Nbody:
  Original
  Cache Blocking
  Loop Unrolling
  SOA Data Layout
  Flush Denormals To Zero