VGAoutput

CompareVGAGPUCUDA

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1702284-RI-1702273RI95
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
VGAoutputCUDA
February 27 2017
 
CompareVGAGPUCUDA
February 28 2017
 
Invert Hiding All Results Option
 
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


VGAoutputProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionVGAoutputCUDACompareVGAGPUCUDA2 x Intel Xeon E5-2660 v3 @ 3.30GHz (40 Cores)Supermicro X10DRG-OT+-CPU v1.00Intel Xeon E7 v3/Xeon8 x 16384 MB 2133MHz240GB INTEL SSDSC2BB24LLVMpipeNVIDIA Device 10efIntel 10-Gigabit X540-AT2CentOS Linux 73.10.0-514.6.2.el7.x86_64 (x86_64)GNOME Shell 3.14.4X Server 1.17.2modesetting 1.17.22.1 Mesa 11.2.2 Gallium 0.4 (LLVM 3.8 256 bits)1.0.24GCC 4.8.5 20150623 + CUDA 8.0xfs1024x768TITAN X (Pascal) 12288MB (1288/5005MHz)NVIDIA 375.264.4.01920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,java,fortran,ada,go,lto --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-linker-hash-style=gnu --with-tune=generic Processor Details- Scaling Governor: intel_pstate powersaveSystem Details- SELinux: Enabled.

VGAoutputCUDA vs. CompareVGAGPUCUDA ComparisonPhoronix Test SuiteBaseline+18.2%+18.2%+36.4%+36.4%+54.6%+54.6%72.6%67.8%42.9%35.5%34.6%4.2%2.7%Cache BlockingLoop UnrollingOriginalSOA Data LayoutF.D.T.ZGriddingDegriddingCUDA Mini-NbodyCUDA Mini-NbodyCUDA Mini-NbodyCUDA Mini-NbodyCUDA Mini-NbodyASKAP tConvolveCudaASKAP tConvolveCudaVGAoutputCUDACompareVGAGPUCUDA

VGAoutputaskap: Griddingaskap: Degriddingcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To ZeroVGAoutputCUDACompareVGAGPUCUDA10650.2021050.1335.4525.1325.9836.6336.561109421619.0724.8014.5615.4827.0327.17OpenBenchmarking.org

ASKAP tConvolveCuda

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingVGAoutputCUDACompareVGAGPUCUDA2K4K6K8K10KSE +/- 0.00, N = 3SE +/- 0.00, N = 310650.2011094.001. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingVGAoutputCUDACompareVGAGPUCUDA2K4K6K8K10KMin: 10650.2 / Avg: 10650.2 / Max: 10650.2Min: 11094 / Avg: 11094 / Max: 110941. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingVGAoutputCUDACompareVGAGPUCUDA5K10K15K20K25KSE +/- 568.93, N = 3SE +/- 568.93, N = 321050.1321619.071. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingVGAoutputCUDACompareVGAGPUCUDA4K8K12K16K20KMin: 20481.2 / Avg: 21050.13 / Max: 22188Min: 20481.2 / Avg: 21619.07 / Max: 221881. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalVGAoutputCUDACompareVGAGPUCUDA816243240SE +/- 0.05, N = 3SE +/- 0.05, N = 335.4524.80
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalVGAoutputCUDACompareVGAGPUCUDA816243240Min: 35.39 / Avg: 35.45 / Max: 35.55Min: 24.73 / Avg: 24.8 / Max: 24.9

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingVGAoutputCUDACompareVGAGPUCUDA612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 325.1314.56
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingVGAoutputCUDACompareVGAGPUCUDA612182430Min: 25.1 / Avg: 25.13 / Max: 25.15Min: 14.55 / Avg: 14.56 / Max: 14.57

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingVGAoutputCUDACompareVGAGPUCUDA612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 325.9815.48
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingVGAoutputCUDACompareVGAGPUCUDA612182430Min: 25.94 / Avg: 25.98 / Max: 26.04Min: 15.46 / Avg: 15.48 / Max: 15.5

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutVGAoutputCUDACompareVGAGPUCUDA816243240SE +/- 0.03, N = 3SE +/- 0.31, N = 336.6327.03
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutVGAoutputCUDACompareVGAGPUCUDA816243240Min: 36.58 / Avg: 36.63 / Max: 36.7Min: 26.41 / Avg: 27.03 / Max: 27.34

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroVGAoutputCUDACompareVGAGPUCUDA816243240SE +/- 0.02, N = 3SE +/- 0.16, N = 336.5627.17
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroVGAoutputCUDACompareVGAGPUCUDA816243240Min: 36.54 / Avg: 36.56 / Max: 36.6Min: 26.87 / Avg: 27.17 / Max: 27.37

7 Results Shown

ASKAP tConvolveCuda:
  Gridding
  Degridding
CUDA Mini-Nbody:
  Original
  Cache Blocking
  Loop Unrolling
  SOA Data Layout
  Flush Denormals To Zero