OpenCL CUDA NVIDIA GPGPU Linux Tests

Running pts/shoc-1.0.0, pts/askap-1.0.0, pts/cuda-mini-nbody-1.0.0, pts/juliagpu-1.3.0, pts/mandelbulbgpu-1.3.0 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1812085-KH-1511113PT12
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 680
November 11 2015
 
GeForce GTX 750
November 11 2015
 
GeForce GTX 760
November 11 2015
 
GeForce GTX 780 Ti
November 11 2015
 
GeForce GTX 950
November 10 2015
 
GeForce GTX 960
November 11 2015
 
GeForce GTX 970
November 11 2015
 
GeForce GTX 980
November 11 2015
 
GeForce GTX 980 Ti
November 10 2015
 
GeForce GTX TITAN X
November 11 2015
 
GeForce GTX Titan Xp Oct-Off
December 08 2018
 
GeForce GTX Titan Xp Oct-Swappiness 100
December 08 2018
 
Invert Behavior (Only Show Selected Data)
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL CUDA NVIDIA GPGPU Linux TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX Titan Xp Oct-OffGeForce GTX Titan Xp Oct-Swappiness 100Intel Core i5-6600K @ 3.50GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.0Intel Device 191f16384MB256GB TS256GSSD370SNVIDIA GeForce GTX 680 2048MB (1006/3004MHz)Intel Device a170Intel Device 15b8Ubuntu 14.043.19.0-33-generic (x86_64)Unity 7.2.5X Server 1.17.1NVIDIA 352.394.3.0GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5ext43840x2160eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz)NVIDIA GeForce GTX 760 2048MB (980/3004MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2048MB (135/405MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)Intel Core i9-7920X @ 4.40GHz (24 Cores)ASUS WS X299 SAGEIntel Sky Lake-E DMI3 Registers64512MB10001GB Western Digital WD101KRYZ-01TITAN Xp 12288MB (139/405MHz)Realtek ALC1220Intel ConnectionUbuntu 18.044.15.0-42-generic (x86_64)GNOME Shell 3.28.3NVIDIA 410.484.6.0CUDA 9.11920x1080TITAN Xp 12288MB (1468/5702MHz)OpenBenchmarking.orgCompiler Details- GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 960: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX TITAN X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX Titan Xp Oct-Off: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX Titan Xp Oct-Swappiness 100: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- GeForce GTX 680: Scaling Governor: acpi-cpufreq performance- GeForce GTX 750: Scaling Governor: acpi-cpufreq performance- GeForce GTX 760: Scaling Governor: acpi-cpufreq performance- GeForce GTX 780 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX 950: Scaling Governor: acpi-cpufreq performance- GeForce GTX 960: Scaling Governor: acpi-cpufreq performance- GeForce GTX 970: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX TITAN X: Scaling Governor: acpi-cpufreq performance- GeForce GTX Titan Xp Oct-Off: Scaling Governor: intel_pstate powersave- GeForce GTX Titan Xp Oct-Swappiness 100: Scaling Governor: intel_pstate powersaveOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 750: GPU Compute Cores: 512- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X: GPU Compute Cores: 3072- GeForce GTX Titan Xp Oct-Off: GPU Compute Cores: 3840- GeForce GTX Titan Xp Oct-Swappiness 100: GPU Compute Cores: 3840System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 750: GPU Compute Cores: 512.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X: GPU Compute Cores: 3072.- GeForce GTX Titan Xp Oct-Off: GPU Compute Cores: 3840.- GeForce GTX Titan Xp Oct-Swappiness 100: GPU Compute Cores: 3840.

GeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX Titan Xp Oct-OffGeForce GTX Titan Xp Oct-Swappiness 100Result OverviewPhoronix Test Suite100%258%416%574%732%SHOC Scalable HeterOgeneous ComputingMandelbulbGPUJuliaGPU

OpenCL CUDA NVIDIA GPGPU Linux Testsluxmark: GPU - Luxball HDRluxmark: GPU - Microphoneluxmark: GPU - Hotelmandelbulbgpu: GPUjuliagpu: GPUcuda-mini-nbody: Flush Denormals To Zerocuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Originalaskap: Degriddingaskap: Griddingshoc: OpenCL - Texture Read Bandwidthshoc: CUDA - Texture Read Bandwidthshoc: OpenCL - MD5 Hashshoc: OpenCL - FFT SPshoc: CUDA - MD5 Hashshoc: CUDA - FFT SPGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX Titan Xp Oct-OffGeForce GTX Titan Xp Oct-Swappiness 1004554212757731636512.9748074789.03242.161.9174.97349120060275.5336136874.00199.83199.9589.3498.19180.66121.14158.421.0754.691.08113.644253194146325392138.5038310650.50170.261.4078.449639430299247400001.9078839770.1353.2654.3927.0529.9961.03286.623.78126.715313242376937156070.8764913682.63108.48108.5047.5449.89105.305706.073399.14239.19326.232.3463.222.36172.285474246089744953399.4780042041.7379.8479.9735.3537.0882.015290.323144.85269.98351.313.3662.783.38212.4397374458134658811317.17104144917.2355.8055.8726.4228.5354.329509.145325.12283.36325.164.77117.234.79263.14107134776149263616558.77113830604.2749.5350.1523.8825.1345.38110946051.27332.60336.485.68140.125.70289.63138026268185571656708.83127978049.5340.8540.9418.4619.7734.5817380.608320.50345.55348.926.79170.366.81311.46140816360190675614774.13136037921.4337.3737.4317.5918.6532.3717380.608458.77354.09356.527.41173.897.42324.0987713632.10149218672.5021.9922.3412.0211.4920.4025818.7713464.82626.12623.5815.81280.2916.01458.7186921286.30149335524.4722.2222.2711.9011.3720.2225011.9313546.37629.83635.3515.81279.7216.01450.23OpenBenchmarking.org

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 750GeForce GTX 6803K6K9K12K15KSE +/- 4.70, N = 3SE +/- 44.35, N = 3SE +/- 1.20, N = 3SE +/- 24.85, N = 3SE +/- 0.88, N = 3SE +/- 16.67, N = 3SE +/- 35.97, N = 3SE +/- 1.45, N = 3SE +/- 11.67, N = 3SE +/- 12.17, N = 31408113802107139737547453139639425334914554

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 68014002800420056007000SE +/- 3.00, N = 3SE +/- 18.50, N = 3SE +/- 0.67, N = 3SE +/- 7.64, N = 3SE +/- 1.15, N = 3SE +/- 4.26, N = 3SE +/- 12.00, N = 3SE +/- 0.67, N = 3SE +/- 3.06, N = 3636062684776445824602423430219412127

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 680400800120016002000SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 2.00, N = 31906185514921346897769992463577

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 750GeForce GTX 68020M40M60M80M100MSE +/- 950597.06, N = 3SE +/- 188108.10, N = 3SE +/- 166919.37, N = 3SE +/- 168304.91, N = 3SE +/- 140370.89, N = 3SE +/- 91420.68, N = 3SE +/- 75512.83, N = 3SE +/- 29855.85, N = 3SE +/- 48150.35, N = 3SE +/- 28089.31, N = 3SE +/- 9818.73, N = 3SE +/- 36731.70, N = 386921286.3087713632.1075614774.1371656708.8363616558.7758811317.1744953399.4737156070.8747400001.9025392138.5020060275.5331636512.971. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 750GeForce GTX 68030M60M90M120M150MSE +/- 527550.54, N = 3SE +/- 386285.03, N = 3SE +/- 318277.32, N = 3SE +/- 473156.02, N = 3SE +/- 218639.12, N = 3SE +/- 84325.23, N = 3SE +/- 157475.07, N = 3SE +/- 58084.93, N = 3SE +/- 293396.06, N = 3SE +/- 14125.16, N = 3SE +/- 22546.70, N = 3SE +/- 59682.63, N = 3149335524.47149218672.50136037921.43127978049.53113830604.27104144917.2380042041.7364913682.6378839770.1338310650.5036136874.0048074789.031. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

CUDA Mini-Nbody

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 7504080120160200SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 322.2221.9937.3740.8549.5355.8079.84108.4853.26199.83

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 7504080120160200SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.11, N = 3SE +/- 0.21, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.04, N = 322.2722.3437.4340.9450.1555.8779.97108.5054.39199.95

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 75020406080100SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.25, N = 3SE +/- 0.15, N = 3SE +/- 0.21, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.9012.0217.5918.4623.8826.4235.3547.5427.0589.34

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 75020406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 3SE +/- 0.00, N = 311.3711.4918.6519.7725.1328.5337.0849.8929.9998.19

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 7504080120160200SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.35, N = 3SE +/- 0.57, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.43, N = 3SE +/- 0.21, N = 3SE +/- 0.50, N = 3SE +/- 0.05, N = 320.2220.4032.3734.5845.3854.3282.01105.3061.03180.66

ASKAP tConvolveCuda

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 9506K12K18K24K30KSE +/- 806.83, N = 3SE +/- 806.83, N = 3SE +/- 369.80, N = 3SE +/- 369.80, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 34.80, N = 3SE +/- 41.05, N = 325011.9325818.7717380.6017380.6011094.009509.145290.325706.071. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 9503K6K9K12K15KSE +/- 233.57, N = 3SE +/- 333.87, N = 6SE +/- 130.14, N = 4SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 12.43, N = 3SE +/- 14.40, N = 313546.3713464.828458.778320.506051.275325.123144.853399.141. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 750GeForce GTX 680140280420560700SE +/- 0.78, N = 3SE +/- 1.27, N = 3SE +/- 1.56, N = 3SE +/- 0.21, N = 3SE +/- 0.20, N = 3SE +/- 0.06, N = 3SE +/- 0.56, N = 3SE +/- 0.73, N = 3SE +/- 0.02, N = 3SE +/- 0.28, N = 3SE +/- 0.23, N = 3SE +/- 1.02, N = 3629.83626.12354.09345.55332.60283.36269.98239.19286.62170.26121.14242.16-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 750140280420560700SE +/- 2.35, N = 3SE +/- 8.97, N = 5SE +/- 0.12, N = 3SE +/- 1.22, N = 3SE +/- 1.15, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 3SE +/- 0.85, N = 3SE +/- 0.42, N = 3635.35623.58356.52348.92336.48325.16351.31326.23158.42-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 750GeForce GTX 68048121620SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 315.8115.817.416.795.684.773.362.343.781.401.071.91-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 780 TiGeForce GTX 760GeForce GTX 750GeForce GTX 68060120180240300SE +/- 1.72, N = 3SE +/- 1.77, N = 3SE +/- 0.19, N = 3SE +/- 0.65, N = 3SE +/- 1.30, N = 3SE +/- 0.52, N = 3SE +/- 1.20, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.31, N = 3SE +/- 0.08, N = 3SE +/- 0.87, N = 3279.72280.29173.89170.36140.12117.2362.7863.22126.7178.4454.6974.97-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 75048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 316.0116.017.426.815.704.793.382.361.08-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX Titan Xp Oct-Swappiness 100GeForce GTX Titan Xp Oct-OffGeForce GTX TITAN XGeForce GTX 980 TiGeForce GTX 980GeForce GTX 970GeForce GTX 960GeForce GTX 950GeForce GTX 750100200300400500SE +/- 3.82, N = 3SE +/- 1.03, N = 3SE +/- 1.19, N = 3SE +/- 0.32, N = 3SE +/- 3.09, N = 3SE +/- 2.44, N = 3SE +/- 1.49, N = 3SE +/- 0.47, N = 3SE +/- 0.69, N = 3450.23458.71324.09311.46289.63263.14212.43172.28113.64-std=c++14-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft