OpenCL CUDA NVIDIA GPGPU Linux Tests

All Maxwell and various Kepler graphics cards tested on the NVIDIA Linux driver. Benchmarks by Michael Larabel for a future article on Phoronix.com just delivering various GPGPU benchmarks for reference purposes.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1603283-GA-1511113PT64
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

HPC - High Performance Computing 2 Tests
OpenCL 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 680
November 11 2015
 
GeForce GTX 750
November 11 2015
 
GeForce GTX 760
November 11 2015
 
GeForce GTX 780 Ti
November 11 2015
 
GeForce GTX 950
November 10 2015
 
GeForce GTX 960
November 11 2015
 
GeForce GTX 970
November 11 2015
 
GeForce GTX 980
November 11 2015
 
GeForce GTX 980 Ti
November 10 2015
 
GeForce GTX TITAN X
November 11 2015
 
dell_bisag
March 23 2016
 
Bisag_Node
March 28 2016
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL CUDA NVIDIA GPGPU Linux TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_NodeIntel Core i5-6600K @ 3.50GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.0Intel Device 191f16384MB256GB TS256GSSD370SNVIDIA GeForce GTX 680 2048MB (1006/3004MHz)Intel Device a170Intel Device 15b8Ubuntu 14.043.19.0-33-generic (x86_64)Unity 7.2.5X Server 1.17.1NVIDIA 352.394.3.0GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5ext43840x2160eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz)NVIDIA GeForce GTX 760 2048MB (980/3004MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2048MB (135/405MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)Intel Core i7-4790 @ 3.60GHz (8 Cores)Dell 048DY8Intel 4th Gen Core DRAM2 x 1000GB Western Digital WD10EZEX-75MNVIDIA Device 11b4Realtek ALC280Intel Connection I217-LM3.13.0-24-generic (x86_64)X Server 1.15.1modesetting 0.8.11.3 Mesa 4.0.4GCC 4.8.4 + CUDA 7.51364x768NVIDIA Quadro K4200Unity 7.2.0OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 680: Scaling Governor: acpi-cpufreq performance- GeForce GTX 750: Scaling Governor: acpi-cpufreq performance- GeForce GTX 760: Scaling Governor: acpi-cpufreq performance- GeForce GTX 780 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX 950: Scaling Governor: acpi-cpufreq performance- GeForce GTX 960: Scaling Governor: acpi-cpufreq performance- GeForce GTX 970: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX TITAN X: Scaling Governor: acpi-cpufreq performance- dell_bisag: Scaling Governor: acpi-cpufreq ondemand- Bisag_Node: Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 750: GPU Compute Cores: 512- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X: GPU Compute Cores: 3072System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 750: GPU Compute Cores: 512.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X: GPU Compute Cores: 3072.

GeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_NodeResult OverviewPhoronix Test Suite100%248%396%544%SHOC Scalable HeterOgeneous ComputingLuxMarkSHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous ComputingOpenCL - MD5 HashGPU - Luxball HDROpenCL - FFT SPOpenCL - T.R.B

OpenCL CUDA NVIDIA GPGPU Linux Testsshoc: CUDA - FFT SPshoc: CUDA - MD5 Hashshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: CUDA - Texture Read Bandwidthshoc: OpenCL - Texture Read Bandwidthaskap: Griddingaskap: Degriddingcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerojuliagpu: GPUmandelbulbgpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node74.971.91242.1648074789.0331636512.9757721274554113.641.0854.691.07158.42121.14180.6698.1989.34199.95199.8336136874.0020060275.53349178.441.40170.2638310650.5025392138.5046319414253126.713.78286.6261.0329.9927.0554.3953.2678839770.1347400001.9099243029639172.282.3663.222.34326.23239.193399.145706.07105.3049.8947.54108.50108.4864913682.6337156070.8776924235313212.433.3862.783.36351.31269.983144.855290.3282.0137.0835.3579.9779.8480042041.7344953399.4789724605474263.144.79117.234.77325.16283.365325.129509.1454.3228.5326.4255.8755.80104144917.2358811317.17134644589737289.635.70140.125.68336.48332.606051.271109445.3825.1323.8850.1549.53113830604.2763616558.771492477610713311.466.81170.366.79348.92345.558320.5017380.6034.5819.7718.4640.9440.85127978049.5371656708.831855626813802324.097.42173.897.41356.52354.098458.7717380.6032.3718.6517.5937.4337.37136037921.4375614774.13190663601408160.211.28155.773961583344159.181.10113.723.763.694.393.603.7635214222899OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X70140210280350SE +/- 0.69, N = 3SE +/- 0.47, N = 3SE +/- 1.49, N = 3SE +/- 2.44, N = 3SE +/- 3.09, N = 3SE +/- 0.32, N = 3SE +/- 1.19, N = 3113.64172.28212.43263.14289.63311.46324.091. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X60120180240300Min: 112.3 / Avg: 113.64 / Max: 114.58Min: 171.35 / Avg: 172.28 / Max: 172.85Min: 209.45 / Avg: 212.43 / Max: 213.96Min: 258.45 / Avg: 263.14 / Max: 266.66Min: 283.58 / Avg: 289.63 / Max: 293.77Min: 310.83 / Avg: 311.46 / Max: 311.86Min: 321.78 / Avg: 324.09 / Max: 325.731. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.082.363.384.795.706.817.421. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X3691215Min: 1.08 / Avg: 1.08 / Max: 1.08Min: 2.35 / Avg: 2.36 / Max: 2.36Min: 3.37 / Avg: 3.38 / Max: 3.38Min: 4.78 / Avg: 4.79 / Max: 4.79Min: 5.7 / Avg: 5.7 / Max: 5.71Min: 6.8 / Avg: 6.81 / Max: 6.81Min: 7.42 / Avg: 7.42 / Max: 7.421. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node4080120160200SE +/- 0.87, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.19, N = 3SE +/- 0.08, N = 3SE +/- 1.20, N = 3SE +/- 0.52, N = 3SE +/- 1.30, N = 3SE +/- 0.65, N = 3SE +/- 0.19, N = 3SE +/- 0.22, N = 3SE +/- 0.92, N = 674.9754.6978.44126.7163.2262.78117.23140.12170.36173.8960.2159.181. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node306090120150Min: 73.32 / Avg: 74.97 / Max: 76.26Min: 54.6 / Avg: 54.69 / Max: 54.86Min: 78 / Avg: 78.44 / Max: 79.04Min: 126.33 / Avg: 126.71 / Max: 126.94Min: 63.06 / Avg: 63.22 / Max: 63.32Min: 60.39 / Avg: 62.78 / Max: 64.2Min: 116.29 / Avg: 117.23 / Max: 118.08Min: 137.96 / Avg: 140.12 / Max: 142.44Min: 169.1 / Avg: 170.36 / Max: 171.3Min: 173.56 / Avg: 173.89 / Max: 174.22Min: 59.94 / Avg: 60.21 / Max: 60.65Min: 56.26 / Avg: 59.18 / Max: 60.891. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.911.071.403.782.343.364.775.686.797.411.281.101. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node3691215Min: 1.91 / Avg: 1.91 / Max: 1.91Min: 1.07 / Avg: 1.07 / Max: 1.07Min: 1.4 / Avg: 1.4 / Max: 1.4Min: 3.77 / Avg: 3.78 / Max: 3.78Min: 2.34 / Avg: 2.34 / Max: 2.35Min: 3.36 / Avg: 3.36 / Max: 3.36Min: 4.77 / Avg: 4.77 / Max: 4.77Min: 5.68 / Avg: 5.68 / Max: 5.68Min: 6.79 / Avg: 6.79 / Max: 6.8Min: 7.4 / Avg: 7.41 / Max: 7.41Min: 1.28 / Avg: 1.28 / Max: 1.28Min: 1.09 / Avg: 1.1 / Max: 1.131. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X80160240320400SE +/- 0.42, N = 3SE +/- 0.85, N = 3SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 1.15, N = 3SE +/- 1.22, N = 3SE +/- 0.12, N = 3158.42326.23351.31325.16336.48348.92356.521. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X60120180240300Min: 157.99 / Avg: 158.42 / Max: 159.25Min: 325.05 / Avg: 326.23 / Max: 327.87Min: 351.16 / Avg: 351.31 / Max: 351.59Min: 324.62 / Avg: 325.16 / Max: 325.57Min: 334.27 / Avg: 336.48 / Max: 338.12Min: 347.68 / Avg: 348.92 / Max: 351.36Min: 356.3 / Avg: 356.52 / Max: 356.71. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node80160240320400SE +/- 1.02, N = 3SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.02, N = 3SE +/- 0.73, N = 3SE +/- 0.56, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.21, N = 3SE +/- 1.56, N = 3SE +/- 1.00, N = 3SE +/- 12.78, N = 6242.16121.14170.26286.62239.19269.98283.36332.60345.55354.09155.77113.721. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node60120180240300Min: 240.5 / Avg: 242.16 / Max: 244.03Min: 120.69 / Avg: 121.14 / Max: 121.48Min: 169.78 / Avg: 170.26 / Max: 170.74Min: 286.58 / Avg: 286.62 / Max: 286.65Min: 238.31 / Avg: 239.19 / Max: 240.65Min: 269.42 / Avg: 269.98 / Max: 271.09Min: 283.23 / Avg: 283.36 / Max: 283.42Min: 332.35 / Avg: 332.6 / Max: 332.99Min: 345.15 / Avg: 345.55 / Max: 345.88Min: 351.56 / Avg: 354.09 / Max: 356.94Min: 153.84 / Avg: 155.77 / Max: 157.16Min: 51.61 / Avg: 113.72 / Max: 136.551. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

ASKAP tConvolveCuda

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X2K4K6K8K10KSE +/- 14.40, N = 3SE +/- 12.43, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 130.14, N = 43399.143144.855325.126051.278320.508458.771. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X15003000450060007500Min: 3370.33 / Avg: 3399.14 / Max: 3413.54Min: 3132.42 / Avg: 3144.85 / Max: 3169.71Min: 5325.12 / Avg: 5325.12 / Max: 5325.12Min: 6051.27 / Avg: 6051.27 / Max: 6051.27Min: 8320.5 / Avg: 8320.5 / Max: 8320.5Min: 8068.36 / Avg: 8458.77 / Max: 8588.91. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4K8K12K16K20KSE +/- 41.05, N = 3SE +/- 34.80, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 369.80, N = 3SE +/- 369.80, N = 35706.075290.329509.1411094.0017380.6017380.601. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl
OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X3K6K9K12K15KMin: 5665.02 / Avg: 5706.07 / Max: 5788.17Min: 5220.71 / Avg: 5290.32 / Max: 5325.12Min: 9509.14 / Avg: 9509.14 / Max: 9509.14Min: 11094 / Avg: 11094 / Max: 11094Min: 16641 / Avg: 17380.6 / Max: 17750.4Min: 16641 / Avg: 17380.6 / Max: 17750.41. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node4080120160200SE +/- 0.05, N = 3SE +/- 0.50, N = 3SE +/- 0.21, N = 3SE +/- 0.43, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.57, N = 3SE +/- 0.35, N = 3SE +/- 0.15, N = 6180.6661.03105.3082.0154.3245.3834.5832.373.76
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node306090120150Min: 180.57 / Avg: 180.66 / Max: 180.75Min: 60.06 / Avg: 61.03 / Max: 61.72Min: 104.94 / Avg: 105.3 / Max: 105.69Min: 81.52 / Avg: 82.01 / Max: 82.85Min: 54.17 / Avg: 54.32 / Max: 54.57Min: 45.28 / Avg: 45.38 / Max: 45.57Min: 33.89 / Avg: 34.58 / Max: 35.7Min: 31.93 / Avg: 32.37 / Max: 33.06Min: 3.47 / Avg: 3.76 / Max: 4.48

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node20406080100SE +/- 0.00, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 698.1929.9949.8937.0828.5325.1319.7718.653.69
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node20406080100Min: 98.18 / Avg: 98.19 / Max: 98.2Min: 29.68 / Avg: 29.99 / Max: 30.53Min: 49.85 / Avg: 49.89 / Max: 49.93Min: 37.06 / Avg: 37.08 / Max: 37.11Min: 28.52 / Avg: 28.53 / Max: 28.55Min: 25.03 / Avg: 25.13 / Max: 25.21Min: 19.5 / Avg: 19.77 / Max: 20.19Min: 18.54 / Avg: 18.65 / Max: 18.85Min: 3.47 / Avg: 3.69 / Max: 3.83

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node20406080100SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.25, N = 3SE +/- 0.07, N = 389.3427.0547.5435.3526.4223.8818.4617.594.39
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node20406080100Min: 89.29 / Avg: 89.34 / Max: 89.43Min: 26.97 / Avg: 27.05 / Max: 27.14Min: 47.48 / Avg: 47.54 / Max: 47.57Min: 35.29 / Avg: 35.35 / Max: 35.4Min: 26.38 / Avg: 26.42 / Max: 26.45Min: 23.47 / Avg: 23.88 / Max: 24.16Min: 18.23 / Avg: 18.46 / Max: 18.75Min: 17.21 / Avg: 17.59 / Max: 18.07Min: 4.3 / Avg: 4.39 / Max: 4.53

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node4080120160200SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.21, N = 3SE +/- 0.11, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 6199.9554.39108.5079.9755.8750.1540.9437.433.60
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node4080120160200Min: 199.9 / Avg: 199.95 / Max: 200.04Min: 54.14 / Avg: 54.39 / Max: 54.69Min: 108.47 / Avg: 108.5 / Max: 108.55Min: 79.85 / Avg: 79.97 / Max: 80.13Min: 55.81 / Avg: 55.87 / Max: 55.96Min: 49.74 / Avg: 50.15 / Max: 50.36Min: 40.77 / Avg: 40.94 / Max: 41.16Min: 37.04 / Avg: 37.43 / Max: 37.65Min: 3.39 / Avg: 3.6 / Max: 3.75

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node4080120160200SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 5199.8353.26108.4879.8455.8049.5340.8537.373.76
OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XBisag_Node4080120160200Min: 199.79 / Avg: 199.83 / Max: 199.87Min: 53.08 / Avg: 53.26 / Max: 53.39Min: 108.46 / Avg: 108.48 / Max: 108.51Min: 79.83 / Avg: 79.84 / Max: 79.85Min: 55.68 / Avg: 55.8 / Max: 55.91Min: 49.2 / Avg: 49.53 / Max: 49.82Min: 40.64 / Avg: 40.85 / Max: 40.96Min: 37.23 / Avg: 37.37 / Max: 37.49Min: 3.65 / Avg: 3.76 / Max: 3.95

JuliaGPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X30M60M90M120M150MSE +/- 59682.63, N = 3SE +/- 22546.70, N = 3SE +/- 14125.16, N = 3SE +/- 293396.06, N = 3SE +/- 58084.93, N = 3SE +/- 157475.07, N = 3SE +/- 84325.23, N = 3SE +/- 218639.12, N = 3SE +/- 473156.02, N = 3SE +/- 318277.32, N = 348074789.0336136874.0038310650.5078839770.1364913682.6380042041.73104144917.23113830604.27127978049.53136037921.431. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X20M40M60M80M100MMin: 47972762.6 / Avg: 48074789.03 / Max: 48179458.6Min: 36098425.4 / Avg: 36136874 / Max: 36176502.7Min: 38282530.1 / Avg: 38310650.5 / Max: 38327054.3Min: 78472428.5 / Avg: 78839770.13 / Max: 79419722.2Min: 64826614.5 / Avg: 64913682.63 / Max: 65023819.5Min: 79729904.9 / Avg: 80042041.73 / Max: 80234485.2Min: 104029122.7 / Avg: 104144917.23 / Max: 104309002.8Min: 113566736.4 / Avg: 113830604.27 / Max: 114264514.2Min: 127186077.4 / Avg: 127978049.53 / Max: 128822605.5Min: 135496222.7 / Avg: 136037921.43 / Max: 136598293.21. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelbulbGPU

MandelbulbGPU is an OpenCL benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X16M32M48M64M80MSE +/- 36731.70, N = 3SE +/- 9818.73, N = 3SE +/- 28089.31, N = 3SE +/- 48150.35, N = 3SE +/- 29855.85, N = 3SE +/- 75512.83, N = 3SE +/- 91420.68, N = 3SE +/- 140370.89, N = 3SE +/- 168304.91, N = 3SE +/- 166919.37, N = 331636512.9720060275.5325392138.5047400001.9037156070.8744953399.4758811317.1763616558.7771656708.8375614774.131. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X13M26M39M52M65MMin: 31570064.4 / Avg: 31636512.97 / Max: 31696868.3Min: 20041319.8 / Avg: 20060275.53 / Max: 20074195.5Min: 25336028.4 / Avg: 25392138.5 / Max: 25422595.7Min: 47303715.3 / Avg: 47400001.9 / Max: 47449572Min: 37096733.9 / Avg: 37156070.87 / Max: 37191523.7Min: 44802584.9 / Avg: 44953399.47 / Max: 45035719.5Min: 58637108.5 / Avg: 58811317.17 / Max: 58946501.9Min: 63433732.8 / Avg: 63616558.77 / Max: 63892479.1Min: 71353889.7 / Avg: 71656708.83 / Max: 71935417.5Min: 75285884.7 / Avg: 75614774.13 / Max: 75828817.61. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender / SLG2. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 680GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node400800120016002000SE +/- 2.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 6.62, N = 6SE +/- 0.88, N = 35774639927698971346149218551906396352
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 680GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node30060090012001500Min: 575 / Avg: 577 / Max: 581Min: 463 / Avg: 463.33 / Max: 464Min: 992 / Avg: 992 / Max: 992Min: 769 / Avg: 769 / Max: 769Min: 896 / Avg: 896.67 / Max: 898Min: 1346 / Avg: 1346 / Max: 1346Min: 1490 / Avg: 1492.33 / Max: 1494Min: 1855 / Avg: 1855.33 / Max: 1856Min: 1906 / Avg: 1906.33 / Max: 1907Min: 383 / Avg: 395.83 / Max: 419Min: 351 / Avg: 352.33 / Max: 354

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 680GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node14002800420056007000SE +/- 3.06, N = 3SE +/- 0.67, N = 3SE +/- 12.00, N = 3SE +/- 4.26, N = 3SE +/- 1.15, N = 3SE +/- 7.64, N = 3SE +/- 0.67, N = 3SE +/- 18.50, N = 3SE +/- 3.00, N = 3SE +/- 20.85, N = 3SE +/- 6.33, N = 321271941430224232460445847766268636015831422
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 680GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node11002200330044005500Min: 2121 / Avg: 2127 / Max: 2131Min: 1940 / Avg: 1940.67 / Max: 1942Min: 4290 / Avg: 4302 / Max: 4326Min: 2415 / Avg: 2423.33 / Max: 2429Min: 2458 / Avg: 2460 / Max: 2462Min: 4443 / Avg: 4458 / Max: 4468Min: 4775 / Avg: 4775.67 / Max: 4777Min: 6249 / Avg: 6268 / Max: 6305Min: 6357 / Avg: 6360 / Max: 6366Min: 1561 / Avg: 1583.33 / Max: 1625Min: 1410 / Avg: 1422.33 / Max: 1431

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node3K6K9K12K15KSE +/- 12.17, N = 3SE +/- 11.67, N = 3SE +/- 1.45, N = 3SE +/- 35.97, N = 3SE +/- 16.67, N = 3SE +/- 0.88, N = 3SE +/- 24.85, N = 3SE +/- 1.20, N = 3SE +/- 44.35, N = 3SE +/- 4.70, N = 3SE +/- 24.18, N = 3SE +/- 0.58, N = 3455434914253963953135474973710713138021408134412899
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xdell_bisagBisag_Node2K4K6K8K10KMin: 4541 / Avg: 4553.67 / Max: 4578Min: 3468 / Avg: 3491.33 / Max: 3503Min: 4251 / Avg: 4253.33 / Max: 4256Min: 9567 / Avg: 9638.67 / Max: 9680Min: 5280 / Avg: 5313.33 / Max: 5330Min: 5473 / Avg: 5474.33 / Max: 5476Min: 9711 / Avg: 9737.33 / Max: 9787Min: 10711 / Avg: 10712.67 / Max: 10715Min: 13713 / Avg: 13801.67 / Max: 13848Min: 14075 / Avg: 14080.67 / Max: 14090Min: 3415 / Avg: 3440.67 / Max: 3489Min: 2898 / Avg: 2899 / Max: 2900