NVIDIA OpenCL CUDA Compute Comparison

NVIDIA Maxwell and Kepler graphics card OpenCL and CUDA GPGPU compute tests on Ubuntu Linux. Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1601182-PTS-NVIDIAOP62
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 650
January 18 2016
 
GeForce GTX 680
January 18 2016
 
GeForce GTX 750
January 18 2016
 
GeForce GTX 750 Ti
January 18 2016
 
GeForce GTX 760
January 18 2016
 
GeForce GTX 770
January 17 2016
 
GeForce GTX 780 Ti
January 17 2016
 
GeForce GTX 950
January 17 2016
 
GeForce GTX 960
January 17 2016
 
GeForce GTX 970
January 18 2016
 
GeForce GTX 980
January 17 2016
 
GeForce GTX 980 Ti
January 17 2016
 
GeForce GTX TITAN X
January 17 2016
 
Invert Hiding All Results Option
 

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA OpenCL CUDA Compute Comparison NVIDIA Maxwell and Kepler graphics card OpenCL and CUDA GPGPU compute tests on Ubuntu Linux. Benchmarks by Michael Larabel for a future article. ,,"GeForce GTX 770","GeForce GTX 780 Ti","GeForce GTX 980 Ti","GeForce GTX 980","GeForce GTX 960","GeForce GTX TITAN X","GeForce GTX 950","GeForce GTX 680","GeForce GTX 970","GeForce GTX 760","GeForce GTX 650","GeForce GTX 750 Ti","GeForce GTX 750" Processor,,Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores),Intel Core i7-5960X @ 3.50GHz (16 Cores) Motherboard,,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF,Gigabyte X99-UD4-CF Chipset,,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon,Intel Xeon E7 v3/Xeon Memory,,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB,16384MB Disk,,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850,120GB Samsung SSD 850 Graphics,,NVIDIA GeForce GTX 770 2048MB (1045/3505MHz),NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz),NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz),NVIDIA GeForce GTX 980 4096MB (1126/3505MHz),eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz),NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz),eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz),NVIDIA GeForce GTX 680 2048MB (1006/3004MHz),eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz),NVIDIA GeForce GTX 760 2048MB (980/3004MHz),MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz),NVIDIA GeForce GTX 750 Ti 2048MB (1019/2700MHz),eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz) Audio,,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150,Realtek ALC1150 Network,,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection,Intel Connection OS,,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10,Ubuntu 15.10 Kernel,,4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64),4.2.0-23-generic (x86_64) Desktop,,Unity,Unity,Unity,Unity,Unity,Unity,Unity,Unity,Unity,Unity,Unity,Unity,Unity Display Server,,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2,X Server 1.17.2 Display Driver,,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39,NVIDIA 352.39 OpenGL,,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0,4.5.0 Compiler,,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5,GCC 4.9.3 + CUDA 7.5 File-System,,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4,ext4 Screen Resolution,,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600,2560x1600 ,,"GeForce GTX 770","GeForce GTX 780 Ti","GeForce GTX 980 Ti","GeForce GTX 980","GeForce GTX 960","GeForce GTX TITAN X","GeForce GTX 950","GeForce GTX 680","GeForce GTX 970","GeForce GTX 760","GeForce GTX 650","GeForce GTX 750 Ti","GeForce GTX 750" "CUDA Mini-Nbody - Test: Flush Denormals To Zero (sec)",LIB,,54.78,43.15,51.08,82.29,39.38,109.54,,58.07,,,161.87,201.60 "CUDA Mini-Nbody - Test: Original (sec)",LIB,,62.17,36.72,47.34,84.72,33.99,105.91,,53.92,,,152.93,181.82 "CUDA Mini-Nbody - Test: Loop Unrolling (sec)",LIB,,28.87,20.84,25.39,37.60,19.46,47.90,,27.72,,,76.95,91.05 "CUDA Mini-Nbody - Test: Cache Blocking (sec)",LIB,,31.31,22.03,27.09,39.31,20.85,50.83,,29.65,,,83.68,100.00 "JuliaGPU - OpenCL Device: GPU (Samples/sec)",HIB,48247014.30,75614786.03,117899119.93,105470459.07,76296539.27,124716357.57,62133382.70,46179144.83,97941750.40,37480386.07,13367332.83,42811492.53,35328613.93 "LuxMark - OpenCL Device: GPU - Scene: Luxball HDR (Score)",HIB,4762,9577,13883,10721,5517,14099,5376,4571,9750,4267,1447,3835,3508 "CUDA Mini-Nbody - Test: SOA Data Layout (sec)",LIB,,55.37,43.01,51.29,82.22,39.46,109.56,,58.18,,,161.95,201.61 "LuxMark - OpenCL Device: GPU - Scene: Microphone (Score)",HIB,2229,4263,6300,4820,2484,6366,2439,2123,4462,1962,670,1384,1323 "LuxMark - OpenCL Device: GPU - Scene: Hotel (Score)",HIB,611,986,1898,1423,900,1969,762,580,1458,459,142,436,380 "JuliaGPU - OpenCL Device: GPU (Samples/sec/Watt)",HIB,295100.81,336604.28,623540.93,653646.05,,628293.99,485480.27,284271.97,648191.60,226839.62,133469.73,420467.67,358967.19 "JuliaGPU - System Power Consumption Monitor (Watts)",LIB,163.49,224.64,189.08,161.36,,198.5,127.98,162.45,151.1,165.23,100.15,101.82,98.42 "LuxMark - OpenCL Device: GPU - Scene: Luxball HDR (Score/Watt)",HIB,21.19,30.88,54.13,51.32,,52.02,36.56,20.91,50.62,18.88,12.88,33.47,31.55 "LuxMark - System Power Consumption Monitor (Watts)",LIB,224.71,310.12,256.5,208.92,,271.01,147.04,218.65,192.61,226.06,112.38,114.57,111.19 "LuxMark - OpenCL Device: GPU - Scene: Microphone (Score/Watt)",HIB,11.08,16.65,27.97,26.11,,25.21,17.50,10.30,24.94,9.15,6.28,13.35,12.99 "LuxMark - System Power Consumption Monitor (Watts)",LIB,201.15,256,225.24,184.61,,252.48,139.38,206.09,178.91,214.32,106.69,103.69,101.86 "LuxMark - OpenCL Device: GPU - Scene: Hotel (Score/Watt)",HIB,2.95,3.66,7.98,7.29,,7.49,5.28,2.78,7.59,2.16,1.33,4.05,3.64 "LuxMark - System Power Consumption Monitor (Watts)",LIB,207.24,269.57,237.83,195.11,,262.95,144.43,208.66,192.01,212.63,107.04,107.75,104.26 "System Power Consumption Monitor - Phoronix Test Suite System Monitoring (Watts)",,180.9,271.71,235.84,196.64,,254.99,150.87,188.73,188.8,192.15,101.73,115.52,112.53 "LuxMark - Performance / Cost - OpenCL Device: GPU - Scene: Luxball HDR (Score/Dollar)",HIB,,,21.39,21.48,25.19,14.11,33.81,,29.64,,,35.18, "LuxMark - Performance / Cost - OpenCL Device: GPU - Scene: Microphone (Score/Dollar)",HIB,,,9.71,9.66,11.34,6.37,15.34,,13.56,,,12.70, "LuxMark - Performance / Cost - OpenCL Device: GPU - Scene: Hotel (Score/Dollar)",HIB,,,2.92,2.85,4.11,1.97,4.79,,4.43,,,4.00,