NVIDIA OpenCL CUDA Compute Comparison

NVIDIA Maxwell and Kepler graphics card OpenCL and CUDA GPGPU compute tests on Ubuntu Linux. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1609214-LO-1601182PT42&grr&sro.

NVIDIA OpenCL CUDA Compute ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XscudaIntel Core i7-5960X @ 3.50GHz (16 Cores)Gigabyte X99-UD4-CFIntel Xeon E7 v3/Xeon16384MB120GB Samsung SSD 850MSI NVIDIA GeForce GTX 650 1024MB (1084/2500MHz)Realtek ALC1150Intel ConnectionUbuntu 15.104.2.0-23-generic (x86_64)UnityX Server 1.17.2NVIDIA 352.394.5.0GCC 4.9.3 + CUDA 7.5ext42560x1600NVIDIA GeForce GTX 680 2048MB (1006/3004MHz)eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz)NVIDIA GeForce GTX 750 Ti 2048MB (1019/2700MHz)NVIDIA GeForce GTX 760 2048MB (980/3004MHz)NVIDIA GeForce GTX 770 2048MB (1045/3505MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2048MB (1202/3304MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)AMD Phenom II X4 955 @ 3.20GHz (4 Cores)ASUS M4A77TD PROAMD RX780/RX790 + SB7x0/SB8x0/SB9x04096MB60GB OCZ VERTEX2 + 2 x 1000GB Western Digital WD10EARS-00MASUS NVIDIA GeForce GT 730 2048MB (148/405MHz)AMD SBx00 AzaliaRealtek RTL8111/8168/8411LinuxMint 23.16.0-4-amd64 (x86_64)Cinnamon 3.0.6X Server 1.16.4NVIDIA 361.424.4.0GCC 4.9.21920x1080OpenBenchmarking.orgCompiler Details- GeForce GTX 650: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 770: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 960: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX TITAN X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- scuda: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i586 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -vProcessor Details- GeForce GTX 650: Scaling Governor: intel_pstate performance- GeForce GTX 680: Scaling Governor: intel_pstate performance- GeForce GTX 750: Scaling Governor: intel_pstate performance- GeForce GTX 750 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 760: Scaling Governor: intel_pstate performance- GeForce GTX 770: Scaling Governor: intel_pstate performance- GeForce GTX 780 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 950: Scaling Governor: intel_pstate performance- GeForce GTX 960: Scaling Governor: intel_pstate performance- GeForce GTX 970: Scaling Governor: intel_pstate performance- GeForce GTX 980: Scaling Governor: intel_pstate performance- GeForce GTX 980 Ti: Scaling Governor: intel_pstate performance- GeForce GTX TITAN X: Scaling Governor: intel_pstate performance- scuda: Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 650: GPU Compute Cores: 384- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 750: GPU Compute Cores: 512- GeForce GTX 750 Ti: GPU Compute Cores: 640- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 770: GPU Compute Cores: 1536- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X: GPU Compute Cores: 3072- scuda: GPU Compute Cores: 384System Details- GeForce GTX 650: GPU Compute Cores: 384.- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 750: GPU Compute Cores: 512.- GeForce GTX 750 Ti: GPU Compute Cores: 640.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 770: GPU Compute Cores: 1536.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X: GPU Compute Cores: 3072.- scuda: GPU Compute Cores: 384.

NVIDIA OpenCL CUDA Compute Comparisonluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRjuliagpu: GPUcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Flush Denormals To Zerocuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: OriginalGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xscuda142670144713367332.835802123457146179144.833801323350835328613.9391.05100.00201.60201.61181.824361384383542811492.5376.9583.68161.87161.95152.934591962426737480386.076112229476248247014.309864263957775614786.0328.8731.3154.7855.3762.177622439537662133382.7047.9050.83109.54109.56105.919002484551776296539.2737.6039.3182.2982.2284.7214584462975097941750.4027.7229.6558.0758.1853.921423482010721105470459.0725.3927.0951.0851.2947.341898630013883117899119.9320.8422.0343.1543.0136.721969636614099124716357.5719.4620.8539.3839.4633.99141462975OpenBenchmarking.org

LuxMark

Performance / Cost - OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore Per Dollar, More Is BetterLuxMark 3.0Performance / Cost - OpenCL Device: GPU - Scene: HotelGeForce GTX 750 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X1.07782.15563.23344.31125.3894.004.794.114.432.852.921.971. GeForce GTX 750 Ti: $109 reported cost.2. GeForce GTX 950: $159 reported cost.3. GeForce GTX 960: $219 reported cost.4. GeForce GTX 970: $329 reported cost.5. GeForce GTX 980: $499 reported cost.6. GeForce GTX 980 Ti: $649 reported cost.7. GeForce GTX TITAN X: $999 reported cost.

LuxMark

Performance / Cost - OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore Per Dollar, More Is BetterLuxMark 3.0Performance / Cost - OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 750 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4812162012.7015.3411.3413.569.669.716.371. GeForce GTX 750 Ti: $109 reported cost.2. GeForce GTX 950: $159 reported cost.3. GeForce GTX 960: $219 reported cost.4. GeForce GTX 970: $329 reported cost.5. GeForce GTX 980: $499 reported cost.6. GeForce GTX 980 Ti: $649 reported cost.7. GeForce GTX TITAN X: $999 reported cost.

LuxMark

Performance / Cost - OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore Per Dollar, More Is BetterLuxMark 3.0Performance / Cost - OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 750 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X81624324035.1833.8125.1929.6421.4821.3914.111. GeForce GTX 750 Ti: $109 reported cost.2. GeForce GTX 950: $159 reported cost.3. GeForce GTX 960: $219 reported cost.4. GeForce GTX 970: $329 reported cost.5. GeForce GTX 980: $499 reported cost.6. GeForce GTX 980 Ti: $649 reported cost.7. GeForce GTX TITAN X: $999 reported cost.

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System MonitoringGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X60120180240300Min: 58.7 / Avg: 101.73 / Max: 113.1Min: 66.2 / Avg: 188.73 / Max: 222.3Min: 57.1 / Avg: 112.53 / Max: 133.4Min: 56.9 / Avg: 115.52 / Max: 134.1Min: 65.3 / Avg: 192.15 / Max: 229.5Min: 60.3 / Avg: 180.9 / Max: 256.4Min: 65.1 / Avg: 271.71 / Max: 349.4Min: 59.6 / Avg: 150.87 / Max: 200.1Min: 62.2 / Avg: 188.8 / Max: 252.9Min: 63.2 / Avg: 196.64 / Max: 260.4Min: 65.8 / Avg: 235.84 / Max: 345.4Min: 65.4 / Avg: 254.99 / Max: 350

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X50100150200250Min: 82.2 / Avg: 107.04 / Max: 110Min: 139.9 / Avg: 208.66 / Max: 222.3Min: 77.8 / Avg: 104.26 / Max: 106.1Min: 81.9 / Avg: 107.75 / Max: 109.7Min: 154.5 / Avg: 212.63 / Max: 216.5Min: 87.2 / Avg: 207.24 / Max: 218.2Min: 86.1 / Avg: 269.57 / Max: 289.2Min: 111.9 / Avg: 144.43 / Max: 149Min: 134.4 / Avg: 192.01 / Max: 193.5Min: 85.8 / Avg: 195.11 / Max: 205.5Min: 89 / Avg: 237.83 / Max: 249.3Min: 176 / Avg: 262.95 / Max: 266.3

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X2468101.332.783.644.052.162.953.665.287.597.297.987.49

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xscuda400800120016002000SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 1.86, N = 3SE +/- 0.33, N = 3SE +/- 2.67, N = 3SE +/- 3.93, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 3SE +/- 0.67, N = 3SE +/- 6.56, N = 3SE +/- 2.00, N = 3SE +/- 2.08, N = 3SE +/- 0.88, N = 31425803804364596119867629001458142318981969141

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X50100150200250Min: 80.1 / Avg: 106.69 / Max: 109.9Min: 134.4 / Avg: 206.09 / Max: 209.4Min: 77.3 / Avg: 101.86 / Max: 103.9Min: 75.4 / Avg: 103.69 / Max: 106.5Min: 204.1 / Avg: 214.32 / Max: 217.1Min: 85 / Avg: 201.15 / Max: 217.4Min: 85.6 / Avg: 256 / Max: 289.5Min: 135 / Avg: 139.38 / Max: 143.2Min: 175.4 / Avg: 178.91 / Max: 179.6Min: 82 / Avg: 184.61 / Max: 196.6Min: 86 / Avg: 225.24 / Max: 237.1Min: 245.7 / Avg: 252.48 / Max: 253.9

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X7142128356.2810.3012.9913.359.1511.0816.6517.5024.9426.1127.9725.21

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xscuda14002800420056007000SE +/- 1.00, N = 3SE +/- 3.48, N = 3SE +/- 1.00, N = 3SE +/- 3.46, N = 3SE +/- 0.88, N = 3SE +/- 3.84, N = 3SE +/- 19.00, N = 3SE +/- 3.76, N = 3SE +/- 1.15, N = 3SE +/- 4.84, N = 3SE +/- 7.36, N = 3SE +/- 7.57, N = 3SE +/- 29.09, N = 3SE +/- 3.18, N = 3670212313231384196222294263243924844462482063006366462

LuxMark

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterLuxMark 3.0System Power Consumption MonitorGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X60120180240300Min: 109.3 / Avg: 112.38 / Max: 113.1Min: 185.1 / Avg: 218.65 / Max: 221.8Min: 106.4 / Avg: 111.19 / Max: 111.7Min: 83.9 / Avg: 114.57 / Max: 116Min: 212.1 / Avg: 226.06 / Max: 229.5Min: 169.4 / Avg: 224.71 / Max: 228.3Min: 245.8 / Avg: 310.12 / Max: 319.4Min: 107.6 / Avg: 147.04 / Max: 152Min: 121 / Avg: 192.61 / Max: 194.6Min: 206.7 / Avg: 208.92 / Max: 213.3Min: 253 / Avg: 256.5 / Max: 257.7Min: 167.9 / Avg: 271.01 / Max: 275.5

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore Per Watt, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X122436486012.8820.9131.5533.4718.8821.1930.8836.5650.6251.3254.1352.02

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN Xscuda3K6K9K12K15KSE +/- 1.53, N = 3SE +/- 15.00, N = 3SE +/- 6.69, N = 3SE +/- 10.02, N = 3SE +/- 0.88, N = 3SE +/- 13.50, N = 3SE +/- 43.03, N = 3SE +/- 2.91, N = 3SE +/- 15.34, N = 3SE +/- 2.19, N = 3SE +/- 15.68, N = 3SE +/- 29.69, N = 3SE +/- 34.70, N = 3SE +/- 7.86, N = 31447457135083835426747629577537655179750107211388314099975

JuliaGPU

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterJuliaGPU 1.2pts1System Power Consumption MonitorGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200Min: 92.2 / Avg: 100.15 / Max: 100.6Min: 159.4 / Avg: 162.45 / Max: 164.9Min: 87.3 / Avg: 98.42 / Max: 99.6Min: 101.3 / Avg: 101.82 / Max: 102.5Min: 157.8 / Avg: 165.23 / Max: 169.9Min: 160 / Avg: 163.49 / Max: 166.4Min: 221.8 / Avg: 224.64 / Max: 225.8Min: 120.8 / Avg: 127.98 / Max: 129.9Min: 148.3 / Avg: 151.1 / Max: 152Min: 147.4 / Avg: 161.36 / Max: 164.3Min: 171.5 / Avg: 189.08 / Max: 194.2Min: 190 / Avg: 198.5 / Max: 200.7

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec Per Watt, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X140K280K420K560K700K133469.73284271.97358967.19420467.67226839.62295100.81336604.28485480.27648191.60653646.05623540.93628293.99

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 650GeForce GTX 680GeForce GTX 750GeForce GTX 750 TiGeForce GTX 760GeForce GTX 770GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X30M60M90M120M150MSE +/- 3982.22, N = 3SE +/- 45750.50, N = 3SE +/- 37447.35, N = 3SE +/- 55710.34, N = 3SE +/- 17232.19, N = 3SE +/- 69746.04, N = 3SE +/- 252523.36, N = 3SE +/- 160512.11, N = 3SE +/- 141058.55, N = 3SE +/- 432465.04, N = 3SE +/- 266882.02, N = 3SE +/- 188928.81, N = 3SE +/- 124748.15, N = 313367332.8346179144.8335328613.9342811492.5337480386.0748247014.3075614786.0362133382.7076296539.2797941750.40105470459.07117899119.93124716357.571. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGeForce GTX 750GeForce GTX 750 TiGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X20406080100SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 391.0576.9528.8747.9037.6027.7225.3920.8419.46

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGeForce GTX 750GeForce GTX 750 TiGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X20406080100SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.30, N = 3SE +/- 0.13, N = 3100.0083.6831.3150.8339.3129.6527.0922.0320.85

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGeForce GTX 750GeForce GTX 750 TiGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.22, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3201.60161.8754.78109.5482.2958.0751.0843.1539.38

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGeForce GTX 750GeForce GTX 750 TiGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.20, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3201.61161.9555.37109.5682.2258.1851.2943.0139.46

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 750GeForce GTX 750 TiGeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200SE +/- 0.15, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.22, N = 3SE +/- 0.18, N = 3SE +/- 0.41, N = 3SE +/- 0.17, N = 3181.82152.9362.17105.9184.7253.9247.3436.7233.99


Phoronix Test Suite v10.8.4