OpenCL CUDA NVIDIA GPGPU Linux Tests

Benchmarks ran on the Eurocom Q6 for performance evaluation.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1806259-AR-1511113PT86
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

HPC - High Performance Computing 2 Tests
OpenCL 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 680
November 11 2015
 
GeForce GTX 750
November 11 2015
 
GeForce GTX 760
November 11 2015
 
GeForce GTX 780 Ti
November 11 2015
 
GeForce GTX 950
November 10 2015
 
GeForce GTX 960
November 11 2015
 
GeForce GTX 970
November 11 2015
 
GeForce GTX 980
November 11 2015
 
GeForce GTX 980 Ti
November 10 2015
 
GeForce GTX TITAN X
November 11 2015
 
GeForce GTX 1070 with Max-Q design
June 25 2018
  14 Minutes
Invert Hiding All Results Option
  2 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL CUDA NVIDIA GPGPU Linux Tests - Phoronix Test Suite

OpenCL CUDA NVIDIA GPGPU Linux Tests

Benchmarks ran on the Eurocom Q6 for performance evaluation.

HTML result view exported from: https://openbenchmarking.org/result/1806259-AR-1511113PT86&sro&grw.

OpenCL CUDA NVIDIA GPGPU Linux TestsProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionOpenCLGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070 with Max-Q designIntel Core i5-6600K @ 3.50GHz (4 Cores)MSI Z170A GAMING PRO (MS-7984) v1.0Intel Device 191f16384MB256GB TS256GSSD370SNVIDIA GeForce GTX 680 2048MB (1006/3004MHz)Intel Device a170Intel Device 15b8Ubuntu 14.043.19.0-33-generic (x86_64)Unity 7.2.5X Server 1.17.1NVIDIA 352.394.3.0GCC 4.8.4 + Clang 3.4-1ubuntu3 + CUDA 7.5ext43840x2160eVGA NVIDIA GeForce GTX 750 1024MB (1019/2505MHz)NVIDIA GeForce GTX 760 2048MB (980/3004MHz)NVIDIA GeForce GTX 780 Ti 3072MB (875/3500MHz)eVGA NVIDIA GeForce GTX 950 2048MB (135/405MHz)eVGA NVIDIA GeForce GTX 960 2048MB (1277/3505MHz)eVGA NVIDIA GeForce GTX 970 4096MB (1163/3505MHz)NVIDIA GeForce GTX 980 4096MB (1126/3505MHz)NVIDIA GeForce GTX 980 Ti 6144MB (999/3505MHz)NVIDIA GeForce GTX TITAN X 12288MB (1001/3505MHz)Intel Core i7-8750H @ 4.10GHz (6 Cores / 12 Threads)Eurocom Q6 (7.005 BIOS)Intel Cannon Lake PCH Shared SRAM32768MB2050GB Crucial_CT2050MX + 1000GB Samsung SSD 960 EVO 1TBNVIDIA GeForce GTX 1070 with Max-Q Design 8192MB (1101/4006MHz)Realtek ALC1220Realtek RTL8111/8168/8411 + Intel Wireless-AC 9260Ubuntu 18.044.17.2 (x86_64)GNOME Shell 3.28.1X Server 1.19.6NVIDIA 396.24.024.6.0OpenCL 1.2 CUDA 9.2.127 + OpenCL 2.1GCC 7.3.0 + Clang 4.0.1-10 + LLVM 4.0.1 + CUDA 9.21920x1080OpenBenchmarking.orgCompiler Details- GeForce GTX 680: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 750: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 760: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 780 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 950: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 960: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 970: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 980 Ti: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX TITAN X: --build=x86_64-linux-gnu --disable-browser-plugin --disable-libmudflap --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-multilib-list=m32,m64,mx32 --with-tune=generic -v- GeForce GTX 1070 with Max-Q design: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- GeForce GTX 680: Scaling Governor: acpi-cpufreq performance- GeForce GTX 750: Scaling Governor: acpi-cpufreq performance- GeForce GTX 760: Scaling Governor: acpi-cpufreq performance- GeForce GTX 780 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX 950: Scaling Governor: acpi-cpufreq performance- GeForce GTX 960: Scaling Governor: acpi-cpufreq performance- GeForce GTX 970: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980: Scaling Governor: acpi-cpufreq performance- GeForce GTX 980 Ti: Scaling Governor: acpi-cpufreq performance- GeForce GTX TITAN X: Scaling Governor: acpi-cpufreq performance- GeForce GTX 1070 with Max-Q design: Scaling Governor: intel_pstate performanceOpenCL Details- GeForce GTX 680: GPU Compute Cores: 1536- GeForce GTX 750: GPU Compute Cores: 512- GeForce GTX 760: GPU Compute Cores: 1152- GeForce GTX 780 Ti: GPU Compute Cores: 2880- GeForce GTX 950: GPU Compute Cores: 768- GeForce GTX 960: GPU Compute Cores: 1024- GeForce GTX 970: GPU Compute Cores: 1664- GeForce GTX 980: GPU Compute Cores: 2048- GeForce GTX 980 Ti: GPU Compute Cores: 2816- GeForce GTX TITAN X: GPU Compute Cores: 3072- GeForce GTX 1070 with Max-Q design: GPU Compute Cores: 2048System Details- GeForce GTX 680: GPU Compute Cores: 1536.- GeForce GTX 750: GPU Compute Cores: 512.- GeForce GTX 760: GPU Compute Cores: 1152.- GeForce GTX 780 Ti: GPU Compute Cores: 2880.- GeForce GTX 950: GPU Compute Cores: 768.- GeForce GTX 960: GPU Compute Cores: 1024.- GeForce GTX 970: GPU Compute Cores: 1664.- GeForce GTX 980: GPU Compute Cores: 2048.- GeForce GTX 980 Ti: GPU Compute Cores: 2816.- GeForce GTX TITAN X: GPU Compute Cores: 3072.Kernel Details- GeForce GTX 1070 with Max-Q design: drm.debug=0xeSecurity Details- GeForce GTX 1070 with Max-Q design: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW Protection

OpenCL CUDA NVIDIA GPGPU Linux Testscuda-mini-nbody: Cache Blockingshoc: CUDA - FFT SPcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zeroshoc: CUDA - MD5 Hashcuda-mini-nbody: Loop Unrollingshoc: OpenCL - FFT SPcuda-mini-nbody: Originalshoc: OpenCL - MD5 Hashshoc: CUDA - Texture Read Bandwidthshoc: OpenCL - Texture Read Bandwidthaskap: Griddingaskap: Degriddingjuliagpu: GPUluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRmandelbulbgpu: GPUGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN XGeForce GTX 1070 with Max-Q design74.971.91242.1648074789.035772127455431636512.9798.19113.64199.95199.831.0889.3454.69180.661.07158.42121.1436136874.00349120060275.5378.441.40170.2638310650.504631941425325392138.5029.9954.3953.2627.05126.7161.033.78286.6278839770.139924302963947400001.9049.89172.28108.50108.482.3647.5463.22105.302.34326.23239.193399.145706.0764913682.637692423531337156070.8737.08212.4379.9779.843.3835.3562.7882.013.36351.31269.983144.855290.3280042041.738972460547444953399.4728.53263.1455.8755.804.7926.42117.2354.324.77325.16283.365325.129509.14104144917.2313464458973758811317.1725.13289.6350.1549.535.7023.88140.1245.385.68336.48332.606051.2711094113830604.27149247761071363616558.7719.77311.4640.9440.856.8118.46170.3634.586.79348.92345.558320.5017380.60127978049.53185562681380271656708.8318.65324.0937.4337.377.4217.59173.8932.377.41356.52354.098458.7717380.60136037921.43190663601408175614774.1321.94306.7142.4241.966.5621.96234.9145.726.51430.42428.557681.8913312.80131929862.8074340055.80OpenBenchmarking.org

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X20406080100SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 321.9498.1929.9949.8937.0828.5325.1319.7718.65

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: FFT SPGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X70140210280350SE +/- 0.65, N = 3SE +/- 0.69, N = 3SE +/- 0.47, N = 3SE +/- 1.49, N = 3SE +/- 2.44, N = 3SE +/- 3.09, N = 3SE +/- 0.32, N = 3SE +/- 1.19, N = 3306.71113.64172.28212.43263.14289.63311.46324.09-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.21, N = 3SE +/- 0.11, N = 3SE +/- 0.20, N = 342.42199.9554.39108.5079.9755.8750.1540.9437.43

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 341.96199.8353.26108.4879.8455.8049.5340.8537.37

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: MD5 HashGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.561.082.363.384.795.706.817.42-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X20406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.25, N = 321.9689.3427.0547.5435.3526.4223.8818.4617.59

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1070 with Max-Q designGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X50100150200250SE +/- 0.95, N = 3SE +/- 0.87, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.19, N = 3SE +/- 0.08, N = 3SE +/- 1.20, N = 3SE +/- 0.52, N = 3SE +/- 1.30, N = 3SE +/- 0.65, N = 3SE +/- 0.19, N = 3234.9174.9754.6978.44126.7163.2262.78117.23140.12170.36173.89-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4080120160200SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.50, N = 3SE +/- 0.21, N = 3SE +/- 0.43, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.57, N = 3SE +/- 0.35, N = 345.72180.6661.03105.3082.0154.3245.3834.5832.37

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1070 with Max-Q designGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.511.911.071.403.782.343.364.775.686.797.41-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: CUDA - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: CUDA - Benchmark: Texture Read BandwidthGeForce GTX 1070 with Max-Q designGeForce GTX 750GeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X90180270360450SE +/- 3.41, N = 3SE +/- 0.42, N = 3SE +/- 0.85, N = 3SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 1.15, N = 3SE +/- 1.22, N = 3SE +/- 0.12, N = 3430.42158.42326.23351.31325.16336.48348.92356.52-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1070 with Max-Q designGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X90180270360450SE +/- 1.10, N = 3SE +/- 1.02, N = 3SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.02, N = 3SE +/- 0.73, N = 3SE +/- 0.56, N = 3SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.21, N = 3SE +/- 1.56, N = 3428.55242.16121.14170.26286.62239.19269.98283.36332.60345.55354.09-std=c++141. (CXX) g++ options: -O2 -lSHOCCommon -lcudadevrt -lcudart_static -lrt -lpthread -ldl -lcufft

ASKAP tConvolveCuda

Processing: Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingGeForce GTX 1070 with Max-Q designGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X2K4K6K8K10KSE +/- 74.58, N = 3SE +/- 14.40, N = 3SE +/- 12.43, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 130.14, N = 47681.893399.143144.855325.126051.278320.508458.77-std=c++141. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ASKAP tConvolveCuda

Processing: Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingGeForce GTX 1070 with Max-Q designGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X4K8K12K16K20KSE +/- 0.00, N = 3SE +/- 41.05, N = 3SE +/- 34.80, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 369.80, N = 3SE +/- 369.80, N = 313312.805706.075290.329509.1411094.0017380.6017380.60-std=c++141. (CXX) g++ options: -fPIC -O3 -m64 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

JuliaGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1070 with Max-Q designGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X30M60M90M120M150MSE +/- 177133.34, N = 3SE +/- 59682.63, N = 3SE +/- 22546.70, N = 3SE +/- 14125.16, N = 3SE +/- 293396.06, N = 3SE +/- 58084.93, N = 3SE +/- 157475.07, N = 3SE +/- 84325.23, N = 3SE +/- 218639.12, N = 3SE +/- 473156.02, N = 3SE +/- 318277.32, N = 3131929862.8048074789.0336136874.0038310650.5078839770.1364913682.6380042041.73104144917.23113830604.27127978049.53136037921.431. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: HotelGeForce GTX 680GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X400800120016002000SE +/- 2.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 35774639927698971346149218551906

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: MicrophoneGeForce GTX 680GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X14002800420056007000SE +/- 3.06, N = 3SE +/- 0.67, N = 3SE +/- 12.00, N = 3SE +/- 4.26, N = 3SE +/- 1.15, N = 3SE +/- 7.64, N = 3SE +/- 0.67, N = 3SE +/- 18.50, N = 3SE +/- 3.00, N = 3212719414302242324604458477662686360

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.0OpenCL Device: GPU - Scene: Luxball HDRGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X3K6K9K12K15KSE +/- 12.17, N = 3SE +/- 11.67, N = 3SE +/- 1.45, N = 3SE +/- 35.97, N = 3SE +/- 16.67, N = 3SE +/- 0.88, N = 3SE +/- 24.85, N = 3SE +/- 1.20, N = 3SE +/- 44.35, N = 3SE +/- 4.70, N = 34554349142539639531354749737107131380214081

MandelbulbGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelbulbGPU 1.0pts1OpenCL Device: GPUGeForce GTX 1070 with Max-Q designGeForce GTX 680GeForce GTX 750GeForce GTX 760GeForce GTX 780 TiGeForce GTX 950GeForce GTX 960GeForce GTX 970GeForce GTX 980GeForce GTX 980 TiGeForce GTX TITAN X16M32M48M64M80MSE +/- 107143.67, N = 3SE +/- 36731.70, N = 3SE +/- 9818.73, N = 3SE +/- 28089.31, N = 3SE +/- 48150.35, N = 3SE +/- 29855.85, N = 3SE +/- 75512.83, N = 3SE +/- 91420.68, N = 3SE +/- 140370.89, N = 3SE +/- 168304.91, N = 3SE +/- 166919.37, N = 374340055.8031636512.9720060275.5325392138.5047400001.9037156070.8744953399.4758811317.1763616558.7771656708.8375614774.131. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL


Phoronix Test Suite v10.8.4