NVIDIA Pascal Fresh Summer 2018 OpenCL Benchmarks

NVIDIA OpenCL compute benchmarks on Ubuntu Linux for a future article by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1808082-KH-1807240RA61
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 5 Tests
OpenCL 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GeForce GTX 1050
July 23 2018
  31 Minutes
GeForce GTX 1050 Ti
July 23 2018
  31 Minutes
GeForce GTX 1060
July 23 2018
  29 Minutes
GeForce GTX 1070
July 23 2018
  28 Minutes
GeForce GTX 1070 Ti
July 23 2018
  28 Minutes
GeForce GTX 1080 Ti
July 23 2018
  28 Minutes
test
August 08 2018
 
Invert Hiding All Results Option
  25 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA Pascal Fresh Summer 2018 OpenCL BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1050 TiGeForce GTX 1080 TiGeForce GTX 1050testIntel Core i7-8086K @ 5.00GHz (6 Cores / 12 Threads)ASUS PRIME Z370-A (0809 BIOS)Intel Device 3ec216384MB525GB SABRENT + 118GB INTEL SSDPEK1W120GANVIDIA GeForce GTX 1060 6GB 6144MB (1506/4006MHz)Realtek ALC1220DELL P2415QIntel ConnectionUbuntu 18.044.17.8-041708-generic (x86_64)GNOME Shell 3.28.2X Server 1.19.6NVIDIA 396.454.6.0OpenCL 1.2 CUDA 9.2.177GCC 7.3.0ext43840x2160NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1607/4006MHz)eVGA NVIDIA GeForce GTX 1050 Ti 4096MB (1354/3504MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)Zotac NVIDIA GeForce GTX 1050 2048MB (1354/3504MHz)AMD Ryzen 7 1700 Eight-Core @ 3.00GHz (16 Cores)Gigabyte AB350-Gaming-CFAMD Device 1450500GB Samsung SSD 850 + 512GB Intenso SSD SatGigabyte NVIDIA GeForce GTX 1070 8192MB (987/4006MHz)NVIDIA Device 10f0Realtek RTL8111/8168/8411elementary 0.4.14.15.0-30-generic (x86_64)NVIDIA 396.51GCC 5.5.0 20171010ext4 (ecryptfs)2560x1440OpenBenchmarking.orgCompiler Details- GeForce GTX 1060: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1070: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1070 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1050 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1050: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - test: --build=x86_64-linux-gnu --disable-browser-plugin --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-gnu-unique-object --enable-gtk-cairo --enable-java-awt=gtk --enable-java-home --enable-languages=c,ada,c++,java,go,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-arch-directory=amd64 --with-default-libstdcxx-abi=new --with-multilib-list=m32,m64,mx32 --with-tune=generic -v Processor Details- GeForce GTX 1060: Scaling Governor: intel_pstate performance- GeForce GTX 1070: Scaling Governor: intel_pstate performance- GeForce GTX 1070 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1080 Ti: Scaling Governor: intel_pstate performance- GeForce GTX 1050: Scaling Governor: intel_pstate performance- test: Scaling Governor: acpi-cpufreq ondemandOpenCL Details- GeForce GTX 1060: GPU Compute Cores: 1280- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1050 Ti: GPU Compute Cores: 768- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- GeForce GTX 1050: GPU Compute Cores: 640- test: GPU Compute Cores: 1920Security Details- GeForce GTX 1060, GeForce GTX 1070, GeForce GTX 1070 Ti, GeForce GTX 1050 Ti, GeForce GTX 1080 Ti, GeForce GTX 1050: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp ProtectionSystem Details- test: GPU Compute Cores: 1920.

NVIDIA Pascal Fresh Summer 2018 OpenCL Benchmarksshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Texture Read Bandwidthviennacl: OpenCL LU Factorizationcl-mem: Copycl-mem: Readcl-mem: Writeindigobench: Bedroomindigobench: Supercarjuliagpu: GPUmandelgpu: GPUluxmark: GPU - Hotelluxmark: GPU - MicrophoneGeForce GTX 1060GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1050 TiGeForce GTX 1080 TiGeForce GTX 1050test350.647.38417.8059.01140.57154.47145.602.658.74123277207.17106989159.3026306965516.4110.74458.9964.27188.23206.30197.633.7712.41155907964.80153001211.8738469982557.4613.83510.1866.49188.20206.50196.534.1513.87177590252.53189306428.90446110620223.704.12336.6749.5588.1795.1789.101.655.4882228551.0761788066.4716344703988.4220.20606.7369.22319.20339.23344.105.3217.55209807323.37264945183.70570713781248.783.24308.5645.0588.709690.201.424.6468020911.2048526868.7713854122392710061OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10602004006008001000SE +/- 8.24, N = 6SE +/- 2.56, N = 3SE +/- 4.57, N = 6SE +/- 0.16, N = 3SE +/- 7.97, N = 3SE +/- 4.21, N = 3248.78988.42223.70557.46516.41350.641. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10602004006008001000Min: 207.77 / Avg: 248.78 / Max: 258.96Min: 984.7 / Avg: 988.42 / Max: 993.33Min: 218.24 / Avg: 223.7 / Max: 246.52Min: 557.25 / Avg: 557.46 / Max: 557.77Min: 500.47 / Avg: 516.41 / Max: 524.47Min: 342.23 / Avg: 350.64 / Max: 355.21. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1060510152025SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.2420.204.1213.8310.747.381. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1060510152025Min: 3.24 / Avg: 3.24 / Max: 3.24Min: 20.15 / Avg: 20.2 / Max: 20.28Min: 4.12 / Avg: 4.12 / Max: 4.12Min: 13.83 / Avg: 13.83 / Max: 13.84Min: 10.74 / Avg: 10.74 / Max: 10.74Min: 7.38 / Avg: 7.38 / Max: 7.391. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1060130260390520650SE +/- 1.01, N = 3SE +/- 3.78, N = 3SE +/- 1.20, N = 3SE +/- 0.86, N = 3SE +/- 0.13, N = 3SE +/- 2.05, N = 3308.56606.73336.67510.18458.99417.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1060110220330440550Min: 306.82 / Avg: 308.56 / Max: 310.3Min: 599.87 / Avg: 606.73 / Max: 612.9Min: 334.94 / Avg: 336.67 / Max: 338.98Min: 508.86 / Avg: 510.18 / Max: 511.79Min: 458.75 / Avg: 458.99 / Max: 459.2Min: 414.79 / Avg: 417.8 / Max: 421.731. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile uses ViennaCL OpenCL support and runs the included computational benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10601530456075SE +/- 0.01, N = 3SE +/- 0.37, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 345.0569.2249.5566.4964.2759.011. (CXX) g++ options: -rdynamic -lOpenCL
OpenBenchmarking.orgGFLOPS, More Is BetterViennaCL 1.4.2OpenCL LU FactorizationGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10601326395265Min: 45.03 / Avg: 45.05 / Max: 45.06Min: 68.49 / Avg: 69.22 / Max: 69.67Min: 49.54 / Avg: 49.55 / Max: 49.56Min: 66.46 / Avg: 66.49 / Max: 66.53Min: 64.21 / Avg: 64.27 / Max: 64.32Min: 58.99 / Avg: 59.01 / Max: 59.031. (CXX) g++ options: -rdynamic -lOpenCL

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106070140210280350SE +/- 0.30, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 388.70319.2088.17188.20188.23140.571. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106060120180240300Min: 88.1 / Avg: 88.7 / Max: 89Min: 319.2 / Avg: 319.2 / Max: 319.2Min: 88.1 / Avg: 88.17 / Max: 88.2Min: 188.2 / Avg: 188.2 / Max: 188.2Min: 188.1 / Avg: 188.23 / Max: 188.3Min: 140.5 / Avg: 140.57 / Max: 140.71. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106070140210280350SE +/- 0.78, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 396.00339.2395.17206.50206.30154.471. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106060120180240300Min: 337.7 / Avg: 339.23 / Max: 340.2Min: 95.1 / Avg: 95.17 / Max: 95.2Min: 206.3 / Avg: 206.5 / Max: 206.8Min: 206.1 / Avg: 206.3 / Max: 206.5Min: 154.4 / Avg: 154.47 / Max: 154.61. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106070140210280350SE +/- 0.00, N = 3SE +/- 0.21, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 390.20344.1089.10196.53197.63145.601. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106060120180240300Min: 90.2 / Avg: 90.2 / Max: 90.2Min: 343.8 / Avg: 344.1 / Max: 344.5Min: 89.1 / Avg: 89.1 / Max: 89.1Min: 196.5 / Avg: 196.53 / Max: 196.6Min: 197.6 / Avg: 197.63 / Max: 197.7Min: 145.6 / Avg: 145.6 / Max: 145.61. (CC) gcc options: -O2 -flto -lOpenCL

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: BedroomGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10601.1972.3943.5914.7885.985SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.425.321.654.153.772.65
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: BedroomGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 1060246810Min: 1.42 / Avg: 1.42 / Max: 1.43Min: 5.32 / Avg: 5.32 / Max: 5.32Min: 1.65 / Avg: 1.65 / Max: 1.65Min: 4.15 / Avg: 4.15 / Max: 4.15Min: 3.77 / Avg: 3.77 / Max: 3.78Min: 2.65 / Avg: 2.65 / Max: 2.65

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: SupercarGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106048121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.6417.555.4813.8712.418.74
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: SupercarGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106048121620Min: 4.64 / Avg: 4.64 / Max: 4.65Min: 17.54 / Avg: 17.55 / Max: 17.56Min: 5.48 / Avg: 5.48 / Max: 5.48Min: 13.87 / Avg: 13.87 / Max: 13.88Min: 12.39 / Avg: 12.41 / Max: 12.43Min: 8.74 / Avg: 8.74 / Max: 8.75

JuliaGPU

JuliaGPU is an OpenCL benchmark with this version containing various PTS-specific enhancements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106040M80M120M160M200MSE +/- 85767.34, N = 3SE +/- 588976.70, N = 3SE +/- 158080.14, N = 3SE +/- 280197.95, N = 3SE +/- 396739.11, N = 3SE +/- 228331.44, N = 368020911.20209807323.3782228551.07177590252.53155907964.80123277207.171. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm
OpenBenchmarking.orgSamples/sec, More Is BetterJuliaGPU 1.2pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106040M80M120M160M200MMin: 67875552.7 / Avg: 68020911.2 / Max: 68172465.6Min: 208631664.3 / Avg: 209807323.37 / Max: 210458792.4Min: 81921565.4 / Avg: 82228551.07 / Max: 82447525.5Min: 177087112 / Avg: 177590252.53 / Max: 178055527.9Min: 155115056.4 / Avg: 155907964.8 / Max: 156330457Min: 122820627.7 / Avg: 123277207.17 / Max: 123513055.91. (CC) gcc options: -O3 -march=native -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL -lm

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106060M120M180M240M300MSE +/- 25451.70, N = 3SE +/- 328049.57, N = 3SE +/- 121552.05, N = 3SE +/- 264378.94, N = 3SE +/- 152368.68, N = 3SE +/- 36638.63, N = 348526868.77264945183.7061788066.47189306428.90153001211.87106989159.301. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPUGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106050M100M150M200M250MMin: 48488728.4 / Avg: 48526868.77 / Max: 48575134Min: 264569716.8 / Avg: 264945183.7 / Max: 265598876.4Min: 61545306.9 / Avg: 61788066.47 / Max: 61920651.1Min: 188781723.1 / Avg: 189306428.9 / Max: 189625363.9Min: 152696486.5 / Avg: 153001211.87 / Max: 153155915.5Min: 106929842.7 / Avg: 106989159.3 / Max: 107056077.81. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HoteltestGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106012002400360048006000SE +/- 16.15, N = 3SE +/- 1.67, N = 3SE +/- 25.01, N = 3SE +/- 1.33, N = 3SE +/- 20.67, N = 3SE +/- 0.33, N = 33927138557071634446138462630
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HoteltestGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 106010002000300040005000Min: 3906 / Avg: 3927.33 / Max: 3959Min: 1382 / Avg: 1385.33 / Max: 1387Min: 5660 / Avg: 5707.33 / Max: 5745Min: 1631 / Avg: 1633.67 / Max: 1635Min: 3805 / Avg: 3846.33 / Max: 3867Min: 2630 / Avg: 2630.33 / Max: 2631

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophonetestGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10603K6K9K12K15KSE +/- 27.20, N = 3SE +/- 16.17, N = 3SE +/- 34.18, N = 3SE +/- 8.17, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3SE +/- 1.33, N = 31006141221378147031062099826965
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophonetestGeForce GTX 1050GeForce GTX 1080 TiGeForce GTX 1050 TiGeForce GTX 1070 TiGeForce GTX 1070GeForce GTX 10602K4K6K8K10KMin: 10031 / Avg: 10060.67 / Max: 10115Min: 4105 / Avg: 4121.67 / Max: 4154Min: 13745 / Avg: 13780.67 / Max: 13849Min: 4687 / Avg: 4703.33 / Max: 4712Min: 10619 / Avg: 10620.33 / Max: 10622Min: 9981 / Avg: 9982.33 / Max: 9984Min: 6964 / Avg: 6965.33 / Max: 6968

13 Results Shown

SHOC Scalable HeterOgeneous Computing:
  OpenCL - FFT SP
  OpenCL - MD5 Hash
  OpenCL - Texture Read Bandwidth
ViennaCL
cl-mem:
  Copy
  Read
  Write
IndigoBench:
  Bedroom
  Supercar
JuliaGPU
MandelGPU
LuxMark:
  GPU - Hotel
  GPU - Microphone