OpenCL August

Fresh NVIDIA vs. Radeon OpenCL Linux benchmarks. Tests by Michael Larabel for a future article on Phoronix.com.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1812246-SK-1808234PT49
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

NVIDIA GPU Compute 4 Tests
OpenCL 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Radeon RX Vega 56
August 23 2018
  15 Minutes
Radeon RX Vega 64
August 23 2018
  14 Minutes
GeForce GTX 1070
August 23 2018
  14 Minutes
GeForce GTX 1070 Ti
August 23 2018
  14 Minutes
GeForce GTX 1080
August 23 2018
  15 Minutes
GeForce GTX 1080 Ti
August 23 2018
  14 Minutes
Mobile GeForce 1050
December 19 2018
  4 Minutes
NVIDIA GeForce GTX 1050
December 24 2018
  18 Minutes
Invert Hiding All Results Option
  14 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


OpenCL AugustProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 1050AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1402 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBAMD Radeon RX Vega 8176MBRealtek ALC1220ASUS VP28UIntel I211 Gigabit Connection + Qualcomm Atheros QCA6174 802.11ac WirelessUbuntu 18.044.15.0-33-generic (x86_64)GNOME Shell 3.28.3X Server 1.19.6amdgpu 18.0.994.6.13536OpenCL 2.1 AMD-APP (2671.3)GCC 7.3.0ext43840x2160NVIDIA GeForce GTX 1070 8192MB (1506/4006MHz)NVIDIA 396.544.6.0OpenCL 1.2 CUDA 9.2.210Zotac NVIDIA GeForce GTX 1070 Ti 8192MB (1607/4006MHz)NVIDIA GeForce GTX 1080 8192MB (1607/5005MHz)NVIDIA GeForce GTX 1080 Ti 11264MB (1480/5508MHz)Intel Core i7-7700HQ @ 3.80GHz (4 Cores / 8 Threads)Dell 0YH90J (1.12.1 BIOS)Intel Xeon E3-1200 v6/7th16384MBPM961 NVMe SAMSUNG 512GBNVIDIA GeForce GTX 1050 4GB (151/405MHz)Realtek ALC3266Intel Wireless 8265 / 82754.15.0-42-generic (x86_64)NVIDIA 390.77GCC 7.3.0 + CUDA 9.1NVIDIA GeForce GTX 1050 4GB (1759/3504MHz)4.15.0-43-generic (x86_64)4.6.0OpenBenchmarking.orgCompiler Details- Radeon RX Vega 56: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Radeon RX Vega 64: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1070: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1070 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1080: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - GeForce GTX 1080 Ti: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-as=/usr/bin/x86_64-linux-gnu-as --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-ld=/usr/bin/x86_64-linux-gnu-ld --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Mobile GeForce 1050: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - NVIDIA GeForce GTX 1050: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- Radeon RX Vega 56: Scaling Governor: acpi-cpufreq ondemand- Radeon RX Vega 64: Scaling Governor: acpi-cpufreq ondemand- GeForce GTX 1070: Scaling Governor: acpi-cpufreq ondemand- GeForce GTX 1070 Ti: Scaling Governor: acpi-cpufreq ondemand- GeForce GTX 1080: Scaling Governor: acpi-cpufreq ondemand- GeForce GTX 1080 Ti: Scaling Governor: acpi-cpufreq ondemand- Mobile GeForce 1050: Scaling Governor: intel_pstate powersave- NVIDIA GeForce GTX 1050: Scaling Governor: intel_pstate powersaveGraphics Details- Radeon RX Vega 56, Radeon RX Vega 64: GLAMORPython Details- Radeon RX Vega 56: Python 2.7.15rc1 + Python 3.6.5Security Details- Radeon RX Vega 56: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection- Radeon RX Vega 64: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection- GeForce GTX 1070: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection- GeForce GTX 1070 Ti: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection- GeForce GTX 1080: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection- GeForce GTX 1080 Ti: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp Protection- Mobile GeForce 1050: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable- NVIDIA GeForce GTX 1050: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerableOpenCL Details- GeForce GTX 1070: GPU Compute Cores: 1920- GeForce GTX 1070 Ti: GPU Compute Cores: 2432- GeForce GTX 1080: GPU Compute Cores: 2560- GeForce GTX 1080 Ti: GPU Compute Cores: 3584- Mobile GeForce 1050: GPU Compute Cores: 640- NVIDIA GeForce GTX 1050: GPU Compute Cores: 640

Radeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 1050Result OverviewPhoronix Test Suite100%225%350%475%599%SHOC Scalable HeterOgeneous ComputingSHOC Scalable HeterOgeneous Computingcl-memcl-memcl-memSHOC Scalable HeterOgeneous ComputingOpenCL - MD5 HashOpenCL - FFT SPReadWriteCopyOpenCL - T.R.B

OpenCL Augustshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Texture Read Bandwidthcl-mem: Copycl-mem: Readcl-mem: Writefahbench: mandelgpu: GPUluxmark: GPU - HotelRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105083113.10363313.30346.47333.2092.48165896206508388317.12428369.93399.00388.5792.46192917451590752910.70456186.87205.50192.30131.18148507151382055213.80501186.80205.63191145.38184334944440565114.40524209.33228.40216.70141.38188560527388398519.72593317.37337.73336.30179.3425067865156622243.3128685.3791.3389.502203.2928888.3094.6793.1745.08531518101318OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 10502004006008001000SE +/- 31.13, N = 12SE +/- 14.14, N = 12SE +/- 2.10, N = 3SE +/- 1.30, N = 3SE +/- 1.65, N = 3SE +/- 1.72, N = 3SE +/- 1.29, N = 3SE +/- 1.84, N = 3831883529552651985224220-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt
OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 10502004006008001000Min: 581.54 / Avg: 830.77 / Max: 905.22Min: 779.11 / Avg: 882.96 / Max: 917.19Min: 524.65 / Avg: 528.71 / Max: 531.68Min: 550.15 / Avg: 551.95 / Max: 554.46Min: 647.7 / Avg: 650.78 / Max: 653.36Min: 981.2 / Avg: 984.61 / Max: 986.7Min: 221.75 / Avg: 223.79 / Max: 226.19Min: 216.78 / Avg: 220.45 / Max: 222.291. (CXX) g++ options: -O2 -lSHOCCommon -lrt

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 1050510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.1017.1210.7013.8014.4019.723.313.29-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt
OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 1050510152025Min: 13.09 / Avg: 13.1 / Max: 13.12Min: 17.11 / Avg: 17.12 / Max: 17.12Min: 10.68 / Avg: 10.7 / Max: 10.74Min: 13.8 / Avg: 13.8 / Max: 13.8Min: 14.37 / Avg: 14.4 / Max: 14.43Min: 19.7 / Avg: 19.72 / Max: 19.75Min: 3.31 / Avg: 3.31 / Max: 3.32Min: 3.27 / Avg: 3.29 / Max: 3.321. (CXX) g++ options: -O2 -lSHOCCommon -lrt

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 1050130260390520650SE +/- 1.35, N = 3SE +/- 0.13, N = 3SE +/- 0.38, N = 3SE +/- 1.71, N = 3SE +/- 2.76, N = 3SE +/- 0.88, N = 3SE +/- 4.16, N = 3SE +/- 0.67, N = 3363428456501524593286288-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-lSHOCCommonMPI -lSHOCCommonOpenCL -lOpenCL -pthread -lmpi_cxx -lmpi-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft-std=c++14 -lcudadevrt -lcudart_static -lpthread -ldl -lcufft1. (CXX) g++ options: -O2 -lSHOCCommon -lrt
OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 1050100200300400500Min: 360.38 / Avg: 362.97 / Max: 364.92Min: 427.48 / Avg: 427.74 / Max: 427.91Min: 455.35 / Avg: 456.1 / Max: 456.51Min: 498.07 / Avg: 501.46 / Max: 503.51Min: 518.17 / Avg: 523.65 / Max: 527.03Min: 591.8 / Avg: 593.14 / Max: 594.79Min: 278.17 / Avg: 286.22 / Max: 292.05Min: 286.94 / Avg: 287.61 / Max: 288.961. (CXX) g++ options: -O2 -lSHOCCommon -lrt

cl-mem

A basic OpenCL memory benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105080160240320400SE +/- 0.30, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 1.27, N = 3SE +/- 0.21, N = 3313.30369.93186.87186.80209.33317.3785.3788.301. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105070140210280350Min: 313 / Avg: 313.3 / Max: 313.9Min: 369.9 / Avg: 369.93 / Max: 370Min: 186.8 / Avg: 186.87 / Max: 186.9Min: 186.8 / Avg: 186.8 / Max: 186.8Min: 209.3 / Avg: 209.33 / Max: 209.4Min: 317.1 / Avg: 317.37 / Max: 317.6Min: 83.5 / Avg: 85.37 / Max: 87.8Min: 87.9 / Avg: 88.3 / Max: 88.61. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105090180270360450SE +/- 0.52, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.20, N = 3SE +/- 0.30, N = 3SE +/- 0.43, N = 3SE +/- 0.12, N = 3346.47399.00205.50205.63228.40337.7391.3394.671. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105070140210280350Min: 345.9 / Avg: 346.47 / Max: 347.5Min: 398.8 / Avg: 399 / Max: 399.1Min: 205.3 / Avg: 205.5 / Max: 205.6Min: 205.5 / Avg: 205.63 / Max: 205.8Min: 228.2 / Avg: 228.4 / Max: 228.8Min: 337.3 / Avg: 337.73 / Max: 338.3Min: 90.9 / Avg: 91.33 / Max: 92.2Min: 94.5 / Avg: 94.67 / Max: 94.91. (CC) gcc options: -O2 -flto -lOpenCL

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105080160240320400SE +/- 0.40, N = 3SE +/- 0.71, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.80, N = 3SE +/- 0.12, N = 3333.20388.57192.30191.00216.70336.3089.5093.171. (CC) gcc options: -O2 -flto -lOpenCL
OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiMobile GeForce 1050NVIDIA GeForce GTX 105070140210280350Min: 332.4 / Avg: 333.2 / Max: 333.7Min: 387.2 / Avg: 388.57 / Max: 389.6Min: 192.3 / Avg: 192.3 / Max: 192.3Min: 216.6 / Avg: 216.7 / Max: 216.8Min: 336.1 / Avg: 336.3 / Max: 336.4Min: 88.6 / Avg: 89.5 / Max: 91.1Min: 93 / Avg: 93.17 / Max: 93.41. (CC) gcc options: -O2 -flto -lOpenCL

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Radeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiNVIDIA GeForce GTX 10504080120160200SE +/- 0.14, N = 3SE +/- 0.82, N = 3SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 3SE +/- 0.34, N = 3SE +/- 0.01, N = 392.4892.46131.18145.38141.38179.3445.08
OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2Radeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiNVIDIA GeForce GTX 1050306090120150Min: 92.21 / Avg: 92.48 / Max: 92.66Min: 91.6 / Avg: 92.46 / Max: 94.11Min: 130.85 / Avg: 131.18 / Max: 131.45Min: 145.04 / Avg: 145.38 / Max: 145.61Min: 141.29 / Avg: 141.38 / Max: 141.47Min: 178.71 / Avg: 179.34 / Max: 179.89Min: 45.07 / Avg: 45.08 / Max: 45.09

MandelGPU

MandelGPU is an OpenCL benchmark and this test runs with the OpenCL rendering float4 kernel with a maximum of 4096 iterations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiNVIDIA GeForce GTX 105050M100M150M200M250MSE +/- 144327.89, N = 3SE +/- 234631.71, N = 3SE +/- 483103.56, N = 3SE +/- 572304.04, N = 3SE +/- 430789.92, N = 3SE +/- 727895.77, N = 3SE +/- 130218.42, N = 3165896206192917451148507151184334944188560527250678651531518101. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL
OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiNVIDIA GeForce GTX 105040M80M120M160M200MMin: 165610148.9 / Avg: 165896206.17 / Max: 166072701.9Min: 192543113.9 / Avg: 192917451.23 / Max: 193349690.9Min: 147696346.1 / Avg: 148507150.87 / Max: 149367650.9Min: 183208207.8 / Avg: 184334944.1 / Max: 185072796.4Min: 187932275.9 / Avg: 188560526.97 / Max: 189385255.4Min: 249279724.8 / Avg: 250678650.87 / Max: 251727042.5Min: 52906321.9 / Avg: 53151810.1 / Max: 533498681. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiNVIDIA GeForce GTX 105013002600390052006500SE +/- 4.18, N = 3SE +/- 32.69, N = 3SE +/- 1.33, N = 3SE +/- 12.60, N = 3SE +/- 1.15, N = 35083590738204405388356621318
OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelRadeon RX Vega 56Radeon RX Vega 64GeForce GTX 1070GeForce GTX 1070 TiGeForce GTX 1080GeForce GTX 1080 TiNVIDIA GeForce GTX 105010002000300040005000Min: 5078 / Avg: 5082.67 / Max: 5091Min: 5872 / Avg: 5906.67 / Max: 5972Min: 3819 / Avg: 3820.33 / Max: 3823Min: 5643 / Avg: 5662.33 / Max: 5686Min: 1316 / Avg: 1318 / Max: 1320

9 Results Shown

SHOC Scalable HeterOgeneous Computing:
  OpenCL - FFT SP
  OpenCL - MD5 Hash
  OpenCL - Texture Read Bandwidth
cl-mem:
  Copy
  Read
  Write
FAHBench
MandelGPU
LuxMark