RTX 2060 OpenCL AMD Ryzen Threadripper 2950X 16-Core testing with a MSI MEG X399 CREATION (MS-7B92) v1.0 (1.10 BIOS) and NVIDIA GeForce RTX 2060 6GB on Ubuntu 18.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/1901210-SP-RTX2060OP70 .
RTX 2060 OpenCL Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Vulkan Compiler File-System Screen Resolution RTX 2060 AMD Ryzen Threadripper 2950X 16-Core @ 3.50GHz (16 Cores / 32 Threads) MSI MEG X399 CREATION (MS-7B92) v1.0 (1.10 BIOS) AMD Family 17h 32768MB 16GB Voyager 3.0 + Samsung SSD 970 EVO 250GB NVIDIA GeForce RTX 2060 6GB (1365/7000MHz) Realtek ALC1220 ASUS PB278 2 x Intel I211 + Intel-AC 9260 Ubuntu 18.10 4.18.0-13-generic (x86_64) GNOME Shell 3.30.1 X Server 1.20.1 NVIDIA 415.27 4.6.0 1.1.84 GCC 8.2.0 ext4 2560x1440 OpenBenchmarking.org - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand - GPU Compute Cores: 1920 - Python 2.7.15+ + Python 3.6.7 - __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp
RTX 2060 OpenCL shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - MD5 Hash shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Texture Read Bandwidth cl-mem: Copy cl-mem: Read cl-mem: Write plaidml: No - Training - VGG16 - OpenCL plaidml: No - Training - VGG19 - OpenCL plaidml: No - Inference - VGG16 - OpenCL plaidml: No - Inference - VGG19 - OpenCL plaidml: Yes - Inference - VGG16 - OpenCL plaidml: Yes - Inference - VGG19 - OpenCL plaidml: No - Training - IMDB LSTM - OpenCL plaidml: No - Training - Mobilenet - OpenCL plaidml: No - Training - ResNet 50 - OpenCL plaidml: No - Inference - IMDB LSTM - OpenCL plaidml: No - Inference - Mobilenet - OpenCL plaidml: No - Inference - ResNet 50 - OpenCL lczero: BLAS lczero: OpenCL luxmark: GPU - Hotel luxmark: GPU - Microphone luxmark: GPU - Luxball HDR clpeak: Kernel Latency clpeak: Integer Compute INT clpeak: Single-Precision Float clpeak: Double-Precision Double clpeak: Global Memory Bandwidth clpeak: Transfer Bandwidth enqueueReadBuffer clpeak: Transfer Bandwidth enqueueWriteBuffer RTX 2060 9.83 823 16.23 7301 13.12 7.07 1002 250.20 297.23 257.10 2033.22 2021.69 149.56 126.01 146.47 125.26 147 104.66 1091.97 315 719 299 138 1258.09 4825 13753 21599 5.33 6932.70 7062.37 227.82 276.33 6.79 12.73 OpenBenchmarking.org
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Triad RTX 2060 3 6 9 12 15 SE +/- 0.02, N = 3 9.83 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: FFT SP RTX 2060 200 400 600 800 1000 SE +/- 0.51, N = 3 823 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: MD5 Hash OpenBenchmarking.org GHash/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: MD5 Hash RTX 2060 4 8 12 16 20 SE +/- 0.03, N = 3 16.23 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Max SP Flops RTX 2060 1600 3200 4800 6400 8000 SE +/- 57.25, N = 3 7301 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Download RTX 2060 3 6 9 12 15 SE +/- 0.00, N = 3 13.12 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Bus Speed Readback RTX 2060 2 4 6 8 10 SE +/- 0.04, N = 3 7.07 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Texture Read Bandwidth OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2015-11-10 Target: OpenCL - Benchmark: Texture Read Bandwidth RTX 2060 200 400 600 800 1000 SE +/- 3.42, N = 3 1002 1. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi
cl-mem Benchmark: Copy OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Copy RTX 2060 50 100 150 200 250 SE +/- 0.20, N = 3 250.20 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Read OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Read RTX 2060 60 120 180 240 300 SE +/- 0.09, N = 3 297.23 1. (CC) gcc options: -O2 -flto -lOpenCL
cl-mem Benchmark: Write OpenBenchmarking.org GB/s, More Is Better cl-mem 2017-01-13 Benchmark: Write RTX 2060 60 120 180 240 300 SE +/- 1.25, N = 3 257.10 1. (CC) gcc options: -O2 -flto -lOpenCL
PlaidML FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Training - Network: VGG16 - Device: OpenCL RTX 2060 400 800 1200 1600 2000 SE +/- 21.70, N = 3 2033.22
PlaidML FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Training - Network: VGG19 - Device: OpenCL RTX 2060 400 800 1200 1600 2000 SE +/- 6.81, N = 3 2021.69
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: OpenCL RTX 2060 30 60 90 120 150 SE +/- 0.14, N = 3 149.56
PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: OpenCL RTX 2060 30 60 90 120 150 SE +/- 0.11, N = 3 126.01
PlaidML FP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: VGG16 - Device: OpenCL RTX 2060 30 60 90 120 150 SE +/- 0.12, N = 3 146.47
PlaidML FP16: Yes - Mode: Inference - Network: VGG19 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: Yes - Mode: Inference - Network: VGG19 - Device: OpenCL RTX 2060 30 60 90 120 150 SE +/- 0.08, N = 3 125.26
PlaidML FP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Training - Network: IMDB LSTM - Device: OpenCL RTX 2060 30 60 90 120 150 SE +/- 0.88, N = 3 147
PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Training - Network: Mobilenet - Device: OpenCL RTX 2060 20 40 60 80 100 SE +/- 0.10, N = 3 104.66
PlaidML FP16: No - Mode: Training - Network: ResNet 50 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Training - Network: ResNet 50 - Device: OpenCL RTX 2060 200 400 600 800 1000 1091.97
PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Inference - Network: IMDB LSTM - Device: OpenCL RTX 2060 70 140 210 280 350 SE +/- 0.79, N = 3 315
PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Inference - Network: Mobilenet - Device: OpenCL RTX 2060 160 320 480 640 800 SE +/- 8.83, N = 3 719
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL OpenBenchmarking.org Examples Per Second, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: OpenCL RTX 2060 70 140 210 280 350 SE +/- 0.17, N = 3 299
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.20.1 Backend: BLAS RTX 2060 30 60 90 120 150 SE +/- 2.18, N = 9 138 1. (CXX) g++ options: -lpthread
LeelaChessZero Backend: OpenCL OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.20.1 Backend: OpenCL RTX 2060 300 600 900 1200 1500 SE +/- 8.89, N = 3 1258.09 1. (CXX) g++ options: -lpthread
LuxMark OpenCL Device: GPU - Scene: Hotel OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Hotel RTX 2060 1000 2000 3000 4000 5000 SE +/- 19.88, N = 3 4825
LuxMark OpenCL Device: GPU - Scene: Microphone OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Microphone RTX 2060 3K 6K 9K 12K 15K SE +/- 11.79, N = 3 13753
LuxMark OpenCL Device: GPU - Scene: Luxball HDR OpenBenchmarking.org Score, More Is Better LuxMark 3.1 OpenCL Device: GPU - Scene: Luxball HDR RTX 2060 5K 10K 15K 20K 25K SE +/- 68.17, N = 3 21599
clpeak OpenCL Test: Kernel Latency OpenBenchmarking.org us, Fewer Is Better clpeak OpenCL Test: Kernel Latency RTX 2060 1.1993 2.3986 3.5979 4.7972 5.9965 SE +/- 0.10, N = 3 5.33 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Integer Compute INT OpenBenchmarking.org GIOPS, More Is Better clpeak OpenCL Test: Integer Compute INT RTX 2060 1500 3000 4500 6000 7500 SE +/- 132.84, N = 11 6932.70 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Single-Precision Float OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Single-Precision Float RTX 2060 1500 3000 4500 6000 7500 SE +/- 136.37, N = 11 7062.37 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Double-Precision Double OpenBenchmarking.org GFLOPS, More Is Better clpeak OpenCL Test: Double-Precision Double RTX 2060 50 100 150 200 250 SE +/- 0.02, N = 3 227.82 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Global Memory Bandwidth OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Global Memory Bandwidth RTX 2060 60 120 180 240 300 SE +/- 0.14, N = 3 276.33 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueReadBuffer RTX 2060 2 4 6 8 10 SE +/- 0.00, N = 3 6.79 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer OpenBenchmarking.org GBPS, More Is Better clpeak OpenCL Test: Transfer Bandwidth enqueueWriteBuffer RTX 2060 3 6 9 12 15 SE +/- 0.00, N = 3 12.73 1. (CXX) g++ options: -O3 -rdynamic -lOpenCL
Phoronix Test Suite v10.8.4