NVIDIA GeForce RTX 2070

Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1810180-SK-NVIDIAGEF35.

NVIDIA GeForce RTX 2070ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionNVIDIA GeForce RTX 2070AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1402 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBeVGA NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Realtek ALC1220ASUS VP28UIntel I211 Gigabit ConnectionUbuntu 18.044.18.0-041800-generic (x86_64)GNOME Shell 3.28.3X Server 1.19.6NVIDIA 410.664.6.0OpenCL 1.2 CUDA 10.0.175GCC 7.3.0 + CUDA 10.0ext43840x2160OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand- GPU Compute Cores: 2304- Python 2.7.15rc1 + Python 3.6.6- __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp

NVIDIA GeForce RTX 2070financebench: Monte-Carlo OpenCLfinancebench: Black-Scholes OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthaskap: Griddingaskap: Degriddingcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerongc-tensorflow: VGG-16, FP16ngc-tensorflow: VGG-16, FP32ngc-tensorflow: AlexNet, FP16ngc-tensorflow: AlexNet, FP32ngc-tensorflow: Googlenet, FP16ngc-tensorflow: ResNet-50, FP16ngc-tensorflow: ResNet-50, FP32ngc-tensorflow: Inception v4, FP16parboil: OpenCL BFSparboil: OpenCL LBMparboil: OpenCL TPACFrodinia: OpenCL Myocyterodinia: OpenCL Particle Filterdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferv-ray: CUDA GPUNVIDIA GeForce RTX 207034710.406.50104218.8787586.646.611091131022581917.7913.9214.1617.5217.34138.1799.672713210964229718285.831.418.881.0137.217.619.255.520.152.54634518701301815.5684038535275.723686.496.5566.43OpenBenchmarking.org

FinanceBench

Benchmark: Monte-Carlo OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Monte-Carlo OpenCLNVIDIA GeForce RTX 207080160240320400SE +/- 0.25, N = 33471. (CXX) g++ options: -O3 -lOpenCL

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLNVIDIA GeForce RTX 20703691215SE +/- 0.00, N = 310.401. (CXX) g++ options: -O3 -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.501. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPNVIDIA GeForce RTX 20702004006008001000SE +/- 11.37, N = 1210421. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashNVIDIA GeForce RTX 2070510152025SE +/- 0.00, N = 318.871. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 0.22, N = 387581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.641. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthNVIDIA GeForce RTX 20702004006008001000SE +/- 1.22, N = 310911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ASKAP tConvolveCuda

Processing: Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingNVIDIA GeForce RTX 20703K6K9K12K15KSE +/- 211.30, N = 3131021. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ASKAP tConvolveCuda

Processing: Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 806.83, N = 3258191. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalNVIDIA GeForce RTX 207048121620SE +/- 0.05, N = 317.79

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingNVIDIA GeForce RTX 207048121620SE +/- 0.00, N = 313.92

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 314.16

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutNVIDIA GeForce RTX 207048121620SE +/- 0.02, N = 317.52

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 317.34

NVIDIA GPU Cloud TensorFlow

Test: VGG-16, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP16NVIDIA GeForce RTX 2070306090120150SE +/- 0.03, N = 3138.17

NVIDIA GPU Cloud TensorFlow

Test: VGG-16, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP32NVIDIA GeForce RTX 207020406080100SE +/- 0.09, N = 399.67

NVIDIA GPU Cloud TensorFlow

Test: AlexNet, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP16NVIDIA GeForce RTX 20706001200180024003000SE +/- 2.30, N = 32713

NVIDIA GPU Cloud TensorFlow

Test: AlexNet, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP32NVIDIA GeForce RTX 20705001000150020002500SE +/- 2.58, N = 32109

NVIDIA GPU Cloud TensorFlow

Test: Googlenet, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Googlenet, FP16NVIDIA GeForce RTX 2070140280420560700SE +/- 0.25, N = 3642

NVIDIA GPU Cloud TensorFlow

Test: ResNet-50, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP16NVIDIA GeForce RTX 207060120180240300SE +/- 0.52, N = 3297

NVIDIA GPU Cloud TensorFlow

Test: ResNet-50, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP32NVIDIA GeForce RTX 20704080120160200SE +/- 0.22, N = 3182

NVIDIA GPU Cloud TensorFlow

Test: Inception v4, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Inception v4, FP16NVIDIA GeForce RTX 207020406080100SE +/- 0.15, N = 385.83

Parboil

Test: OpenCL BFS

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSNVIDIA GeForce RTX 20700.31730.63460.95191.26921.5865SE +/- 0.01, N = 31.411. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenCL LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMNVIDIA GeForce RTX 2070246810SE +/- 0.01, N = 38.881. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenCL TPACF

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACFNVIDIA GeForce RTX 20700.22730.45460.68190.90921.1365SE +/- 0.01, N = 121.011. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteNVIDIA GeForce RTX 2070918273645SE +/- 0.06, N = 337.211. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterNVIDIA GeForce RTX 2070246810SE +/- 0.03, N = 37.611. (CXX) g++ options: -O2 -lOpenCL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLNVIDIA GeForce RTX 20703691215SE +/- 0.38, N = 129.25

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLNVIDIA GeForce RTX 20701.2422.4843.7264.9686.21SE +/- 0.06, N = 125.52

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLNVIDIA GeForce RTX 20700.03380.06760.10140.13520.169SE +/- 0.00, N = 120.15

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLNVIDIA GeForce RTX 20700.57151.1431.71452.2862.8575SE +/- 0.03, N = 32.54

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelNVIDIA GeForce RTX 207014002800420056007000SE +/- 11.17, N = 36345

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneNVIDIA GeForce RTX 20704K8K12K16K20KSE +/- 8.35, N = 318701

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 64.91, N = 330181

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyNVIDIA GeForce RTX 20701.2512.5023.7535.0046.255SE +/- 0.03, N = 35.56

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 111.04, N = 128403

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 172.39, N = 118535

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleNVIDIA GeForce RTX 207060120180240300SE +/- 0.16, N = 3275.72

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthNVIDIA GeForce RTX 207080160240320400SE +/- 0.52, N = 3368

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.49

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.55

Chaos Group V-RAY

Mode: CUDA GPU

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUNVIDIA GeForce RTX 20701530456075SE +/- 0.10, N = 366.43


Phoronix Test Suite v10.8.4