NVIDIA GeForce RTX 2070

Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1810180-SK-NVIDIAGEF35&grr.

NVIDIA GeForce RTX 2070ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionNVIDIA GeForce RTX 2070AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1402 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBeVGA NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Realtek ALC1220ASUS VP28UIntel I211 Gigabit ConnectionUbuntu 18.044.18.0-041800-generic (x86_64)GNOME Shell 3.28.3X Server 1.19.6NVIDIA 410.664.6.0OpenCL 1.2 CUDA 10.0.175GCC 7.3.0 + CUDA 10.0ext43840x2160OpenBenchmarking.org- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand- GPU Compute Cores: 2304- Python 2.7.15rc1 + Python 3.6.6- __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp

NVIDIA GeForce RTX 2070shoc: OpenCL - Max SP Flopsluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRngc-tensorflow: Inception v4, FP16ngc-tensorflow: Googlenet, FP16v-ray: CUDA GPUngc-tensorflow: ResNet-50, FP32ngc-tensorflow: VGG-16, FP32ngc-tensorflow: VGG-16, FP16ngc-tensorflow: ResNet-50, FP16darktable: Boat - OpenCLclpeak: Double-Precision Doublerodinia: OpenCL Myocytengc-tensorflow: AlexNet, FP32ngc-tensorflow: AlexNet, FP16darktable: Masskrug - OpenCLshoc: OpenCL - Texture Read Bandwidthcuda-mini-nbody: Originalcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zeroclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferaskap: Griddingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: Cache Blockingparboil: OpenCL LBMshoc: OpenCL - FFT SProdinia: OpenCL Particle Filterclpeak: Integer Compute INTclpeak: Single-Precision Floatfinancebench: Monte-Carlo OpenCLparboil: OpenCL TPACFdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLparboil: OpenCL BFSshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Triadshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - MD5 Hashclpeak: Global Memory Bandwidthfinancebench: Black-Scholes OpenCLclpeak: Kernel Latencyaskap: DegriddingNVIDIA GeForce RTX 207087586345187013018185.8364266.4318299.67138.172979.25275.7237.21210927135.52109117.7917.5217.346.556.491310214.1613.928.8810427.61840385353471.010.152.541.416.616.506.6418.8736810.405.5625819OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 0.22, N = 387581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelNVIDIA GeForce RTX 207014002800420056007000SE +/- 11.17, N = 36345

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneNVIDIA GeForce RTX 20704K8K12K16K20KSE +/- 8.35, N = 318701

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 64.91, N = 330181

NVIDIA GPU Cloud TensorFlow

Test: Inception v4, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Inception v4, FP16NVIDIA GeForce RTX 207020406080100SE +/- 0.15, N = 385.83

NVIDIA GPU Cloud TensorFlow

Test: Googlenet, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Googlenet, FP16NVIDIA GeForce RTX 2070140280420560700SE +/- 0.25, N = 3642

Chaos Group V-RAY

Mode: CUDA GPU

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUNVIDIA GeForce RTX 20701530456075SE +/- 0.10, N = 366.43

NVIDIA GPU Cloud TensorFlow

Test: ResNet-50, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP32NVIDIA GeForce RTX 20704080120160200SE +/- 0.22, N = 3182

NVIDIA GPU Cloud TensorFlow

Test: VGG-16, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP32NVIDIA GeForce RTX 207020406080100SE +/- 0.09, N = 399.67

NVIDIA GPU Cloud TensorFlow

Test: VGG-16, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP16NVIDIA GeForce RTX 2070306090120150SE +/- 0.03, N = 3138.17

NVIDIA GPU Cloud TensorFlow

Test: ResNet-50, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP16NVIDIA GeForce RTX 207060120180240300SE +/- 0.52, N = 3297

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLNVIDIA GeForce RTX 20703691215SE +/- 0.38, N = 129.25

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleNVIDIA GeForce RTX 207060120180240300SE +/- 0.16, N = 3275.72

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteNVIDIA GeForce RTX 2070918273645SE +/- 0.06, N = 337.211. (CXX) g++ options: -O2 -lOpenCL

NVIDIA GPU Cloud TensorFlow

Test: AlexNet, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP32NVIDIA GeForce RTX 20705001000150020002500SE +/- 2.58, N = 32109

NVIDIA GPU Cloud TensorFlow

Test: AlexNet, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP16NVIDIA GeForce RTX 20706001200180024003000SE +/- 2.30, N = 32713

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLNVIDIA GeForce RTX 20701.2422.4843.7264.9686.21SE +/- 0.06, N = 125.52

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthNVIDIA GeForce RTX 20702004006008001000SE +/- 1.22, N = 310911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalNVIDIA GeForce RTX 207048121620SE +/- 0.05, N = 317.79

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutNVIDIA GeForce RTX 207048121620SE +/- 0.02, N = 317.52

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 317.34

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.55

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.49

ASKAP tConvolveCuda

Processing: Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingNVIDIA GeForce RTX 20703K6K9K12K15KSE +/- 211.30, N = 3131021. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 314.16

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingNVIDIA GeForce RTX 207048121620SE +/- 0.00, N = 313.92

Parboil

Test: OpenCL LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMNVIDIA GeForce RTX 2070246810SE +/- 0.01, N = 38.881. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPNVIDIA GeForce RTX 20702004006008001000SE +/- 11.37, N = 1210421. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterNVIDIA GeForce RTX 2070246810SE +/- 0.03, N = 37.611. (CXX) g++ options: -O2 -lOpenCL

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 111.04, N = 128403

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 172.39, N = 118535

FinanceBench

Benchmark: Monte-Carlo OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Monte-Carlo OpenCLNVIDIA GeForce RTX 207080160240320400SE +/- 0.25, N = 33471. (CXX) g++ options: -O3 -lOpenCL

Parboil

Test: OpenCL TPACF

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACFNVIDIA GeForce RTX 20700.22730.45460.68190.90921.1365SE +/- 0.01, N = 121.011. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLNVIDIA GeForce RTX 20700.03380.06760.10140.13520.169SE +/- 0.00, N = 120.15

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLNVIDIA GeForce RTX 20700.57151.1431.71452.2862.8575SE +/- 0.03, N = 32.54

Parboil

Test: OpenCL BFS

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSNVIDIA GeForce RTX 20700.31730.63460.95191.26921.5865SE +/- 0.01, N = 31.411. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.501. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.641. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashNVIDIA GeForce RTX 2070510152025SE +/- 0.00, N = 318.871. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthNVIDIA GeForce RTX 207080160240320400SE +/- 0.52, N = 3368

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLNVIDIA GeForce RTX 20703691215SE +/- 0.00, N = 310.401. (CXX) g++ options: -O3 -lOpenCL

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyNVIDIA GeForce RTX 20701.2512.5023.7535.0046.255SE +/- 0.03, N = 35.56

ASKAP tConvolveCuda

Processing: Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 806.83, N = 3258191. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl


Phoronix Test Suite v10.8.4