NVIDIA GeForce RTX 2070

Benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1810203-SKEE-181018060.

NVIDIA GeForce RTX 2070ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionNVIDIA GeForce RTX 2070ZotacRTX2080AmpAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1402 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBeVGA NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Realtek ALC1220ASUS VP28UIntel I211 Gigabit ConnectionUbuntu 18.044.18.0-041800-generic (x86_64)GNOME Shell 3.28.3X Server 1.19.6NVIDIA 410.664.6.0OpenCL 1.2 CUDA 10.0.175GCC 7.3.0 + CUDA 10.0ext43840x2160Intel Core i7-8086K @ 5.10GHz (6 Cores / 12 Threads)ASRock Z370 Extreme4 (P3.10 BIOS)Intel 8th Gen Core16384MB4001GB Western Digital WD40EMRX-82U + 240GB Force MP300 + 1000GB Samsung SSD 970 EVO 1TBZotac NVIDIA GeForce RTX 2080 8192MB (1515/8000MHz)VX2439wmIntel ConnectionLinuxMint 194.19.0-999-lowlatency (x86_64)Cinnamon 3.8.9GCC 8.2.01920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- NVIDIA GeForce RTX 2070: Scaling Governor: acpi-cpufreq ondemand- ZotacRTX2080Amp: Scaling Governor: intel_pstate performanceOpenCL Details- NVIDIA GeForce RTX 2070: GPU Compute Cores: 2304- ZotacRTX2080Amp: GPU Compute Cores: 2944Python Details- Python 2.7.15rc1 + Python 3.6.6Security Details- NVIDIA GeForce RTX 2070: __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp- ZotacRTX2080Amp: KPTI + __user pointer sanitization + Full generic retpoline IBPB IBRS_FW + SSB disabled via prctl and seccomp + PTE Inversion; VMX: conditional cache flushes SMT vulnerable

NVIDIA GeForce RTX 2070financebench: Monte-Carlo OpenCLfinancebench: Black-Scholes OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthaskap: Griddingaskap: Degriddingcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerongc-tensorflow: VGG-16, FP16ngc-tensorflow: VGG-16, FP32ngc-tensorflow: AlexNet, FP16ngc-tensorflow: AlexNet, FP32ngc-tensorflow: Googlenet, FP16ngc-tensorflow: ResNet-50, FP16ngc-tensorflow: ResNet-50, FP32ngc-tensorflow: Inception v4, FP16parboil: OpenCL BFSparboil: OpenCL LBMparboil: OpenCL TPACFrodinia: OpenCL Myocyterodinia: OpenCL Particle Filterdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferv-ray: CUDA GPUNVIDIA GeForce RTX 2070ZotacRTX2080Amp34710.406.50104218.8787586.646.611091131022581917.7913.9214.1617.5217.34138.1799.672713210964229718285.831.418.881.0137.217.619.255.520.152.54634518701301815.5684038535275.723686.496.5566.433367.6212.52123026.4212.9612.80122530.345.651.813.890.110.777639236783446660.35OpenBenchmarking.org

FinanceBench

Benchmark: Monte-Carlo OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Monte-Carlo OpenCLNVIDIA GeForce RTX 2070ZotacRTX2080Amp80160240320400SE +/- 0.25, N = 3SE +/- 0.52, N = 33473361. (CXX) g++ options: -O3 -lOpenCL

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLNVIDIA GeForce RTX 2070ZotacRTX2080Amp3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 310.407.621. (CXX) g++ options: -O3 -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadNVIDIA GeForce RTX 2070ZotacRTX2080Amp3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 36.5012.521. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPNVIDIA GeForce RTX 2070ZotacRTX2080Amp30060090012001500SE +/- 11.37, N = 12SE +/- 9.23, N = 3104212301. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashNVIDIA GeForce RTX 2070ZotacRTX2080Amp612182430SE +/- 0.00, N = 3SE +/- 0.02, N = 318.8726.421. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 0.22, N = 387581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadNVIDIA GeForce RTX 2070ZotacRTX2080Amp3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 36.6412.961. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA GeForce RTX 2070ZotacRTX2080Amp3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 36.6112.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthNVIDIA GeForce RTX 2070ZotacRTX2080Amp30060090012001500SE +/- 1.22, N = 3SE +/- 3.01, N = 3109112251. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ASKAP tConvolveCuda

Processing: Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingNVIDIA GeForce RTX 20703K6K9K12K15KSE +/- 211.30, N = 3131021. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

ASKAP tConvolveCuda

Processing: Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 806.83, N = 3258191. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

Test: Original

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalNVIDIA GeForce RTX 207048121620SE +/- 0.05, N = 317.79

CUDA Mini-Nbody

Test: Cache Blocking

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingNVIDIA GeForce RTX 207048121620SE +/- 0.00, N = 313.92

CUDA Mini-Nbody

Test: Loop Unrolling

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 314.16

CUDA Mini-Nbody

Test: SOA Data Layout

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutNVIDIA GeForce RTX 207048121620SE +/- 0.02, N = 317.52

CUDA Mini-Nbody

Test: Flush Denormals To Zero

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 317.34

NVIDIA GPU Cloud TensorFlow

Test: VGG-16, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP16NVIDIA GeForce RTX 2070306090120150SE +/- 0.03, N = 3138.17

NVIDIA GPU Cloud TensorFlow

Test: VGG-16, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP32NVIDIA GeForce RTX 207020406080100SE +/- 0.09, N = 399.67

NVIDIA GPU Cloud TensorFlow

Test: AlexNet, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP16NVIDIA GeForce RTX 20706001200180024003000SE +/- 2.30, N = 32713

NVIDIA GPU Cloud TensorFlow

Test: AlexNet, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP32NVIDIA GeForce RTX 20705001000150020002500SE +/- 2.58, N = 32109

NVIDIA GPU Cloud TensorFlow

Test: Googlenet, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Googlenet, FP16NVIDIA GeForce RTX 2070140280420560700SE +/- 0.25, N = 3642

NVIDIA GPU Cloud TensorFlow

Test: ResNet-50, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP16NVIDIA GeForce RTX 207060120180240300SE +/- 0.52, N = 3297

NVIDIA GPU Cloud TensorFlow

Test: ResNet-50, FP32

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP32NVIDIA GeForce RTX 20704080120160200SE +/- 0.22, N = 3182

NVIDIA GPU Cloud TensorFlow

Test: Inception v4, FP16

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Inception v4, FP16NVIDIA GeForce RTX 207020406080100SE +/- 0.15, N = 385.83

Parboil

Test: OpenCL BFS

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSNVIDIA GeForce RTX 20700.31730.63460.95191.26921.5865SE +/- 0.01, N = 31.411. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenCL LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMNVIDIA GeForce RTX 2070246810SE +/- 0.01, N = 38.881. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenCL TPACF

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACFNVIDIA GeForce RTX 20700.22730.45460.68190.90921.1365SE +/- 0.01, N = 121.011. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Test: OpenCL Myocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteNVIDIA GeForce RTX 2070ZotacRTX2080Amp918273645SE +/- 0.06, N = 3SE +/- 0.12, N = 337.2130.341. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterNVIDIA GeForce RTX 2070ZotacRTX2080Amp246810SE +/- 0.03, N = 3SE +/- 0.07, N = 37.615.651. (CXX) g++ options: -O2 -lOpenCL

Darktable

Test: Boat - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLNVIDIA GeForce RTX 2070ZotacRTX2080Amp3691215SE +/- 0.38, N = 12SE +/- 0.01, N = 39.251.81

Darktable

Test: Masskrug - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLNVIDIA GeForce RTX 2070ZotacRTX2080Amp1.2422.4843.7264.9686.21SE +/- 0.06, N = 12SE +/- 0.00, N = 35.523.89

Darktable

Test: Server Rack - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLNVIDIA GeForce RTX 2070ZotacRTX2080Amp0.03380.06760.10140.13520.169SE +/- 0.00, N = 12SE +/- 0.00, N = 30.150.11

Darktable

Test: Server Room - Acceleration: OpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLNVIDIA GeForce RTX 2070ZotacRTX2080Amp0.57151.1431.71452.2862.8575SE +/- 0.03, N = 3SE +/- 0.00, N = 32.540.77

LuxMark

OpenCL Device: GPU - Scene: Hotel

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelNVIDIA GeForce RTX 2070ZotacRTX2080Amp16003200480064008000SE +/- 11.17, N = 3SE +/- 37.30, N = 363457639

LuxMark

OpenCL Device: GPU - Scene: Microphone

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneNVIDIA GeForce RTX 2070ZotacRTX2080Amp5K10K15K20K25KSE +/- 8.35, N = 3SE +/- 32.83, N = 31870123678

LuxMark

OpenCL Device: GPU - Scene: Luxball HDR

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRNVIDIA GeForce RTX 2070ZotacRTX2080Amp7K14K21K28K35KSE +/- 64.91, N = 3SE +/- 106.21, N = 33018134466

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyNVIDIA GeForce RTX 20701.2512.5023.7535.0046.255SE +/- 0.03, N = 35.56

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 111.04, N = 128403

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 172.39, N = 118535

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleNVIDIA GeForce RTX 207060120180240300SE +/- 0.16, N = 3275.72

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthNVIDIA GeForce RTX 207080160240320400SE +/- 0.52, N = 3368

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.49

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.55

Chaos Group V-RAY

Mode: CUDA GPU

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUNVIDIA GeForce RTX 2070ZotacRTX2080Amp1530456075SE +/- 0.10, N = 3SE +/- 0.01, N = 366.4360.35


Phoronix Test Suite v10.8.4