NVIDIA GeForce RTX 2070

Benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1810180-SK-NVIDIAGEF35
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
NVIDIA GeForce RTX 2070
October 18 2018
  1 Hour, 53 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA GeForce RTX 2070OpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1402 BIOS)AMD Family 17h32768MBSamsung SSD 970 EVO 500GBeVGA NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)Realtek ALC1220ASUS VP28UIntel I211 Gigabit ConnectionUbuntu 18.044.18.0-041800-generic (x86_64)GNOME Shell 3.28.3X Server 1.19.6NVIDIA 410.664.6.0OpenCL 1.2 CUDA 10.0.175GCC 7.3.0 + CUDA 10.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionNVIDIA GeForce RTX 2070 BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand- GPU Compute Cores: 2304- Python 2.7.15rc1 + Python 3.6.6- __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp

NVIDIA GeForce RTX 2070financebench: Monte-Carlo OpenCLfinancebench: Black-Scholes OpenCLshoc: OpenCL - Triadshoc: OpenCL - FFT SPshoc: OpenCL - MD5 Hashshoc: OpenCL - Max SP Flopsshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthaskap: Griddingaskap: Degriddingcuda-mini-nbody: Originalcuda-mini-nbody: Cache Blockingcuda-mini-nbody: Loop Unrollingcuda-mini-nbody: SOA Data Layoutcuda-mini-nbody: Flush Denormals To Zerongc-tensorflow: VGG-16, FP16ngc-tensorflow: VGG-16, FP32ngc-tensorflow: AlexNet, FP16ngc-tensorflow: AlexNet, FP32ngc-tensorflow: Googlenet, FP16ngc-tensorflow: ResNet-50, FP16ngc-tensorflow: ResNet-50, FP32ngc-tensorflow: Inception v4, FP16parboil: OpenCL BFSparboil: OpenCL LBMparboil: OpenCL TPACFrodinia: OpenCL Myocyterodinia: OpenCL Particle Filterdarktable: Boat - OpenCLdarktable: Masskrug - OpenCLdarktable: Server Rack - OpenCLdarktable: Server Room - OpenCLluxmark: GPU - Hotelluxmark: GPU - Microphoneluxmark: GPU - Luxball HDRclpeak: Kernel Latencyclpeak: Integer Compute INTclpeak: Single-Precision Floatclpeak: Double-Precision Doubleclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueReadBufferclpeak: Transfer Bandwidth enqueueWriteBufferv-ray: CUDA GPUNVIDIA GeForce RTX 207034710.406.50104218.8787586.646.611091131022581917.7913.9214.1617.5217.34138.1799.672713210964229718285.831.418.881.0137.217.619.255.520.152.54634518701301815.5684038535275.723686.496.5566.43OpenBenchmarking.org

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Monte-Carlo OpenCLNVIDIA GeForce RTX 207080160240320400SE +/- 0.25, N = 33471. (CXX) g++ options: -O3 -lOpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-06-06Benchmark: Black-Scholes OpenCLNVIDIA GeForce RTX 20703691215SE +/- 0.00, N = 310.401. (CXX) g++ options: -O3 -lOpenCL

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: TriadNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.501. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: FFT SPNVIDIA GeForce RTX 20702004006008001000SE +/- 11.37, N = 1210421. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: MD5 HashNVIDIA GeForce RTX 2070510152025SE +/- 0.00, N = 318.871. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Max SP FlopsNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 0.22, N = 387581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed DownloadNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.641. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Bus Speed ReadbackNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2015-11-10Target: OpenCL - Benchmark: Texture Read BandwidthNVIDIA GeForce RTX 20702004006008001000SE +/- 1.22, N = 310911. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -pthread -lmpi_cxx -lmpi

ASKAP tConvolveCuda

This is a CUDA benchmark of ATNF's ASKAP Benchmark with currently using the tConvolveCuda sub-test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: GriddingNVIDIA GeForce RTX 20703K6K9K12K15KSE +/- 211.30, N = 3131021. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP tConvolveCuda 2015-11-10Processing: DegriddingNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 806.83, N = 3258191. (CXX) g++ options: -fPIC -O3 -m64 -std=c++14 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

CUDA Mini-Nbody

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: OriginalNVIDIA GeForce RTX 207048121620SE +/- 0.05, N = 317.79

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Cache BlockingNVIDIA GeForce RTX 207048121620SE +/- 0.00, N = 313.92

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Loop UnrollingNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 314.16

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: SOA Data LayoutNVIDIA GeForce RTX 207048121620SE +/- 0.02, N = 317.52

OpenBenchmarking.orgSeconds, Fewer Is BetterCUDA Mini-Nbody 2015-11-10Test: Flush Denormals To ZeroNVIDIA GeForce RTX 207048121620SE +/- 0.03, N = 317.34

NVIDIA GPU Cloud TensorFlow

This test profile uses the NVIDIA GPU Cloud (NGC/nvcr.io) for running the TensorFlow image inside Docker for benchmarking. You must have already signed into NGC for this test profile to work. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP16NVIDIA GeForce RTX 2070306090120150SE +/- 0.03, N = 3138.17

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: VGG-16, FP32NVIDIA GeForce RTX 207020406080100SE +/- 0.09, N = 399.67

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP16NVIDIA GeForce RTX 20706001200180024003000SE +/- 2.30, N = 32713

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: AlexNet, FP32NVIDIA GeForce RTX 20705001000150020002500SE +/- 2.58, N = 32109

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Googlenet, FP16NVIDIA GeForce RTX 2070140280420560700SE +/- 0.25, N = 3642

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP16NVIDIA GeForce RTX 207060120180240300SE +/- 0.52, N = 3297

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: ResNet-50, FP32NVIDIA GeForce RTX 20704080120160200SE +/- 0.22, N = 3182

OpenBenchmarking.orgImages Per Second, More Is BetterNVIDIA GPU Cloud TensorFlow 18.09Test: Inception v4, FP16NVIDIA GeForce RTX 207020406080100SE +/- 0.15, N = 385.83

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL BFSNVIDIA GeForce RTX 20700.31730.63460.95191.26921.5865SE +/- 0.01, N = 31.411. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL LBMNVIDIA GeForce RTX 2070246810SE +/- 0.01, N = 38.881. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenCL TPACFNVIDIA GeForce RTX 20700.22730.45460.68190.90921.1365SE +/- 0.01, N = 121.011. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL MyocyteNVIDIA GeForce RTX 2070918273645SE +/- 0.06, N = 337.211. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenCL Particle FilterNVIDIA GeForce RTX 2070246810SE +/- 0.03, N = 37.611. (CXX) g++ options: -O2 -lOpenCL

Darktable

Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: OpenCLNVIDIA GeForce RTX 20703691215SE +/- 0.38, N = 129.25

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: OpenCLNVIDIA GeForce RTX 20701.2422.4843.7264.9686.21SE +/- 0.06, N = 125.52

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: OpenCLNVIDIA GeForce RTX 20700.03380.06760.10140.13520.169SE +/- 0.00, N = 120.15

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: OpenCLNVIDIA GeForce RTX 20700.57151.1431.71452.2862.8575SE +/- 0.03, N = 32.54

LuxMark

LuxMark is a multi-platform OpenGL benchmark using LuxRender. LuxMark supports targeting different OpenCL devices and has multiple scenes available for rendering. LuxMark is a fully open-source OpenCL program with real-world rendering examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: HotelNVIDIA GeForce RTX 207014002800420056007000SE +/- 11.17, N = 36345

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: MicrophoneNVIDIA GeForce RTX 20704K8K12K16K20KSE +/- 8.35, N = 318701

OpenBenchmarking.orgScore, More Is BetterLuxMark 3.1OpenCL Device: GPU - Scene: Luxball HDRNVIDIA GeForce RTX 20706K12K18K24K30KSE +/- 64.91, N = 330181

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is BetterclpeakOpenCL Test: Kernel LatencyNVIDIA GeForce RTX 20701.2512.5023.7535.0046.255SE +/- 0.03, N = 35.56

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 111.04, N = 128403

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatNVIDIA GeForce RTX 20702K4K6K8K10KSE +/- 172.39, N = 118535

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleNVIDIA GeForce RTX 207060120180240300SE +/- 0.16, N = 3275.72

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthNVIDIA GeForce RTX 207080160240320400SE +/- 0.52, N = 3368

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueReadBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.49

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Transfer Bandwidth enqueueWriteBufferNVIDIA GeForce RTX 2070246810SE +/- 0.00, N = 36.55

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CUDA GPUNVIDIA GeForce RTX 20701530456075SE +/- 0.10, N = 366.43