NVIDIA CUDA OpenCL Compute Tests Pre-Ampere

NVIDIA GeForce compute benchmarks of GTX 1000 and RTX 2000 series. Benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2008319-FI-NVIDIACOM78
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GTX 1070
August 30 2020
  2 Hours, 17 Minutes
GTX 1080
August 29 2020
  2 Hours, 24 Minutes
GTX 1650 SUPER
August 30 2020
  2 Hours, 53 Minutes
GTX 1660
August 30 2020
  2 Hours, 22 Minutes
GTX 1660 SUPER
August 29 2020
  2 Hours, 15 Minutes
RTX 2060
August 27 2020
  4 Hours
RTX 2060 SUPER
August 28 2020
  3 Hours, 54 Minutes
RTX 2070
August 28 2020
  3 Hours, 59 Minutes
RTX 2070 SUPER
August 28 2020
  2 Hours, 30 Minutes
RTX 2080
August 29 2020
  2 Hours, 27 Minutes
RTX 2080 SUPER
August 28 2020
  2 Hours, 21 Minutes
RTX 2080 Ti
August 28 2020
  2 Hours, 14 Minutes
TITAN RTX
August 27 2020
  2 Hours, 49 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 48 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA CUDA OpenCL Compute Tests Pre-AmpereOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)ASUS NVIDIA GeForce GTX 1650 SUPER 4GB (1530/6000MHz)ASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz)NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (1410/7000MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)Zotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TITAN RTX 24GB (1350/7000MHz)NVIDIA GP104 HD AudioNVIDIA TU116 HD AudioNVIDIA TU106 HD AudioNVIDIA TU104 HD AudioNVIDIA TU102 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-42-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160ProcessorMotherboardChipsetMemoryDiskGraphicsAudiosMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionNVIDIA CUDA OpenCL Compute Tests Pre-Ampere BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013- GTX 1070: GPU Compute Cores: 1920- GTX 1080: GPU Compute Cores: 2560- GTX 1650 SUPER: GPU Compute Cores: 1280- GTX 1660: GPU Compute Cores: 1408- GTX 1660 SUPER: GPU Compute Cores: 1408- RTX 2060: GPU Compute Cores: 1920- RTX 2060 SUPER: GPU Compute Cores: 2176- RTX 2070: GPU Compute Cores: 2304- RTX 2070 SUPER: GPU Compute Cores: 2560- RTX 2080: GPU Compute Cores: 2944- RTX 2080 SUPER: GPU Compute Cores: 3072- RTX 2080 Ti: GPU Compute Cores: 4352- TITAN RTX: GPU Compute Cores: 4608- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - TITAN RTX: Python 3.8.2

GTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTXResult OverviewPhoronix Test Suite100%163%226%288%351%OctaneBenchclpeakLuxCoreRender OpenCLBlenderRodiniaArrayFireNAMD CUDADarmstadt Automotive Parallel Heterogeneous Suite

NVIDIA CUDA OpenCL Compute Tests Pre-Ampereblender: Barbershop - NVIDIA OptiXblender: Barbershop - CUDAblender: Pabellon Barcelona - CUDAblender: Classroom - CUDAblender: Pabellon Barcelona - NVIDIA OptiXoctanebench: Total Scoreblender: Fishy Cat - CUDAblender: Classroom - NVIDIA OptiXluxcorerender-cl: Foodluxcorerender-cl: LuxCore Benchmarkluxcorerender-cl: DLSCblender: BMW27 - CUDAblender: Fishy Cat - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXdaphne: OpenCL - NDT Mappingclpeak: Double-Precision Doubledaphne: NVIDIA CUDA - NDT Mappingnamd-cuda: ATPase Simulation - 327,506 Atomsrodinia: OpenCL Particle Filterclpeak: Single-Precision Floatclpeak: Integer Compute INTarrayfire: Conjugate Gradient OpenCLclpeak: Global Memory BandwidthGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX991.22666.87327.21132.918217208.790.912.142.46104.22349.28223.960.230138.2786223.211670.534.075197.35765.18556.32256.83148.579893173.560.962.082.4085.82351.67297.590.203626.5838114.572409.963.386223.031201.48951.26415.2892.509207251.450.811.361.62135.98312.38158.07560.440.2456712.5804107.404222.584.445156.911083.24673.64319.05119.405787175.431.002.202.5690.98317.14167.30571.120.2291911.9924561.344422.504.540158.251039.36630.86303.12136.619341161.391.072.272.6484.82320.58173.47570.140.2204511.4884605.334748.952.643276.621758.681038.23565.38287.55211.07165.87791133.75145.011.192.813.3472.3267.5137.90330.31231.69578.190.193088.8355296.985279.952.653276.421736.001013.67530.35282.77195.62206.370933117.51139.841.303.564.2061.7560.3431.22337.40262.17591.340.188637.9317047.526903.902.083368.731785.391042.63541.45291.51199.91208.175753119.30143.481.303.524.1862.7760.7331.44343.97267.78594.850.187347.7877221.347047.122.098369.07990.71552.03346.63162.41137.37222.37487490.2788.571.893.714.4649.7646.8427.03341.15308.88612.400.183856.8618597.678493.382.071369.70963.51550.41345.00160.87137.52223.95792490.2685.661.873.594.3150.4244.5627.73348.74344.26611.080.182126.2188852.629030.212.080369.26905.99522.49336.59153.35131.93233.8025388.9181.161.943.634.3749.6542.1527.43352.15376.35604.180.181015.76910336.7610369.131.903406.14890.52516.75277.75143.60105.42311.11795368.6072.252.204.755.6737.5833.9820.36353.79522.05625.350.178484.42313408.5313256.971.676506.53898.32520.40273.74143.94104.08324.96206366.3672.322.204.935.8735.9633.1623.29349.64544.30625.620.179484.29514044.5913207.171.654530.38OpenBenchmarking.org

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: NVIDIA OptiXRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX400800120016002000SE +/- 0.59, N = 3SE +/- 1.45, N = 3SE +/- 1.45, N = 3SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.61, N = 3SE +/- 1.03, N = 3SE +/- 1.39, N = 31758.681736.001785.39990.71963.51905.99890.52898.32

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Barbershop - Compute: CUDAGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX30060090012001500SE +/- 0.10, N = 3SE +/- 0.23, N = 3SE +/- 0.25, N = 3SE +/- 0.05, N = 3SE +/- 2.44, N = 3SE +/- 0.18, N = 3SE +/- 0.27, N = 3SE +/- 0.29, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3991.22765.181201.481083.241039.361038.231013.671042.63552.03550.41522.49516.75520.40

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: CUDAGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX2004006008001000SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.73, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.02, N = 3SE +/- 0.39, N = 3SE +/- 0.23, N = 3666.87556.32951.26673.64630.86565.38530.35541.45346.63345.00336.59277.75273.74

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: CUDAGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX90180270360450SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.49, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.37, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 3327.21256.83415.28319.05303.12287.55282.77291.51162.41160.87153.35143.60143.94

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX50100150200250SE +/- 0.20, N = 3SE +/- 0.41, N = 3SE +/- 0.40, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3211.07195.62199.91137.37137.52131.93105.42104.08

OctaneBench

OctaneBench is a test of the OctaneRender on the GPU and requires the use of NVIDIA CUDA. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterOctaneBench 4.00cTotal ScoreGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX70140210280350132.92148.5892.51119.41136.62165.88206.37208.18222.37223.96233.80311.12324.96

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: CUDAGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX50100150200250SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.39, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3208.79173.56251.45175.43161.39133.75117.51119.3090.2790.2688.9168.6066.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Classroom - Compute: NVIDIA OptiXRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX306090120150SE +/- 0.15, N = 3SE +/- 0.26, N = 3SE +/- 0.32, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.47, N = 3145.01139.84143.4888.5785.6681.1672.2572.32

LuxCoreRender OpenCL

LuxCoreRender is an open-source physically based renderer. This test profile is focused on running LuxCoreRender on OpenCL accelerators/GPUs. The alternative luxcorerender test profile is for CPU execution due to a difference in tests, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: FoodGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX0.4950.991.4851.982.475SE +/- 0.01, N = 3SE +/- 0.03, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 120.910.960.811.001.071.191.301.301.891.871.942.202.20MIN: 0.23 / MAX: 1.06MIN: 0.13 / MAX: 1.15MIN: 0.24 / MAX: 0.93MIN: 0.24 / MAX: 1.16MIN: 0.25 / MAX: 1.26MIN: 0.25 / MAX: 1.39MIN: 0.26 / MAX: 1.53MIN: 0.29 / MAX: 1.5MIN: 0.29 / MAX: 2.24MIN: 0.25 / MAX: 2.24MIN: 0.25 / MAX: 2.32MIN: 0.29 / MAX: 2.63MIN: 0.13 / MAX: 2.7

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: LuxCore BenchmarkGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX1.10932.21863.32794.43725.5465SE +/- 0.00, N = 3SE +/- 0.04, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 122.142.081.362.202.272.813.563.523.713.593.634.754.93MIN: 0.32 / MAX: 2.39MIN: 0.14 / MAX: 2.4MIN: 0.27 / MAX: 1.51MIN: 0.32 / MAX: 2.48MIN: 0.27 / MAX: 2.55MIN: 0.32 / MAX: 3.18MIN: 0.27 / MAX: 4.03MIN: 0.38 / MAX: 4MIN: 0.27 / MAX: 4.24MIN: 0.27 / MAX: 4.08MIN: 0.27 / MAX: 4.15MIN: 0.32 / MAX: 5.38MIN: 0.17 / MAX: 5.74

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender OpenCL 2.3Scene: DLSCGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX1.32082.64163.96245.28326.604SE +/- 0.00, N = 3SE +/- 0.05, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 122.462.401.622.562.643.344.204.184.464.314.375.675.87MIN: 2.3 / MAX: 2.51MIN: 0.66 / MAX: 2.49MIN: 1.43 / MAX: 1.65MIN: 2.31 / MAX: 2.63MIN: 2.53 / MAX: 2.71MIN: 3.15 / MAX: 3.41MIN: 4.11 / MAX: 4.37MIN: 4.09 / MAX: 4.28MIN: 4.11 / MAX: 4.56MIN: 4.11 / MAX: 4.39MIN: 4.11 / MAX: 4.5MIN: 3.79 / MAX: 5.74MIN: 1.56 / MAX: 6.11

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: CUDAGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX306090120150SE +/- 0.23, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.22, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3104.2285.82135.9890.9884.8272.3261.7562.7749.7650.4249.6537.5835.96

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX1530456075SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 367.5160.3460.7346.8444.5642.1533.9833.16

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.82Blend File: BMW27 - Compute: NVIDIA OptiXRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX918273645SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 3.53, N = 1537.9031.2231.4427.0327.7327.4320.3623.29

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenCL - Kernel: NDT MappingGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX80160240320400SE +/- 2.39, N = 3SE +/- 0.47, N = 3SE +/- 0.11, N = 3SE +/- 0.28, N = 3SE +/- 2.04, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 1.78, N = 3SE +/- 0.22, N = 3SE +/- 1.02, N = 3SE +/- 0.12, N = 3SE +/- 0.42, N = 3SE +/- 1.50, N = 3349.28351.67312.38317.14320.58330.31337.40343.97341.15348.74352.15353.79349.641. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Double-Precision DoubleGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX120240360480600SE +/- 0.94, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.36, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.67, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 1.15, N = 3SE +/- 1.45, N = 3223.96297.59158.07167.30173.47231.69262.17267.78308.88344.26376.35522.05544.301. (CXX) g++ options: -O3 -rdynamic -lOpenCL

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: NVIDIA CUDA - Kernel: NDT MappingGTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX140280420560700SE +/- 0.75, N = 3SE +/- 0.87, N = 3SE +/- 1.18, N = 3SE +/- 1.50, N = 3SE +/- 1.15, N = 3SE +/- 1.38, N = 3SE +/- 1.48, N = 3SE +/- 0.67, N = 3SE +/- 1.57, N = 3SE +/- 2.17, N = 3SE +/- 2.67, N = 3560.44571.12570.14578.19591.34594.85612.40611.08604.18625.35625.621. (CXX) g++ options: -O3 -m64 -std=c++11 -lcudadevrt -lcudart_static -lrt -lpthread -ldl

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX0.05530.11060.16590.22120.2765SE +/- 0.00394, N = 3SE +/- 0.00124, N = 3SE +/- 0.00031, N = 3SE +/- 0.00052, N = 3SE +/- 0.00036, N = 3SE +/- 0.00034, N = 3SE +/- 0.00047, N = 3SE +/- 0.00023, N = 3SE +/- 0.00031, N = 3SE +/- 0.00006, N = 3SE +/- 0.00010, N = 3SE +/- 0.00006, N = 3SE +/- 0.00081, N = 30.230130.203620.245670.229190.220450.193080.188630.187340.183850.182120.181010.178480.17948

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX3691215SE +/- 0.018, N = 3SE +/- 0.079, N = 3SE +/- 0.013, N = 3SE +/- 0.023, N = 3SE +/- 0.025, N = 3SE +/- 0.020, N = 3SE +/- 0.004, N = 3SE +/- 0.017, N = 3SE +/- 0.005, N = 3SE +/- 0.012, N = 3SE +/- 0.006, N = 3SE +/- 0.006, N = 3SE +/- 0.064, N = 38.2786.58312.58011.99211.4888.8357.9317.7876.8616.2185.7694.4234.2951. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterclpeakOpenCL Test: Single-Precision FloatGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX3K6K9K12K15KSE +/- 0.24, N = 3SE +/- 112.13, N = 15SE +/- 55.18, N = 15SE +/- 46.17, N = 15SE +/- 17.25, N = 3SE +/- 83.16, N = 3SE +/- 91.57, N = 5SE +/- 96.05, N = 15SE +/- 77.32, N = 15SE +/- 10.28, N = 3SE +/- 101.58, N = 9SE +/- 163.05, N = 15SE +/- 194.88, N = 156223.218114.574107.404561.344605.335296.987047.527221.348597.678852.6210336.7613408.5314044.591. (CXX) g++ options: -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGIOPS, More Is BetterclpeakOpenCL Test: Integer Compute INTGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX3K6K9K12K15KSE +/- 21.96, N = 3SE +/- 15.07, N = 3SE +/- 52.82, N = 15SE +/- 53.01, N = 3SE +/- 26.64, N = 3SE +/- 49.62, N = 3SE +/- 70.16, N = 15SE +/- 70.45, N = 15SE +/- 75.60, N = 15SE +/- 32.74, N = 3SE +/- 146.75, N = 4SE +/- 130.38, N = 15SE +/- 112.69, N = 31670.532409.964222.584422.504748.955279.956903.907047.128493.389030.2110369.1313256.9713207.171. (CXX) g++ options: -O3 -rdynamic -lOpenCL

ArrayFire

ArrayFire is an GPU and CPU numeric processing library, this test uses the built-in CPU and OpenCL ArrayFire benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX1.02152.0433.06454.0865.1075SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.007, N = 34.0753.3864.4454.5402.6432.6532.0832.0982.0712.0801.9031.6761.6541. (CXX) g++ options: -rdynamic

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is BetterclpeakOpenCL Test: Global Memory BandwidthGTX 1070GTX 1080GTX 1650 SUPERGTX 1660GTX 1660 SUPERRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiTITAN RTX110220330440550SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 3SE +/- 0.23, N = 3SE +/- 0.10, N = 3SE +/- 1.07, N = 3SE +/- 0.12, N = 3197.35223.03156.91158.25276.62276.42368.73369.07369.70369.26406.14506.53530.381. (CXX) g++ options: -O3 -rdynamic -lOpenCL