NVIDIA RTX 40 Series Compute

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2401305-NE-NVIDIACOM43
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 2060 SUPER
January 23
  1 Hour, 58 Minutes
RTX 2070
January 27
  2 Hours
RTX 2070 SUPER
January 23
  1 Hour, 45 Minutes
RTX 2080
January 24
  1 Hour, 39 Minutes
RTX 2080 SUPER
January 24
  1 Hour, 33 Minutes
TITAN RTX
January 26
  1 Hour, 19 Minutes
RTX 3060
January 22
  2 Hours, 9 Minutes
RTX 3060 Ti
January 21
  1 Hour, 38 Minutes
RTX 3070
January 20
  1 Hour, 33 Minutes
RTX 3070 Ti
January 21
  1 Hour, 26 Minutes
RTX 3080
January 22
  1 Hour, 13 Minutes
RTX 3080 Ti
January 21
  1 Hour, 10 Minutes
RTX 3090
January 24
  1 Hour, 6 Minutes
RTX 4060
January 20
  1 Hour, 56 Minutes
RTX 4070
January 27
  1 Hour, 22 Minutes
RTX 4070 SUPER
January 28
  1 Hour, 17 Minutes
RTX 4070 Ti SUPER
January 28
  1 Hour, 7 Minutes
RTX 4080
January 22
  1 Hour, 5 Minutes
RTX 4080 SUPER
January 29
  1 Hour, 1 Minute
RTX 4090
January 21
  53 Minutes
Invert Behavior (Only Show Selected Data)
  1 Hour, 28 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA RTX 40 Series ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionRTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERTITAN RTXRTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 3080 TiRTX 3090RTX 4060RTX 4070RTX 4070 SUPERRTX 4070 Ti SUPERRTX 4080RTX 4080 SUPERRTX 4090AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS)AMD Device 14d82 x 16 GB DRAM-6000MT/s G Skill F5-6000J3038F16G2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 2060 SUPER 8GBNVIDIA TU106 HD AudioDELL U2723QEIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Ubuntu 23.106.7.0-060700-generic (x86_64)GNOME Shell 45.2X Server 1.21.1.7NVIDIA 545.29.064.6.0OpenCL 3.0 CUDA 12.3.99GCC 13.2.0 + LLVM 16.0.6ext43840x2160ASUS NVIDIA GeForce RTX 2070 8GBNVIDIA GeForce RTX 2070 SUPER 8GBNVIDIA TU104 HD AudioZotac NVIDIA GeForce RTX 2080 8GBNVIDIA GeForce RTX 2080 SUPER 8GB4001GB Western Digital WD_BLACK SN850X 4000GB + 2000GB Samsung SSD 980 PRO 2TBNVIDIA TITAN RTX 24GBNVIDIA TU102 HD Audio2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBeVGA NVIDIA GeForce RTX 3060 12GBNVIDIA GA106 HD AudioNVIDIA GeForce RTX 3060 Ti 8GBNVIDIA GA104 HD Audio4001GB Western Digital WD_BLACK SN850X 4000GB + 2000GB Samsung SSD 980 PRO 2TBNVIDIA GeForce RTX 3070 8GBNVIDIA GeForce RTX 3070 Ti 8GB2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GBNVIDIA GeForce RTX 3080 10GBNVIDIA GA102 HD AudioNVIDIA GeForce RTX 3080 Ti 12GBNVIDIA GeForce RTX 3090 24GBMSI NVIDIA GeForce RTX 4060 8GBNVIDIA Device 22beNVIDIA GeForce RTX 4070 12GBNVIDIA Device 22bcNVIDIA GeForce RTX 4070 SUPER 12GBNVIDIA 550.40.07OpenCL 3.0 CUDA 12.4.74ASUS NVIDIA GeForce RTX 4070 Ti SUPER 16GBNVIDIA Device 22bbNVIDIA GeForce RTX 4080 16GBNVIDIA 545.29.06OpenCL 3.0 CUDA 12.3.99NVIDIA GeForce RTX 4080 SUPER 16GBNVIDIA 550.40.07OpenCL 3.0 CUDA 12.4.74NVIDIA GeForce RTX 4090 24GBNVIDIA AD102 HD AudioNVIDIA 545.29.06OpenCL 3.0 CUDA 12.3.99OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Details- Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Graphics Details- RTX 2060 SUPER: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.06.44.00.01- RTX 2070: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.06.0b.40.83- RTX 2070 SUPER: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.04.76.00.01- RTX 2080: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.04.0d.00.1e- RTX 2080 SUPER: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.04.79.00.01- TITAN RTX: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.23.00.01- RTX 3060: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.06.14.40.46- RTX 3060 Ti: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2c- RTX 3070: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.25.00.2b- RTX 3070 Ti: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 94.04.5b.00.02- RTX 3080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.20.00.07- RTX 3080 Ti: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 94.02.71.00.01- RTX 3090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.27.00.02- RTX 4060: BAR1 / Visible vRAM Size: 8192 MiB - vBIOS Version: 95.07.31.00.e3- RTX 4070: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.49.00.03- RTX 4070 SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.04.69.00.01- RTX 4070 Ti SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.45.00.9c- RTX 4080: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.04- RTX 4080 SUPER: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.44.00.01- RTX 4090: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.01OpenCL Details- RTX 2060 SUPER: GPU Compute Cores: 2176- RTX 2070: GPU Compute Cores: 2304- RTX 2070 SUPER: GPU Compute Cores: 2560- RTX 2080: GPU Compute Cores: 2944- RTX 2080 SUPER: GPU Compute Cores: 3072- TITAN RTX: GPU Compute Cores: 4608- RTX 3060: GPU Compute Cores: 3584- RTX 3060 Ti: GPU Compute Cores: 4864- RTX 3070: GPU Compute Cores: 5888- RTX 3070 Ti: GPU Compute Cores: 6144- RTX 3080: GPU Compute Cores: 8704- RTX 3080 Ti: GPU Compute Cores: 10240- RTX 3090: GPU Compute Cores: 10496- RTX 4060: GPU Compute Cores: 3072- RTX 4070: GPU Compute Cores: 5888- RTX 4070 SUPER: GPU Compute Cores: 7168- RTX 4070 Ti SUPER: GPU Compute Cores: 8448- RTX 4080: GPU Compute Cores: 9728- RTX 4080 SUPER: GPU Compute Cores: 10240- RTX 4090: GPU Compute Cores: 16384Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERTITAN RTXRTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 3080 TiRTX 3090RTX 4060RTX 4070RTX 4070 SUPERRTX 4070 Ti SUPERRTX 4080RTX 4080 SUPERRTX 4090Result OverviewPhoronix Test Suite100%290%480%670%859%Chaos Group V-RAYGpuOwlLuxCoreRenderBlenderProjectPhysX OpenCL-BenchmarkIndigoBenchFluidX3DPyTorch

NVIDIA RTX 40 Series Computev-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUluxcorerender: DLSC - GPUluxcorerender: Danish Mood - GPUopencl-benchmark: FP64 Computegpuowl: 332220523gpuowl: 77936867gpuowl: 57885161opencl-benchmark: INT32 Computeopencl-benchmark: INT8 Computeopencl-benchmark: INT16 Computeluxcorerender: LuxCore Benchmark - GPUblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXluxcorerender: Orange Juice - GPUindigobench: OpenCL GPU - Bedroomblender: Classroom - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXblender: BMW27 - NVIDIA OptiXindigobench: OpenCL GPU - Supercarfluidx3d: FP32-FP16Copencl-benchmark: Memory Bandwidth Coalesced Readluxcorerender: Rainbow Colors and Prism - GPUfluidx3d: FP32-FP32opencl-benchmark: Memory Bandwidth Coalesced Writefluidx3d: FP32-FP16Sopencl-benchmark: INT64 Computepytorch: NVIDIA CUDA GPU - 512 - ResNet-50pytorch: NVIDIA CUDA GPU - 512 - ResNet-152opencl-benchmark: FP32 ComputeRTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERTITAN RTXRTX 3060RTX 3060 TiRTX 3070RTX 3070 TiRTX 3080RTX 3080 TiRTX 3090RTX 4060RTX 4070RTX 4070 SUPERRTX 4070 Ti SUPERRTX 4080RTX 4080 SUPERRTX 40905118393.272.730.25556.17263.99355.8718861218.2145.4577.0493.5927.6843.413.667.56935.98141.7413.2823.1854442388.0012.352437418.5749822.185191.5380.278.2724507362.932.490.25651.78244.74330.628.3775.6577.2603.2429.7847.353.366.94639.50155.7014.4021.1954223388.4911.372270421.1645892.126177.7373.528.48479410684.113.550.30166.04310.72419.649.7076.4988.4074.2124.8141.824.477.98534.63131.4012.8524.4754955388.3012.142321409.5049483.067200.5184.389.7857829984.223.650.33573.30344.12463.1810.8077.0989.2644.3323.2941.654.588.06234.88130.9812.7425.0154966387.2511.802322417.5549573.310197.6282.8610.89880410134.443.810.36580.43377.03508.8211.8317.89710.4034.4921.8240.564.738.31333.57126.6012.3525.8875351426.0111.732504452.0054233.630208.6887.6011.89197014525.915.100.530115.73531.73711.0717.08110.94313.8996.3716.2528.296.0512.06124.4592.689.2934.9247157556.6016.373343603.9971083.576263.23106.9617.17781111474.253.340.20144.28209.13284.506.7945.1175.9174.1028.3742.744.828.28538.45135.4814.4824.3843386335.7513.572039339.6739321.925204.4187.1913.080125016855.954.780.29965.93313.15421.239.6587.1878.4015.7020.1831.356.1811.68128.1498.8411.0033.3864698414.4519.08263742251363.090267.12117.4018.693141618746.935.570.35678.55369.14497.0211.4528.4549.9876.6617.6328.197.0413.17425.3688.759.9836.8914884414.4321.522597422.0150793.270287.04126.5522.272148720317.185.780.36179.25372.02503.9511.5948.73010.1156.9616.7026.237.1813.86723.4883.719.2238.5065913562.7822.203507578.0969153.185305.64130.8322.553177724499.847.890.530116.07533.43725.6916.95612.16214.5269.6312.4921.019.0817.76518.3865.487.5245.6127999702.7727.224338721.7784603.266388.94143.6432.9242045291811.319.130.606131.09615.01817.4419.32913.58416.36811.1411.1617.8510.0820.38515.9757.276.6451.0189144842.8832.005199866.26101223.155387.25145.3637.2982108299511.879.690.643139.80656.60882.3620.41714.33317.26011.6010.6717.5410.4921.04815.5655.516.4552.7989610864.4932.525328885.87104913.170389.05146.2239.661123918605.374.560.26458.24275.74374.538.4916.2507.3915.2219.0326.225.489.68623.9598.269.2228.6903112253.0313.151608258.2330462.091288.08117.1116.502172025059.257.540.508103.41486.70656.7516.31212.35414.1668.9112.2117.988.3616.77316.4565.096.8544.6995252465.3620.212689459.2746793.428369.73140.0431.6482250309111.599.650.614133.96631.18850.8319.67214.36316.87011.349.9214.689.7819.46213.1953.455.8052.1725670464.7624.782778455.2755473.989386.65141.9738.1752697368113.7611.240.725159.72744.601003.0123.20117.22720.20113.088.7312.8811.5424.19411.7046.445.2560.4537285619.5828.283830612.0264474.304399.92146.2945.0313070418215.2912.810.830182.69861.821138.9526.62320.15023.08114.527.7611.2712.4825.67910.1041.124.6864.8227807653.2832.423820611.3176834.521387.42147.4451.5693073411315.8213.240.864188.70889.941187.64845605727.65420.91624.02414.847.5711.0912.8126.19410.1240.644.6364.9258106681.1132.533967627.3377784.246399.58147.5953.6424333564522.2718.521.389302.791426.531909.6144.40333.32538.43919.255.598.9017.5035.6227.9632.543.8079.19611542927.8841.035726905.50102374.351401.36147.3185.843OpenBenchmarking.org

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA CUDA GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER9001800270036004500SE +/- 1.00, N = 3SE +/- 2.31, N = 3SE +/- 1.67, N = 3SE +/- 1.45, N = 3SE +/- 5.61, N = 3SE +/- 1.20, N = 3SE +/- 1.53, N = 3SE +/- 0.67, N = 3SE +/- 1.86, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 1.86, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.20, N = 39704333307330702697225017201239210820451777148714161250811804782794450511

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA RTX GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER12002400360048006000SE +/- 1.33, N = 3SE +/- 16.17, N = 3SE +/- 6.93, N = 3SE +/- 6.01, N = 3SE +/- 3.51, N = 3SE +/- 3.61, N = 3SE +/- 0.67, N = 3SE +/- 2.08, N = 3SE +/- 1.76, N = 3SE +/- 4.67, N = 3SE +/- 3.18, N = 3SE +/- 0.33, N = 3SE +/- 3.21, N = 3SE +/- 2.52, N = 3SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 4.37, N = 3SE +/- 1.00, N = 3SE +/- 1.33, N = 314525645411341823681309125051860299529182449203118741685114710139981068736839

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.9122.2715.8215.2913.7611.599.255.3711.8711.319.847.186.935.954.254.444.224.112.933.27

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER510152025SE +/- 0.04, N = 3SE +/- 0.22, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 35.1018.5213.2412.8111.249.657.544.569.699.137.895.785.574.783.343.813.653.552.492.73

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 ComputeTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER0.31250.6250.93751.251.5625SE +/- 0.001, N = 3SE +/- 0.000, N = 5SE +/- 0.000, N = 5SE +/- 0.000, N = 4SE +/- 0.000, N = 4SE +/- 0.001, N = 4SE +/- 0.000, N = 4SE +/- 0.000, N = 3SE +/- 0.000, N = 4SE +/- 0.000, N = 4SE +/- 0.001, N = 4SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.5301.3890.8640.8300.7250.6140.5080.2640.6430.6060.5300.3610.3560.2990.2010.3650.3350.3010.2560.2551. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.2.1Exponent: 332220523TITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER70140210280350SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3115.73302.79188.70182.69159.72133.96103.4158.24139.80131.09116.0779.2578.5565.9344.2880.4373.3066.0451.7856.17

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.2.1Exponent: 77936867TITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER30060090012001500SE +/- 0.38, N = 3SE +/- 0.00, N = 3SE +/- 0.53, N = 3SE +/- 0.50, N = 3SE +/- 0.00, N = 3SE +/- 0.87, N = 3SE +/- 0.80, N = 3SE +/- 0.03, N = 3SE +/- 0.25, N = 3SE +/- 0.38, N = 3SE +/- 0.38, N = 3SE +/- 0.16, N = 3SE +/- 0.00, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3531.731426.53889.94861.82744.60631.18486.70275.74656.60615.01533.43372.02369.14313.15209.13377.03344.12310.72244.74263.99

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.2.1Exponent: 57885161TITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER400800120016002000SE +/- 1.18, N = 3SE +/- 1.22, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.05, N = 3SE +/- 1.01, N = 3SE +/- 0.14, N = 3SE +/- 2.08, N = 3SE +/- 1.56, N = 3SE +/- 0.53, N = 3SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.18, N = 3SE +/- 0.24, N = 3SE +/- 1.12, N = 3SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 0.13, N = 3SE +/- 0.00, N = 3711.071909.611187.651138.951003.01850.83656.75374.53882.36817.44725.69503.95497.02421.23284.50508.82463.18419.64330.62355.87

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 ComputeTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER1020304050SE +/- 0.046, N = 3SE +/- 0.001, N = 5SE +/- 0.009, N = 5SE +/- 0.022, N = 4SE +/- 0.001, N = 4SE +/- 0.000, N = 4SE +/- 0.000, N = 4SE +/- 0.000, N = 3SE +/- 0.046, N = 4SE +/- 0.037, N = 4SE +/- 0.033, N = 4SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.014, N = 3SE +/- 0.001, N = 3SE +/- 0.030, N = 3SE +/- 0.014, N = 3SE +/- 0.023, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 317.08144.40327.65426.62323.20119.67216.3128.49120.41719.32916.95611.59411.4529.6586.79411.83110.8079.7078.3778.2141. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 ComputeTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER816243240SE +/- 0.069, N = 3SE +/- 0.061, N = 5SE +/- 0.011, N = 5SE +/- 0.006, N = 4SE +/- 0.001, N = 4SE +/- 0.038, N = 4SE +/- 0.008, N = 4SE +/- 0.034, N = 3SE +/- 0.001, N = 4SE +/- 0.057, N = 4SE +/- 0.072, N = 4SE +/- 0.011, N = 3SE +/- 0.000, N = 3SE +/- 0.036, N = 3SE +/- 0.012, N = 3SE +/- 0.005, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.015, N = 3SE +/- 0.001, N = 310.94333.32520.91620.15017.22714.36312.3546.25014.33313.58412.1628.7308.4547.1875.1177.8977.0986.4985.6575.4571. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 ComputeTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER918273645SE +/- 0.038, N = 3SE +/- 0.051, N = 5SE +/- 0.010, N = 5SE +/- 0.006, N = 4SE +/- 0.007, N = 4SE +/- 0.004, N = 4SE +/- 0.008, N = 4SE +/- 0.003, N = 3SE +/- 0.033, N = 4SE +/- 0.006, N = 4SE +/- 0.045, N = 4SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.014, N = 3SE +/- 0.027, N = 3SE +/- 0.031, N = 3SE +/- 0.027, N = 3SE +/- 0.019, N = 3SE +/- 0.028, N = 313.89938.43924.02423.08120.20116.87014.1667.39117.26016.36814.52610.1159.9878.4015.91710.4039.2648.4077.2607.0491. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.3719.2514.8414.5213.0811.348.915.2211.6011.149.636.966.665.704.104.494.334.213.243.59

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Fishy Cat - Compute: NVIDIA OptiXTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER714212835SE +/- 0.16, N = 5SE +/- 0.06, N = 15SE +/- 0.01, N = 6SE +/- 0.06, N = 15SE +/- 0.05, N = 15SE +/- 0.05, N = 15SE +/- 0.10, N = 8SE +/- 0.21, N = 4SE +/- 0.01, N = 5SE +/- 0.09, N = 9SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.21, N = 4SE +/- 0.01, N = 3SE +/- 0.30, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.06, N = 3SE +/- 0.27, N = 316.255.597.577.768.739.9212.2119.0310.6711.1612.4916.7017.6320.1828.3721.8223.2924.8129.7827.68

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER1122334455SE +/- 0.05, N = 3SE +/- 0.01, N = 5SE +/- 0.00, N = 4SE +/- 0.02, N = 4SE +/- 0.01, N = 4SE +/- 0.02, N = 4SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 328.298.9011.0911.2712.8814.6817.9826.2217.5417.8521.0126.2328.1931.3542.7440.5641.6541.8247.3543.41

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.0517.5012.8112.4811.549.788.365.4810.4910.089.087.187.046.184.824.734.584.473.363.66

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER816243240SE +/- 0.027, N = 3SE +/- 0.105, N = 3SE +/- 0.037, N = 3SE +/- 0.068, N = 3SE +/- 0.003, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.007, N = 3SE +/- 0.022, N = 3SE +/- 0.004, N = 3SE +/- 0.015, N = 3SE +/- 0.003, N = 3SE +/- 0.013, N = 3SE +/- 0.011, N = 3SE +/- 0.007, N = 3SE +/- 0.006, N = 3SE +/- 0.009, N = 3SE +/- 0.007, N = 3SE +/- 0.002, N = 312.06135.62226.19425.67924.19419.46216.7739.68621.04820.38517.76513.86713.17411.6818.2858.3138.0627.9856.9467.569

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Classroom - Compute: NVIDIA OptiXTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER918273645SE +/- 0.05, N = 3SE +/- 0.01, N = 6SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.03, N = 4SE +/- 0.05, N = 4SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 4SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 324.457.9610.1210.1011.7013.1916.4523.9515.5615.9718.3823.4825.3628.1438.4533.5734.8834.6339.5035.98

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Barbershop - Compute: NVIDIA OptiXTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER306090120150SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.21, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 392.6832.5440.6441.1246.4453.4565.0998.2655.5157.2765.4883.7188.7598.84135.48126.60130.98131.40155.70141.74

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: BMW27 - Compute: NVIDIA OptiXTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER48121620SE +/- 0.05, N = 15SE +/- 0.06, N = 15SE +/- 0.01, N = 7SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.02, N = 6SE +/- 0.06, N = 15SE +/- 0.01, N = 6SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 5SE +/- 0.12, N = 8SE +/- 0.04, N = 4SE +/- 0.02, N = 4SE +/- 0.10, N = 10SE +/- 0.01, N = 4SE +/- 0.12, N = 79.293.804.634.685.255.806.859.226.456.647.529.229.9811.0014.4812.3512.7412.8514.4013.28

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER20406080100SE +/- 0.03, N = 3SE +/- 0.57, N = 3SE +/- 0.34, N = 3SE +/- 0.37, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.31, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 334.9279.2064.9364.8260.4552.1744.7028.6952.8051.0245.6138.5136.8933.3924.3825.8925.0224.4821.2023.19

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16CTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER2K4K6K8K10KSE +/- 1.76, N = 3SE +/- 0.25, N = 4SE +/- 0.33, N = 3SE +/- 9.45, N = 3SE +/- 1.20, N = 3SE +/- 8.84, N = 3SE +/- 1.86, N = 3SE +/- 0.67, N = 3SE +/- 6.69, N = 3SE +/- 19.94, N = 3SE +/- 12.24, N = 3SE +/- 8.69, N = 3SE +/- 2.33, N = 3SE +/- 5.49, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 5.93, N = 3SE +/- 0.00, N = 3SE +/- 16.42, N = 3SE +/- 10.41, N = 3715711542810678077285567052523112961091447999591348844698338653514966495542234442

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced ReadTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER2004006008001000SE +/- 0.16, N = 3SE +/- 0.02, N = 5SE +/- 0.03, N = 5SE +/- 0.02, N = 4SE +/- 0.02, N = 4SE +/- 0.03, N = 4SE +/- 0.03, N = 4SE +/- 0.00, N = 3SE +/- 0.02, N = 4SE +/- 0.03, N = 4SE +/- 0.02, N = 4SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 3556.60927.88681.11653.28619.58464.76465.36253.03864.49842.88702.77562.78414.43414.45335.75426.01387.25388.30388.49388.001. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPUTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER918273645SE +/- 0.04, N = 5SE +/- 0.20, N = 7SE +/- 0.17, N = 6SE +/- 0.16, N = 6SE +/- 0.03, N = 6SE +/- 0.24, N = 6SE +/- 0.07, N = 5SE +/- 0.07, N = 4SE +/- 0.07, N = 6SE +/- 0.09, N = 6SE +/- 0.21, N = 9SE +/- 0.06, N = 5SE +/- 0.12, N = 5SE +/- 0.05, N = 5SE +/- 0.03, N = 4SE +/- 0.06, N = 4SE +/- 0.05, N = 4SE +/- 0.02, N = 4SE +/- 0.06, N = 4SE +/- 0.06, N = 416.3741.0332.5332.4228.2824.7820.2113.1532.5232.0027.2222.2021.5219.0813.5711.7311.8012.1411.3712.35

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP32TITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER12002400360048006000SE +/- 3.18, N = 3SE +/- 0.33, N = 3SE +/- 4.04, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 3.51, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 2.08, N = 3SE +/- 0.33, N = 333435726396738203830277826891608532851994338350725972637203925042322232122702437

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced WriteTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER2004006008001000SE +/- 0.30, N = 3SE +/- 0.29, N = 5SE +/- 0.43, N = 5SE +/- 0.28, N = 4SE +/- 0.23, N = 4SE +/- 0.23, N = 4SE +/- 0.18, N = 4SE +/- 0.01, N = 3SE +/- 0.04, N = 4SE +/- 0.01, N = 4SE +/- 0.02, N = 4SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.51, N = 3SE +/- 0.64, N = 3SE +/- 0.64, N = 3SE +/- 0.04, N = 3SE +/- 0.15, N = 3603.99905.50627.33611.31612.02455.27459.27258.23885.87866.26721.77578.09422.01422.00339.67452.00417.55409.50421.16418.571. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

FluidX3D

FluidX3D is a speedy and memory efficient Boltzmann CFD (Computational Fluid Dynamics) software package implemented using OpenCL and intended for GPU acceleration. FluidX3D is developed by Moritz Lehmann and written free for non-commercial use. This is a test profile measuring the system OpenCL performance using the FluidX3D benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.9Test: FP32-FP16STITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER2K4K6K8K10KSE +/- 0.33, N = 3SE +/- 4.36, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 2.73, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 0.00, N = 371081023777787683644755474679304610491101228460691550795136393254234957494845894982

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 ComputeTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER1.01722.03443.05164.06885.086SE +/- 0.010, N = 3SE +/- 0.007, N = 5SE +/- 0.003, N = 5SE +/- 0.007, N = 4SE +/- 0.002, N = 4SE +/- 0.006, N = 4SE +/- 0.001, N = 4SE +/- 0.003, N = 3SE +/- 0.054, N = 4SE +/- 0.014, N = 4SE +/- 0.020, N = 4SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.009, N = 3SE +/- 0.008, N = 3SE +/- 0.005, N = 3SE +/- 0.010, N = 3SE +/- 0.025, N = 3SE +/- 0.003, N = 3SE +/- 0.000, N = 33.5764.3514.2464.5214.3043.9893.4282.0913.1703.1553.2663.1853.2703.0901.9253.6303.3103.0672.1262.1851. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

PyTorch

This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Currently this test profile is catered to CPU-based testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-50TITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER90180270360450SE +/- 0.38, N = 5SE +/- 2.01, N = 6SE +/- 1.09, N = 6SE +/- 1.24, N = 6SE +/- 1.81, N = 6SE +/- 0.86, N = 6SE +/- 1.00, N = 5SE +/- 0.56, N = 5SE +/- 1.32, N = 6SE +/- 1.61, N = 6SE +/- 1.06, N = 6SE +/- 0.48, N = 5SE +/- 0.67, N = 5SE +/- 0.60, N = 5SE +/- 0.37, N = 4SE +/- 0.39, N = 4SE +/- 0.22, N = 4SE +/- 0.10, N = 4SE +/- 0.43, N = 4SE +/- 0.16, N = 4263.23401.36399.58387.42399.92386.65369.73288.08389.05387.25388.94305.64287.04267.12204.41208.68197.62200.51177.73191.53

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: NVIDIA CUDA GPU - Batch Size: 512 - Model: ResNet-152TITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER306090120150SE +/- 0.17, N = 3SE +/- 0.83, N = 3SE +/- 0.91, N = 3SE +/- 1.10, N = 3SE +/- 0.72, N = 3SE +/- 0.98, N = 3SE +/- 0.31, N = 3SE +/- 0.04, N = 3SE +/- 0.33, N = 3SE +/- 0.46, N = 3SE +/- 1.03, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 3SE +/- 0.19, N = 3SE +/- 0.18, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.15, N = 3106.96147.31147.59147.44146.29141.97140.04117.11146.22145.36143.64130.83126.55117.4087.1987.6082.8684.3873.5280.27

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 ComputeTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER20406080100SE +/- 0.000, N = 3SE +/- 0.014, N = 5SE +/- 0.007, N = 5SE +/- 0.032, N = 4SE +/- 0.012, N = 4SE +/- 0.001, N = 4SE +/- 0.025, N = 4SE +/- 0.000, N = 3SE +/- 0.001, N = 4SE +/- 0.019, N = 4SE +/- 0.026, N = 4SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.049, N = 3SE +/- 0.030, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.023, N = 3SE +/- 0.018, N = 317.17785.84353.64251.56945.03138.17531.64816.50239.66137.29832.92422.55322.27218.69313.08011.89110.8989.7858.4848.2721. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 4060RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER1632486480Min: 40 / Avg: 71.04 / Max: 78Min: 33 / Avg: 42.46 / Max: 59Min: 33 / Avg: 45.64 / Max: 57Min: 32 / Avg: 45.66 / Max: 60Min: 36 / Avg: 49.56 / Max: 60Min: 34 / Avg: 53.31 / Max: 68Min: 33 / Avg: 48.29 / Max: 62Min: 34 / Avg: 52.86 / Max: 65Min: 45 / Avg: 64.33 / Max: 71Min: 36 / Avg: 62.58 / Max: 78Min: 49 / Avg: 70.32 / Max: 78Min: 48 / Avg: 68.94 / Max: 80Min: 35 / Avg: 58.6 / Max: 73Min: 40 / Avg: 61.62 / Max: 72Min: 43 / Avg: 54.51 / Max: 64Min: 29 / Avg: 60.19 / Max: 73Min: 51 / Avg: 73.87 / Max: 83Min: 28 / Avg: 59.08 / Max: 71Min: 33 / Avg: 63.8 / Max: 76Min: 32 / Avg: 63.7 / Max: 74

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringTITAN RTXRTX 4090RTX 4080 SUPERRTX 4080RTX 4070 Ti SUPERRTX 4070 SUPERRTX 4070RTX 3090RTX 3080 TiRTX 3080RTX 3070 TiRTX 3070RTX 3060 TiRTX 3060RTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPER60120180240300Min: 9.65 / Avg: 194.86 / Max: 286.86Min: 13.54 / Avg: 34.16 / Max: 322.66Min: 9.23 / Avg: 123.83 / Max: 251.41Min: 9.6 / Avg: 131.42 / Max: 259.62Min: 8.14 / Avg: 141.6 / Max: 272.56Min: 3.4 / Avg: 106.14 / Max: 213.55Min: 7.95 / Avg: 93.57 / Max: 184.25Min: 16.24 / Avg: 253.87 / Max: 348.11Min: 20.08 / Avg: 179.98 / Max: 349.71Min: 16.53 / Avg: 220.58 / Max: 319.93Min: 23.19 / Avg: 167.28 / Max: 289.92Min: 14.88 / Avg: 107.51 / Max: 219.28Min: 19.64 / Avg: 119.17 / Max: 199.96Min: 19.36 / Avg: 87.09 / Max: 151.4Min: 10.27 / Avg: 129.96 / Max: 255.32Min: 14.33 / Avg: 141.11 / Max: 223.41Min: 14.9 / Avg: 124.93 / Max: 220.48Min: 6.75 / Avg: 109.77 / Max: 188.99Min: 17.39 / Avg: 114.08 / Max: 198.52