NVIDIA RTX 6000 Ada Generation

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409138-PTS-NVIDIART87
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 6000 Ada Generation
September 11
  4 Hours, 51 Minutes
4
September 12
  1 Hour, 34 Minutes
5
September 12
  1 Hour, 34 Minutes
RTX 4000 Ada Generation
September 12
  5 Hours, 30 Minutes
NVIDIA RTX 4000 Ada Generation
September 12
  1 Hour, 48 Minutes
RTX 2000 Ada Generation
September 12
  6 Hours, 42 Minutes
2
September 12
  6 Hours, 12 Minutes
2a
September 13
  6 Hours, 40 Minutes
Invert Behavior (Only Show Selected Data)
  4 Hours, 21 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA RTX 6000 Ada GenerationProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionRTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation22aAMD Ryzen 9 9950X 16-Core @ 8.18GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2308 BIOS)AMD Device 14d82 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32Western Digital WD_BLACK SN850X 2000GBNVIDIA RTX 6000 Ada Generation 48GBNVIDIA AD102 HD AudioDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.8.0-41-generic (x86_64)GNOME Shell 46.0X Server 1.21.1.11NVIDIA 560.35.034.6.0OpenCL 3.0 CUDA 12.6.65GCC 13.2.0ext43840x2160NVIDIA RTX 4000 Ada Generation 20GBNVIDIA Device 22bcNVIDIA RTX 2000 Ada Generation 16GBNVIDIA Device 22beOpenBenchmarking.orgKernel Details- nouveau.modeset=0 - Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xb40401cGraphics Details- RTX 6000 Ada Generation: BAR1 / Visible vRAM Size: 65536 MiB - vBIOS Version: 95.02.3a.00.01- 4: BAR1 / Visible vRAM Size: 65536 MiB - vBIOS Version: 95.02.3a.00.01- 5: BAR1 / Visible vRAM Size: 65536 MiB - vBIOS Version: 95.02.3a.00.01- RTX 4000 Ada Generation: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.04.5c.00.0d- NVIDIA RTX 4000 Ada Generation: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.04.5c.00.0d- RTX 2000 Ada Generation: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.07.47.00.05- 2: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.07.47.00.05- 2a: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.07.47.00.05OpenCL Details- RTX 6000 Ada Generation: GPU Compute Cores: 18176- 4: GPU Compute Cores: 18176- 5: GPU Compute Cores: 18176- RTX 4000 Ada Generation: GPU Compute Cores: 6144- NVIDIA RTX 4000 Ada Generation: GPU Compute Cores: 6144- RTX 2000 Ada Generation: GPU Compute Cores: 2816- 2: GPU Compute Cores: 2816- 2a: GPU Compute Cores: 2816Python Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation22aResult OverviewPhoronix Test Suite100%245%391%536%682%vkpeakGpuOwlProjectPhysX OpenCL-BenchmarkFluidX3DFinanceBenchBlenderParaViewclpeakSPECViewPerf 2020FAHBenchRodinia

NVIDIA RTX 6000 Ada Generationspecviewperf2020: 2560 x 1440 - CATIA-06specviewperf2020: 2560 x 1440 - CREO-03specviewperf2020: 2560 x 1440 - ENERGY-03specviewperf2020: 2560 x 1440 - MAYA-06specviewperf2020: 2560 x 1440 - MEDICAL-O3specviewperf2020: 2560 x 1440 - SNX-04specviewperf2020: 2560 x 1440 - SOLIDWORKS-07paraview: Many Spheres - 3000 - 2560 x 1440paraview: Many Spheres - 3000 - 3840 x 2160paraview: Wavelet Contour - 3000 - 2560 x 1440paraview: Wavelet Contour - 3000 - 3840 x 2160paraview: Wavelet Volume - 3000 - 2560 x 1440paraview: Wavelet Volume - 3000 - 3840 x 2160opencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: Memory Bandwidth Coalesced Writeclpeak: Global Memory Bandwidthclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBuffervkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp16-scalarvkpeak: fp16-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4clpeak: Single-Precision Computeclpeak: Double-Precision Computevkpeak: int32-scalarvkpeak: int32-vec4vkpeak: int16-scalarvkpeak: int16-vec4clpeak: Integer Computeclpeak: Integer 24-bit Computegpuowl: 77936867gpuowl: 332220523gpuowl: 57885161indigobench: OpenCL GPU - Supercarindigobench: OpenCL GPU - Bedroomluxcorerender: DLSC - GPUluxcorerender: Rainbow Colors and Prism - GPUluxcorerender: LuxCore Benchmark - GPUluxcorerender: Orange Juice - GPUluxcorerender: Danish Mood - GPUparaview: Many Spheres - 3000 - 2560 x 1440paraview: Many Spheres - 3000 - 3840 x 2160paraview: Wavelet Contour - 3000 - 2560 x 1440paraview: Wavelet Contour - 3000 - 3840 x 2160paraview: Wavelet Volume - 3000 - 2560 x 1440paraview: Wavelet Volume - 3000 - 3840 x 2160fluidx3d: FP32-FP32fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sfahbench: opencl-benchmark: FP64 Computeopencl-benchmark: FP32 Computeopencl-benchmark: INT64 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT8 Computev-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUfinancebench: Black-Scholes OpenCLfinancebench: Monte-Carlo OpenCLrodinia: OpenCL Myocyterodinia: OpenCL Particle Filterblender: BMW27 - NVIDIA CUDAblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA CUDAblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA CUDAblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA CUDAblender: Pabellon Barcelona - NVIDIA OptiXblender: Barbershop - NVIDIA CUDAblender: Barbershop - NVIDIA OptiXblender: Junkshop - NVIDIA CUDAblender: Junkshop - NVIDIA OptiXclpeak: Kernel LatencyRTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation22a236.96256.46156.17813.18211.981026.56578.10167.61158.47811.19625.76853.68639.67865.57850.04815.9122.1122.8748174.3362508.5446258.4393562.471538.941533.6483188.791534.7048838.8348137.8531027.6638953.7140360.7540310.581490.31322.422043.6073.90529.28218.9535.6517.7815.8617.7216804.13815887.6678453.5676521.15213658.90510234.76252521058510208462.31311.5190.8053.77444.16132.29721.792598481742.71099.23928720.1642.0915.573.6110.447.9210.675.4821.308.8645.6532.9111.087.343.79237.35256.36156.36812.16212.241029.44579.49165.37157.73806.74622.92846.86634866.19860.25816.1221.8922.6648751.2563072.9746494.2994400.521539.941533.5782190.361526.6248805.5747938.730661.8938068.1538762.5942249.871490.31322.582044.9973.80829.3021935.7717.7915.5517.7816578.78515813.2588407.1816491.52613549.79610144.05352671054510175462.97851.50290.1793.69343.50431.16820.844594881592.70299.32199920.3062.1155.553.6210.387.8510.615.421.138.8845.7732.8311.077.373.78236.74255.97155.91812.5211.551029.37579.41165.54157.74805.17625.14863.24638.96866.47860.03815.8822.9823.8648735.8163065.0546558.2893801.481539.991535.4883007.711532.8148814.9148068.4430694.2937707.0938304.8939686.791490.31322.482044.9973.78229.2581935.7317.8115.8917.5416596.14115813.7858390.8266514.70513811.89110223.37652681058410209467.31081.50290.1723.81243.86630.94620.846595981592.69299.34720.272.0835.553.6110.347.8210.645.4121.138.8245.6632.9411.127.343.79141.84170.6162.79430.4897.36423.87308.7696.7788.93345.92267.58474.47269.93326.93333.72307.3122.2022.6914015.6218522.5314009.3627661.65440.80439.9125836.20439.0713996.8513937.839337.3312377.7913179.7113239.24451.0695.65610.8736.88612.8647.9617.598.047.696.909701.1748915.5883604.9082788.5457591.5894318.907204640643805282.15940.43326.9402.87513.87511.3889.706194629277.660199.82800321.6844.98513.267.2426.6317.9126.1913.0259.1919.44108.1472.3922.9114.543.73143.77170.7962.9430.297.38423.85308.6896.6188.57344.93266.78468.3268.98326.92333.67307.4721.921.7514015.818566.914070.2127782.79440.86439.9325728439.7613995.613936.619344.5512378.8913209.7513222.86451.6795.74611.2536.94412.897.9717.578.057.717.079685.6998879.4663594.5392780.1177492.8444303.65204740663804282.15420.43326.9212.82413.87211.389.707196029377.641200.21099921.5094.98413.257.2226.5217.8226.1812.9259.1319.38108.0872.122.9914.593.783.87113.9331.69276.2565.49285.10184.6047.1243.16193.51145.58286.61151.41207.71211.82195.1113.112.926944.909185.796967.8413744.12218.85218.8411704.57217.306959.066926.574635.636166.935978.975987.65226.5547.86308.3624.3908.1504.611.964.635.214.094723.5874326.6322016.5811517.1154585.8192422.539133125292434207.95290.21513.4391.7506.8095.9244.7761422203016.228000265.87800120.7609.17222.5210.5245.8928.6147.2823.02108.9631.18188.78114.4338.3522.353.7783.90114.1431.74276.5865.60285.57184.6247.1943.20193.97145.84287.09151.28207.71211.84195.1013.1012.936969.229223.626995.3713799.41218.76218.7011608.16218.656959.026926.814643.866166.585936.095951.11226.6047.86308.264731.2574331.2672021.3691519.7534593.5342420.426133225302434208.01540.21513.4381.7326.8095.9204.77616.225334266.16300520.8179.19022.4910.4645.8628.5947.2422.76108.9831.20189.15114.6438.4622.193.7683.57114.1431.79277.1465.73286.31184.9947.3143.28194.28145.99287.49151.09207.71211.82195.0513.112.936941.309184.816967.8713744.29218.84218.7611616.09218.016959.086926.874625.436166.885939.245954.25226.6047.86308.2624.4028.1574.6111.954.635.214.064743.2354338.5482024.6101521.4084599.8972417.426133125302434207.66300.21513.4391.7286.8235.9224.7761415202316.227000265.86166420.6949.18722.5510.4445.8728.5547.2422.79108.9731.14189.04114.6738.2922.113.78OpenBenchmarking.org

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 6000 Ada Generation1632486480Min: 41 / Avg: 72.97 / Max: 87

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRTX 6000 Ada Generation50100150200250Min: 21.72 / Avg: 191 / Max: 303.57

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: CATIA-064RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2RTX 2000 Ada Generation2a50100150200250SE +/- 1.01, N = 3SE +/- 0.04, N = 3SE +/- 0.30, N = 3SE +/- 0.49, N = 3SE +/- 0.04, N = 3237.35236.96236.74143.77141.8483.9083.8783.57

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: CREO-03RTX 6000 Ada Generation45NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation60120180240300SE +/- 0.47, N = 3SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3256.46256.36255.97170.79170.61114.14114.14113.93

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: ENERGY-034RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation306090120150SE +/- 0.57, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3156.36156.17155.9162.9062.7931.7931.7431.69

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: MAYA-06RTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation2004006008001000SE +/- 0.66, N = 3SE +/- 0.06, N = 3SE +/- 0.16, N = 3SE +/- 0.16, N = 3SE +/- 0.08, N = 3813.18812.50812.16430.48430.20277.14276.58276.25

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: MEDICAL-O34RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation50100150200250SE +/- 0.21, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3212.24211.98211.5597.3897.3665.7365.6065.49

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: SNX-0445RTX 6000 Ada GenerationRTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation2004006008001000SE +/- 5.90, N = 3SE +/- 0.12, N = 3SE +/- 0.24, N = 3SE +/- 0.16, N = 3SE +/- 0.21, N = 31029.441029.371026.56423.87423.85286.31285.57285.10

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: SOLIDWORKS-0745RTX 6000 Ada GenerationRTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation130260390520650SE +/- 0.20, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3579.49579.41578.10308.76308.68184.99184.62184.60

ParaView

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 2560 x 1440RTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation4080120160200SE +/- 0.74, N = 3SE +/- 0.05, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 3SE +/- 0.18, N = 3167.61165.54165.3796.7796.6147.3147.1947.12

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 3840 x 2160RTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation4080120160200SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3158.47157.74157.7388.9388.5743.2843.2043.16

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 2560 x 1440RTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation2004006008001000SE +/- 0.63, N = 3SE +/- 0.71, N = 3SE +/- 0.41, N = 3SE +/- 0.37, N = 3SE +/- 0.43, N = 3811.19806.74805.17345.92344.93194.28193.97193.51

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 3840 x 2160RTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation140280420560700SE +/- 0.45, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3625.76625.14622.92267.58266.78145.99145.84145.58

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 2560 x 14405RTX 6000 Ada Generation4RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation2004006008001000SE +/- 2.56, N = 3SE +/- 1.78, N = 3SE +/- 0.37, N = 3SE +/- 0.97, N = 3SE +/- 0.33, N = 3863.24853.68846.86474.47468.30287.49287.09286.61

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 3840 x 2160RTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation22a140280420560700SE +/- 6.56, N = 3SE +/- 0.17, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3639.67638.96634.00269.93268.98151.41151.28151.09

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Read54RTX 6000 Ada GenerationRTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation2004006008001000SE +/- 0.06, N = 6SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3866.47866.19865.57326.93326.92207.71207.71207.711. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgGB/s Per Watt, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced WriteRTX 6000 Ada Generation2468106.228

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced Write45RTX 6000 Ada GenerationRTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation22aRTX 2000 Ada Generation2004006008001000SE +/- 0.26, N = 6SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3860.25860.03850.04333.72333.67211.84211.82211.821. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

clpeak

OpenBenchmarking.orgGBPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRTX 6000 Ada Generation2468107.894

OpenBenchmarking.orgGBPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferRTX 6000 Ada Generation0.0560.1120.1680.2240.280.249

OpenBenchmarking.orgGBPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferRTX 6000 Ada Generation0.05830.11660.17490.23320.29150.259

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory Bandwidth4RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation22a2004006008001000SE +/- 0.03, N = 11SE +/- 0.19, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3816.12815.91815.88307.47307.31195.11195.10195.051. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBuffer5RTX 4000 Ada GenerationRTX 6000 Ada GenerationNVIDIA RTX 4000 Ada Generation42a2RTX 2000 Ada Generation612182430SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 322.9822.2022.1121.9021.8913.1013.1013.101. (CXX) g++ options: -O3

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBuffer5RTX 6000 Ada GenerationRTX 4000 Ada Generation4NVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation612182430SE +/- 0.01, N = 3SE +/- 0.26, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 323.8622.8722.6922.6621.7512.9312.9312.921. (CXX) g++ options: -O3

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalar45RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2RTX 2000 Ada Generation2a10K20K30K40K50KSE +/- 370.02, N = 3SE +/- 2.02, N = 3SE +/- 0.00, N = 3SE +/- 12.38, N = 3SE +/- 14.03, N = 348751.2548735.8148174.3314015.8014015.626969.226944.906941.30

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec445RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2RTX 2000 Ada Generation2a14K28K42K56K70KSE +/- 485.03, N = 3SE +/- 38.83, N = 3SE +/- 0.25, N = 3SE +/- 18.82, N = 3SE +/- 19.42, N = 363072.9763065.0562508.5418566.9018522.539223.629185.799184.81

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalar54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation10K20K30K40K50KSE +/- 153.56, N = 3SE +/- 28.84, N = 3SE +/- 0.54, N = 3SE +/- 14.03, N = 3SE +/- 14.04, N = 346558.2846494.2946258.4314070.2114009.366995.376967.876967.84

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec445RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation20K40K60K80K100KSE +/- 692.18, N = 3SE +/- 60.23, N = 3SE +/- 0.07, N = 3SE +/- 27.86, N = 3SE +/- 27.72, N = 394400.5293801.4893562.4727782.7927661.6513799.4113744.2913744.12

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-scalar54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a230060090012001500SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 31539.991539.941538.94440.86440.80218.85218.84218.76

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-vec45RTX 6000 Ada Generation4NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a230060090012001500SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 31535.481533.641533.57439.93439.91218.84218.76218.70

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation2a220K40K60K80K100KSE +/- 359.84, N = 13SE +/- 33.21, N = 3SE +/- 122.39, N = 3SE +/- 50.14, N = 3SE +/- 15.71, N = 383188.7983007.7182190.3625836.2025728.0011704.5711616.0911608.161. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation30060090012001500SE +/- 2.12, N = 6SE +/- 0.63, N = 3SE +/- 0.13, N = 3SE +/- 0.24, N = 3SE +/- 0.28, N = 31534.701532.811526.62439.76439.07218.65218.01217.301. (CXX) g++ options: -O3

vkpeak

OpenBenchmarking.orgGIOPS Per Watt, More Is Bettervkpeak 20230730int16-vec4RTX 6000 Ada Generation4080120160200162.79

clpeak

OpenBenchmarking.orgGIOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRTX 6000 Ada Generation90180270360450432.48

OpenBenchmarking.orgGIOPS Per Watt, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRTX 6000 Ada Generation90180270360450429.34

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-scalarRTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2aRTX 2000 Ada Generation210K20K30K40K50KSE +/- 4.83, N = 3SE +/- 1.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 348838.8348814.9148805.5713996.8513995.606959.086959.066959.02

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int32-vec4RTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation10K20K30K40K50KSE +/- 34.51, N = 3SE +/- 0.20, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.25, N = 348137.8548068.4447938.7013937.8313936.616926.876926.816926.57

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarRTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2RTX 2000 Ada Generation2a7K14K21K28K35KSE +/- 5.73, N = 3SE +/- 0.96, N = 3SE +/- 0.29, N = 3SE +/- 8.47, N = 3SE +/- 9.35, N = 331027.6630694.2930661.899344.559337.334643.864635.634625.43

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4RTX 6000 Ada Generation45NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a28K16K24K32K40KSE +/- 89.71, N = 3SE +/- 0.88, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.28, N = 338953.7138068.1537707.0912378.8912377.796166.936166.886166.58

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeRTX 6000 Ada Generation45NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a29K18K27K36K45KSE +/- 462.74, N = 15SE +/- 19.73, N = 3SE +/- 39.53, N = 3SE +/- 0.10, N = 3SE +/- 3.27, N = 340360.7538762.5938304.8913209.7513179.715978.975939.245936.091. (CXX) g++ options: -O3

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit Compute4RTX 6000 Ada Generation5RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation2a29K18K27K36K45KSE +/- 289.03, N = 15SE +/- 7.41, N = 3SE +/- 36.62, N = 3SE +/- 3.51, N = 3SE +/- 0.00, N = 342249.8740310.5839686.7913239.2413222.865987.655954.255951.111. (CXX) g++ options: -O3

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 7793686754RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation30060090012001500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31490.311490.311490.31451.67451.06226.60226.60226.551. (CXX) g++ options: -O3 -lgmp -lOpenCL

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 33222052345RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation70140210280350SE +/- 1.45, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3322.58322.48322.4295.7495.6547.8647.8647.861. (CXX) g++ options: -O3 -lgmp -lOpenCL

OpenBenchmarking.orgIterations / Second Per Watt, More Is BetterGpuOwl 7.5Exponent: 57885161RTX 6000 Ada Generation2468108.762

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 5788516154RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a2400800120016002000SE +/- 1.39, N = 3SE +/- 0.22, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32044.992044.992043.60611.25610.87308.36308.26308.261. (CXX) g++ options: -O3 -lgmp -lOpenCL

IndigoBench

OpenBenchmarking.orgM samples/s Per Watt, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 6000 Ada Generation0.02570.05140.07710.10280.12850.114

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 6000 Ada Generation45NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2aRTX 2000 Ada Generation1632486480SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 373.9173.8173.7836.9436.8924.4024.39

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Bedroom4RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2aRTX 2000 Ada Generation714212835SE +/- 0.029, N = 3SE +/- 0.006, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 329.30229.28229.25812.89012.8648.1578.150

LuxCoreRender

OpenBenchmarking.orgM samples/sec Per Watt, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURTX 6000 Ada Generation0.01510.03020.04530.06040.07550.067

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPU54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2aRTX 2000 Ada Generation510152025SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 319.0019.0018.957.977.964.614.60MIN: 18.27 / MAX: 19.15MIN: 18.23 / MAX: 19.15MIN: 18.15 / MAX: 19.18MIN: 7.76 / MAX: 8.16MIN: 7.65 / MAX: 8.16MIN: 4.41 / MAX: 4.79MIN: 4.38 / MAX: 4.75

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPU45RTX 6000 Ada GenerationRTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation2a816243240SE +/- 0.09, N = 7SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 335.7735.7335.6517.5917.5711.9611.95MIN: 31.96 / MAX: 37.45MIN: 31.93 / MAX: 37.41MIN: 31.82 / MAX: 39.14MIN: 16.25 / MAX: 18.22MIN: 16.25 / MAX: 18.2MIN: 11.2 / MAX: 12.57MIN: 11.18 / MAX: 12.57

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPU54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2aRTX 2000 Ada Generation48121620SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 317.8117.7917.788.058.044.634.63MIN: 8.01 / MAX: 20.74MIN: 8.01 / MAX: 20.68MIN: 7.98 / MAX: 20.82MIN: 3.52 / MAX: 9.09MIN: 3.16 / MAX: 9.09MIN: 1.91 / MAX: 5.26MIN: 1.91 / MAX: 5.25

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPU5RTX 6000 Ada Generation4NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2aRTX 2000 Ada Generation48121620SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 315.8915.8615.557.717.695.215.21MIN: 13.71 / MAX: 21.62MIN: 13.68 / MAX: 21.62MIN: 13.62 / MAX: 21.43MIN: 6.04 / MAX: 9.86MIN: 6 / MAX: 9.96MIN: 4.19 / MAX: 6.52MIN: 4.17 / MAX: 6.54

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPU4RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a48121620SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 317.7817.7217.547.076.904.094.06MIN: 7.14 / MAX: 20.22MIN: 6.81 / MAX: 20.39MIN: 6.24 / MAX: 20.16MIN: 3.54 / MAX: 7.94MIN: 3.19 / MAX: 7.9MIN: 1.57 / MAX: 4.63MIN: 1.82 / MAX: 4.59

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

Test: tConvolve OpenCL

RTX 6000 Ada Generation: The test run did not produce a result.

4: The test run did not produce a result.

5: The test run did not produce a result.

RTX 4000 Ada Generation: The test run did not produce a result.

NVIDIA RTX 4000 Ada Generation: The test run did not produce a result.

RTX 2000 Ada Generation: The test run did not produce a result.

2: The test run did not produce a result.

2a: The test run did not produce a result.

FluidX3D

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.17Test: FP32-FP32RTX 6000 Ada Generation51015202522.69

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP3254RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation11002200330044005500SE +/- 16.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 352685267525220472046133213311331

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP16CRTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation2K4K6K8K10KSE +/- 3.50, N = 4SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 1.20, N = 3SE +/- 1.45, N = 310585105841054540664064253025302529

OpenBenchmarking.orgMLUPs/s Per Watt, More Is BetterFluidX3D 2.17Test: FP32-FP16SRTX 6000 Ada Generation102030405041.67

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP16S5RTX 6000 Ada Generation4RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation2K4K6K8K10KSE +/- 2.00, N = 3SE +/- 0.88, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 310209102081017538053804243424342434

FAHBench

OpenBenchmarking.orgNs Per Day Per Watt, More Is BetterFAHBench 2.3.2RTX 6000 Ada Generation0.67031.34062.01092.68123.35152.979

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.254RTX 6000 Ada GenerationRTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2RTX 2000 Ada Generation2a100200300400500SE +/- 0.94, N = 3SE +/- 0.67, N = 3SE +/- 0.12, N = 3SE +/- 0.29, N = 3SE +/- 0.51, N = 3467.31462.98462.31282.16282.15208.02207.95207.66

Meta Performance Per Watts

OpenBenchmarking.orgPerformance Per Watts, More Is BetterMeta Performance Per WattsPerformance Per WattsRTX 6000 Ada Generation2004006008001000983.03

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 ComputeRTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation0.33980.67961.01941.35921.699SE +/- 0.000, N = 6SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 31.5101.5021.5020.4330.4330.2150.2150.2151. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 ComputeRTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2aRTX 2000 Ada Generation220406080100SE +/- 0.08, N = 6SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 390.8190.1890.1726.9426.9213.4413.4413.441. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 Compute5RTX 6000 Ada Generation4RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation22a0.85771.71542.57313.43084.2885SE +/- 0.021, N = 6SE +/- 0.088, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 33.8123.7743.6932.8752.8241.7501.7321.7281. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT32 ComputeRTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation1020304050SE +/- 0.291, N = 6SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.014, N = 3SE +/- 0.014, N = 344.16143.86643.50413.87513.8726.8236.8096.8091. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 ComputeRTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 2000 Ada Generation2a2816243240SE +/- 0.127, N = 6SE +/- 0.012, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 332.29731.16830.94611.38811.3805.9245.9225.9201. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 ComputeRTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation510152025SE +/- 0.068, N = 6SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 321.79220.84620.8449.7079.7064.7764.7764.7761. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

Chaos Group V-RAY

OpenBenchmarking.orgvpaths Per Watt, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPURTX 6000 Ada Generation61218243026.17

OpenBenchmarking.orgvpaths Per Watt, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPURTX 6000 Ada Generation81624324033.33

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPURTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a13002600390052006500SE +/- 24.01, N = 3SE +/- 3.67, N = 3SE +/- 3.67, N = 3SE +/- 6.06, N = 35984595959481960194614221415

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPURTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a2K4K6K8K10KSE +/- 8.95, N = 3SE +/- 10.33, N = 3SE +/- 3.67, N = 3SE +/- 6.06, N = 38174815981592937292720302023

ArrayFire

Test: Neural Network OpenCL FP16

RTX 6000 Ada Generation: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

4: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

5: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

RTX 4000 Ada Generation: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

NVIDIA RTX 4000 Ada Generation: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

RTX 2000 Ada Generation: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

2: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

2a: The test run did not produce a result. E: ./arrayfire: 7: ./neural_network_opencl: not found

FinanceBench

OpenBenchmarking.orgCelsius, Fewer Is BetterFinanceBench 2016-07-25GPU Temperature MonitorRTX 6000 Ada Generation1122334455Min: 53 / Avg: 54.41 / Max: 57

OpenBenchmarking.orgCelsius, Fewer Is BetterFinanceBench 2016-07-25GPU Temperature MonitorRTX 6000 Ada Generation1122334455Min: 52 / Avg: 53.05 / Max: 55

Rodinia

OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 3.1GPU Temperature MonitorRTX 6000 Ada Generation1224364860Min: 60 / Avg: 62 / Max: 63

OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 3.1GPU Temperature MonitorRTX 6000 Ada Generation1326395265Min: 57 / Avg: 59.94 / Max: 65

clpeak

OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature MonitorRTX 6000 Ada Generation1224364860Min: 56 / Avg: 57.85 / Max: 60

Blender

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1428425670Min: 55 / Avg: 66.18 / Max: 74

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1428425670Min: 61 / Avg: 66.52 / Max: 72

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1530456075Min: 61 / Avg: 73.3 / Max: 80

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1530456075Min: 65 / Avg: 73 / Max: 79

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1530456075Min: 65 / Avg: 72.25 / Max: 78

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1530456075Min: 63 / Avg: 70.68 / Max: 77

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1632486480Min: 63 / Avg: 76.81 / Max: 83

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1530456075Min: 66 / Avg: 72.85 / Max: 77

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1632486480Min: 64 / Avg: 77.47 / Max: 83

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1632486480Min: 67 / Avg: 77.91 / Max: 84

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1530456075Min: 67 / Avg: 71.29 / Max: 76

OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 4.2GPU Temperature MonitorRTX 6000 Ada Generation1428425670Min: 63 / Avg: 68.6 / Max: 74

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCL54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation48121620SE +/- 0.003806, N = 15SE +/- 0.008950, N = 3SE +/- 0.002186, N = 3SE +/- 0.001155, N = 3SE +/- 0.002646, N = 32.6920002.7020002.7100007.6410007.66000016.22533416.22700016.2280001. (CXX) g++ options: -O3 -march=native -fopenmp

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Monte-Carlo OpenCLRTX 6000 Ada Generation45RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2aRTX 2000 Ada Generation260120180240300SE +/- 0.04, N = 7SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.13, N = 3SE +/- 0.29, N = 399.2499.3299.35199.83200.21265.86265.88266.161. (CXX) g++ options: -O3 -march=native -fopenmp

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL MyocyteRTX 6000 Ada Generation542aRTX 2000 Ada Generation2NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation510152025SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 320.1620.2720.3120.6920.7620.8221.5121.681. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle Filter5RTX 6000 Ada Generation4NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a23691215SE +/- 0.004, N = 11SE +/- 0.008, N = 3SE +/- 0.012, N = 3SE +/- 0.022, N = 3SE +/- 0.021, N = 32.0832.0912.1154.9844.9859.1729.1879.1901. (CXX) g++ options: -O2 -lOpenCL

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA CUDA45RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2RTX 2000 Ada Generation2a510152025SE +/- 0.01, N = 7SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 35.555.555.5713.2513.2622.4922.5222.55

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA OptiXRTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation3691215SE +/- 0.00, N = 8SE +/- 0.06, N = 14SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 103.613.613.627.227.2410.4410.4610.52

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA CUDA54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation1020304050SE +/- 0.02, N = 5SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 310.3410.3810.4426.5226.6345.8645.8745.89

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA OptiX54RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2a2RTX 2000 Ada Generation714212835SE +/- 0.01, N = 6SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 37.827.857.9217.8217.9128.5528.5928.61

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA CUDA45RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation1122334455SE +/- 0.01, N = 5SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 310.6110.6410.6726.1826.1947.2447.2447.28

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA OptiX45RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation22aRTX 2000 Ada Generation612182430SE +/- 0.05, N = 15SE +/- 0.11, N = 7SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.27, N = 35.405.415.4812.9213.0222.7622.7923.02

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA45RTX 6000 Ada GenerationNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a220406080100SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 321.1321.1321.3059.1359.19108.96108.97108.98

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX5RTX 6000 Ada Generation4NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2aRTX 2000 Ada Generation2714212835SE +/- 0.01, N = 5SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 38.828.868.8819.3819.4431.1431.1831.20

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA CUDARTX 6000 Ada Generation54NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation2a24080120160200SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.27, N = 3SE +/- 0.15, N = 345.6545.6645.77108.08108.14188.78189.04189.15

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA OptiX4RTX 6000 Ada Generation5NVIDIA RTX 4000 Ada GenerationRTX 4000 Ada GenerationRTX 2000 Ada Generation22a306090120150SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 332.8332.9132.9472.1072.39114.43114.64114.67

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA CUDA4RTX 6000 Ada Generation5RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2aRTX 2000 Ada Generation2918273645SE +/- 0.01, N = 4SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 311.0711.0811.1222.9122.9938.2938.3538.46

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA OptiXRTX 6000 Ada Generation54RTX 4000 Ada GenerationNVIDIA RTX 4000 Ada Generation2a2RTX 2000 Ada Generation510152025SE +/- 0.01, N = 6SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 37.347.347.3714.5414.5922.1122.1922.35

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel LatencyNVIDIA RTX 4000 Ada GenerationRTX 4000 Ada Generation2RTX 2000 Ada Generation42aRTX 6000 Ada Generation50.85281.70562.55843.41124.264SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 143.703.733.763.773.783.783.793.791. (CXX) g++ options: -O3

106 Results Shown

GPU Temperature Monitor:
  Phoronix Test Suite System Monitoring:
    Celsius
    Watts
SPECViewPerf 2020:
  2560 x 1440 - CATIA-06
  2560 x 1440 - CREO-03
  2560 x 1440 - ENERGY-03
  2560 x 1440 - MAYA-06
  2560 x 1440 - MEDICAL-O3
  2560 x 1440 - SNX-04
  2560 x 1440 - SOLIDWORKS-07
ParaView:
  Many Spheres - 3000 - 2560 x 1440
  Many Spheres - 3000 - 3840 x 2160
  Wavelet Contour - 3000 - 2560 x 1440
  Wavelet Contour - 3000 - 3840 x 2160
  Wavelet Volume - 3000 - 2560 x 1440
  Wavelet Volume - 3000 - 3840 x 2160
ProjectPhysX OpenCL-Benchmark
ProjectPhysX OpenCL-Benchmark
ProjectPhysX OpenCL-Benchmark
clpeak:
  Global Memory Bandwidth
  Transfer Bandwidth enqueueWriteBuffer
  Transfer Bandwidth enqueueReadBuffer
clpeak:
  Global Memory Bandwidth
  Transfer Bandwidth enqueueWriteBuffer
  Transfer Bandwidth enqueueReadBuffer
vkpeak:
  fp32-scalar
  fp32-vec4
  fp16-scalar
  fp16-vec4
  fp64-scalar
  fp64-vec4
clpeak:
  Single-Precision Compute
  Double-Precision Compute
vkpeak:
  int16-vec4
  Integer Compute
  Integer 24-bit Compute
vkpeak:
  int32-scalar
  int32-vec4
  int16-scalar
  int16-vec4
clpeak:
  Integer Compute
  Integer 24-bit Compute
GpuOwl:
  77936867
  332220523
GpuOwl
GpuOwl
IndigoBench
IndigoBench:
  OpenCL GPU - Supercar
  OpenCL GPU - Bedroom
LuxCoreRender
LuxCoreRender:
  DLSC - GPU
  Rainbow Colors and Prism - GPU
  LuxCore Benchmark - GPU
  Orange Juice - GPU
  Danish Mood - GPU
FluidX3D
FluidX3D:
  FP32-FP32
  FP32-FP16C
FluidX3D
FluidX3D
FAHBench
FAHBench
Meta Performance Per Watts
ProjectPhysX OpenCL-Benchmark:
  FP64 Compute
  FP32 Compute
  INT64 Compute
  INT32 Compute
  INT16 Compute
  INT8 Compute
Chaos Group V-RAY:
  NVIDIA CUDA GPU
  NVIDIA RTX GPU
Chaos Group V-RAY:
  NVIDIA CUDA GPU
  NVIDIA RTX GPU
FinanceBench:
  GPU Temp Monitor:
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
FinanceBench:
  Black-Scholes OpenCL
  Monte-Carlo OpenCL
Rodinia:
  OpenCL Myocyte
  OpenCL Particle Filter
Blender:
  BMW27 - NVIDIA CUDA
  BMW27 - NVIDIA OptiX
  Classroom - NVIDIA CUDA
  Classroom - NVIDIA OptiX
  Fishy Cat - NVIDIA CUDA
  Fishy Cat - NVIDIA OptiX
  Pabellon Barcelona - NVIDIA CUDA
  Pabellon Barcelona - NVIDIA OptiX
  Barbershop - NVIDIA CUDA
  Barbershop - NVIDIA OptiX
  Junkshop - NVIDIA CUDA
  Junkshop - NVIDIA OptiX
clpeak