NVIDIA RTX 2000 / 4000 / 6000 Ada Generation

Benchmarks for a future article by Michael Larabel of NVIDIA RTX 6000 Ada Generation and other cards against Radeon PRO W7000 series on Linux.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2409147-PTS-NVIDIART65
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
RTX 2000 Ada Generation
September 13
  6 Hours, 12 Minutes
RTX 4000 Ada Generation
September 13
  5 Hours, 1 Minute
RTX 6000 Ada Generation
September 11
  4 Hours, 22 Minutes
Radeon PRO W7500
September 14
  9 Hours, 55 Minutes
Radeon PRO W7600
September 14
  6 Hours, 2 Minutes
Radeon PRO W7700
September 13
  5 Hours, 2 Minutes
Radeon PRO W7900
September 14
  5 Hours, 37 Minutes
Invert Behavior (Only Show Selected Data)
  6 Hours, 2 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NVIDIA RTX 2000 / 4000 / 6000 Ada GenerationProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen ResolutionRTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada GenerationRadeon PRO W7700Radeon PRO W7500Radeon PRO W7600Radeon PRO W7900AMD Ryzen 9 9950X 16-Core @ 8.18GHz (16 Cores / 32 Threads)ASUS ROG STRIX X670E-E GAMING WIFI (2308 BIOS)AMD Device 14d82 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32Western Digital WD_BLACK SN850X 2000GBNVIDIA RTX 6000 Ada Generation 48GBNVIDIA AD102 HD AudioDELL U2723QEIntel I225-V + Intel Wi-Fi 6EUbuntu 24.046.8.0-41-generic (x86_64)GNOME Shell 46.0X Server 1.21.1.11NVIDIA 560.35.034.6.0OpenCL 3.0 CUDA 12.6.65GCC 13.2.0ext43840x2160NVIDIA RTX 2000 Ada Generation 16GBNVIDIA Device 22beNVIDIA RTX 4000 Ada Generation 20GBNVIDIA Device 22bcAMD Radeon PRO W7700 15GBAMD Navi 31 HDMI/DPX Server 1.21.1.11 + Wayland4.6 Mesa 24.2.0-devel (LLVM 18.1.7 DRM 3.58)OpenCL 2.1 AMD-APP (3625.0)AMD Radeon PRO W7500 8GBAMD Radeon PRO W7600 8GBAMD Radeon PRO W7900 45GBOpenBenchmarking.orgKernel Details- RTX 6000 Ada Generation: nouveau.modeset=0 - Transparent Huge Pages: madvise- RTX 2000 Ada Generation: nouveau.modeset=0 - Transparent Huge Pages: madvise- RTX 4000 Ada Generation: nouveau.modeset=0 - Transparent Huge Pages: madvise- Radeon PRO W7700: Transparent Huge Pages: madvise- Radeon PRO W7500: Transparent Huge Pages: madvise- Radeon PRO W7600: Transparent Huge Pages: madvise- Radeon PRO W7900: Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xb40401cGraphics Details- RTX 6000 Ada Generation: BAR1 / Visible vRAM Size: 65536 MiB - vBIOS Version: 95.02.3a.00.01- RTX 2000 Ada Generation: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.07.47.00.05- RTX 4000 Ada Generation: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.04.5c.00.0d- Radeon PRO W7700: BAR1 / Visible vRAM Size: 15344 MB- Radeon PRO W7500: BAR1 / Visible vRAM Size: 8176 MB- Radeon PRO W7600: BAR1 / Visible vRAM Size: 8176 MB- Radeon PRO W7900: BAR1 / Visible vRAM Size: 46064 MBOpenCL Details- RTX 6000 Ada Generation: GPU Compute Cores: 18176- RTX 2000 Ada Generation: GPU Compute Cores: 2816- RTX 4000 Ada Generation: GPU Compute Cores: 6144Python Details- Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada GenerationRadeon PRO W7700Radeon PRO W7500Radeon PRO W7600Radeon PRO W7900Result OverviewPhoronix Test Suite100%299%499%698%897%LuxCoreRendervkpeakclpeakProjectPhysX OpenCL-BenchmarkGpuOwlFluidX3DSPECViewPerf 2020IndigoBenchFinanceBenchParaView

RTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada GenerationRadeon PRO W7700Radeon PRO W7500Radeon PRO W7600Radeon PRO W7900Per Watt Result OverviewPhoronix Test Suite100%150%200%249%299%ParaViewLuxCoreRenderProjectPhysX OpenCL-BenchmarkFluidX3DGpuOwlSPECViewPerf 2020vkpeakIndigoBenchclpeakP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.MP.W.G.M

NVIDIA RTX 2000 / 4000 / 6000 Ada Generationvkpeak: fp32-scalarluxcorerender: Danish Mood - GPUvkpeak: fp16-scalaropencl-benchmark: INT8 Computevkpeak: int16-vec4clpeak: Integer 24-bit Computeclpeak: Double-Precision Computevkpeak: fp64-scalaropencl-benchmark: FP64 Computevkpeak: fp64-vec4gpuowl: 332220523vkpeak: int16-scalaropencl-benchmark: INT16 Computegpuowl: 57885161gpuowl: 77936867fluidx3d: FP32-FP16Cfluidx3d: FP32-FP16Sfluidx3d: FP32-FP32financebench: Black-Scholes OpenCLopencl-benchmark: Memory Bandwidth Coalesced Readclpeak: Global Memory Bandwidthopencl-benchmark: Memory Bandwidth Coalesced Writeluxcorerender: Rainbow Colors and Prism - GPUblender: Pabellon Barcelona - NVIDIA CUDAspecviewperf2020: 2560 x 1440 - SNX-04indigobench: OpenCL GPU - Bedroomblender: Fishy Cat - NVIDIA CUDArodinia: OpenCL Particle Filterblender: Classroom - NVIDIA CUDAparaview: Wavelet Contour - 3000 - 3840 x 2160paraview: Wavelet Volume - 3000 - 3840 x 2160v-ray: NVIDIA CUDA GPUparaview: Wavelet Contour - 3000 - 2560 x 1440blender: Fishy Cat - NVIDIA OptiXblender: Barbershop - NVIDIA CUDAblender: BMW27 - NVIDIA CUDAv-ray: NVIDIA RTX GPUopencl-benchmark: INT64 Computeindigobench: OpenCL GPU - Supercarblender: Pabellon Barcelona - Radeon HIPblender: Barbershop - Radeon HIPblender: Classroom - Radeon HIPblender: Fishy Cat - Radeon HIPparaview: Many Spheres - 3000 - 3840 x 2160blender: Classroom - NVIDIA OptiXspecviewperf2020: 2560 x 1440 - SOLIDWORKS-07paraview: Many Spheres - 3000 - 2560 x 1440specviewperf2020: 2560 x 1440 - MAYA-06blender: Pabellon Barcelona - NVIDIA OptiXspecviewperf2020: 2560 x 1440 - CATIA-06blender: Barbershop - NVIDIA OptiXblender: Junkshop - NVIDIA CUDAblender: Junkshop - Radeon HIPblender: BMW27 - Radeon HIPfinancebench: Monte-Carlo OpenCLblender: Junkshop - NVIDIA OptiXparaview: Wavelet Volume - 3000 - 2560 x 1440blender: BMW27 - NVIDIA OptiXspecviewperf2020: 2560 x 1440 - CREO-03fahbench: luxcorerender: Orange Juice - GPUvkpeak: fp32-vec4opencl-benchmark: FP32 Computeclpeak: Single-Precision Computespecviewperf2020: 2560 x 1440 - MEDICAL-O3vkpeak: fp16-vec4specviewperf2020: 2560 x 1440 - ENERGY-03luxcorerender: LuxCore Benchmark - GPUluxcorerender: DLSC - GPUparaview: Wavelet Volume - 3000 - 3840 x 2160paraview: Wavelet Volume - 3000 - 2560 x 1440paraview: Wavelet Contour - 3000 - 3840 x 2160paraview: Wavelet Contour - 3000 - 2560 x 1440paraview: Many Spheres - 3000 - 3840 x 2160paraview: Many Spheres - 3000 - 2560 x 1440RTX 6000 Ada GenerationRTX 2000 Ada GenerationRTX 4000 Ada GenerationRadeon PRO W7700Radeon PRO W7500Radeon PRO W7600Radeon PRO W790048174.3317.7246258.4321.79238953.7140310.581534.701538.941.511533.64322.4231027.6632.2972043.601490.31105851020852522.710865.57815.91850.0435.6521.301026.5629.28210.672.09110.44625.76639.675984811.195.4845.655.5781743.77473.905158.477.92578.10167.61813.188.86236.9632.9111.0899.2392877.34853.683.61256.46462.313115.8662508.5490.80583188.79211.9893562.47156.1717.7818.9510234.76213658.9056521.1528453.56715887.66716804.1386944.944.096968.174.7876167.155965.80217.61218.820.215218.8247.854634.075.947308.39226.6425312434133116.228143207.71195.00211.8111.92108.95285.228.16147.269.20645.87145.70150.831419194.0122.74188.7622.5320231.73524.39943.2928.55184.7347.27276.4431.0783.69114.4538.40266.03514322.30285.7310.45113.82207.85005.219186.8113.43911596.8265.6013745.3031.734.634.612413.3354571.6351518.3562021.8214339.6574739.46613990.276.9213985.479.69212381.1113209.06439.32440.870.433440.0195.309328.0111.483612.37449.514039377020477.693326.85307.54333.7817.6259.20423.6612.87826.134.99126.56267.98269.031960346.7712.90107.8513.2629272.82436.91589.0417.86308.7296.58430.1019.40142.9572.3622.96199.92686014.62466.707.19170.61281.64647.7018498.6426.96125785.6897.3527650.7062.748.037.964304.5147467.0902792.6543613.7118926.8069682.32112385.234.1612392.956.11525011.6911408.66462.25484.510.469484.53102.1012377.1410.693735.11545.262865291116326.779403.72391.21308.9314.98360.2312.033320.57340.94400.302.36436.45290.92164.7340.4539.9496.28336.76109.72501.38137.5631.3622.45152.220712564.16208.572.3110997.0714.35312304.2433.2625066.8536.674.904.095455.0819026.5703340.7254171.5749652.08210999.5404887.921.905005.482.7278451.355462.75238.56235.120.237229.9249.084897.024.855315.99226.911682163085612.201146.92139.39149.236.92209.926.370160.57151.97218.561.04718.503181.37348.2684.3979.1249.08162.1755.01229.6567.8463.3845.14327.081569291.53113.671.054345.636.4806072.1616.028701.8714.532.191.922431.5364664.4031673.2592277.7054920.5485514.7727784.732.557839.683.82015006.126985.95332.83332.940.324329.5266.187740.886.731467.65345.112317219411779.724237.72220.39213.469.27263.698.740212.95229.33274.721.46325.080135.28258.0663.0158.4568.77216.5275.07316.2587.8747.9734.40261.737998398.03151.461.296876.758.9197710.9525.2715112.5323.033.012.543669.2946368.4912219.2022862.8776894.4587526.37225619.416.6225838.0812.56648689.6822802.741096.591118.441.0571117.47206.3025473.8922.4591460.581088.535503465027963.382590.24496.42459.2220.69582.3019.348582.89589.04734.054.22654.96947.2191.4422.2821.42147.76530.89168.48770.35203.2218.6713.52100.719712653.03276.102.9322828.3731.03125936.6553.7149210.0967.467.696.289424.65110448.3916074.4267649.73914813.52516890.756OpenBenchmarking.org

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-scalarRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 289.96, N = 4SE +/- 53.71, N = 3SE +/- 42.52, N = 8SE +/- 10.33, N = 3SE +/- 16.39, N = 3SE +/- 12.68, N = 3SE +/- 370.02, N = 325619.417784.734887.9212385.2313990.276944.9448174.33

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 36.622.551.904.166.924.0917.72MIN: 2.89 / MAX: 7.51MIN: 1.07 / MAX: 2.84MIN: 0.73 / MAX: 2.13MIN: 1.91 / MAX: 4.68MIN: 3.18 / MAX: 7.9MIN: 1.83 / MAX: 4.63MIN: 6.81 / MAX: 20.39

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-scalarRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 158.36, N = 4SE +/- 11.58, N = 3SE +/- 32.58, N = 8SE +/- 3.75, N = 3SE +/- 9.85, N = 3SE +/- 14.03, N = 3SE +/- 153.56, N = 325838.087839.685005.4812392.9513985.476968.1746258.43

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT8 ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.050, N = 5SE +/- 0.070, N = 3SE +/- 0.035, N = 3SE +/- 0.012, N = 3SE +/- 0.017, N = 4SE +/- 0.011, N = 3SE +/- 0.068, N = 612.5663.8202.7276.1159.6924.78721.7921. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-vec4Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation10K20K30K40K50KSE +/- 18.90, N = 4SE +/- 2.93, N = 3SE +/- 28.67, N = 8SE +/- 5.00, N = 3SE +/- 0.51, N = 3SE +/- 0.02, N = 3SE +/- 89.71, N = 348689.6815006.128451.3525011.6912381.116167.1538953.71

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation9K18K27K36K45KSE +/- 99.46, N = 13SE +/- 31.15, N = 13SE +/- 25.04, N = 12SE +/- 27.65, N = 13SE +/- 6.66, N = 13SE +/- 17.53, N = 13SE +/- 289.03, N = 1522802.746985.955462.7511408.6613209.065965.8040310.581. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.78, N = 8SE +/- 0.86, N = 7SE +/- 0.17, N = 7SE +/- 1.47, N = 7SE +/- 0.36, N = 5SE +/- 0.22, N = 5SE +/- 2.12, N = 61096.59332.83238.56462.25439.32217.611534.701. (CXX) g++ options: -O3

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-scalarRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.27, N = 4SE +/- 0.01, N = 3SE +/- 0.81, N = 8SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 31118.44332.94235.12484.51440.87218.821538.94

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP64 ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation0.33980.67961.01941.35921.699SE +/- 0.002, N = 5SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 4SE +/- 0.000, N = 3SE +/- 0.000, N = 61.0570.3240.2370.4690.4330.2151.5101. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp64-vec4Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.18, N = 4SE +/- 0.04, N = 3SE +/- 0.73, N = 8SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 31117.47329.52229.92484.53440.01218.821533.64

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 332220523Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation70140210280350SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.45, N = 3206.3066.1849.08102.1095.3047.85322.421. (CXX) g++ options: -O3 -lgmp -lOpenCL

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20230730int16-scalarRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation7K14K21K28K35KSE +/- 13.76, N = 4SE +/- 0.62, N = 3SE +/- 15.50, N = 8SE +/- 0.51, N = 3SE +/- 11.44, N = 3SE +/- 8.93, N = 3SE +/- 5.73, N = 325473.897740.884897.0212377.149328.014634.0731027.66

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT16 ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation816243240SE +/- 0.144, N = 5SE +/- 0.221, N = 3SE +/- 0.078, N = 3SE +/- 0.062, N = 3SE +/- 0.004, N = 4SE +/- 0.011, N = 3SE +/- 0.127, N = 622.4596.7314.85510.69311.4835.94732.2971. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

GpuOwl

GpuOwl is a Mersenne primality tester leveraging OpenCL for cross-vendor GPU acceleration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 57885161Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation400800120016002000SE +/- 2.85, N = 3SE +/- 0.07, N = 3SE +/- 0.23, N = 3SE +/- 0.18, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 1.39, N = 31460.58467.65315.99735.11612.37308.392043.601. (CXX) g++ options: -O3 -lgmp -lOpenCL

OpenBenchmarking.orgIterations / Second, More Is BetterGpuOwl 7.5Exponent: 77936867Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation30060090012001500SE +/- 0.40, N = 3SE +/- 1.00, N = 3SE +/- 0.22, N = 3SE +/- 0.60, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 31088.53345.11226.91545.26449.51226.641490.311. (CXX) g++ options: -O3 -lgmp -lOpenCL

FluidX3D

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP16CRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 73.37, N = 3SE +/- 3.84, N = 3SE +/- 0.00, N = 3SE +/- 2.73, N = 3SE +/- 2.03, N = 3SE +/- 1.67, N = 3SE +/- 3.50, N = 455032317168228654039253110585

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP16SRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 12.73, N = 3SE +/- 0.88, N = 3SE +/- 2.31, N = 3SE +/- 1.45, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 2.00, N = 346502194163029113770243410208

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 2.17Test: FP32-FP32Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation11002200330044005500SE +/- 7.22, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 1.15, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 16.33, N = 3279611778561632204713315252

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.017790, N = 15SE +/- 0.061700, N = 15SE +/- 0.061748, N = 14SE +/- 0.015787, N = 15SE +/- 0.041867, N = 15SE +/- 0.001644, N = 14SE +/- 0.003806, N = 153.3820009.72400012.2010006.7790007.69300016.2281432.7100001. (CXX) g++ options: -O3 -march=native -fopenmp

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced ReadRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 2.98, N = 5SE +/- 2.74, N = 3SE +/- 0.32, N = 3SE +/- 10.49, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 3SE +/- 0.06, N = 6590.24237.72146.92403.72326.85207.71865.571. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 1.00, N = 9SE +/- 0.18, N = 7SE +/- 0.08, N = 5SE +/- 0.98, N = 8SE +/- 0.05, N = 8SE +/- 0.06, N = 7SE +/- 0.03, N = 11496.42220.39139.39391.21307.54195.00815.911. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: Memory Bandwidth Coalesced WriteRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 1.25, N = 5SE +/- 2.65, N = 3SE +/- 1.37, N = 3SE +/- 3.49, N = 3SE +/- 0.02, N = 4SE +/- 0.01, N = 3SE +/- 0.26, N = 6459.22213.46149.23308.93333.78211.81850.041. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation816243240SE +/- 0.08, N = 5SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 4SE +/- 0.09, N = 720.699.276.9214.9817.6211.9235.65MIN: 19.26 / MAX: 21.65MIN: 8.58 / MAX: 9.52MIN: 6.36 / MAX: 7.09MIN: 13.88 / MAX: 15.48MIN: 16.21 / MAX: 18.4MIN: 10.89 / MAX: 12.56MIN: 31.82 / MAX: 39.14

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 359.20108.9521.30

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: SNX-04Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 0.42, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.27, N = 3SE +/- 0.62, N = 3SE +/- 5.90, N = 3582.30263.69209.92360.23423.66285.221026.56

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation714212835SE +/- 0.007, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.009, N = 3SE +/- 0.002, N = 3SE +/- 0.029, N = 319.3488.7406.37012.03312.8788.16129.282

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1122334455SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 526.1347.2610.67

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation3691215SE +/- 0.005, N = 7SE +/- 0.009, N = 5SE +/- 0.004, N = 114.9919.2062.0911. (CXX) g++ options: -O2 -lOpenCL

Test: OpenCL Particle Filter

Radeon PRO W7700: The test run did not produce a result. E: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

Radeon PRO W7500: The test run did not produce a result. E: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

Radeon PRO W7600: The test run did not produce a result. E: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

Radeon PRO W7900: The test run did not produce a result. E: ERROR: clEnqueueNDRangeKernel(kernel_likelihood)=>-54 failed

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 526.5645.8710.44

ParaView

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 3840 x 2160Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation140280420560700SE +/- 0.90, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.57, N = 3SE +/- 0.16, N = 3SE +/- 0.11, N = 3SE +/- 0.45, N = 3582.89212.95160.57320.57267.98145.70625.76

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 3840 x 2160Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation140280420560700SE +/- 2.09, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.42, N = 3SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 6.56, N = 3589.04229.33151.97340.94269.03150.83639.67

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation13002600390052006500SE +/- 0.00, N = 3SE +/- 3.67, N = 3SE +/- 24.01, N = 3196014195984

ParaView

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Contour - Frames: 3000 - Resolution: 2560 x 1440Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 1.91, N = 3SE +/- 0.38, N = 3SE +/- 0.14, N = 3SE +/- 0.87, N = 3SE +/- 0.38, N = 3SE +/- 0.41, N = 3SE +/- 0.63, N = 3734.05274.72218.56400.30346.77194.01811.19

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.05, N = 1512.9022.745.48

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4080120160200SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 3107.85188.7645.65

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.02, N = 4SE +/- 0.02, N = 3SE +/- 0.01, N = 713.2622.535.57

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPURTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2K4K6K8K10KSE +/- 13.86, N = 3SE +/- 0.00, N = 3SE +/- 8.95, N = 3292720238174

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: INT64 ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation0.95091.90182.85273.80364.7545SE +/- 0.016, N = 5SE +/- 0.004, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.060, N = 4SE +/- 0.009, N = 3SE +/- 0.021, N = 64.2261.4631.0472.3642.8241.7353.7741. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1632486480SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 354.9725.0818.5036.4536.9224.4073.91

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: Radeon HIPRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W77004080120160200SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.25, N = 3SE +/- 0.04, N = 347.21135.28181.3790.92

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: Radeon HIPRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W770080160240320400SE +/- 0.06, N = 3SE +/- 0.71, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 391.44258.06348.26164.73

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: Radeon HIPRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W770020406080100SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 322.2863.0184.3940.45

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Fishy Cat - Compute: Radeon HIPRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W770020406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 321.4258.4579.1239.94

ParaView

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 3840 x 2160Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4080120160200SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3147.7668.7749.0896.2889.0443.29158.47

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Classroom - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation714212835SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 617.8628.557.92

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: SOLIDWORKS-07Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation120240360480600SE +/- 0.42, N = 3SE +/- 0.09, N = 3SE +/- 0.25, N = 3SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3530.89216.52162.17336.76308.72184.73578.10

ParaView

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Many Spheres - Frames: 3000 - Resolution: 2560 x 1440Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation4080120160200SE +/- 0.41, N = 3SE +/- 0.25, N = 3SE +/- 0.05, N = 3SE +/- 0.18, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.74, N = 3168.4875.0755.01109.7296.5847.27167.61

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: MAYA-06Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 4.21, N = 3SE +/- 0.09, N = 3SE +/- 0.58, N = 3SE +/- 1.26, N = 3SE +/- 0.09, N = 3SE +/- 0.26, N = 3SE +/- 0.66, N = 3770.35316.25229.65501.38430.10276.44813.18

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation714212835SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 519.4031.078.86

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: CATIA-06Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation50100150200250SE +/- 1.26, N = 3SE +/- 0.37, N = 3SE +/- 0.57, N = 3SE +/- 0.33, N = 3SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 1.01, N = 3203.2287.8767.84137.56142.9583.69236.96

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation306090120150SE +/- 0.15, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 372.36114.4532.91

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA CUDARTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation918273645SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 422.9638.4011.08

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: Radeon HIPRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W77001428425670SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 318.6747.9763.3831.36

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: Radeon HIPRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W77001020304050SE +/- 0.02, N = 4SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 313.5234.4045.1422.45

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Monte-Carlo OpenCLRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation70140210280350SE +/- 0.25, N = 7SE +/- 0.99, N = 7SE +/- 0.35, N = 7SE +/- 0.22, N = 7SE +/- 0.04, N = 7SE +/- 0.11, N = 7SE +/- 0.04, N = 7100.72261.74327.08152.22199.93266.0499.241. (CXX) g++ options: -O3 -march=native -fopenmp

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: Junkshop - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.02, N = 4SE +/- 0.04, N = 3SE +/- 0.01, N = 614.6222.307.34

ParaView

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.13Test: Wavelet Volume - Frames: 3000 - Resolution: 2560 x 1440Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation2004006008001000SE +/- 2.07, N = 3SE +/- 1.21, N = 3SE +/- 0.21, N = 3SE +/- 1.43, N = 3SE +/- 0.99, N = 3SE +/- 0.37, N = 3SE +/- 2.56, N = 3653.03398.03291.53564.16466.70285.73853.68

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.2Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation3691215SE +/- 0.01, N = 6SE +/- 0.01, N = 5SE +/- 0.00, N = 87.1910.453.61

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: CREO-03Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation60120180240300SE +/- 0.53, N = 3SE +/- 0.08, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.47, N = 3276.10151.46113.67208.57170.61113.82256.46

FAHBench

FAHBench is a Folding@Home benchmark on the GPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation100200300400500SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.94, N = 3281.65207.85462.31

Radeon PRO W7700: The test run did not produce a result.

Radeon PRO W7500: The test run did not produce a result.

Radeon PRO W7600: The test run did not produce a result.

Radeon PRO W7900: The test run did not produce a result.

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 32.931.291.052.317.705.2115.86MIN: 2.69 / MAX: 3.71MIN: 2.09 / MAX: 3.15MIN: 6.01 / MAX: 9.92MIN: 4.19 / MAX: 6.51MIN: 13.68 / MAX: 21.62

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp32-vec4Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation13K26K39K52K65KSE +/- 342.60, N = 4SE +/- 16.67, N = 3SE +/- 32.25, N = 8SE +/- 6.54, N = 3SE +/- 31.57, N = 3SE +/- 18.50, N = 3SE +/- 485.03, N = 322828.376876.754345.6310997.0718498.649186.8162508.54

ProjectPhysX OpenCL-Benchmark

ProjectPhysX OpenCL-Benchmark provides various OpenCL compute and memory bandwidth micro-benchmarks Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.2Operation: FP32 ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20406080100SE +/- 0.188, N = 5SE +/- 0.135, N = 3SE +/- 0.102, N = 3SE +/- 0.040, N = 3SE +/- 0.002, N = 4SE +/- 0.000, N = 3SE +/- 0.081, N = 631.0318.9196.48014.35326.96113.43990.8051. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20K40K60K80K100KSE +/- 73.46, N = 12SE +/- 14.09, N = 12SE +/- 9.29, N = 12SE +/- 27.07, N = 12SE +/- 26.99, N = 13SE +/- 14.33, N = 15SE +/- 359.84, N = 1325936.657710.956072.1612304.2425785.6811596.8283188.791. (CXX) g++ options: -O3

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: MEDICAL-O3Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation50100150200250SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.21, N = 353.7125.2716.0233.2697.3565.60211.98

vkpeak

Vkpeak is a Vulkan compute benchmark inspired by OpenCL's clpeak. Vkpeak provides Vulkan compute performance measurements for FP16 / FP32 / FP64 / INT16 / INT32 scalar and vec4 performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20230730fp16-vec4Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation20K40K60K80K100KSE +/- 135.83, N = 4SE +/- 2.46, N = 3SE +/- 53.19, N = 8SE +/- 8.54, N = 3SE +/- 16.63, N = 3SE +/- 27.78, N = 3SE +/- 692.18, N = 349210.0915112.538701.8725066.8527650.7013745.3093562.47

SPECViewPerf 2020

This test runs SPECViewPerf 2020 if available on your system. SPECViewPerf is made up of real-world OpenGL workstation tests such as CATIA and SolidWorks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgComposite Score, More Is BetterSPECViewPerf 2020 3.0Resolution: 2560 x 1440 - Viewset: ENERGY-03Radeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation306090120150SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.57, N = 367.4623.0314.5336.6762.7431.73156.17

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation1632486480Min: 27 / Avg: 70.28 / Max: 80Min: 27 / Avg: 76.2 / Max: 86Min: 30 / Avg: 74.13 / Max: 83Min: 31 / Avg: 68.93 / Max: 83Min: 29 / Avg: 63.59 / Max: 80Min: 36 / Avg: 66.43 / Max: 79Min: 41 / Avg: 72.97 / Max: 87

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation50100150200250Min: 8 / Avg: 189.77 / Max: 249Min: 1 / Avg: 79.44 / Max: 101Min: 1 / Avg: 43.46 / Max: 54Min: 5 / Avg: 111.28 / Max: 154Min: 6.46 / Avg: 78.35 / Max: 130.8Min: 7.09 / Avg: 47.88 / Max: 70Min: 21.72 / Avg: 191 / Max: 303.57

LuxCoreRender

MinAvgMaxRadeon PRO W790053.073.077.0Radeon PRO W760064.080.183.0Radeon PRO W750060.075.279.0Radeon PRO W770064.077.483.0RTX 4000 Ada Generation53.067.372.0RTX 2000 Ada Generation56.069.173.0RTX 6000 Ada Generation53.068.183.0OpenBenchmarking.orgCelsius, Fewer Is BetterLuxCoreRender 2.6GPU Temperature Monitor20406080100

MinAvgMaxRadeon PRO W790012.0209.3245.0Radeon PRO W76001.084.7100.0Radeon PRO W75001.045.653.0Radeon PRO W77008.0130.9154.0RTX 4000 Ada Generation14.393.3105.9RTX 2000 Ada Generation7.754.862.5RTX 6000 Ada Generation27.1169.0299.9OpenBenchmarking.orgWatts, Fewer Is BetterLuxCoreRender 2.6GPU Power Consumption Monitor80160240320400

OpenBenchmarking.orgM samples/sec Per Watt, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation0.02360.04720.07080.09440.1180.0370.0360.0480.0370.0860.0840.105

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation48121620SE +/- 0.18, N = 12SE +/- 0.07, N = 12SE +/- 0.05, N = 12SE +/- 0.12, N = 12SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 37.693.012.194.908.034.6317.78MIN: 1.31 / MAX: 8.88MIN: 0.48 / MAX: 3.47MIN: 0.35 / MAX: 2.52MIN: 0.86 / MAX: 5.64MIN: 3.12 / MAX: 9.08MIN: 1.89 / MAX: 5.26MIN: 7.98 / MAX: 20.82

MinAvgMaxRadeon PRO W790046.074.077.0Radeon PRO W760040.078.984.0Radeon PRO W750042.074.680.0Radeon PRO W770045.074.881.0RTX 4000 Ada Generation54.070.174.0RTX 2000 Ada Generation56.071.776.0RTX 6000 Ada Generation63.081.485.0OpenBenchmarking.orgCelsius, Fewer Is BetterLuxCoreRender 2.6GPU Temperature Monitor20406080100

MinAvgMaxRadeon PRO W790014.0222.0249.0Radeon PRO W76001.090.7100.0Radeon PRO W75001.048.654.0Radeon PRO W77007.0138.0154.0RTX 4000 Ada Generation14.6102.3111.7RTX 2000 Ada Generation7.760.765.8RTX 6000 Ada Generation32.7279.1300.3OpenBenchmarking.orgWatts, Fewer Is BetterLuxCoreRender 2.6GPU Power Consumption Monitor80160240320400

OpenBenchmarking.orgM samples/sec Per Watt, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation0.01760.03520.05280.07040.0880.0280.0280.0390.0300.0780.0760.068

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: GPURadeon PRO W7900Radeon PRO W7600Radeon PRO W7500Radeon PRO W7700RTX 4000 Ada GenerationRTX 2000 Ada GenerationRTX 6000 Ada Generation510152025SE +/- 0.17, N = 12SE +/- 0.07, N = 12SE +/- 0.05, N = 12SE +/- 0.11, N = 12SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 36.282.541.924.097.964.6118.95MIN: 1.27 / MAX: 6.75MIN: 0.52 / MAX: 2.75MIN: 0.84 / MAX: 4.42MIN: 7.71 / MAX: 8.17MIN: 4.41 / MAX: 4.76MIN: 18.15 / MAX: 19.18

77 Results Shown

vkpeak
LuxCoreRender
vkpeak
ProjectPhysX OpenCL-Benchmark
vkpeak
clpeak:
  Integer 24-bit Compute
  Double-Precision Compute
vkpeak
ProjectPhysX OpenCL-Benchmark
vkpeak
GpuOwl
vkpeak
ProjectPhysX OpenCL-Benchmark
GpuOwl:
  57885161
  77936867
FluidX3D:
  FP32-FP16C
  FP32-FP16S
  FP32-FP32
FinanceBench
ProjectPhysX OpenCL-Benchmark
clpeak
ProjectPhysX OpenCL-Benchmark
LuxCoreRender
Blender
SPECViewPerf 2020
IndigoBench
Blender
Rodinia
Blender
ParaView:
  Wavelet Contour - 3000 - 3840 x 2160
  Wavelet Volume - 3000 - 3840 x 2160
Chaos Group V-RAY
ParaView
Blender:
  Fishy Cat - NVIDIA OptiX
  Barbershop - NVIDIA CUDA
  BMW27 - NVIDIA CUDA
Chaos Group V-RAY
ProjectPhysX OpenCL-Benchmark
IndigoBench
Blender:
  Pabellon Barcelona - Radeon HIP
  Barbershop - Radeon HIP
  Classroom - Radeon HIP
  Fishy Cat - Radeon HIP
ParaView
Blender
SPECViewPerf 2020
ParaView
SPECViewPerf 2020
Blender
SPECViewPerf 2020
Blender:
  Barbershop - NVIDIA OptiX
  Junkshop - NVIDIA CUDA
  Junkshop - Radeon HIP
  BMW27 - Radeon HIP
FinanceBench
Blender
ParaView
Blender
SPECViewPerf 2020
FAHBench
LuxCoreRender
vkpeak
ProjectPhysX OpenCL-Benchmark
clpeak
SPECViewPerf 2020
vkpeak
SPECViewPerf 2020
GPU Temperature Monitor:
  Phoronix Test Suite System Monitoring:
    Celsius
    Watts
  GPU Temp Monitor:
    Celsius
  GPU Power Consumption Monitor:
    Watts
  LuxCore Benchmark - GPU:
    M samples/sec Per Watt
LuxCoreRender
LuxCoreRender:
  GPU Temp Monitor
  GPU Power Consumption Monitor
  DLSC - GPU
LuxCoreRender