nvidia RTX 5080 rtx 5090 compute benchmarks

Benchmarks for a future article.

RTX 5090

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: ASUS ROG MAXIMUS Z890 HERO (1203 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D1, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: ASUS NVIDIA GeForce RTX 5090 32GB, Audio: Intel Device 7f50, Monitor: ASUS VP28U, Network: Realtek Device 8126 + Intel I226-V + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.11.0-13-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server 1.21.1.13, Display Driver: NVIDIA 570.86.10, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.8.51 + OpenCL 3.0, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 3840x2160

Kernel Notes: nouveau.modeset=0 - Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x114 - Thermald 2.5.8
Graphics Notes: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 98.02.2e.00.03
OpenCL Notes: GPU Compute Cores: 21760
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

RTX 5080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x114 - Thermald 2.5.8
Graphics Notes: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 98.03.3b.00.01
OpenCL Notes: GPU Compute Cores: 10752
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

NVIDIA RTX 5080

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: ASUS ROG MAXIMUS Z890 HERO (1203 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D1, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: ASUS NVIDIA GeForce RTX 5080 16GB, Audio: Intel Device 7f50, Monitor: ASUS VP28U, Network: Realtek Device 8126 + Intel I226-V + Intel Wi-Fi 7

OS: Ubuntu 24.10, Kernel: 6.11.0-14-generic (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server 1.21.1.13, Display Driver: NVIDIA 570.86.10, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.8.51, Compiler: GCC 14.2.0 + CUDA 12.8, File-System: ext4, Screen Resolution: 3840x2160

NCNN

Llama.cpp

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

NAMD CUDA

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. This version of the NAMD test profile uses CUDA GPU acceleration. Learn more via the OpenBenchmarking.org test page.

ATPase Simulation - 327,506 Atoms

RTX 5080: The test run did not produce a result. E: FATAL ERROR: No simulation config file specified on command line.

NVIDIA RTX 5080: The test run did not produce a result. E: FATAL ERROR: No simulation config file specified on command line.

Blender

Llama.cpp

VkResample

VkResample is a Vulkan-based image upscaling library based on VkFFT. The sample input file is upscaling a 4K image to 8K using Vulkan-based GPU acceleration. Learn more via the OpenBenchmarking.org test page.

Blender

FluidX3D

Blender

Llama.cpp

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Llama.cpp

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

NAMD CUDA

FluidX3D

Blender

Llama.cpp

Blender

Llama.cpp

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Llama.cpp

Blender

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Blender

ProjectPhysX OpenCL-Benchmark

VkResample

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

Blender

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

Blender

Llama.cpp

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

Hashcat

Hashcat is an open-source, advanced password recovery tool supporting GPU acceleration with OpenCL, NVIDIA CUDA, and Radeon ROCm. Learn more via the OpenBenchmarking.org test page.

NAMD CUDA

RealSR-NCNN

Blender

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

SHOC Scalable HeterOgeneous Computing

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

SHOC Scalable HeterOgeneous Computing

clpeak

Clpeak is designed to test the peak capabilities of OpenCL devices. Learn more via the OpenBenchmarking.org test page.

88 Results Shown

NCNN:
Vulkan GPU - FastestDet
Vulkan GPU - vision_transformer
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - yolov4-tiny
Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
Vulkan GPU - resnet50
Vulkan GPU - alexnet
Vulkan GPU - resnet18
Vulkan GPU - vgg16
Vulkan GPU - googlenet
Vulkan GPU - blazeface
Vulkan GPU - efficientnet-b0
Vulkan GPU - mnasnet
Vulkan GPU - shufflenet-v2
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU-v2-v2 - mobilenet-v2
Vulkan GPU - mobilenet
Llama.cpp:
NVIDIA CUDA - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128
NVIDIA CUDA - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128
NVIDIA CUDA - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128
Chaos Group V-RAY:
NVIDIA CUDA GPU
NVIDIA RTX GPU
IndigoBench:
OpenCL GPU - Bedroom
OpenCL GPU - Supercar
NAMD CUDA
Blender
Llama.cpp
VkResample
Blender
FluidX3D
Blender
Llama.cpp
clpeak:
Transfer Bandwidth enqueueReadBuffer
Transfer Bandwidth enqueueWriteBuffer
Llama.cpp:
NVIDIA CUDA - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048
NVIDIA CUDA - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 1024
RealSR-NCNN
NAMD CUDA
FluidX3D:
FP32-FP16S
FP32-FP16C
Blender
Llama.cpp
Blender:
Junkshop - NVIDIA CUDA
Classroom - NVIDIA CUDA
Llama.cpp
Hashcat
Llama.cpp
Blender
Hashcat:
SHA-512
SHA1
Blender
ProjectPhysX OpenCL-Benchmark:
Memory Bandwidth Coalesced Write
Memory Bandwidth Coalesced Read
INT8 Compute
INT16 Compute
INT32 Compute
INT64 Compute
FP16 Compute
FP32 Compute
FP64 Compute
VkResample
clpeak
Blender
Hashcat
Blender:
BMW27 - NVIDIA CUDA
Fishy Cat - NVIDIA OptiX
Llama.cpp:
NVIDIA CUDA - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 512
NVIDIA CUDA - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512
SHOC Scalable HeterOgeneous Computing
Hashcat
NAMD CUDA
RealSR-NCNN
Blender
Waifu2x-NCNN Vulkan
clpeak
SHOC Scalable HeterOgeneous Computing:
OpenCL - FFT SP
OpenCL - GEMM SGEMM_N
OpenCL - Triad
OpenCL - Bus Speed Readback
OpenCL - Bus Speed Download
OpenCL - S3D
OpenCL - Reduction
clpeak:
Single-Precision Compute
Integer Compute
Integer 24-bit Compute
SHOC Scalable HeterOgeneous Computing
clpeak

RTX 5090

Testing initiated at 24 January 2025 16:55 by user pts.

RTX 5080

Testing initiated at 28 January 2025 18:27 by user pts.

NVIDIA RTX 5080

Processor: Intel Core Ultra 9 285K @ 5.10GHz (24 Cores), Motherboard: ASUS ROG MAXIMUS Z890 HERO (1203 BIOS), Chipset: Intel Device ae7f, Memory: 2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D1, Disk: 1000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: ASUS NVIDIA GeForce RTX 5080 16GB, Audio: Intel Device 7f50, Monitor: ASUS VP28U, Network: Realtek Device 8126 + Intel I226-V + Intel Wi-Fi 7

Testing initiated at 28 January 2025 19:13 by user pts.

nvidia RTX 5080 rtx 5090 compute benchmarks

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

RTX 5090

RTX 5080

NVIDIA RTX 5080

NCNN

Llama.cpp

Chaos Group V-RAY

IndigoBench

NAMD CUDA

Blender

Llama.cpp

VkResample

Blender

FluidX3D

Blender

Llama.cpp

clpeak

Llama.cpp

RealSR-NCNN

NAMD CUDA

FluidX3D

Blender

Llama.cpp

Blender

Llama.cpp

Hashcat

Llama.cpp

Blender

Hashcat

Blender

ProjectPhysX OpenCL-Benchmark

VkResample

clpeak

Blender

Hashcat

Blender

Llama.cpp

SHOC Scalable HeterOgeneous Computing

Hashcat

NAMD CUDA

RealSR-NCNN

Blender

Waifu2x-NCNN Vulkan

clpeak

SHOC Scalable HeterOgeneous Computing

clpeak

SHOC Scalable HeterOgeneous Computing

clpeak

88 Results Shown

RTX 5090

RTX 5080

NVIDIA RTX 5080