nvidia RTX 5080 rtx 5090 compute benchmarks
Benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2501297-PTS-NVIDIART00&sor&grs.
NCNN
Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2
NCNN
Target: Vulkan GPU - Model: FastestDet
NCNN
Target: Vulkan GPU - Model: mnasnet
NCNN
Target: Vulkan GPU - Model: regnety_400m
NCNN
Target: Vulkan GPU - Model: squeezenet_ssd
NCNN
Target: Vulkan GPU - Model: googlenet
NCNN
Target: Vulkan GPU - Model: blazeface
NCNN
Target: Vulkan GPU - Model: efficientnet-b0
NCNN
Target: Vulkan GPU - Model: shufflenet-v2
Hashcat
Benchmark: TrueCrypt RIPEMD160 + XTS
NCNN
Target: Vulkan GPU - Model: alexnet
Hashcat
Benchmark: SHA1
Hashcat
Benchmark: SHA-512
ProjectPhysX OpenCL-Benchmark
Operation: INT8 Compute
ProjectPhysX OpenCL-Benchmark
Operation: FP16 Compute
clpeak
OpenCL Test: Integer Compute
clpeak
OpenCL Test: Integer 24-bit Compute
clpeak
OpenCL Test: Double-Precision Compute
ProjectPhysX OpenCL-Benchmark
Operation: FP64 Compute
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: MD5 Hash
VkResample
Upscale: 2x - Precision: Double
clpeak
OpenCL Test: Single-Precision Compute
ProjectPhysX OpenCL-Benchmark
Operation: INT32 Compute
ProjectPhysX OpenCL-Benchmark
Operation: FP32 Compute
NCNN
Target: Vulkan GPU - Model: resnet50
ProjectPhysX OpenCL-Benchmark
Operation: INT16 Compute
Hashcat
Benchmark: 7-Zip
Chaos Group V-RAY
Mode: NVIDIA CUDA GPU
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: GEMM SGEMM_N
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: S3D
FluidX3D
Test: FP32-FP16C
ProjectPhysX OpenCL-Benchmark
Operation: Memory Bandwidth Coalesced Write
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: FFT SP
FluidX3D
Test: FP32-FP32
clpeak
OpenCL Test: Global Memory Bandwidth
VkResample
Upscale: 2x - Precision: Single
Blender
Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA
FluidX3D
Test: FP32-FP16S
ProjectPhysX OpenCL-Benchmark
Operation: Memory Bandwidth Coalesced Read
NCNN
Target: Vulkan GPU - Model: vision_transformer
Blender
Blend File: Classroom - Compute: NVIDIA CUDA
NCNN
Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3
NCNN
Target: Vulkan GPU - Model: mobilenet
Blender
Blend File: Barbershop - Compute: NVIDIA CUDA
IndigoBench
Acceleration: OpenCL GPU - Scene: Bedroom
Blender
Blend File: Barbershop - Compute: NVIDIA OptiX
Blender
Blend File: Fishy Cat - Compute: NVIDIA CUDA
Chaos Group V-RAY
Mode: NVIDIA RTX GPU
NCNN
Target: Vulkan GPU - Model: resnet18
Blender
Blend File: Fishy Cat - Compute: NVIDIA OptiX
Blender
Blend File: BMW27 - Compute: NVIDIA CUDA
Blender
Blend File: Classroom - Compute: NVIDIA OptiX
Blender
Blend File: Junkshop - Compute: NVIDIA CUDA
Blender
Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX
NCNN
Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3
Blender
Blend File: BMW27 - Compute: NVIDIA OptiX
Blender
Blend File: Junkshop - Compute: NVIDIA OptiX
RealSR-NCNN
Scale: 4x - TAA: Yes
IndigoBench
Acceleration: OpenCL GPU - Scene: Supercar
Waifu2x-NCNN Vulkan
Scale: 2x - Denoise: 3 - TAA: Yes
NCNN
Target: Vulkan GPU - Model: vgg16
NCNN
Target: Vulkan GPU - Model: yolov4-tiny
RealSR-NCNN
Scale: 4x - TAA: No
Hashcat
Benchmark: MD5
clpeak
OpenCL Test: Kernel Latency
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Texture Read Bandwidth
ProjectPhysX OpenCL-Benchmark
Operation: INT64 Compute
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Reduction
clpeak
OpenCL Test: Transfer Bandwidth enqueueReadBuffer
clpeak
OpenCL Test: Transfer Bandwidth enqueueWriteBuffer
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Triad
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Bus Speed Readback
NAMD CUDA
Input: ATPase with 327,506 Atoms
Llama.cpp
Backend: NVIDIA CUDA - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512
Llama.cpp
Backend: NVIDIA CUDA - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048
NAMD CUDA
Input: STMV with 1,066,628 Atoms
Llama.cpp
Backend: NVIDIA CUDA - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048
Llama.cpp
Backend: NVIDIA CUDA - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024
Llama.cpp
Backend: NVIDIA CUDA - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128
Llama.cpp
Backend: NVIDIA CUDA - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128
Llama.cpp
Backend: NVIDIA CUDA - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512
Llama.cpp
Backend: NVIDIA CUDA - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024
Llama.cpp
Backend: NVIDIA CUDA - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048
Llama.cpp
Backend: NVIDIA CUDA - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128
Llama.cpp
Backend: NVIDIA CUDA - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024
Llama.cpp
Backend: NVIDIA CUDA - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512
SHOC Scalable HeterOgeneous Computing
Target: OpenCL - Benchmark: Bus Speed Download
NAMD CUDA
ATPase Simulation - 327,506 Atoms
Phoronix Test Suite v10.8.5