nvidia rtx 5090 compute benchmarks

Tests for a future article. Intel Core Ultra 9 285K testing with a ASUS ROG MAXIMUS Z890 HERO (1203 BIOS) and ASUS NVIDIA GeForce RTX 5090 32GB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2501242-PTS-NVIDIART00&grs&sor.

nvidia rtx 5090 compute benchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolutionrtx 5090NVIDIA 5090GeForce RTX 5090Intel Core Ultra 9 285K @ 5.10GHz (24 Cores)ASUS ROG MAXIMUS Z890 HERO (1203 BIOS)Intel Device ae7f2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D11000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBASUS NVIDIA GeForce RTX 5090 32GBIntel Device 7f50ASUS VP28URealtek Device 8126 + Intel I226-V + Intel Wi-Fi 7Ubuntu 24.106.11.0-13-generic (x86_64)GNOME Shell 47.0X Server 1.21.1.13NVIDIA 570.86.104.6.0OpenCL 3.0 CUDA 12.8.51 + OpenCL 3.0GCC 14.2.0ext43840x2160OpenBenchmarking.orgKernel Details- nouveau.modeset=0 - Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x114 - Thermald 2.5.8Graphics Details- BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 98.02.2e.00.03OpenCL Details- GPU Compute Cores: 21760Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

nvidia rtx 5090 compute benchmarksncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50realsr-ncnn: 4x - Noncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - vgg16namd-cuda: ATPase Simulation - 327,506 Atomsblender: BMW27 - NVIDIA OptiXshoc: OpenCL - Bus Speed Readbackvkfft: FFT + iFFT C2C 1D batched in half precisionvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C 1D batched in double precisionblender: BMW27 - NVIDIA CUDAwaifu2x-ncnn: 2x - 3 - Yesblender: Barbershop - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingclpeak: Transfer Bandwidth enqueueReadBufferv-ray: NVIDIA CUDA GPUhashcat: MD5blender: Pabellon Barcelona - NVIDIA CUDAshoc: OpenCL - FFT SPblender: Barbershop - NVIDIA CUDAclpeak: Transfer Bandwidth enqueueWriteBufferblender: Classroom - NVIDIA OptiXopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: Memory Bandwidth Coalesced Writeindigobench: OpenCL GPU - Bedroomhashcat: 7-Ziphashcat: SHA1shoc: OpenCL - Triadshoc: OpenCL - S3Dvkfft: FFT + iFFT C2C Bluestein in single precisionhashcat: TrueCrypt RIPEMD160 + XTSblender: Junkshop - NVIDIA CUDAshoc: OpenCL - GEMM SGEMM_Nblender: Fishy Cat - NVIDIA OptiXindigobench: OpenCL GPU - Supercarshoc: OpenCL - MD5 Hashclpeak: Kernel Latencyopencl-benchmark: INT64 Computeblender: Junkshop - NVIDIA OptiXopencl-benchmark: INT8 Computeshoc: OpenCL - Texture Read Bandwidthncnn: Vulkan GPU - vision_transformeropencl-benchmark: INT16 Computerealsr-ncnn: 4x - Yesclpeak: Global Memory Bandwidthfluidx3d: FP32-FP16Cclpeak: Integer 24-bit Computeclpeak: Integer Computevkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkfft: FFT + iFFT R2C / C2Rshoc: OpenCL - Max SP Flopsvkfft: FFT + iFFT C2C multidimensional in single precisionhashcat: SHA-512opencl-benchmark: FP64 Computevkresample: 2x - Doublevkpeak: fp32-vec4vkpeak: int32-vec4vkpeak: int16-scalarvkpeak: fp32-scalarfluidx3d: FP32-FP32shoc: OpenCL - Reductionopencl-benchmark: FP32 Computeopencl-benchmark: FP16 Computeclpeak: Double-Precision Computevkpeak: fp16-vec4vkpeak: fp16-scalaropencl-benchmark: INT32 Computefluidx3d: FP32-FP16Svkpeak: fp64-vec4clpeak: Single-Precision Computevkresample: 2x - Singlevkpeak: int16-vec4vkpeak: fp64-scalarshoc: OpenCL - Bus Speed Downloadvkpeak: int32-scalarv-ray: NVIDIA RTX GPUblender: Fishy Cat - NVIDIA CUDAblender: Classroom - NVIDIA CUDAwaifu2x-ncnn: 2x - 3 - Nortx 5090NVIDIA 5090GeForce RTX 50909.847.913.9227.4410.2111.014.5322.0210.932.674.6842.4442.4439.3428.584.638.7437.050.058102.9228.6895302221237717637384.722.29924.33724393713.83485110684825000017.354398.3935.1418.416.161596.241687.4942.75932723006885250000027.83291117.543605427760008.9935937.24.5592.7142.4075.154.3965.6641.7952870.6862.7654.01813.4841562.971914061843.1162151.94993116493312461514462489004000001.95103.50583296.8461885.3740006.663013.329524837.207117.847122.9141976.972592.9362611.7861.759184991965.7121415.535.64843806.131967.3728.786762142.56119238.928.384.7752.5912.7820.7811.258.684.0426.399.413.054.2839.7439.7438.9130.14.48.5538.730.058512.9728.5856305671235884627734.722.26724.67.0624391313.89488210621655000017.344400.8535.2818.426.141603.931679.4442.85232766006907270000027.80981120.513602627707008.9836016.54.5692.52142.245.164.45.6741.7242872.5362.8654.03713.5041564.491912161866.8662119.51994016480112455614460089012000001.95103.47883290.2261894.9539989.763035.629527836.98117.864122.9441977.2672578.6262611.1761.759184961965.39121438.415.64943799.961967.4928.78762142.07119238.928.386.5536.218.0619.798.379.295.1225.5711.012.674.2938.9138.9136.8629.474.4468.9937.490.059432.9828.1406300737239575636374.652.27824.57.0424191313.78488210654400000017.444375.9535.118.336.171596.881680.2342.92432643006910430000027.74551120.88359542777600935961.34.5592.711142.5325.154.3925.6641.7572875.562.8353.95313.4861564.611913561903.9362178.05993416487312464614469588959000001.951103.45383257.5961914.5139998.4263035.629525837.243117.881122.9411976.7872575.3362597.5161.773185001965.32121419.575.64843803.61967.4328.788162141.02119238.928.38OpenBenchmarking.org

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mnasnetNVIDIA 5090GeForce RTX 5090rtx 509036912154.776.559.80MIN: 3.69 / MAX: 54.04MIN: 3.69 / MAX: 59.21MIN: 3.75 / MAX: 63.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: regnety_400mGeForce RTX 5090rtx 5090NVIDIA 5090122436486036.2047.9052.59MIN: 21.98 / MAX: 458.49MIN: 21.96 / MAX: 421.33MIN: 21.91 / MAX: 425.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: googlenetNVIDIA 5090rtx 5090GeForce RTX 50904812162012.7813.9218.06MIN: 7.62 / MAX: 95.87MIN: 7.49 / MAX: 98.37MIN: 7.48 / MAX: 98.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: efficientnet-b0GeForce RTX 5090NVIDIA 5090rtx 509061218243019.7920.7827.44MIN: 6.3 / MAX: 110.05MIN: 6.36 / MAX: 109.98MIN: 6.34 / MAX: 109.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2GeForce RTX 5090rtx 5090NVIDIA 509036912158.3710.2111.25MIN: 3.84 / MAX: 63.9MIN: 3.85 / MAX: 64.73MIN: 3.82 / MAX: 63.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: FastestDetNVIDIA 5090GeForce RTX 5090rtx 509036912158.689.2911.01MIN: 5.07 / MAX: 85.5MIN: 5.01 / MAX: 89.47MIN: 5.09 / MAX: 92.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: shufflenet-v2NVIDIA 5090rtx 5090GeForce RTX 50901.1522.3043.4564.6085.764.044.535.12MIN: 3.9 / MAX: 5.77MIN: 3.88 / MAX: 57.59MIN: 3.91 / MAX: 67.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: squeezenet_ssdrtx 5090GeForce RTX 5090NVIDIA 509061218243022.0225.5726.39MIN: 7.41 / MAX: 92.66MIN: 7.22 / MAX: 94.51MIN: 7.39 / MAX: 95.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet18NVIDIA 5090rtx 5090GeForce RTX 509036912159.4110.9311.01MIN: 4.47 / MAX: 43.1MIN: 4.48 / MAX: 44.35MIN: 4.51 / MAX: 42.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: blazefacertx 5090GeForce RTX 5090NVIDIA 50900.68631.37262.05892.74523.43152.672.673.05MIN: 2.4 / MAX: 41.21MIN: 2.38 / MAX: 25.86MIN: 2.38 / MAX: 49.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3NVIDIA 5090GeForce RTX 5090rtx 50901.0532.1063.1594.2125.2654.284.294.68MIN: 4.05 / MAX: 5.15MIN: 4.06 / MAX: 5.93MIN: 4.08 / MAX: 57.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3GeForce RTX 5090NVIDIA 5090rtx 5090102030405038.9139.7442.44MIN: 8.9 / MAX: 75.48MIN: 8.16 / MAX: 76.72MIN: 8.34 / MAX: 76.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mobilenetGeForce RTX 5090NVIDIA 5090rtx 5090102030405038.9139.7442.44MIN: 8.9 / MAX: 75.48MIN: 8.16 / MAX: 76.72MIN: 8.34 / MAX: 76.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: yolov4-tinyGeForce RTX 5090NVIDIA 5090rtx 509091827364536.8638.9139.34MIN: 11.12 / MAX: 47.83MIN: 15.12 / MAX: 48.75MIN: 15.92 / MAX: 49.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet50rtx 5090GeForce RTX 5090NVIDIA 509071421283528.5829.4730.10MIN: 10.02 / MAX: 89.48MIN: 10.07 / MAX: 90.59MIN: 10.13 / MAX: 90.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoNVIDIA 5090GeForce RTX 5090rtx 50901.04182.08363.12544.16725.2094.4004.4464.630

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: alexnetNVIDIA 5090rtx 5090GeForce RTX 509036912158.558.748.99MIN: 3.18 / MAX: 21.75MIN: 3.21 / MAX: 22.23MIN: 3.19 / MAX: 21.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vgg16rtx 5090GeForce RTX 5090NVIDIA 509091827364537.0537.4938.73MIN: 22.53 / MAX: 46.22MIN: 23.61 / MAX: 45.51MIN: 20.99 / MAX: 46.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atomsrtx 5090NVIDIA 5090GeForce RTX 50900.01340.02680.04020.05360.0670.058100.058510.05943

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50900.67051.3412.01152.6823.35252.922.972.98

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackrtx 5090NVIDIA 5090GeForce RTX 509071421283528.6928.5928.141. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in half precisionNVIDIA 5090rtx 5090GeForce RTX 509070K140K210K280K350K3056713022213007371. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precisionGeForce RTX 5090rtx 5090NVIDIA 509050K100K150K200K250K2395752377172358841. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in double precisionrtx 5090GeForce RTX 5090NVIDIA 509014K28K42K56K70K6373863637627731. (CXX) g++ options: -O3

Blender

Blend File: BMW27 - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA CUDAGeForce RTX 5090rtx 5090NVIDIA 50901.0622.1243.1864.2485.314.654.724.72

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesNVIDIA 5090GeForce RTX 5090rtx 50900.51731.03461.55192.06922.58652.2672.2782.299

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 509061218243024.3324.5024.60

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 50902468107.007.047.06

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingrtx 5090NVIDIA 5090GeForce RTX 509050K100K150K200K250K2439372439132419131. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferNVIDIA 5090rtx 5090GeForce RTX 50904812162013.8913.8313.781. (CXX) g++ options: -O3

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPUGeForce RTX 5090NVIDIA 5090rtx 509010002000300040005000488248824851

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5rtx 5090GeForce RTX 5090NVIDIA 509020000M40000M60000M80000M100000MSE +/- 102551750000.00, N = 2SE +/- 102256000000.00, N = 2SE +/- 101883450000.00, N = 2106848250000106544000000106216550000

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA CUDANVIDIA 5090rtx 5090GeForce RTX 50904812162017.3417.3517.44

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPNVIDIA 5090rtx 5090GeForce RTX 509090018002700360045004400.854398.394375.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Blender

Blend File: Barbershop - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA CUDAGeForce RTX 5090rtx 5090NVIDIA 509081624324035.1035.1435.28

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferNVIDIA 5090rtx 5090GeForce RTX 509051015202518.4218.4118.331. (CXX) g++ options: -O3

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA OptiXNVIDIA 5090rtx 5090GeForce RTX 50902468106.146.166.17

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: Memory Bandwidth Coalesced ReadNVIDIA 5090GeForce RTX 5090rtx 5090300600900120015001603.931596.881596.241. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: Memory Bandwidth Coalesced Writertx 5090GeForce RTX 5090NVIDIA 50904008001200160020001687.491680.231679.441. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomGeForce RTX 5090NVIDIA 5090rtx 5090102030405042.9242.8542.76

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipNVIDIA 5090rtx 5090GeForce RTX 5090700K1400K2100K2800K3500K327660032723003264300

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1GeForce RTX 5090NVIDIA 5090rtx 509015000M30000M45000M60000M75000M691043000006907270000068852500000

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadrtx 5090NVIDIA 5090GeForce RTX 509071421283527.8327.8127.751. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DGeForce RTX 5090NVIDIA 5090rtx 509020040060080010001120.881120.511117.541. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein in single precisionrtx 5090NVIDIA 5090GeForce RTX 50908K16K24K32K40K3605436026359541. (CXX) g++ options: -O3

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSGeForce RTX 5090rtx 5090NVIDIA 5090600K1200K1800K2400K3000K277760027760002770700

Blender

Blend File: Junkshop - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA CUDANVIDIA 5090rtx 5090GeForce RTX 509036912158.988.999.00

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NNVIDIA 5090GeForce RTX 5090rtx 50908K16K24K32K40K36016.535961.335937.21. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 50901.0262.0523.0784.1045.134.554.554.56

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarGeForce RTX 5090rtx 5090NVIDIA 50902040608010092.7192.7092.52

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashGeForce RTX 5090rtx 5090NVIDIA 5090306090120150142.53142.41142.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel Latencyrtx 5090GeForce RTX 5090NVIDIA 50901.1612.3223.4834.6445.8055.155.155.161. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT64 ComputeNVIDIA 5090rtx 5090GeForce RTX 50900.991.982.973.964.954.4004.3964.3921. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

Blender

Blend File: Junkshop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA OptiXrtx 5090GeForce RTX 5090NVIDIA 50901.27582.55163.82745.10326.3795.665.665.67

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT8 Computertx 5090GeForce RTX 5090NVIDIA 5090102030405041.8041.7641.721. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthGeForce RTX 5090NVIDIA 5090rtx 509060012001800240030002875.502872.532870.681. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vision_transformerrtx 5090GeForce RTX 5090NVIDIA 5090142842567062.7662.8362.86MIN: 40.3 / MAX: 105.61MIN: 41.21 / MAX: 109.15MIN: 42.12 / MAX: 106.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT16 ComputeNVIDIA 5090rtx 5090GeForce RTX 5090122436486054.0454.0253.951. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yesrtx 5090GeForce RTX 5090NVIDIA 5090369121513.4813.4913.50

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthGeForce RTX 5090NVIDIA 5090rtx 5090300600900120015001564.611564.491562.971. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16Crtx 5090GeForce RTX 5090NVIDIA 50904K8K12K16K20K191401913519121

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit ComputeGeForce RTX 5090NVIDIA 5090rtx 509013K26K39K52K65K61903.9361866.8661843.111. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer ComputeGeForce RTX 5090rtx 5090NVIDIA 509013K26K39K52K65K62178.0562151.9462119.511. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein benchmark in double precisionNVIDIA 5090GeForce RTX 5090rtx 50902K4K6K8K10K9940993499311. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT R2C / C2Rrtx 5090GeForce RTX 5090NVIDIA 509040K80K120K160K200K1649331648731648011. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsGeForce RTX 5090rtx 5090NVIDIA 509030K60K90K120K150K1246461246151245561. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C multidimensional in single precisionGeForce RTX 5090rtx 5090NVIDIA 509030K60K90K120K150K1446951446241446001. (CXX) g++ options: -O3

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512NVIDIA 5090rtx 5090GeForce RTX 50902000M4000M6000M8000M10000M890120000089004000008895900000

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP64 ComputeGeForce RTX 5090NVIDIA 5090rtx 50900.4390.8781.3171.7562.1951.9511.9501.9501. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleGeForce RTX 5090NVIDIA 5090rtx 509020406080100103.45103.48103.511. (CXX) g++ options: -O3

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp32-vec4rtx 5090NVIDIA 5090GeForce RTX 509020K40K60K80K100K83296.8483290.2283257.59

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int32-vec4GeForce RTX 5090NVIDIA 5090rtx 509013K26K39K52K65K61914.5161894.9561885.37

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int16-scalarrtx 5090GeForce RTX 5090NVIDIA 50909K18K27K36K45K40006.6039998.4239989.70

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp32-scalarGeForce RTX 5090NVIDIA 5090rtx 509014K28K42K56K70K63035.6263035.6263013.32

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP32NVIDIA 5090GeForce RTX 5090rtx 50902K4K6K8K10K952795259524

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionGeForce RTX 5090rtx 5090NVIDIA 50902004006008001000837.24837.21836.981. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP32 ComputeGeForce RTX 5090NVIDIA 5090rtx 5090306090120150117.88117.86117.851. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP16 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP16 ComputeNVIDIA 5090GeForce RTX 5090rtx 5090306090120150122.94122.94122.911. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision ComputeNVIDIA 5090rtx 5090GeForce RTX 50904008001200160020001977.261976.901976.781. (CXX) g++ options: -O3

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp16-vec4rtx 5090NVIDIA 5090GeForce RTX 509016K32K48K64K80K72592.9372578.6272575.33

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp16-scalarrtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62611.7862611.1762597.51

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT32 ComputeGeForce RTX 5090NVIDIA 5090rtx 5090142842567061.7761.7661.761. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16SGeForce RTX 5090rtx 5090NVIDIA 50904K8K12K16K20K185001849918496

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp64-vec4rtx 5090NVIDIA 5090GeForce RTX 50904008001200160020001965.701965.391965.32

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision ComputeNVIDIA 5090GeForce RTX 5090rtx 509030K60K90K120K150K121438.41121419.57121415.531. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singlertx 5090GeForce RTX 5090NVIDIA 50901.2712.5423.8135.0846.3555.6485.6485.6491. (CXX) g++ options: -O3

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int16-vec4rtx 5090GeForce RTX 5090NVIDIA 50909K18K27K36K45K43806.1343803.6043799.96

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp64-scalarNVIDIA 5090GeForce RTX 5090rtx 50904008001200160020001967.491967.431967.37

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadGeForce RTX 5090NVIDIA 5090rtx 509071421283528.7928.7928.791. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int32-scalarrtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62142.5662142.0762141.02

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPUGeForce RTX 5090NVIDIA 5090rtx 50903K6K9K12K15K119231192311923

Blender

Blend File: Fishy Cat - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50902468108.928.928.92

Blender

Blend File: Classroom - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50902468108.388.388.38


Phoronix Test Suite v10.8.5