nvidia rtx 5090 compute benchmarks

Tests for a future article. Intel Core Ultra 9 285K testing with a ASUS ROG MAXIMUS Z890 HERO (1203 BIOS) and ASUS NVIDIA GeForce RTX 5090 32GB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2501242-PTS-NVIDIART00&grr.

nvidia rtx 5090 compute benchmarksProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolutionrtx 5090NVIDIA 5090GeForce RTX 5090Intel Core Ultra 9 285K @ 5.10GHz (24 Cores)ASUS ROG MAXIMUS Z890 HERO (1203 BIOS)Intel Device ae7f2 x 16GB DDR5-6400MT/s Micron CP16G64C38U5B.M8D11000GB Western Digital WDS100T1X0E-00AFY0 + 4001GB Western Digital WD_BLACK SN850X 4000GBASUS NVIDIA GeForce RTX 5090 32GBIntel Device 7f50ASUS VP28URealtek Device 8126 + Intel I226-V + Intel Wi-Fi 7Ubuntu 24.106.11.0-13-generic (x86_64)GNOME Shell 47.0X Server 1.21.1.13NVIDIA 570.86.104.6.0OpenCL 3.0 CUDA 12.8.51 + OpenCL 3.0GCC 14.2.0ext43840x2160OpenBenchmarking.orgKernel Details- nouveau.modeset=0 - Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: default) - CPU Microcode: 0x114 - Thermald 2.5.8Graphics Details- BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 98.02.2e.00.03OpenCL Details- GPU Compute Cores: 21760Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

nvidia rtx 5090 compute benchmarksshoc: OpenCL - Max SP Flopsncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetvkpeak: int16-vec4vkpeak: int16-scalarvkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp64-vec4vkpeak: fp64-scalarvkpeak: fp16-vec4vkpeak: fp16-scalarvkpeak: fp32-vec4vkpeak: fp32-scalarv-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUvkfft: FFT + iFFT C2C 1D batched in double precisionnamd-cuda: ATPase Simulation - 327,506 Atomsindigobench: OpenCL GPU - Bedroomindigobench: OpenCL GPU - Supercarvkfft: FFT + iFFT C2C 1D batched in single precisionvkfft: FFT + iFFT C2C 1D batched in single precision, no reshufflingblender: Barbershop - NVIDIA CUDAvkfft: FFT + iFFT C2C Bluestein benchmark in double precisionvkresample: 2x - Doubleclpeak: Transfer Bandwidth enqueueWriteBufferclpeak: Transfer Bandwidth enqueueReadBufferblender: Barbershop - NVIDIA OptiXvkfft: FFT + iFFT C2C 1D batched in half precisionfluidx3d: FP32-FP32blender: Pabellon Barcelona - NVIDIA CUDAhashcat: MD5hashcat: 7-Ziphashcat: SHA1vkfft: FFT + iFFT C2C Bluestein in single precisionrealsr-ncnn: 4x - Yeshashcat: SHA-512vkfft: FFT + iFFT C2C multidimensional in single precisionhashcat: TrueCrypt RIPEMD160 + XTSfluidx3d: FP32-FP16Sblender: Junkshop - NVIDIA CUDAblender: Fishy Cat - NVIDIA CUDAfluidx3d: FP32-FP16Cblender: Classroom - NVIDIA CUDAclpeak: Double-Precision Computeblender: Pabellon Barcelona - NVIDIA OptiXvkresample: 2x - Singleopencl-benchmark: Memory Bandwidth Coalesced Writeopencl-benchmark: Memory Bandwidth Coalesced Readopencl-benchmark: INT8 Computeopencl-benchmark: INT16 Computeopencl-benchmark: INT32 Computeopencl-benchmark: INT64 Computeopencl-benchmark: FP16 Computeopencl-benchmark: FP32 Computeopencl-benchmark: FP64 Computeblender: Classroom - NVIDIA OptiXvkfft: FFT + iFFT R2C / C2Rblender: Junkshop - NVIDIA OptiXshoc: OpenCL - Texture Read Bandwidthblender: BMW27 - NVIDIA CUDAblender: Fishy Cat - NVIDIA OptiXrealsr-ncnn: 4x - Noblender: BMW27 - NVIDIA OptiXwaifu2x-ncnn: 2x - 3 - Yesclpeak: Global Memory Bandwidthshoc: OpenCL - FFT SPshoc: OpenCL - Triadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - S3Dshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Reductionclpeak: Single-Precision Computeclpeak: Integer 24-bit Computeclpeak: Integer Computeshoc: OpenCL - MD5 Hashclpeak: Kernel Latencyrtx 5090NVIDIA 5090GeForce RTX 509012461511.0162.7647.922.0239.3442.4428.588.7410.9337.0513.922.6727.449.84.534.6810.2142.4443806.1340006.661885.3762142.561965.71967.3772592.9362611.7883296.8463013.32485111923637380.0581042.75992.723771724393735.149931103.50518.4113.8324.33302221952417.351068482500003272300688525000003605413.48489004000001446242776000184998.998.92191408.381976.975.6481687.491596.2441.79554.01861.7594.396122.914117.8471.956.161649335.662870.684.724.554.632.922.2991562.974398.3927.832928.689535937.21117.5428.7867837.207121415.5361843.1162151.94142.4075.151245568.6862.8652.5926.3938.9139.7430.18.559.4138.7312.783.0520.784.774.044.2811.2539.7443799.9639989.761894.9562142.071965.391967.4972578.6262611.1783290.2263035.62488211923627730.0585142.85292.5223588424391335.289940103.47818.4213.8924.6305671952717.341062165500003276600690727000003602613.50489012000001446002770700184968.988.92191218.381977.267.065.6491679.441603.9341.72454.03761.7594.4122.944117.8641.956.141648015.672872.534.724.564.42.972.2671564.494400.8527.809828.585636016.51120.5128.787836.98121438.4161866.8662119.51142.245.161246469.2962.8336.225.5736.8638.9129.478.9911.0137.4918.062.6719.796.555.124.298.3738.9143803.639998.4261914.5162141.021965.321967.4372575.3362597.5183257.5963035.62488211923636370.0594342.92492.71123957524191335.19934103.45318.3313.7824.5300737952517.441065440000003264300691043000003595413.486889590000014469527776001850098.92191358.381976.787.045.6481680.231596.8841.75753.95361.7734.392122.941117.8811.9516.171648735.662875.54.654.554.4462.982.2781564.614375.9527.745528.140635961.31120.8828.7881837.243121419.5761903.9362178.05142.5325.15OpenBenchmarking.org

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP Flopsrtx 5090NVIDIA 5090GeForce RTX 509030K60K90K120K150K1246151245561246461. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: FastestDetrtx 5090NVIDIA 5090GeForce RTX 5090369121511.018.689.29MIN: 5.09 / MAX: 92.36MIN: 5.07 / MAX: 85.5MIN: 5.01 / MAX: 89.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vision_transformerrtx 5090NVIDIA 5090GeForce RTX 5090142842567062.7662.8662.83MIN: 40.3 / MAX: 105.61MIN: 42.12 / MAX: 106.46MIN: 41.21 / MAX: 109.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: regnety_400mrtx 5090NVIDIA 5090GeForce RTX 5090122436486047.9052.5936.20MIN: 21.96 / MAX: 421.33MIN: 21.91 / MAX: 425.58MIN: 21.98 / MAX: 458.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: squeezenet_ssdrtx 5090NVIDIA 5090GeForce RTX 509061218243022.0226.3925.57MIN: 7.41 / MAX: 92.66MIN: 7.39 / MAX: 95.48MIN: 7.22 / MAX: 94.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: yolov4-tinyrtx 5090NVIDIA 5090GeForce RTX 509091827364539.3438.9136.86MIN: 15.92 / MAX: 49.04MIN: 15.12 / MAX: 48.75MIN: 11.12 / MAX: 47.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3rtx 5090NVIDIA 5090GeForce RTX 5090102030405042.4439.7438.91MIN: 8.34 / MAX: 76.17MIN: 8.16 / MAX: 76.72MIN: 8.9 / MAX: 75.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet50rtx 5090NVIDIA 5090GeForce RTX 509071421283528.5830.1029.47MIN: 10.02 / MAX: 89.48MIN: 10.13 / MAX: 90.77MIN: 10.07 / MAX: 90.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: alexnetrtx 5090NVIDIA 5090GeForce RTX 509036912158.748.558.99MIN: 3.21 / MAX: 22.23MIN: 3.18 / MAX: 21.75MIN: 3.19 / MAX: 21.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: resnet18rtx 5090NVIDIA 5090GeForce RTX 5090369121510.939.4111.01MIN: 4.48 / MAX: 44.35MIN: 4.47 / MAX: 43.1MIN: 4.51 / MAX: 42.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: vgg16rtx 5090NVIDIA 5090GeForce RTX 509091827364537.0538.7337.49MIN: 22.53 / MAX: 46.22MIN: 20.99 / MAX: 46.14MIN: 23.61 / MAX: 45.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: googlenetrtx 5090NVIDIA 5090GeForce RTX 50904812162013.9212.7818.06MIN: 7.49 / MAX: 98.37MIN: 7.62 / MAX: 95.87MIN: 7.48 / MAX: 98.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: blazefacertx 5090NVIDIA 5090GeForce RTX 50900.68631.37262.05892.74523.43152.673.052.67MIN: 2.4 / MAX: 41.21MIN: 2.38 / MAX: 49.95MIN: 2.38 / MAX: 25.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: efficientnet-b0rtx 5090NVIDIA 5090GeForce RTX 509061218243027.4420.7819.79MIN: 6.34 / MAX: 109.98MIN: 6.36 / MAX: 109.98MIN: 6.3 / MAX: 110.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mnasnetrtx 5090NVIDIA 5090GeForce RTX 509036912159.804.776.55MIN: 3.75 / MAX: 63.5MIN: 3.69 / MAX: 54.04MIN: 3.69 / MAX: 59.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: shufflenet-v2rtx 5090NVIDIA 5090GeForce RTX 50901.1522.3043.4564.6085.764.534.045.12MIN: 3.88 / MAX: 57.59MIN: 3.9 / MAX: 5.77MIN: 3.91 / MAX: 67.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3rtx 5090NVIDIA 5090GeForce RTX 50901.0532.1063.1594.2125.2654.684.284.29MIN: 4.08 / MAX: 57.94MIN: 4.05 / MAX: 5.15MIN: 4.06 / MAX: 5.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2rtx 5090NVIDIA 5090GeForce RTX 5090369121510.2111.258.37MIN: 3.85 / MAX: 64.73MIN: 3.82 / MAX: 63.94MIN: 3.84 / MAX: 63.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20241226Target: Vulkan GPU - Model: mobilenetrtx 5090NVIDIA 5090GeForce RTX 5090102030405042.4439.7438.91MIN: 8.34 / MAX: 76.17MIN: 8.16 / MAX: 76.72MIN: 8.9 / MAX: 75.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int16-vec4rtx 5090NVIDIA 5090GeForce RTX 50909K18K27K36K45K43806.1343799.9643803.60

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int16-scalarrtx 5090NVIDIA 5090GeForce RTX 50909K18K27K36K45K40006.6039989.7039998.42

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int32-vec4rtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K61885.3761894.9561914.51

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20240505int32-scalarrtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62142.5662142.0762141.02

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp64-vec4rtx 5090NVIDIA 5090GeForce RTX 50904008001200160020001965.701965.391965.32

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp64-scalarrtx 5090NVIDIA 5090GeForce RTX 50904008001200160020001967.371967.491967.43

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp16-vec4rtx 5090NVIDIA 5090GeForce RTX 509016K32K48K64K80K72592.9372578.6272575.33

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp16-scalarrtx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62611.7862611.1762597.51

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp32-vec4rtx 5090NVIDIA 5090GeForce RTX 509020K40K60K80K100K83296.8483290.2283257.59

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20240505fp32-scalarrtx 5090NVIDIA 5090GeForce RTX 509014K28K42K56K70K63013.3263035.6263035.62

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA CUDA GPUrtx 5090NVIDIA 5090GeForce RTX 509010002000300040005000485148824882

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 6.0Mode: NVIDIA RTX GPUrtx 5090NVIDIA 5090GeForce RTX 50903K6K9K12K15K119231192311923

VkFFT

Test: FFT + iFFT C2C 1D batched in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in double precisionrtx 5090NVIDIA 5090GeForce RTX 509014K28K42K56K70K6373862773636371. (CXX) g++ options: -O3

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 Atomsrtx 5090NVIDIA 5090GeForce RTX 50900.01340.02680.04020.05360.0670.058100.058510.05943

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Bedroomrtx 5090NVIDIA 5090GeForce RTX 5090102030405042.7642.8542.92

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: Supercarrtx 5090NVIDIA 5090GeForce RTX 50902040608010092.7092.5292.71

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precisionrtx 5090NVIDIA 5090GeForce RTX 509050K100K150K200K250K2377172358842395751. (CXX) g++ options: -O3

VkFFT

Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in single precision, no reshufflingrtx 5090NVIDIA 5090GeForce RTX 509050K100K150K200K250K2439372439132419131. (CXX) g++ options: -O3

Blender

Blend File: Barbershop - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 509081624324035.1435.2835.10

VkFFT

Test: FFT + iFFT C2C Bluestein benchmark in double precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein benchmark in double precisionrtx 5090NVIDIA 5090GeForce RTX 50902K4K6K8K10K9931994099341. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Doublertx 5090NVIDIA 5090GeForce RTX 509020406080100103.51103.48103.451. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueWriteBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueWriteBufferrtx 5090NVIDIA 5090GeForce RTX 509051015202518.4118.4218.331. (CXX) g++ options: -O3

clpeak

OpenCL Test: Transfer Bandwidth enqueueReadBuffer

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Transfer Bandwidth enqueueReadBufferrtx 5090NVIDIA 5090GeForce RTX 50904812162013.8313.8913.781. (CXX) g++ options: -O3

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 509061218243024.3324.6024.50

VkFFT

Test: FFT + iFFT C2C 1D batched in half precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C 1D batched in half precisionrtx 5090NVIDIA 5090GeForce RTX 509070K140K210K280K350K3022213056713007371. (CXX) g++ options: -O3

FluidX3D

Test: FP32-FP32

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP32rtx 5090NVIDIA 5090GeForce RTX 50902K4K6K8K10K952495279525

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50904812162017.3517.3417.44

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5rtx 5090NVIDIA 5090GeForce RTX 509020000M40000M60000M80000M100000MSE +/- 102551750000.00, N = 2SE +/- 101883450000.00, N = 2SE +/- 102256000000.00, N = 2106848250000106216550000106544000000

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-Ziprtx 5090NVIDIA 5090GeForce RTX 5090700K1400K2100K2800K3500K327230032766003264300

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1rtx 5090NVIDIA 5090GeForce RTX 509015000M30000M45000M60000M75000M688525000006907270000069104300000

VkFFT

Test: FFT + iFFT C2C Bluestein in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C Bluestein in single precisionrtx 5090NVIDIA 5090GeForce RTX 50908K16K24K32K40K3605436026359541. (CXX) g++ options: -O3

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Yesrtx 5090NVIDIA 5090GeForce RTX 5090369121513.4813.5013.49

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512rtx 5090NVIDIA 5090GeForce RTX 50902000M4000M6000M8000M10000M890040000089012000008895900000

VkFFT

Test: FFT + iFFT C2C multidimensional in single precision

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT C2C multidimensional in single precisionrtx 5090NVIDIA 5090GeForce RTX 509030K60K90K120K150K1446241446001446951. (CXX) g++ options: -O3

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSrtx 5090NVIDIA 5090GeForce RTX 5090600K1200K1800K2400K3000K277600027707002777600

FluidX3D

Test: FP32-FP16S

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16Srtx 5090NVIDIA 5090GeForce RTX 50904K8K12K16K20K184991849618500

Blender

Blend File: Junkshop - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 509036912158.998.989.00

Blender

Blend File: Fishy Cat - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50902468108.928.928.92

FluidX3D

Test: FP32-FP16C

OpenBenchmarking.orgMLUPs/s, More Is BetterFluidX3D 3.0Test: FP32-FP16Crtx 5090NVIDIA 5090GeForce RTX 50904K8K12K16K20K191401912119135

Blender

Blend File: Classroom - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50902468108.388.388.38

clpeak

OpenCL Test: Double-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision Computertx 5090NVIDIA 5090GeForce RTX 50904008001200160020001976.901977.261976.781. (CXX) g++ options: -O3

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50902468107.007.067.04

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Singlertx 5090NVIDIA 5090GeForce RTX 50901.2712.5423.8135.0846.3555.6485.6495.6481. (CXX) g++ options: -O3

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Write

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: Memory Bandwidth Coalesced Writertx 5090NVIDIA 5090GeForce RTX 50904008001200160020001687.491679.441680.231. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: Memory Bandwidth Coalesced Read

OpenBenchmarking.orgGB/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: Memory Bandwidth Coalesced Readrtx 5090NVIDIA 5090GeForce RTX 5090300600900120015001596.241603.931596.881. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT8 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT8 Computertx 5090NVIDIA 5090GeForce RTX 5090102030405041.8041.7241.761. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT16 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT16 Computertx 5090NVIDIA 5090GeForce RTX 5090122436486054.0254.0453.951. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT32 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT32 Computertx 5090NVIDIA 5090GeForce RTX 5090142842567061.7661.7661.771. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: INT64 Compute

OpenBenchmarking.orgTIOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: INT64 Computertx 5090NVIDIA 5090GeForce RTX 50900.991.982.973.964.954.3964.4004.3921. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP16 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP16 Computertx 5090NVIDIA 5090GeForce RTX 5090306090120150122.91122.94122.941. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP32 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP32 Computertx 5090NVIDIA 5090GeForce RTX 5090306090120150117.85117.86117.881. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

ProjectPhysX OpenCL-Benchmark

Operation: FP64 Compute

OpenBenchmarking.orgTFLOPs/s, More Is BetterProjectPhysX OpenCL-Benchmark 1.6Operation: FP64 Computertx 5090NVIDIA 5090GeForce RTX 50900.4390.8781.3171.7562.1951.9501.9501.9511. (CXX) g++ options: -std=c++17 -pthread -lOpenCL

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50902468106.166.146.17

VkFFT

Test: FFT + iFFT R2C / C2R

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.3.4Test: FFT + iFFT R2C / C2Rrtx 5090NVIDIA 5090GeForce RTX 509040K80K120K160K200K1649331648011648731. (CXX) g++ options: -O3

Blender

Blend File: Junkshop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50901.27582.55163.82745.10326.3795.665.675.66

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read Bandwidthrtx 5090NVIDIA 5090GeForce RTX 509060012001800240030002870.682872.532875.501. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

Blender

Blend File: BMW27 - Compute: NVIDIA CUDA

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA CUDArtx 5090NVIDIA 5090GeForce RTX 50901.0622.1243.1864.2485.314.724.724.65

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50901.0262.0523.0784.1045.134.554.564.55

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: Nortx 5090NVIDIA 5090GeForce RTX 50901.04182.08363.12544.16725.2094.6304.4004.446

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: NVIDIA OptiXrtx 5090NVIDIA 5090GeForce RTX 50900.67051.3412.01152.6823.35252.922.972.98

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yesrtx 5090NVIDIA 5090GeForce RTX 50900.51731.03461.55192.06922.58652.2992.2672.278

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory Bandwidthrtx 5090NVIDIA 5090GeForce RTX 5090300600900120015001562.971564.491564.611. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPrtx 5090NVIDIA 5090GeForce RTX 509090018002700360045004398.394400.854375.951. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Triadrtx 5090NVIDIA 5090GeForce RTX 509071421283527.8327.8127.751. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Readbackrtx 5090NVIDIA 5090GeForce RTX 509071421283528.6928.5928.141. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_Nrtx 5090NVIDIA 5090GeForce RTX 50908K16K24K32K40K35937.236016.535961.31. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3Drtx 5090NVIDIA 5090GeForce RTX 509020040060080010001117.541120.511120.881. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed Downloadrtx 5090NVIDIA 5090GeForce RTX 509071421283528.7928.7928.791. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Reductionrtx 5090NVIDIA 5090GeForce RTX 50902004006008001000837.21836.98837.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Single-Precision Compute

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision Computertx 5090NVIDIA 5090GeForce RTX 509030K60K90K120K150K121415.53121438.41121419.571. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer 24-bit Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer 24-bit Computertx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K61843.1161866.8661903.931. (CXX) g++ options: -O3

clpeak

OpenCL Test: Integer Compute

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Computertx 5090NVIDIA 5090GeForce RTX 509013K26K39K52K65K62151.9462119.5162178.051. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 Hashrtx 5090NVIDIA 5090GeForce RTX 5090306090120150142.41142.24142.531. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Kernel Latency

OpenBenchmarking.orgus, Fewer Is Betterclpeak 1.1.2OpenCL Test: Kernel Latencyrtx 5090NVIDIA 5090GeForce RTX 50901.1612.3223.4834.6445.8055.155.165.151. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5