2307020-PTS-GPUREVIEW1

RTX 4080 16GB PNY REVIEW

HTML result view exported from: https://openbenchmarking.org/result/2307077-NE-2307020PT71&sor&gru.

2307020-PTS-GPUREVIEW1ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyIntel Xeon w9-3495X @ 4.80GHz (56 Cores / 112 Threads)ASUS Pro WS W790E-SAGE SE (0506 BIOS)Intel Device 7aa78 x 32 GB DDR5-4812MT/s Hynix HMCG88AEBRA115N6401GB Micron_9300_MTFDHAL6T4TDR + 0GB Virtual HDisk0NVIDIA GeForce RTX 3090 24GBRealtek ALC1220BenQ PD2720U2 x Intel X710 for 10GBASE-TUbuntu 22.046.3.0-060300-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.4NVIDIA 530.41.034.6.0OpenCL 3.0 CUDA 12.1.981.3.236GCC 11.3.0 + CUDA 12.1ext43840x2160NVIDIA GeForce RTX 2080 Ti 22GBNVIDIA GeForce RTX 4090 24GBNVIDIA GeForce RTX 4080 16GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x2b000390Graphics Details- RTX 3090 24GB -Zotac: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.26.48.65- RTX 2080 Ti 22GB -Dell: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.30.40.4d- RTX 4090 24GB -Nvidia: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.03- RTX 4080 16GB -Pny: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.67OpenCL Details- RTX 3090 24GB -Zotac: GPU Compute Cores: 10496- RTX 2080 Ti 22GB -Dell: GPU Compute Cores: 4352- RTX 4090 24GB -Nvidia: GPU Compute Cores: 16384- RTX 4080 16GB -Pny: GPU Compute Cores: 9728Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

2307020-PTS-GPUREVIEW1vkfft: neatbench: GPUcl-mem: Readcl-mem: Writecl-mem: Copyviennacl: OpenCL BLAS - sCOPYviennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dAXPYviennacl: OpenCL BLAS - dDOTviennacl: OpenCL BLAS - dGEMV-Nviennacl: OpenCL BLAS - dGEMV-Tviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tshoc: OpenCL - Bus Speed Downloadshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Texture Read Bandwidthshoc: OpenCL - Reductionshoc: OpenCL - Triadclpeak: Global Memory Bandwidthclpeak: Single-Precision Floatclpeak: Double-Precision Doubleshoc: OpenCL - Max SP Flopsshoc: OpenCL - FFT SPshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - S3Dvkpeak: fp32-scalarvkpeak: fp32-vec4vkpeak: fp16-scalarvkpeak: fp16-vec4vkpeak: fp64-scalarvkpeak: fp64-vec4viennacl: OpenCL BLAS - dGEMM-NNviennacl: OpenCL BLAS - dGEMM-NTviennacl: OpenCL BLAS - dGEMM-TNviennacl: OpenCL BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTshoc: OpenCL - MD5 Hashclpeak: Integer Compute INTvkpeak: int32-scalarvkpeak: int32-vec4vkpeak: int16-scalarvkpeak: int16-vec4hashcat: MD5hashcat: SHA1hashcat: SHA-512hashcat: 7-Ziphashcat: TrueCrypt RIPEMD160 + XTSindigobench: OpenCL GPU - Supercarindigobench: OpenCL GPU - Bedroomlczero: OpenCLfahbench: gromacs: NVIDIA CUDA GPU - water_GMX50_baremandelgpu: GPUoctanebench: Total Scorev-ray: NVIDIA CUDA GPUv-ray: NVIDIA RTX GPUnamd-cuda: ATPase Simulation - 327,506 Atomscaffe: AlexNet - NVIDIA CUDA - 100caffe: AlexNet - NVIDIA CUDA - 200caffe: AlexNet - NVIDIA CUDA - 1000caffe: GoogleNet - NVIDIA CUDA - 100caffe: GoogleNet - NVIDIA CUDA - 200caffe: GoogleNet - NVIDIA CUDA - 1000arrayfire: Conjugate Gradient OpenCLfinancebench: Black-Scholes OpenCLvkresample: 2x - Singlevkresample: 2x - Doublencnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetrodinia: OpenCL Particle Filterblender: BMW27 - NVIDIA OptiXblender: Classroom - NVIDIA OptiXblender: Fishy Cat - NVIDIA OptiXblender: Pabellon Barcelona - NVIDIA OptiXblender: Barbershop - NVIDIA OptiXrealsr-ncnn: 4x - Yesrealsr-ncnn: 4x - Nowaifu2x-ncnn: 2x - 3 - YesRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny415473090823.3734.3359.23634953726007166581853711842227276483498460617274925.193026.39752148.86392.80424.5486814.9235000.44640.1537738.42349.397912.59427.49120403.3526392.0920161.5940002.93645.24645.3158558858758410712014114341.332417864.5420305.7520074.1013295.7216205.9863700926667206340500002595250000109595073132751.70120.77214316316.572123.743568293355.5669.371026205228780.06742664.5971321.366584.812418.864830.5924142.81.5835.7969.359121.35337.9318.2518.7817.5719.1623.975.3828.444.4021.4521.316.3549.1057.2725.04324.8813.523.7686.1314.2510.8416.0752.2831.1706.5093.536334042080543.8452.2318.130639230547551852429737017552261747923105865817974212.853013.20461147.26366.05812.6064504.6614076.71506.0116106.41504.874858.13270.80115974.2215859.1215521.1230649.13500.91503.53459461460458106.811914214532.518111826.7615963.5915752.9810270.2612975.655150325000016300900000204718333384027558026032.59811.15213228292.694015.377442866316.9349.49018595013060.095151.6798.84014.790153.14638.206.3518.776.117.0326.095.4928.227.0722.1321.505.8552.4861.7423.13363.756.804.5278.9322.6916.8927.4292.8752.7559.2394.307594274090886.0785.8410.344456844666177271922044318742360784946109867118877125.131126.37242980.15967.14225.0717871.5679463.361391.5188606.02779.4827175.9644.60344315.3558607.4644269.7987698.391394.441396.26115012801293134012011513714493.901940697.7344276.7244062.5229507.1739280.821543333333334939336666762977666672746938185780078.44535.29615097423.206943.470951815274.21326.237674426554780.04850447.341883.9534379.701660.013304.1516499.30.88032.8907.79955.9259.214.424.974.854.396.423.955.7725.144.505.087.2835.238.995.46289.045.082.0853.707.475.738.4230.5520.1875.1092.490482014080623.2567.7382.038548741754060859722443519012422783956108567818778725.100726.39353065.761020.23524.7590611.6747802.87869.2555455.81831.8616952.6423.57827771.8836655.6927669.3754827.66873.52873.67774795831849103.512213513760.558124524.3427758.9327595.5118441.4624616.84968919666673088153333339363500001747350114656066.03026.08015316424.349232.044772392077.3977.52814312640800.05427537.4911053.885228.001649.293279.1716310.51.1224.40411.54389.2379.364.325.024.784.376.303.895.717.904.7817.575.1635.879.135.32290.155.182.6854.439.327.5910.4137.9724.3745.6302.761OpenBenchmarking.org

GPU Temperature Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1632486480Min: 32 / Avg: 43.13 / Max: 66Min: 31 / Avg: 44.05 / Max: 73Min: 41 / Avg: 65.62 / Max: 84Min: 44 / Avg: 69.38 / Max: 83

GPU Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac80160240320400Min: 4 / Avg: 92.31 / Max: 331.21Min: 6 / Avg: 110.92 / Max: 456.05Min: 18.41 / Avg: 148.08 / Max: 347.3Min: 18.42 / Avg: 170.36 / Max: 367.21

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell13K26K39K52K65KSE +/- 72.97, N = 3SE +/- 213.00, N = 3SE +/- 475.08, N = 9SE +/- 163.05, N = 3594274820141547334041. (CXX) g++ options: -O3

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPURTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9001800270036004500SE +/- 0.00, N = 14SE +/- 0.00, N = 14SE +/- 0.00, N = 13SE +/- 0.00, N = 134090408030902080

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 0.31, N = 10SE +/- 0.81, N = 9SE +/- 0.71, N = 9SE +/- 0.15, N = 8886.0823.3623.2543.81. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 0.36, N = 10SE +/- 0.49, N = 10SE +/- 0.54, N = 9SE +/- 0.85, N = 8785.8734.3567.7452.21. (CC) gcc options: -O2 -flto -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell90180270360450SE +/- 0.05, N = 10SE +/- 0.03, N = 9SE +/- 0.21, N = 10SE +/- 0.27, N = 8410.3382.0359.2318.11. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell100200300400500SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 1.73, N = 34443853633061. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell120240360480600SE +/- 0.00, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 35684954873921. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 34464173723051. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell140280420560700SE +/- 0.33, N = 3SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 36616005404751. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell170340510680850SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 37727166085181. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell160320480640800SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 37196585975241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRTX 2080 Ti 22GB -DellRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -Zotac60120180240300SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 32972242201851. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell100200300400500SE +/- 0.00, N = 2SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 34434353713701. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell400800120016002000SE +/- 13.45, N = 12SE +/- 19.65, N = 5SE +/- 12.27, N = 15SE +/- 17.80, N = 1519011874184217551. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell5001000150020002500SE +/- 31.16, N = 12SE +/- 13.04, N = 5SE +/- 18.29, N = 15SE +/- 16.08, N = 1524222360227222611. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 7.38, N = 5SE +/- 5.06, N = 12SE +/- 6.38, N = 15SE +/- 4.44, N = 157847837647471. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac2004006008001000SE +/- 9.35, N = 12SE +/- 11.59, N = 5SE +/- 8.86, N = 15SE +/- 7.74, N = 159569469238341. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac2004006008001000SE +/- 22.23, N = 5SE +/- 7.44, N = 12SE +/- 5.45, N = 15SE +/- 6.89, N = 151098108510589841. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac150300450600750SE +/- 8.14, N = 12SE +/- 6.71, N = 5SE +/- 4.70, N = 15SE +/- 1.68, N = 156786716586061. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac4080120160200SE +/- 1.74, N = 5SE +/- 0.91, N = 12SE +/- 1.84, N = 15SE +/- 8.48, N = 151881871791721. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 7.65, N = 12SE +/- 2.58, N = 5SE +/- 4.88, N = 15SE +/- 6.17, N = 147877717497421. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell612182430SE +/- 0.02, N = 13SE +/- 0.02, N = 13SE +/- 0.02, N = 13SE +/- 0.00, N = 1225.1925.1325.1012.851. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -Dell612182430SE +/- 0.00, N = 13SE +/- 0.00, N = 14SE +/- 0.00, N = 13SE +/- 0.00, N = 1226.4026.3926.3713.201. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell7001400210028003500SE +/- 2.39, N = 7SE +/- 0.95, N = 6SE +/- 0.38, N = 3SE +/- 0.54, N = 33065.762980.152148.861147.261. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 13.90, N = 15SE +/- 9.75, N = 15SE +/- 0.11, N = 13SE +/- 0.06, N = 121020.24967.14392.80366.061. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell612182430SE +/- 0.01, N = 12SE +/- 0.01, N = 12SE +/- 0.00, N = 12SE +/- 0.00, N = 1225.0724.7624.5512.611. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 0.04, N = 10SE +/- 0.01, N = 10SE +/- 0.17, N = 10SE +/- 0.36, N = 9871.56814.92611.67504.661. (CXX) g++ options: -O3

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision FloatRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell20K40K60K80K100KSE +/- 103.68, N = 14SE +/- 50.90, N = 14SE +/- 10.34, N = 15SE +/- 80.60, N = 1179463.3647802.8735000.4414076.711. (CXX) g++ options: -O3

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision DoubleRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30060090012001500SE +/- 1.85, N = 6SE +/- 1.29, N = 6SE +/- 0.46, N = 5SE +/- 1.50, N = 41391.51869.25640.15506.011. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell20K40K60K80K100KSE +/- 19.25, N = 3SE +/- 73.47, N = 3SE +/- 292.47, N = 3SE +/- 102.77, N = 388606.055455.837738.416106.41. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell6001200180024003000SE +/- 1.36, N = 12SE +/- 0.59, N = 12SE +/- 2.58, N = 12SE +/- 0.70, N = 112779.482349.391831.861504.871. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell6K12K18K24K30KSE +/- 164.66, N = 13SE +/- 224.54, N = 15SE +/- 16.79, N = 11SE +/- 33.05, N = 1027175.9016952.607912.594858.131. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell140280420560700SE +/- 0.22, N = 12SE +/- 0.35, N = 12SE +/- 0.38, N = 13SE +/- 0.15, N = 12644.60427.49423.58270.801. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9K18K27K36K45KSE +/- 10.09, N = 3SE +/- 35.38, N = 3SE +/- 34.37, N = 3SE +/- 136.44, N = 344315.3527771.8820403.3515974.22

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell13K26K39K52K65KSE +/- 13.83, N = 3SE +/- 13.71, N = 3SE +/- 43.15, N = 3SE +/- 77.45, N = 358607.4636655.6926392.0915859.12

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-scalarRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9K18K27K36K45KSE +/- 7.21, N = 3SE +/- 0.86, N = 3SE +/- 30.30, N = 3SE +/- 61.99, N = 344269.7927669.3720161.5915521.12

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-vec4RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell20K40K60K80K100KSE +/- 98.17, N = 3SE +/- 0.37, N = 3SE +/- 64.52, N = 3SE +/- 26.46, N = 387698.3954827.6640002.9330649.13

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30060090012001500SE +/- 0.73, N = 3SE +/- 0.08, N = 3SE +/- 1.62, N = 3SE +/- 1.28, N = 31394.44873.52645.24500.91

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30060090012001500SE +/- 1.25, N = 3SE +/- 0.01, N = 3SE +/- 1.70, N = 3SE +/- 0.03, N = 31396.26873.67645.31503.53

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell2004006008001000SE +/- 0.00, N = 3SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 0.88, N = 311507745854591. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30060090012001500SE +/- 0.00, N = 3SE +/- 1.00, N = 3SE +/- 1.67, N = 3SE +/- 0.58, N = 312807955884611. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30060090012001500SE +/- 3.33, N = 3SE +/- 0.67, N = 3SE +/- 1.67, N = 3SE +/- 0.50, N = 212938315874601. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30060090012001500SE +/- 0.00, N = 3SE +/- 1.20, N = 3SE +/- 1.45, N = 313408495844581. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4080 16GB -Pny306090120150SE +/- 10.59, N = 5SE +/- 1.05, N = 15SE +/- 1.59, N = 15SE +/- 1.29, N = 12120.0107.0106.8103.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -Nvidia306090120150SE +/- 4.18, N = 12SE +/- 4.37, N = 15SE +/- 2.19, N = 15SE +/- 0.48, N = 41221201191151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNRTX 2080 Ti 22GB -DellRTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny306090120150SE +/- 4.47, N = 15SE +/- 5.51, N = 15SE +/- 9.50, N = 5SE +/- 4.72, N = 121421411371351. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -Pny306090120150SE +/- 4.93, N = 15SE +/- 9.73, N = 5SE +/- 5.83, N = 15SE +/- 6.46, N = 111451441431371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell20406080100SE +/- 0.95, N = 15SE +/- 0.65, N = 15SE +/- 0.05, N = 14SE +/- 0.06, N = 1493.9060.5641.3332.521. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Compute INTRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9K18K27K36K45KSE +/- 49.03, N = 14SE +/- 20.64, N = 14SE +/- 36.29, N = 15SE +/- 86.60, N = 1540697.7324524.3417864.5411826.761. (CXX) g++ options: -O3

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9K18K27K36K45KSE +/- 25.10, N = 3SE +/- 2.25, N = 3SE +/- 10.90, N = 3SE +/- 39.60, N = 344276.7227758.9320305.7515963.59

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9K18K27K36K45KSE +/- 3.11, N = 3SE +/- 37.47, N = 3SE +/- 27.49, N = 3SE +/- 15.45, N = 344062.5227595.5120074.1015752.98

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-scalarRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell6K12K18K24K30KSE +/- 4.42, N = 3SE +/- 0.34, N = 3SE +/- 0.25, N = 3SE +/- 0.89, N = 329507.1718441.4613295.7210270.26

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-vec4RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell8K16K24K32K40KSE +/- 1.77, N = 3SE +/- 0.85, N = 3SE +/- 4.87, N = 3SE +/- 4.25, N = 339280.8224616.8416205.9812975.65

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell30000M60000M90000M120000M150000MSE +/- 243127767.05, N = 6SE +/- 34207656.71, N = 6SE +/- 474267340.07, N = 15SE +/- 232076903.56, N = 6154333333333968919666676370092666751503250000

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell11000M22000M33000M44000M55000MSE +/- 18148896.51, N = 6SE +/- 19028429.02, N = 6SE +/- 15661518.66, N = 6SE +/- 39682498.24, N = 649393366667308815333332063405000016300900000

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1300M2600M3900M5200M6500MSE +/- 2101057.93, N = 6SE +/- 1634982.16, N = 6SE +/- 2013247.79, N = 6SE +/- 6223365.47, N = 66297766667393635000025952500002047183333

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell600K1200K1800K2400K3000KSE +/- 1870.54, N = 8SE +/- 1869.68, N = 8SE +/- 1997.77, N = 8SE +/- 1608.10, N = 8274693817473501095950840275

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell400K800K1200K1600K2000KSE +/- 13557.46, N = 7SE +/- 8051.58, N = 15SE +/- 5945.49, N = 15SE +/- 4392.30, N = 1018578001146560731327580260

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell20406080100SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 378.4566.0351.7032.60

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell816243240SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 335.3026.0820.7711.15

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell3K6K9K12K15KSE +/- 126.87, N = 3SE +/- 198.64, N = 3SE +/- 52.37, N = 3SE +/- 135.58, N = 4153161509714316132281. (CXX) g++ options: -flto -pthread

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell90180270360450SE +/- 0.63, N = 3SE +/- 0.24, N = 3SE +/- 0.41, N = 3SE +/- 0.79, N = 3424.35423.21316.57292.69

GROMACS

Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bareRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1020304050SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 343.4732.0423.7415.381. (CXX) g++ options: -O3

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell200M400M600M800M1000MSE +/- 3825360.26, N = 12SE +/- 1340958.21, N = 12SE +/- 959338.71, N = 11SE +/- 1540493.49, N = 10951815274.2772392077.3568293355.5442866316.91. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell300600900120015001326.24977.53669.37349.49

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA CUDA GPURTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell9001800270036004500SE +/- 2.33, N = 3SE +/- 2.73, N = 3SE +/- 2.85, N = 3SE +/- 0.67, N = 3426531262052950

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA RTX GPURTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell12002400360048006000SE +/- 63.00, N = 3SE +/- 30.20, N = 3SE +/- 18.12, N = 3SE +/- 10.17, N = 35478408028781306

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny31.045.763.0RTX 4090 24GB -Nvidia36.049.165.0RTX 2080 Ti 22GB -Dell50.065.074.0RTX 3090 24GB -Zotac50.067.582.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny34.047.562.0RTX 4090 24GB -Nvidia40.050.864.0RTX 3090 24GB -Zotac46.065.478.0RTX 2080 Ti 22GB -Dell58.068.876.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny35.048.964.0RTX 4090 24GB -Nvidia41.052.565.0RTX 3090 24GB -Zotac45.065.278.0RTX 2080 Ti 22GB -Dell58.069.276.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny36.043.063.0RTX 4090 24GB -Nvidia41.046.662.0RTX 3090 24GB -Zotac45.060.476.0RTX 2080 Ti 22GB -Dell59.065.673.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny35.042.863.0RTX 4090 24GB -Nvidia39.044.862.0RTX 3090 24GB -Zotac47.060.777.0RTX 2080 Ti 22GB -Dell59.065.373.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

FAHBench

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny33.039.746.0RTX 4090 24GB -Nvidia35.040.044.0RTX 3090 24GB -Zotac46.066.880.0RTX 2080 Ti 22GB -Dell55.069.581.0OpenBenchmarking.orgCelsius, Fewer Is BetterFAHBench 2.3.2GPU Temperature Monitor20406080100

GROMACS

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.045.657.0RTX 4080 16GB -Pny35.048.962.0RTX 3090 24GB -Zotac48.071.182.0RTX 2080 Ti 22GB -Dell59.073.680.0OpenBenchmarking.orgCelsius, Fewer Is BetterGROMACS 2023GPU Temperature Monitor20406080100

NAMD CUDA

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia34.037.344.0RTX 4080 16GB -Pny36.039.552.0RTX 3090 24GB -Zotac47.054.378.0RTX 2080 Ti 22GB -Dell60.067.374.0OpenBenchmarking.orgCelsius, Fewer Is BetterNAMD CUDA 2.14GPU Temperature Monitor20406080100

OctaneBench

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny35.054.759.0RTX 4090 24GB -Nvidia35.055.060.0RTX 3090 24GB -Zotac50.079.382.0RTX 2080 Ti 22GB -Dell58.080.083.0OpenBenchmarking.orgCelsius, Fewer Is BetterOctaneBench 2020.1GPU Temperature Monitor20406080100

Rodinia

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny39.043.849.0RTX 4090 24GB -Nvidia41.044.650.0RTX 3090 24GB -Zotac52.064.977.0RTX 2080 Ti 22GB -Dell62.069.274.0OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 3.1GPU Temperature Monitor20406080100

ArrayFire

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.038.541.0RTX 4080 16GB -Pny37.039.442.0RTX 3090 24GB -Zotac49.061.972.0RTX 2080 Ti 22GB -Dell59.064.471.0OpenBenchmarking.orgCelsius, Fewer Is BetterArrayFire 3.7GPU Temperature Monitor20406080100

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia35.037.440.0RTX 4080 16GB -Pny36.038.443.0RTX 3090 24GB -Zotac50.062.075.0RTX 2080 Ti 22GB -Dell59.066.474.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny36.038.052.0RTX 4090 24GB -Nvidia36.038.852.0RTX 3090 24GB -Zotac50.061.173.0RTX 2080 Ti 22GB -Dell60.065.973.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny35.039.241.0RTX 4090 24GB -Nvidia36.039.541.0RTX 3090 24GB -Zotac52.067.173.0RTX 2080 Ti 22GB -Dell59.070.075.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.038.555.0RTX 4080 16GB -Pny35.038.556.0RTX 3090 24GB -Zotac50.061.672.0RTX 2080 Ti 22GB -Dell59.062.568.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

NeatBench

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia33.034.436.0RTX 4080 16GB -Pny34.034.937.0RTX 3090 24GB -Zotac50.057.363.0RTX 2080 Ti 22GB -Dell56.058.361.0OpenBenchmarking.orgCelsius, Fewer Is BetterNeatBench 5GPU Temperature Monitor20406080100

FinanceBench

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia33.034.736.0RTX 4080 16GB -Pny35.036.438.0RTX 3090 24GB -Zotac52.055.860.0RTX 2080 Ti 22GB -Dell55.056.858.0OpenBenchmarking.orgCelsius, Fewer Is BetterFinanceBench 2016-07-25GPU Temperature Monitor1632486480

LeelaChessZero

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.043.646.0RTX 4080 16GB -Pny38.047.050.0RTX 3090 24GB -Zotac53.075.483.0RTX 2080 Ti 22GB -Dell55.079.482.0OpenBenchmarking.orgCelsius, Fewer Is BetterLeelaChessZero 0.28GPU Temperature Monitor20406080100

cl-mem

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia38.039.642.0RTX 4080 16GB -Pny39.041.644.0RTX 3090 24GB -Zotac48.063.973.0RTX 2080 Ti 22GB -Dell59.067.873.0OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature Monitor20406080100

cl-mem

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.037.940.0RTX 4080 16GB -Pny38.040.743.0RTX 3090 24GB -Zotac49.064.574.0RTX 2080 Ti 22GB -Dell60.067.773.0OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature Monitor20406080100

cl-mem

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia35.038.040.0RTX 4080 16GB -Pny37.040.343.0RTX 3090 24GB -Zotac50.064.474.0RTX 2080 Ti 22GB -Dell60.067.673.0OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature Monitor20406080100

MandelGPU

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.038.846.0RTX 4080 16GB -Pny37.040.952.0RTX 3090 24GB -Zotac50.062.979.0RTX 2080 Ti 22GB -Dell60.067.674.0OpenBenchmarking.orgCelsius, Fewer Is BetterMandelGPU 1.3pts1GPU Temperature Monitor20406080100

ViennaCL

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.041.548.0RTX 4080 16GB -Pny37.044.054.0RTX 3090 24GB -Zotac49.070.883.0RTX 2080 Ti 22GB -Dell59.072.278.0OpenBenchmarking.orgCelsius, Fewer Is BetterViennaCL 1.7.1GPU Temperature Monitor20406080100

ViennaCL

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia32.033.937.0RTX 4080 16GB -Pny34.035.738.0RTX 2080 Ti 22GB -Dell44.049.258.0RTX 3090 24GB -Zotac48.053.357.0OpenBenchmarking.orgCelsius, Fewer Is BetterViennaCL 1.7.1GPU Temperature Monitor1632486480

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia34.036.138.0RTX 4080 16GB -Pny38.040.142.0RTX 2080 Ti 22GB -Dell44.049.754.0RTX 3090 24GB -Zotac55.061.168.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia37.039.141.0RTX 4080 16GB -Pny41.042.745.0RTX 2080 Ti 22GB -Dell51.055.159.0RTX 3090 24GB -Zotac52.058.865.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature MonitorRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1632486480Min: 38 / Avg: 43.44 / Max: 73Min: 39 / Avg: 43.63 / Max: 66Min: 52 / Avg: 70.48 / Max: 82Min: 55 / Avg: 73.85 / Max: 82

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia37.041.847.0RTX 4080 16GB -Pny37.043.149.0RTX 3090 24GB -Zotac50.072.279.0RTX 2080 Ti 22GB -Dell59.074.078.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia37.038.039.0RTX 4080 16GB -Pny37.038.140.0RTX 3090 24GB -Zotac49.058.065.0RTX 2080 Ti 22GB -Dell58.060.764.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.037.142.0RTX 4080 16GB -Pny36.038.546.0RTX 3090 24GB -Zotac53.063.874.0RTX 2080 Ti 22GB -Dell56.064.471.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia35.035.637.0RTX 4080 16GB -Pny36.038.655.0RTX 3090 24GB -Zotac50.060.273.0RTX 2080 Ti 22GB -Dell59.062.468.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia34.036.440.0RTX 4080 16GB -Pny36.036.739.0RTX 2080 Ti 22GB -Dell57.062.968.0RTX 3090 24GB -Zotac52.062.972.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny35.037.038.0RTX 4090 24GB -Nvidia38.039.240.0RTX 3090 24GB -Zotac52.056.260.0RTX 2080 Ti 22GB -Dell57.059.462.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny38.039.541.0RTX 4090 24GB -Nvidia40.041.543.0RTX 2080 Ti 22GB -Dell56.058.060.0RTX 3090 24GB -Zotac52.058.162.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

IndigoBench

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia42.049.452.0RTX 4080 16GB -Pny40.051.956.0RTX 2080 Ti 22GB -Dell57.076.081.0RTX 3090 24GB -Zotac52.077.683.0OpenBenchmarking.orgCelsius, Fewer Is BetterIndigoBench 4.4GPU Temperature Monitor20406080100

IndigoBench

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia41.051.757.0RTX 4080 16GB -Pny42.054.059.0RTX 2080 Ti 22GB -Dell61.076.682.0RTX 3090 24GB -Zotac50.077.784.0OpenBenchmarking.orgCelsius, Fewer Is BetterIndigoBench 4.4GPU Temperature Monitor20406080100

Chaos Group V-RAY

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia42.050.857.0RTX 4080 16GB -Pny42.051.857.0RTX 3090 24GB -Zotac48.071.582.0RTX 2080 Ti 22GB -Dell57.073.582.0OpenBenchmarking.orgCelsius, Fewer Is BetterChaos Group V-RAY 5.02GPU Temperature Monitor20406080100

Chaos Group V-RAY

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny39.048.054.0RTX 4090 24GB -Nvidia41.048.654.0RTX 3090 24GB -Zotac47.068.380.0RTX 2080 Ti 22GB -Dell55.071.782.0OpenBenchmarking.orgCelsius, Fewer Is BetterChaos Group V-RAY 5.02GPU Temperature Monitor20406080100

Blender

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia42.044.850.0RTX 4080 16GB -Pny41.045.351.0RTX 3090 24GB -Zotac46.067.379.0RTX 2080 Ti 22GB -Dell60.072.678.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia40.046.552.0RTX 4080 16GB -Pny39.048.355.0RTX 3090 24GB -Zotac48.073.081.0RTX 2080 Ti 22GB -Dell59.075.481.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia41.045.854.0RTX 4080 16GB -Pny40.048.257.0RTX 3090 24GB -Zotac48.070.782.0RTX 2080 Ti 22GB -Dell60.074.480.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia39.045.350.0RTX 4080 16GB -Pny39.048.054.0RTX 3090 24GB -Zotac47.071.980.0RTX 2080 Ti 22GB -Dell60.075.880.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia40.048.554.0RTX 4080 16GB -Pny40.051.056.0RTX 3090 24GB -Zotac47.075.682.0RTX 2080 Ti 22GB -Dell59.078.782.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 4080 16GB -Pny39.042.150.0RTX 4090 24GB -Nvidia40.042.449.0RTX 3090 24GB -Zotac47.060.175.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia38.039.544.0RTX 4080 16GB -Pny38.040.348.0RTX 3090 24GB -Zotac50.061.477.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.040.745.0RTX 4080 16GB -Pny37.043.950.0RTX 3090 24GB -Zotac48.068.980.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia37.038.542.0RTX 4080 16GB -Pny38.040.847.0RTX 3090 24GB -Zotac47.061.374.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia36.039.343.0RTX 4080 16GB -Pny37.041.848.0RTX 3090 24GB -Zotac47.064.676.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia37.042.345.0RTX 4080 16GB -Pny37.046.150.0RTX 3090 24GB -Zotac47.072.878.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

VkFFT

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia35.036.943.0RTX 4080 16GB -Pny37.039.646.0RTX 3090 24GB -Zotac47.062.874.0RTX 2080 Ti 22GB -Dell54.068.677.0OpenBenchmarking.orgCelsius, Fewer Is BetterVkFFT 1.1.1GPU Temperature Monitor20406080100

VkResample

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia35.037.043.0RTX 4080 16GB -Pny36.038.146.0RTX 3090 24GB -Zotac48.053.573.0RTX 2080 Ti 22GB -Dell57.059.769.0OpenBenchmarking.orgCelsius, Fewer Is BetterVkResample 1.0GPU Temperature Monitor20406080100

VkResample

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia38.041.647.0RTX 4080 16GB -Pny38.043.650.0RTX 3090 24GB -Zotac47.059.274.0RTX 2080 Ti 22GB -Dell55.065.473.0OpenBenchmarking.orgCelsius, Fewer Is BetterVkResample 1.0GPU Temperature Monitor20406080100

vkpeak

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia42.053.965.0RTX 4080 16GB -Pny43.057.366.0RTX 3090 24GB -Zotac42.071.379.0RTX 2080 Ti 22GB -Dell56.077.383.0OpenBenchmarking.orgCelsius, Fewer Is Bettervkpeak 20210424GPU Temperature Monitor20406080100

NCNN

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia33.037.445.0RTX 4080 16GB -Pny35.038.243.0RTX 2080 Ti 22GB -Dell46.049.861.0RTX 3090 24GB -Zotac42.051.664.0OpenBenchmarking.orgCelsius, Fewer Is BetterNCNN 20220729GPU Temperature Monitor20406080100

RealSR-NCNN

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia41.051.656.0RTX 4080 16GB -Pny42.057.364.0RTX 2080 Ti 22GB -Dell46.074.280.0RTX 3090 24GB -Zotac52.074.984.0OpenBenchmarking.orgCelsius, Fewer Is BetterRealSR-NCNN 20200818GPU Temperature Monitor20406080100

RealSR-NCNN

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia42.045.153.0RTX 4080 16GB -Pny42.047.658.0RTX 3090 24GB -Zotac48.061.777.0RTX 2080 Ti 22GB -Dell60.069.376.0OpenBenchmarking.orgCelsius, Fewer Is BetterRealSR-NCNN 20200818GPU Temperature Monitor20406080100

Waifu2x-NCNN Vulkan

GPU Temperature Monitor

MinAvgMaxRTX 4090 24GB -Nvidia39.041.646.0RTX 4080 16GB -Pny40.042.951.0RTX 3090 24GB -Zotac47.061.273.0RTX 2080 Ti 22GB -Dell59.068.275.0OpenBenchmarking.orgCelsius, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818GPU Temperature Monitor20406080100

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell0.02140.04280.06420.08560.107SE +/- 0.00119, N = 15SE +/- 0.00113, N = 15SE +/- 0.00094, N = 3SE +/- 0.00074, N = 150.048500.054270.067420.09515

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -Zotac140280420560700SE +/- 1.11, N = 11SE +/- 1.06, N = 11SE +/- 1.37, N = 11447.34537.49664.601. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -Zotac30060090012001500SE +/- 0.87, N = 11SE +/- 1.43, N = 10SE +/- 1.00, N = 10883.951053.881321.361. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -Zotac14002800420056007000SE +/- 2.74, N = 7SE +/- 3.07, N = 6SE +/- 2.76, N = 64379.705228.006584.811. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -Zotac5001000150020002500SE +/- 2.33, N = 10SE +/- 1.34, N = 10SE +/- 1.69, N = 91649.291660.012418.861. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -Zotac10002000300040005000SE +/- 3.25, N = 8SE +/- 3.64, N = 8SE +/- 2.81, N = 73279.173304.154830.591. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -Zotac5K10K15K20K25KSE +/- 18.97, N = 3SE +/- 24.30, N = 3SE +/- 30.41, N = 316310.516499.324142.81. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell0.37780.75561.13341.51121.889SE +/- 0.0014, N = 9SE +/- 0.0013, N = 10SE +/- 0.0020, N = 9SE +/- 0.0055, N = 90.88031.12201.58301.67901. (CXX) g++ options: -rdynamic

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell246810SE +/- 0.005, N = 15SE +/- 0.013, N = 15SE +/- 0.002, N = 14SE +/- 0.110, N = 152.8904.4045.7968.8401. (CXX) g++ options: -O3 -march=native -fopenmp

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 4080 16GB -PnyRTX 2080 Ti 22GB -Dell48121620SE +/- 0.003, N = 5SE +/- 0.008, N = 5SE +/- 0.003, N = 5SE +/- 0.023, N = 57.7999.35911.54314.7901. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell306090120150SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 355.9389.24121.35153.151. (CXX) g++ options: -O3

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mobilenetRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell918273645SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.18, N = 39.219.3637.9338.20MIN: 8.39 / MAX: 47.45MIN: 8.28 / MAX: 49.63MIN: 14.41 / MAX: 61.76MIN: 10.75 / MAX: 66.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac48121620SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.39, N = 3SE +/- 0.28, N = 34.324.426.3518.25MIN: 3.68 / MAX: 23.77MIN: 3.68 / MAX: 20.27MIN: 4.5 / MAX: 27.11MIN: 7.4 / MAX: 39.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac510152025SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.83, N = 3SE +/- 0.18, N = 34.975.0218.7718.78MIN: 4.23 / MAX: 23.03MIN: 4.12 / MAX: 23.74MIN: 7.68 / MAX: 35.81MIN: 8.92 / MAX: 34.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: shufflenet-v2RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac48121620SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 34.784.856.1117.57MIN: 4.03 / MAX: 23.06MIN: 4.28 / MAX: 25.38MIN: 4.77 / MAX: 33.2MIN: 8.24 / MAX: 39.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mnasnetRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac510152025SE +/- 0.02, N = 3SE +/- 0.10, N = 2SE +/- 0.73, N = 3SE +/- 0.13, N = 34.374.397.0319.16MIN: 3.71 / MAX: 23.07MIN: 3.79 / MAX: 22MIN: 4.55 / MAX: 25.62MIN: 7.92 / MAX: 33.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: efficientnet-b0RTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell612182430SE +/- 0.10, N = 3SE +/- 0.36, N = 3SE +/- 0.63, N = 3SE +/- 0.20, N = 36.306.4223.9726.09MIN: 5.32 / MAX: 26.26MIN: 5.35 / MAX: 28.33MIN: 10.15 / MAX: 37.01MIN: 12.37 / MAX: 40.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: blazefaceRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1.23532.47063.70594.94126.1765SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.22, N = 3SE +/- 0.07, N = 33.893.955.385.49MIN: 3.33 / MAX: 36.65MIN: 3.22 / MAX: 30.17MIN: 4.06 / MAX: 22.14MIN: 3.72 / MAX: 33.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: googlenetRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac714212835SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.49, N = 3SE +/- 0.13, N = 35.715.7728.2228.44MIN: 5.03 / MAX: 22.85MIN: 4.99 / MAX: 22.76MIN: 15.18 / MAX: 38.54MIN: 14.74 / MAX: 44.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vgg16RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4080 16GB -PnyRTX 4090 24GB -Nvidia612182430SE +/- 0.11, N = 3SE +/- 1.00, N = 3SE +/- 3.46, N = 3SE +/- 0.10, N = 34.407.077.9025.14MIN: 3.72 / MAX: 24.39MIN: 4.46 / MAX: 48.35MIN: 3.84 / MAX: 36.49MIN: 12.85 / MAX: 33.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet18RTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell510152025SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.55, N = 34.504.7821.4522.13MIN: 3.89 / MAX: 21.9MIN: 4.18 / MAX: 28.49MIN: 8.94 / MAX: 40.36MIN: 9.01 / MAX: 43.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: alexnetRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell510152025SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.22, N = 3SE +/- 0.30, N = 35.0817.5721.3121.50MIN: 3.13 / MAX: 27.91MIN: 7.8 / MAX: 33.45MIN: 7.31 / MAX: 35.87MIN: 3.33 / MAX: 40.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet50RTX 4080 16GB -PnyRTX 2080 Ti 22GB -DellRTX 3090 24GB -ZotacRTX 4090 24GB -Nvidia246810SE +/- 0.04, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.77, N = 35.165.856.357.28MIN: 4.78 / MAX: 21.85MIN: 4.75 / MAX: 26.01MIN: 5 / MAX: 21.95MIN: 3.88 / MAX: 30.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: yolov4-tinyRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1224364860SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.40, N = 3SE +/- 0.13, N = 335.2335.8749.1052.48MIN: 11.4 / MAX: 59.2MIN: 11.56 / MAX: 61.96MIN: 17.59 / MAX: 76.07MIN: 20.11 / MAX: 75.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: squeezenet_ssdRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1428425670SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.43, N = 3SE +/- 0.50, N = 38.999.1357.2761.74MIN: 8.18 / MAX: 62.47MIN: 7.71 / MAX: 66.82MIN: 16.86 / MAX: 94.95MIN: 23.2 / MAX: 85.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: regnety_400mRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 2.01, N = 3SE +/- 0.25, N = 35.325.4623.1325.04MIN: 4.68 / MAX: 22.84MIN: 4.78 / MAX: 28.36MIN: 8.74 / MAX: 39.68MIN: 11.25 / MAX: 44.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vision_transformerRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell80160240320400SE +/- 1.34, N = 3SE +/- 3.05, N = 3SE +/- 0.55, N = 3SE +/- 0.57, N = 3289.04290.15324.88363.75MIN: 260.04 / MAX: 661.43MIN: 247.6 / MAX: 837.91MIN: 283.26 / MAX: 423.43MIN: 316.04 / MAX: 471.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: FastestDetRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac3691215SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3SE +/- 3.64, N = 35.085.186.8013.52MIN: 4.3 / MAX: 23.08MIN: 4.28 / MAX: 25.38MIN: 5.02 / MAX: 32.78MIN: 5.33 / MAX: 35.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1.01862.03723.05584.07445.093SE +/- 0.080, N = 3SE +/- 0.005, N = 10SE +/- 0.029, N = 10SE +/- 0.031, N = 82.0852.6853.7684.5271. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: NVIDIA OptiXRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell246810SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 153.704.436.138.93

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: NVIDIA OptiXRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell510152025SE +/- 0.01, N = 6SE +/- 0.02, N = 5SE +/- 0.01, N = 4SE +/- 0.03, N = 37.479.3214.2522.69

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell48121620SE +/- 0.07, N = 15SE +/- 0.07, N = 15SE +/- 0.07, N = 15SE +/- 0.21, N = 45.737.5910.8416.89

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell612182430SE +/- 0.02, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 3SE +/- 0.04, N = 38.4210.4116.0727.42

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: NVIDIA OptiXRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell20406080100SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 330.5537.9752.2892.87

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell1224364860SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.18, N = 3SE +/- 0.07, N = 320.1924.3731.1752.76

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell3691215SE +/- 0.018, N = 7SE +/- 0.015, N = 7SE +/- 0.015, N = 6SE +/- 0.025, N = 55.1095.6306.5099.239

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -Dell0.96911.93822.90733.87644.8455SE +/- 0.003, N = 10SE +/- 0.004, N = 10SE +/- 0.005, N = 9SE +/- 0.007, N = 82.4902.7613.5364.307

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny6.8142.4304.5RTX 2080 Ti 22GB -Dell21.1147.0255.1RTX 3090 24GB -Zotac20.4191.9326.0RTX 4090 24GB -Nvidia6.7197.8444.1OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor120240360480600

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.7148.9281.6RTX 2080 Ti 22GB -Dell22.5154.3261.7RTX 4090 24GB -Nvidia6.7190.1401.1RTX 3090 24GB -Zotac19.1192.7330.0OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor110220330440550

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 2080 Ti 22GB -Dell22.3156.5321.2RTX 4080 16GB -Pny12.9157.1290.4RTX 3090 24GB -Zotac20.5192.6326.5RTX 4090 24GB -Nvidia6.8231.7417.6OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor110220330440550

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.2101.3323.7RTX 2080 Ti 22GB -Dell22.9121.2333.4RTX 4090 24GB -Nvidia6.8144.6456.1RTX 3090 24GB -Zotac19.1155.4349.1OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor120240360480600

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.494.7308.4RTX 2080 Ti 22GB -Dell22.8115.4347.3RTX 4090 24GB -Nvidia6.6136.8444.0RTX 3090 24GB -Zotac19.1138.5326.4OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor120240360480600

FAHBench

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny10.488.1141.6RTX 4090 24GB -Nvidia6.596.4148.8RTX 2080 Ti 22GB -Dell21.7160.0255.5RTX 3090 24GB -Zotac19.0192.6299.0OpenBenchmarking.orgWatts, Fewer Is BetterFAHBench 2.3.2GPU Power Consumption Monitor80160240320400

GROMACS

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.5176.8317.1RTX 4090 24GB -Nvidia6.7182.8404.4RTX 2080 Ti 22GB -Dell22.7194.8276.3RTX 3090 24GB -Zotac28.0243.2365.1OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2023GPU Power Consumption Monitor110220330440550

NAMD CUDA

GPU Power Consumption Monitor

MinAvgMaxRTX 4090 24GB -Nvidia6.454.4281.7RTX 3090 24GB -Zotac19.068.9327.9RTX 4080 16GB -Pny13.469.1263.4RTX 2080 Ti 22GB -Dell23.6127.2297.8OpenBenchmarking.orgWatts, Fewer Is BetterNAMD CUDA 2.14GPU Power Consumption Monitor80160240320400

OctaneBench

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.7220.4259.6RTX 2080 Ti 22GB -Dell22.6242.0259.3RTX 4090 24GB -Nvidia6.8279.5322.5RTX 3090 24GB -Zotac28.6337.3362.8OpenBenchmarking.orgWatts, Fewer Is BetterOctaneBench 2020.1GPU Power Consumption Monitor100200300400500

Rodinia

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.563.1123.6RTX 4090 24GB -Nvidia6.867.5167.3RTX 2080 Ti 22GB -Dell23.3133.0219.9RTX 3090 24GB -Zotac19.9148.1254.9OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 3.1GPU Power Consumption Monitor70140210280350

ArrayFire

GPU Power Consumption Monitor

MinAvgMaxRTX 4090 24GB -Nvidia6.747.6113.6RTX 4080 16GB -Pny14.758.6132.1RTX 2080 Ti 22GB -Dell22.4103.9212.7RTX 3090 24GB -Zotac24.2149.2279.0OpenBenchmarking.orgWatts, Fewer Is BetterArrayFire 3.7GPU Power Consumption Monitor70140210280350

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.668.3177.2RTX 4090 24GB -Nvidia6.879.8231.0RTX 2080 Ti 22GB -Dell22.0120.7246.3RTX 3090 24GB -Zotac20.3147.9355.6OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor100200300400500

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.356.1255.4RTX 4090 24GB -Nvidia6.970.4356.0RTX 2080 Ti 22GB -Dell21.8109.9254.5RTX 3090 24GB -Zotac22.4114.3268.7OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor100200300400500

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.870.593.0RTX 4090 24GB -Nvidia7.090.3122.6RTX 2080 Ti 22GB -Dell22.4145.6183.3RTX 3090 24GB -Zotac20.3170.7209.4OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor60120180240300

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.858.3297.3RTX 4090 24GB -Nvidia7.183.9421.2RTX 2080 Ti 22GB -Dell22.390.1254.1RTX 3090 24GB -Zotac23.7127.5323.8OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor110220330440550

NeatBench

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny4.627.989.5RTX 4090 24GB -Nvidia6.936.373.8RTX 2080 Ti 22GB -Dell22.168.0119.6RTX 3090 24GB -Zotac23.685.4168.2OpenBenchmarking.orgWatts, Fewer Is BetterNeatBench 5GPU Power Consumption Monitor50100150200250

FinanceBench

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.026.639.8RTX 4090 24GB -Nvidia6.429.351.3RTX 2080 Ti 22GB -Dell21.345.473.1RTX 3090 24GB -Zotac19.566.6119.2OpenBenchmarking.orgWatts, Fewer Is BetterFinanceBench 2016-07-25GPU Power Consumption Monitor4080120160200

LeelaChessZero

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.0117.3160.6RTX 4090 24GB -Nvidia6.5122.6165.3RTX 2080 Ti 22GB -Dell21.9229.3264.6RTX 3090 24GB -Zotac28.5263.6349.6OpenBenchmarking.orgWatts, Fewer Is BetterLeelaChessZero 0.28GPU Power Consumption Monitor100200300400500

cl-mem

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.978.5155.7RTX 4090 24GB -Nvidia6.892.1205.2RTX 2080 Ti 22GB -Dell22.9130.3212.4RTX 3090 24GB -Zotac19.8166.2329.4OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13GPU Power Consumption Monitor80160240320400

cl-mem

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.877.9155.5RTX 4090 24GB -Nvidia6.887.4204.4RTX 2080 Ti 22GB -Dell23.2126.9212.8RTX 3090 24GB -Zotac19.7171.2327.8OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13GPU Power Consumption Monitor80160240320400

cl-mem

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny10.780.4155.4RTX 4090 24GB -Nvidia6.890.7205.9RTX 2080 Ti 22GB -Dell23.0127.1210.3RTX 3090 24GB -Zotac19.4168.8328.7OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13GPU Power Consumption Monitor80160240320400

MandelGPU

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.265.5184.4RTX 4090 24GB -Nvidia6.968.6215.0RTX 2080 Ti 22GB -Dell22.6126.4254.8RTX 3090 24GB -Zotac19.7149.1321.4OpenBenchmarking.orgWatts, Fewer Is BetterMandelGPU 1.3pts1GPU Power Consumption Monitor80160240320400

ViennaCL

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.4109.2223.4RTX 4090 24GB -Nvidia6.9128.0256.4RTX 2080 Ti 22GB -Dell22.4173.6255.0RTX 3090 24GB -Zotac28.9232.9357.9OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.7.1GPU Power Consumption Monitor100200300400500

ViennaCL

GPU Power Consumption Monitor

MinAvgMaxRTX 4090 24GB -Nvidia6.08.015.8RTX 4080 16GB -Pny4.012.118.8RTX 2080 Ti 22GB -Dell18.421.643.4RTX 3090 24GB -Zotac19.026.237.2OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.7.1GPU Power Consumption Monitor1224364860

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.737.060.9RTX 4090 24GB -Nvidia6.443.472.8RTX 2080 Ti 22GB -Dell20.163.6102.9RTX 3090 24GB -Zotac20.393.5158.5OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor50100150200250

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.938.262.5RTX 4090 24GB -Nvidia6.643.872.7RTX 2080 Ti 22GB -Dell20.666.6107.8RTX 3090 24GB -Zotac19.786.6154.5OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor4080120160200

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption MonitorRTX 4080 16GB -PnyRTX 4090 24GB -NvidiaRTX 2080 Ti 22GB -DellRTX 3090 24GB -Zotac80160240320400Min: 13.84 / Avg: 90.17 / Max: 331.21Min: 6.77 / Avg: 116.7 / Max: 447.83Min: 21.69 / Avg: 175.84 / Max: 257.38Min: 20.35 / Avg: 201.61 / Max: 329.19

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.598.3224.9RTX 4090 24GB -Nvidia7.0125.0250.8RTX 2080 Ti 22GB -Dell22.2184.6246.0RTX 3090 24GB -Zotac22.8253.9357.5OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor100200300400500

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.238.772.2RTX 4090 24GB -Nvidia7.045.5106.3RTX 2080 Ti 22GB -Dell23.273.4184.7RTX 3090 24GB -Zotac24.998.4223.7OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor60120180240300

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny11.951.2195.6RTX 4090 24GB -Nvidia7.059.0207.2RTX 2080 Ti 22GB -Dell22.3107.8225.4RTX 3090 24GB -Zotac25.8138.9317.8OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor80160240320400

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4090 24GB -Nvidia7.041.7150.7RTX 4080 16GB -Pny12.844.7278.8RTX 2080 Ti 22GB -Dell22.184.4261.2RTX 3090 24GB -Zotac19.5105.9323.3OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor80160240320400

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.339.398.6RTX 4090 24GB -Nvidia6.441.6108.0RTX 2080 Ti 22GB -Dell21.793.4198.2RTX 3090 24GB -Zotac20.0125.2273.7OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor70140210280350

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.128.540.3RTX 4090 24GB -Nvidia6.533.855.0RTX 2080 Ti 22GB -Dell22.053.476.5RTX 3090 24GB -Zotac19.872.1117.2OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor4080120160200

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.230.647.0RTX 4090 24GB -Nvidia6.639.858.8RTX 2080 Ti 22GB -Dell21.356.580.7RTX 3090 24GB -Zotac19.885.0125.6OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor4080120160200

IndigoBench

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.0159.4207.9RTX 4090 24GB -Nvidia6.7172.9252.7RTX 2080 Ti 22GB -Dell21.3211.7259.5RTX 3090 24GB -Zotac20.9282.9356.8OpenBenchmarking.orgWatts, Fewer Is BetterIndigoBench 4.4GPU Power Consumption Monitor100200300400500

IndigoBench

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.0180.5230.3RTX 4090 24GB -Nvidia6.7211.2296.7RTX 2080 Ti 22GB -Dell23.1213.7254.5RTX 3090 24GB -Zotac20.2287.2353.5OpenBenchmarking.orgWatts, Fewer Is BetterIndigoBench 4.4GPU Power Consumption Monitor100200300400500

Chaos Group V-RAY

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.4159.5227.6RTX 2080 Ti 22GB -Dell22.7181.0256.4RTX 4090 24GB -Nvidia7.2199.1296.3RTX 3090 24GB -Zotac19.2250.0356.2OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 5.02GPU Power Consumption Monitor100200300400500

Chaos Group V-RAY

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny4.5130.0211.7RTX 2080 Ti 22GB -Dell21.6167.9253.1RTX 4090 24GB -Nvidia6.8169.0272.7RTX 3090 24GB -Zotac18.9235.8358.0OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 5.02GPU Power Consumption Monitor100200300400500

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.395.0198.6RTX 4090 24GB -Nvidia7.0107.7242.2RTX 2080 Ti 22GB -Dell23.3173.8254.3RTX 3090 24GB -Zotac19.2215.9349.9OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny5.3142.2230.1RTX 4090 24GB -Nvidia7.1162.1291.8RTX 2080 Ti 22GB -Dell22.5212.6255.0RTX 3090 24GB -Zotac19.2267.4353.6OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.4116.8223.6RTX 4090 24GB -Nvidia6.9136.7303.2RTX 2080 Ti 22GB -Dell22.9195.0256.9RTX 3090 24GB -Zotac19.4234.9344.1OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor80160240320400

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.8140.8239.5RTX 4090 24GB -Nvidia6.9172.2302.0RTX 2080 Ti 22GB -Dell22.7212.9257.2RTX 3090 24GB -Zotac20.5264.0361.3OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.7161.8215.2RTX 4090 24GB -Nvidia6.8189.4276.9RTX 2080 Ti 22GB -Dell22.8226.2257.6RTX 3090 24GB -Zotac18.6292.2358.2OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.048.0178.0RTX 4090 24GB -Nvidia7.056.9202.9RTX 3090 24GB -Zotac18.7125.3335.1OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.460.0179.5RTX 4090 24GB -Nvidia6.869.2202.4RTX 3090 24GB -Zotac19.3144.4336.9OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny10.7104.4178.5RTX 4090 24GB -Nvidia7.0114.8203.5RTX 3090 24GB -Zotac19.1224.3341.6OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.462.2154.8RTX 4090 24GB -Nvidia7.071.0160.9RTX 3090 24GB -Zotac19.3154.8289.3OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor70140210280350

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny12.678.5154.8RTX 4090 24GB -Nvidia7.087.3162.9RTX 3090 24GB -Zotac25.4184.4290.0OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor70140210280350

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.4121.4156.0RTX 4090 24GB -Nvidia6.9129.1162.7RTX 3090 24GB -Zotac18.5252.8299.7OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

VkFFT

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny11.253.2215.9RTX 4090 24GB -Nvidia6.961.1277.8RTX 2080 Ti 22GB -Dell21.5119.5264.1RTX 3090 24GB -Zotac19.1140.5367.2OpenBenchmarking.orgWatts, Fewer Is BetterVkFFT 1.1.1GPU Power Consumption Monitor100200300400500

VkResample

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny4.335.1203.1RTX 4090 24GB -Nvidia6.537.5270.4RTX 2080 Ti 22GB -Dell21.460.8255.9RTX 3090 24GB -Zotac19.278.0357.0OpenBenchmarking.orgWatts, Fewer Is BetterVkResample 1.0GPU Power Consumption Monitor100200300400500

VkResample

GPU Power Consumption Monitor

MinAvgMaxRTX 4090 24GB -Nvidia6.758.1182.3RTX 4080 16GB -Pny4.259.1135.5RTX 2080 Ti 22GB -Dell21.1114.5208.0RTX 3090 24GB -Zotac19.1124.7254.8OpenBenchmarking.orgWatts, Fewer Is BetterVkResample 1.0GPU Power Consumption Monitor70140210280350

vkpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.6171.1245.8RTX 4090 24GB -Nvidia6.6203.4347.2RTX 2080 Ti 22GB -Dell21.4217.8258.8RTX 3090 24GB -Zotac24.9265.1330.5OpenBenchmarking.orgWatts, Fewer Is Bettervkpeak 20210424GPU Power Consumption Monitor100200300400500

NCNN

GPU Power Consumption Monitor

MinAvgMaxRTX 4090 24GB -Nvidia6.413.659.6RTX 4080 16GB -Pny4.218.566.4RTX 2080 Ti 22GB -Dell19.231.7108.8RTX 3090 24GB -Zotac18.836.1148.3OpenBenchmarking.orgWatts, Fewer Is BetterNCNN 20220729GPU Power Consumption Monitor4080120160200

RealSR-NCNN

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny13.9221.3300.8RTX 2080 Ti 22GB -Dell20.4225.5257.1RTX 4090 24GB -Nvidia6.6234.7332.3RTX 3090 24GB -Zotac33.0292.5356.8OpenBenchmarking.orgWatts, Fewer Is BetterRealSR-NCNN 20200818GPU Power Consumption Monitor100200300400500

RealSR-NCNN

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.1102.2294.6RTX 4090 24GB -Nvidia7.1105.0327.6RTX 2080 Ti 22GB -Dell23.3150.8256.7RTX 3090 24GB -Zotac28.7166.5354.3OpenBenchmarking.orgWatts, Fewer Is BetterRealSR-NCNN 20200818GPU Power Consumption Monitor100200300400500

Waifu2x-NCNN Vulkan

GPU Power Consumption Monitor

MinAvgMaxRTX 4080 16GB -Pny14.072.7204.6RTX 4090 24GB -Nvidia7.193.5229.6RTX 2080 Ti 22GB -Dell22.8138.6261.5RTX 3090 24GB -Zotac27.5154.7304.3OpenBenchmarking.orgWatts, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818GPU Power Consumption Monitor80160240320400


Phoronix Test Suite v10.8.4