2307020-PTS-GPUREVIEW1

RTX 4080 16GB PNY REVIEW

HTML result view exported from: https://openbenchmarking.org/result/2307077-NE-2307020PT71&rdt&grs.

2307020-PTS-GPUREVIEW1ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -PnyIntel Xeon w9-3495X @ 4.80GHz (56 Cores / 112 Threads)ASUS Pro WS W790E-SAGE SE (0506 BIOS)Intel Device 7aa78 x 32 GB DDR5-4812MT/s Hynix HMCG88AEBRA115N6401GB Micron_9300_MTFDHAL6T4TDR + 0GB Virtual HDisk0NVIDIA GeForce RTX 3090 24GBRealtek ALC1220BenQ PD2720U2 x Intel X710 for 10GBASE-TUbuntu 22.046.3.0-060300-generic (x86_64)GNOME Shell 42.5X Server 1.21.1.4NVIDIA 530.41.034.6.0OpenCL 3.0 CUDA 12.1.981.3.236GCC 11.3.0 + CUDA 12.1ext43840x2160NVIDIA GeForce RTX 2080 Ti 22GBNVIDIA GeForce RTX 4090 24GBNVIDIA GeForce RTX 4080 16GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-aYxV0E/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave (EPP: performance) - CPU Microcode: 0x2b000390Graphics Details- RTX 3090 24GB -Zotac: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 94.02.26.48.65- RTX 2080 Ti 22GB -Dell: BAR1 / Visible vRAM Size: 256 MiB - vBIOS Version: 90.02.30.40.4d- RTX 4090 24GB -Nvidia: BAR1 / Visible vRAM Size: 32768 MiB - vBIOS Version: 95.02.20.00.03- RTX 4080 16GB -Pny: BAR1 / Visible vRAM Size: 16384 MiB - vBIOS Version: 95.03.0e.00.67OpenCL Details- RTX 3090 24GB -Zotac: GPU Compute Cores: 10496- RTX 2080 Ti 22GB -Dell: GPU Compute Cores: 4352- RTX 4090 24GB -Nvidia: GPU Compute Cores: 16384- RTX 4080 16GB -Pny: GPU Compute Cores: 9728Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

2307020-PTS-GPUREVIEW1ncnn: Vulkan GPU - squeezenet_ssdclpeak: Single-Precision Floatshoc: OpenCL - GEMM SGEMM_Nshoc: OpenCL - Max SP Flopsncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - resnet18v-ray: NVIDIA CUDA GPUncnn: Vulkan GPU - alexnetv-ray: NVIDIA RTX GPUncnn: Vulkan GPU - mobilenetoctanebench: Total Scorevkpeak: fp32-vec4clpeak: Integer Compute INThashcat: 7-Zipblender: Pabellon Barcelona - NVIDIA OptiXhashcat: TrueCrypt RIPEMD160 + XTSindigobench: OpenCL GPU - Bedroomhashcat: SHA-512financebench: Black-Scholes OpenCLblender: Barbershop - NVIDIA OptiXblender: Classroom - NVIDIA OptiXhashcat: SHA1vkpeak: int16-vec4hashcat: MD5blender: Fishy Cat - NVIDIA OptiXviennacl: OpenCL BLAS - dGEMM-TTshoc: OpenCL - MD5 Hashvkpeak: int16-scalarvkpeak: fp16-vec4vkpeak: fp16-scalargromacs: NVIDIA CUDA GPU - water_GMX50_bareviennacl: OpenCL BLAS - dGEMM-TNvkpeak: int32-vec4shoc: OpenCL - Reductionvkpeak: fp64-scalarviennacl: OpenCL BLAS - dGEMM-NTvkpeak: fp32-scalarvkpeak: int32-scalarvkpeak: fp64-vec4clpeak: Double-Precision Doublevkresample: 2x - Doubleshoc: OpenCL - Texture Read Bandwidthrealsr-ncnn: 4x - Yesviennacl: OpenCL BLAS - dGEMM-NNindigobench: OpenCL GPU - Supercarshoc: OpenCL - S3Dmandelgpu: GPUshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Triadneatbench: GPUshoc: OpenCL - Bus Speed Downloadarrayfire: Conjugate Gradient OpenCLvkresample: 2x - Singleshoc: OpenCL - FFT SPrealsr-ncnn: 4x - Novkfft: cl-mem: Writewaifu2x-ncnn: 2x - 3 - Yesclpeak: Global Memory Bandwidthcl-mem: Readviennacl: OpenCL BLAS - dGEMV-Ncaffe: AlexNet - NVIDIA CUDA - 1000caffe: AlexNet - NVIDIA CUDA - 200viennacl: OpenCL BLAS - dAXPYncnn: Vulkan GPU - yolov4-tinycaffe: AlexNet - NVIDIA CUDA - 100caffe: GoogleNet - NVIDIA CUDA - 1000caffe: GoogleNet - NVIDIA CUDA - 200caffe: GoogleNet - NVIDIA CUDA - 100viennacl: OpenCL BLAS - sDOTviennacl: OpenCL BLAS - sCOPYfahbench: viennacl: OpenCL BLAS - sAXPYviennacl: OpenCL BLAS - dCOPYviennacl: OpenCL BLAS - dDOTcl-mem: Copyncnn: Vulkan GPU - vision_transformerviennacl: OpenCL BLAS - dGEMV-Tlczero: OpenCLviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - sDOTncnn: Vulkan GPU - FastestDetncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2blender: BMW27 - NVIDIA OptiXviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Nrodinia: OpenCL Particle Filternamd-cuda: ATPase Simulation - 327,506 AtomsRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny57.2735000.447912.5937738.428.4421.45205221.31287837.93669.37102626392.0917864.54109595016.0773132720.77225952500005.79652.2814.252063405000016205.986370092666710.8458441.332413295.7240002.9320161.5923.74358720074.10392.804645.2458820403.3520305.75645.31640.15121.3532148.8631.17058551.701427.491568293355.526.397524.5486309025.19301.5839.3592349.396.50941547734.33.536814.92823.31856584.811321.3671649.10664.59724142.84830.592418.86372363316.5721495600658359.2324.88371143168346069841842227274976413.5225.046.354.405.3823.9719.1617.5718.7818.256.131431411201071723.7680.0674261.7414076.714858.1316106.428.2222.1395021.50130638.20349.49018515859.1211826.7684027527.4258026011.15220471833338.84092.8722.691630090000012975.655150325000016.8945832.518110270.2630649.1315521.1215.37746015752.98366.058500.9146115974.2215963.59503.53506.01153.1461147.2652.75545932.598270.801442866316.913.204612.6064208012.85301.67914.7901504.879.23933404452.24.307504.66543.829751852.48305306292.6940392475524318.1363.75370132289236581058175522617427476.8023.135.857.075.4926.097.036.1118.776.358.93145142119106.81794.5270.095158.9979463.3627175.988606.05.774.5042655.0854789.211326.23767458607.4640697.7327469388.42185780035.29662977666672.89030.557.474939336666739280.821543333333335.73134093.901929507.1787698.3944269.7943.470129344062.52967.1421394.44128044315.3544276.721396.261391.5155.9252980.1520.187115078.445644.603951815274.226.372425.0717409025.13110.88037.7992779.485.10959427785.82.490871.56886.02204379.70883.95377235.23447.34116499.33304.151660.01446444423.2069568661719410.3289.04443150979466711098187423607717845.085.467.2825.143.956.424.394.854.974.423.701441371151201882.0850.048509.1347802.8716952.655455.85.714.78312617.5740809.36977.5281436655.6924524.34174735010.41114656026.08039363500004.40437.979.323088153333324616.84968919666677.5984960.558118441.4654827.6627669.3732.04483127595.511020.235873.5279527771.8827758.93873.67869.2589.2373065.7624.37477466.030423.578772392077.326.393524.7590408025.10071.12211.5431831.865.63048201567.72.761611.67623.22245228.001053.8860835.87537.49116310.53279.171649.29417385424.3492487540597382.0290.15435153169566781085190124227877835.185.325.167.903.896.304.374.785.024.324.43137135122103.51872.6850.05427OpenBenchmarking.org

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: squeezenet_ssdRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1428425670SE +/- 0.43, N = 3SE +/- 0.50, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 357.2761.748.999.13MIN: 16.86 / MAX: 94.95MIN: 23.2 / MAX: 85.24MIN: 8.18 / MAX: 62.47MIN: 7.71 / MAX: 66.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

clpeak

OpenCL Test: Single-Precision Float

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Single-Precision FloatRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny20K40K60K80K100KSE +/- 10.34, N = 15SE +/- 80.60, N = 11SE +/- 103.68, N = 14SE +/- 50.90, N = 1435000.4414076.7179463.3647802.871. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: GEMM SGEMM_N

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny6K12K18K24K30KSE +/- 16.79, N = 11SE +/- 33.05, N = 10SE +/- 164.66, N = 13SE +/- 224.54, N = 157912.594858.1327175.9016952.601. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Max SP Flops

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny20K40K60K80K100KSE +/- 292.47, N = 3SE +/- 102.77, N = 3SE +/- 19.25, N = 3SE +/- 73.47, N = 337738.416106.488606.055455.81. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: googlenetRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny714212835SE +/- 0.13, N = 3SE +/- 0.49, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 328.4428.225.775.71MIN: 14.74 / MAX: 44.82MIN: 15.18 / MAX: 38.54MIN: 4.99 / MAX: 22.76MIN: 5.03 / MAX: 22.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet18RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny510152025SE +/- 0.21, N = 3SE +/- 0.55, N = 3SE +/- 0.10, N = 3SE +/- 0.06, N = 321.4522.134.504.78MIN: 8.94 / MAX: 40.36MIN: 9.01 / MAX: 43.23MIN: 3.89 / MAX: 21.9MIN: 4.18 / MAX: 28.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Chaos Group V-RAY

Mode: NVIDIA CUDA GPU

OpenBenchmarking.orgvpaths, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA CUDA GPURTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9001800270036004500SE +/- 2.85, N = 3SE +/- 0.67, N = 3SE +/- 2.33, N = 3SE +/- 2.73, N = 3205295042653126

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: alexnetRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny510152025SE +/- 0.22, N = 3SE +/- 0.30, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 321.3121.505.0817.57MIN: 7.31 / MAX: 35.87MIN: 3.33 / MAX: 40.17MIN: 3.13 / MAX: 27.91MIN: 7.8 / MAX: 33.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Chaos Group V-RAY

Mode: NVIDIA RTX GPU

OpenBenchmarking.orgvrays, More Is BetterChaos Group V-RAY 5.02Mode: NVIDIA RTX GPURTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny12002400360048006000SE +/- 18.12, N = 3SE +/- 10.17, N = 3SE +/- 63.00, N = 3SE +/- 30.20, N = 32878130654784080

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mobilenetRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny918273645SE +/- 0.21, N = 3SE +/- 0.18, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 337.9338.209.219.36MIN: 14.41 / MAX: 61.76MIN: 10.75 / MAX: 66.72MIN: 8.39 / MAX: 47.45MIN: 8.28 / MAX: 49.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OctaneBench

Total Score

OpenBenchmarking.orgScore, More Is BetterOctaneBench 2020.1Total ScoreRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500669.37349.491326.24977.53

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny13K26K39K52K65KSE +/- 43.15, N = 3SE +/- 77.45, N = 3SE +/- 13.83, N = 3SE +/- 13.71, N = 326392.0915859.1258607.4636655.69

clpeak

OpenCL Test: Integer Compute INT

OpenBenchmarking.orgGIOPS, More Is Betterclpeak 1.1.2OpenCL Test: Integer Compute INTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9K18K27K36K45KSE +/- 36.29, N = 15SE +/- 86.60, N = 15SE +/- 49.03, N = 14SE +/- 20.64, N = 1417864.5411826.7640697.7324524.341. (CXX) g++ options: -O3

Hashcat

Benchmark: 7-Zip

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: 7-ZipRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny600K1200K1800K2400K3000KSE +/- 1997.77, N = 8SE +/- 1608.10, N = 8SE +/- 1870.54, N = 8SE +/- 1869.68, N = 8109595084027527469381747350

Blender

Blend File: Pabellon Barcelona - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: NVIDIA OptiXRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 5SE +/- 0.01, N = 516.0727.428.4210.41

Hashcat

Benchmark: TrueCrypt RIPEMD160 + XTS

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: TrueCrypt RIPEMD160 + XTSRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny400K800K1200K1600K2000KSE +/- 5945.49, N = 15SE +/- 4392.30, N = 10SE +/- 13557.46, N = 7SE +/- 8051.58, N = 1573132758026018578001146560

IndigoBench

Acceleration: OpenCL GPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: BedroomRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny816243240SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 320.7711.1535.3026.08

Hashcat

Benchmark: SHA-512

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA-512RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1300M2600M3900M5200M6500MSE +/- 2013247.79, N = 6SE +/- 6223365.47, N = 6SE +/- 2101057.93, N = 6SE +/- 1634982.16, N = 62595250000204718333362977666673936350000

FinanceBench

Benchmark: Black-Scholes OpenCL

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Black-Scholes OpenCLRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny246810SE +/- 0.002, N = 14SE +/- 0.110, N = 15SE +/- 0.005, N = 15SE +/- 0.013, N = 155.7968.8402.8904.4041. (CXX) g++ options: -O3 -march=native -fopenmp

Blender

Blend File: Barbershop - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: NVIDIA OptiXRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny20406080100SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 352.2892.8730.5537.97

Blender

Blend File: Classroom - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: NVIDIA OptiXRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny510152025SE +/- 0.01, N = 4SE +/- 0.03, N = 3SE +/- 0.01, N = 6SE +/- 0.02, N = 514.2522.697.479.32

Hashcat

Benchmark: SHA1

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: SHA1RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny11000M22000M33000M44000M55000MSE +/- 15661518.66, N = 6SE +/- 39682498.24, N = 6SE +/- 18148896.51, N = 6SE +/- 19028429.02, N = 620634050000163009000004939336666730881533333

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-vec4RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny8K16K24K32K40KSE +/- 4.87, N = 3SE +/- 4.25, N = 3SE +/- 1.77, N = 3SE +/- 0.85, N = 316205.9812975.6539280.8224616.84

Hashcat

Benchmark: MD5

OpenBenchmarking.orgH/s, More Is BetterHashcat 6.2.4Benchmark: MD5RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30000M60000M90000M120000M150000MSE +/- 474267340.07, N = 15SE +/- 232076903.56, N = 6SE +/- 243127767.05, N = 6SE +/- 34207656.71, N = 6637009266675150325000015433333333396891966667

Blender

Blend File: Fishy Cat - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: NVIDIA OptiXRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny48121620SE +/- 0.07, N = 15SE +/- 0.21, N = 4SE +/- 0.07, N = 15SE +/- 0.07, N = 1510.8416.895.737.59

ViennaCL

Test: OpenCL BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 1.45, N = 3SE +/- 0.00, N = 3SE +/- 1.20, N = 358445813408491. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: MD5 Hash

OpenBenchmarking.orgGHash/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: MD5 HashRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny20406080100SE +/- 0.05, N = 14SE +/- 0.06, N = 14SE +/- 0.95, N = 15SE +/- 0.65, N = 1541.3332.5293.9060.561. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-scalarRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny6K12K18K24K30KSE +/- 0.25, N = 3SE +/- 0.89, N = 3SE +/- 4.42, N = 3SE +/- 0.34, N = 313295.7210270.2629507.1718441.46

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-vec4RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny20K40K60K80K100KSE +/- 64.52, N = 3SE +/- 26.46, N = 3SE +/- 98.17, N = 3SE +/- 0.37, N = 340002.9330649.1387698.3954827.66

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-scalarRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9K18K27K36K45KSE +/- 30.30, N = 3SE +/- 61.99, N = 3SE +/- 7.21, N = 3SE +/- 0.86, N = 320161.5915521.1244269.7927669.37

GROMACS

Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bareRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1020304050SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 323.7415.3843.4732.041. (CXX) g++ options: -O3

ViennaCL

Test: OpenCL BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-TNRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 1.67, N = 3SE +/- 0.50, N = 2SE +/- 3.33, N = 3SE +/- 0.67, N = 358746012938311. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9K18K27K36K45KSE +/- 27.49, N = 3SE +/- 15.45, N = 3SE +/- 3.11, N = 3SE +/- 37.47, N = 320074.1015752.9844062.5227595.51

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Reduction

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 0.11, N = 13SE +/- 0.06, N = 12SE +/- 9.75, N = 15SE +/- 13.90, N = 15392.80366.06967.141020.241. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 1.62, N = 3SE +/- 1.28, N = 3SE +/- 0.73, N = 3SE +/- 0.08, N = 3645.24500.911394.44873.52

ViennaCL

Test: OpenCL BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 1.67, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3SE +/- 1.00, N = 358846112807951. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9K18K27K36K45KSE +/- 34.37, N = 3SE +/- 136.44, N = 3SE +/- 10.09, N = 3SE +/- 35.38, N = 320403.3515974.2244315.3527771.88

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9K18K27K36K45KSE +/- 10.90, N = 3SE +/- 39.60, N = 3SE +/- 25.10, N = 3SE +/- 2.25, N = 320305.7515963.5944276.7227758.93

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 1.70, N = 3SE +/- 0.03, N = 3SE +/- 1.25, N = 3SE +/- 0.01, N = 3645.31503.531396.26873.67

clpeak

OpenCL Test: Double-Precision Double

OpenBenchmarking.orgGFLOPS, More Is Betterclpeak 1.1.2OpenCL Test: Double-Precision DoubleRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 0.46, N = 5SE +/- 1.50, N = 4SE +/- 1.85, N = 6SE +/- 1.29, N = 6640.15506.011391.51869.251. (CXX) g++ options: -O3

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: DoubleRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny306090120150SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3121.35153.1555.9389.241. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Texture Read Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Texture Read BandwidthRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny7001400210028003500SE +/- 0.38, N = 3SE +/- 0.54, N = 3SE +/- 0.95, N = 6SE +/- 2.39, N = 72148.861147.262980.153065.761. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1224364860SE +/- 0.18, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 331.1752.7620.1924.37

ViennaCL

Test: OpenCL BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMM-NNRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 1.67, N = 3SE +/- 0.88, N = 3SE +/- 0.00, N = 3SE +/- 0.88, N = 358545911507741. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

IndigoBench

Acceleration: OpenCL GPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: OpenCL GPU - Scene: SupercarRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny20406080100SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 351.7032.6078.4566.03

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: S3D

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny140280420560700SE +/- 0.35, N = 12SE +/- 0.15, N = 12SE +/- 0.22, N = 12SE +/- 0.38, N = 13427.49270.80644.60423.581. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

MandelGPU

OpenCL Device: GPU

OpenBenchmarking.orgSamples/sec, More Is BetterMandelGPU 1.3pts1OpenCL Device: GPURTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny200M400M600M800M1000MSE +/- 959338.71, N = 11SE +/- 1540493.49, N = 10SE +/- 3825360.26, N = 12SE +/- 1340958.21, N = 12568293355.5442866316.9951815274.2772392077.31. (CC) gcc options: -O3 -lm -ftree-vectorize -funroll-loops -lglut -lOpenCL -lGL

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Readback

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.00, N = 13SE +/- 0.00, N = 12SE +/- 0.00, N = 13SE +/- 0.00, N = 1426.4013.2026.3726.391. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Triad

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.00, N = 12SE +/- 0.00, N = 12SE +/- 0.01, N = 12SE +/- 0.01, N = 1224.5512.6125.0724.761. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

NeatBench

Acceleration: GPU

OpenBenchmarking.orgFPS, More Is BetterNeatBench 5Acceleration: GPURTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny9001800270036004500SE +/- 0.00, N = 13SE +/- 0.00, N = 13SE +/- 0.00, N = 14SE +/- 0.00, N = 143090208040904080

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: Bus Speed Download

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.02, N = 13SE +/- 0.00, N = 12SE +/- 0.02, N = 13SE +/- 0.02, N = 1325.1912.8525.1325.101. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

ArrayFire

Test: Conjugate Gradient OpenCL

OpenBenchmarking.orgms, Fewer Is BetterArrayFire 3.7Test: Conjugate Gradient OpenCLRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny0.37780.75561.13341.51121.889SE +/- 0.0020, N = 9SE +/- 0.0055, N = 9SE +/- 0.0014, N = 9SE +/- 0.0013, N = 101.58301.67900.88031.12201. (CXX) g++ options: -rdynamic

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny48121620SE +/- 0.008, N = 5SE +/- 0.023, N = 5SE +/- 0.003, N = 5SE +/- 0.003, N = 59.35914.7907.79911.5431. (CXX) g++ options: -O3

SHOC Scalable HeterOgeneous Computing

Target: OpenCL - Benchmark: FFT SP

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny6001200180024003000SE +/- 0.59, N = 12SE +/- 0.70, N = 11SE +/- 1.36, N = 12SE +/- 2.58, N = 122349.391504.872779.481831.861. (CXX) g++ options: -O2 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -lmpi_cxx -lmpi

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny3691215SE +/- 0.015, N = 6SE +/- 0.025, N = 5SE +/- 0.018, N = 7SE +/- 0.015, N = 76.5099.2395.1095.630

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny13K26K39K52K65KSE +/- 475.08, N = 9SE +/- 163.05, N = 3SE +/- 72.97, N = 3SE +/- 213.00, N = 3415473340459427482011. (CXX) g++ options: -O3

cl-mem

Benchmark: Write

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: WriteRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 0.49, N = 10SE +/- 0.85, N = 8SE +/- 0.36, N = 10SE +/- 0.54, N = 9734.3452.2785.8567.71. (CC) gcc options: -O2 -flto -lOpenCL

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny0.96911.93822.90733.87644.8455SE +/- 0.005, N = 9SE +/- 0.007, N = 8SE +/- 0.003, N = 10SE +/- 0.004, N = 103.5364.3072.4902.761

clpeak

OpenCL Test: Global Memory Bandwidth

OpenBenchmarking.orgGBPS, More Is Betterclpeak 1.1.2OpenCL Test: Global Memory BandwidthRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 0.01, N = 10SE +/- 0.36, N = 9SE +/- 0.04, N = 10SE +/- 0.17, N = 10814.92504.66871.56611.671. (CXX) g++ options: -O3

cl-mem

Benchmark: Read

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: ReadRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 0.81, N = 9SE +/- 0.15, N = 8SE +/- 0.31, N = 10SE +/- 0.71, N = 9823.3543.8886.0623.21. (CC) gcc options: -O2 -flto -lOpenCL

ViennaCL

Test: OpenCL BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-NRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny60120180240300SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31852972202241. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 1000RTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny14002800420056007000SE +/- 2.76, N = 6SE +/- 2.74, N = 7SE +/- 3.07, N = 66584.814379.705228.001. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 200RTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny30060090012001500SE +/- 1.00, N = 10SE +/- 0.87, N = 11SE +/- 1.43, N = 101321.36883.951053.881. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

ViennaCL

Test: OpenCL BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dAXPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny170340510680850SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 37165187726081. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: yolov4-tinyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1224364860SE +/- 0.40, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 349.1052.4835.2335.87MIN: 17.59 / MAX: 76.07MIN: 20.11 / MAX: 75.46MIN: 11.4 / MAX: 59.2MIN: 11.56 / MAX: 61.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Caffe

Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: NVIDIA CUDA - Iterations: 100RTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny140280420560700SE +/- 1.37, N = 11SE +/- 1.11, N = 11SE +/- 1.06, N = 11664.60447.34537.491. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 1000RTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny5K10K15K20K25KSE +/- 30.41, N = 3SE +/- 24.30, N = 3SE +/- 18.97, N = 324142.816499.316310.51. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 200RTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny10002000300040005000SE +/- 2.81, N = 7SE +/- 3.64, N = 8SE +/- 3.25, N = 84830.593304.153279.171. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: NVIDIA CUDA - Iterations: 100RTX 3090 24GB -ZotacRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny5001000150020002500SE +/- 1.69, N = 9SE +/- 1.34, N = 10SE +/- 2.33, N = 102418.861660.011649.291. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

ViennaCL

Test: OpenCL BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sDOTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny100200300400500SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 33723054464171. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sCOPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny100200300400500SE +/- 1.20, N = 3SE +/- 1.73, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 33633064443851. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

FAHBench

OpenBenchmarking.orgNs Per Day, More Is BetterFAHBench 2.3.2RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny90180270360450SE +/- 0.41, N = 3SE +/- 0.79, N = 3SE +/- 0.24, N = 3SE +/- 0.63, N = 3316.57292.69423.21424.35

ViennaCL

Test: OpenCL BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - sAXPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny120240360480600SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 34953925684871. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dCOPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny140280420560700SE +/- 1.20, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 36004756615401. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: OpenCL BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dDOTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny160320480640800SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 36585247195971. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

cl-mem

Benchmark: Copy

OpenBenchmarking.orgGB/s, More Is Bettercl-mem 2017-01-13Benchmark: CopyRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny90180270360450SE +/- 0.21, N = 10SE +/- 0.27, N = 8SE +/- 0.05, N = 10SE +/- 0.03, N = 9359.2318.1410.3382.01. (CC) gcc options: -O2 -flto -lOpenCL

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vision_transformerRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny80160240320400SE +/- 0.55, N = 3SE +/- 0.57, N = 3SE +/- 1.34, N = 3SE +/- 3.05, N = 3324.88363.75289.04290.15MIN: 283.26 / MAX: 423.43MIN: 316.04 / MAX: 471.43MIN: 260.04 / MAX: 661.43MIN: 247.6 / MAX: 837.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ViennaCL

Test: OpenCL BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: OpenCL BLAS - dGEMV-TRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny100200300400500SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 2SE +/- 0.67, N = 33713704434351. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

LeelaChessZero

Backend: OpenCL

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: OpenCLRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny3K6K9K12K15KSE +/- 52.37, N = 3SE +/- 135.58, N = 4SE +/- 198.64, N = 3SE +/- 126.87, N = 3143161322815097153161. (CXX) g++ options: -flto -pthread

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 7.74, N = 15SE +/- 8.86, N = 15SE +/- 11.59, N = 5SE +/- 9.35, N = 128349239469561. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny150300450600750SE +/- 1.68, N = 15SE +/- 4.70, N = 15SE +/- 6.71, N = 5SE +/- 8.14, N = 126066586716781. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 6.89, N = 15SE +/- 5.45, N = 15SE +/- 22.23, N = 5SE +/- 7.44, N = 129841058109810851. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny400800120016002000SE +/- 12.27, N = 15SE +/- 17.80, N = 15SE +/- 19.65, N = 5SE +/- 13.45, N = 1218421755187419011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny5001000150020002500SE +/- 18.29, N = 15SE +/- 16.08, N = 15SE +/- 13.04, N = 5SE +/- 31.16, N = 1222722261236024221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 4.88, N = 15SE +/- 6.17, N = 14SE +/- 2.58, N = 5SE +/- 7.65, N = 127497427717871. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny2004006008001000SE +/- 6.38, N = 15SE +/- 4.44, N = 15SE +/- 7.38, N = 5SE +/- 5.06, N = 127647477847831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GPU Temperature Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1632486480Min: 41 / Avg: 65.62 / Max: 84Min: 44 / Avg: 69.38 / Max: 83Min: 32 / Avg: 43.13 / Max: 66Min: 31 / Avg: 44.05 / Max: 73

GPU Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny80160240320400Min: 18.42 / Avg: 170.36 / Max: 367.21Min: 18.41 / Avg: 148.08 / Max: 347.3Min: 6 / Avg: 110.92 / Max: 456.05Min: 4 / Avg: 92.31 / Max: 331.21

Waifu2x-NCNN Vulkan

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.061.273.0RTX 2080 Ti 22GB -Dell59.068.275.0RTX 4090 24GB -Nvidia39.041.646.0RTX 4080 16GB -Pny40.042.951.0OpenBenchmarking.orgCelsius, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818GPU Temperature Monitor20406080100

Waifu2x-NCNN Vulkan

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac27.5154.7304.3RTX 2080 Ti 22GB -Dell22.8138.6261.5RTX 4090 24GB -Nvidia7.193.5229.6RTX 4080 16GB -Pny14.072.7204.6OpenBenchmarking.orgWatts, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818GPU Power Consumption Monitor80160240320400

RealSR-NCNN

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.061.777.0RTX 2080 Ti 22GB -Dell60.069.376.0RTX 4090 24GB -Nvidia42.045.153.0RTX 4080 16GB -Pny42.047.658.0OpenBenchmarking.orgCelsius, Fewer Is BetterRealSR-NCNN 20200818GPU Temperature Monitor20406080100

RealSR-NCNN

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac28.7166.5354.3RTX 2080 Ti 22GB -Dell23.3150.8256.7RTX 4090 24GB -Nvidia7.1105.0327.6RTX 4080 16GB -Pny14.1102.2294.6OpenBenchmarking.orgWatts, Fewer Is BetterRealSR-NCNN 20200818GPU Power Consumption Monitor100200300400500

RealSR-NCNN

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.074.984.0RTX 2080 Ti 22GB -Dell46.074.280.0RTX 4090 24GB -Nvidia41.051.656.0RTX 4080 16GB -Pny42.057.364.0OpenBenchmarking.orgCelsius, Fewer Is BetterRealSR-NCNN 20200818GPU Temperature Monitor20406080100

RealSR-NCNN

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac33.0292.5356.8RTX 2080 Ti 22GB -Dell20.4225.5257.1RTX 4090 24GB -Nvidia6.6234.7332.3RTX 4080 16GB -Pny13.9221.3300.8OpenBenchmarking.orgWatts, Fewer Is BetterRealSR-NCNN 20200818GPU Power Consumption Monitor100200300400500

NCNN

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac42.051.664.0RTX 2080 Ti 22GB -Dell46.049.861.0RTX 4090 24GB -Nvidia33.037.445.0RTX 4080 16GB -Pny35.038.243.0OpenBenchmarking.orgCelsius, Fewer Is BetterNCNN 20220729GPU Temperature Monitor20406080100

NCNN

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac18.836.1148.3RTX 2080 Ti 22GB -Dell19.231.7108.8RTX 4090 24GB -Nvidia6.413.659.6RTX 4080 16GB -Pny4.218.566.4OpenBenchmarking.orgWatts, Fewer Is BetterNCNN 20220729GPU Power Consumption Monitor4080120160200

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: FastestDetRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny3691215SE +/- 3.64, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.16, N = 313.526.805.085.18MIN: 5.33 / MAX: 35.07MIN: 5.02 / MAX: 32.78MIN: 4.3 / MAX: 23.08MIN: 4.28 / MAX: 25.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: regnety_400mRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.25, N = 3SE +/- 2.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 325.0423.135.465.32MIN: 11.25 / MAX: 44.6MIN: 8.74 / MAX: 39.68MIN: 4.78 / MAX: 28.36MIN: 4.68 / MAX: 22.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: resnet50RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny246810SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.77, N = 3SE +/- 0.04, N = 36.355.857.285.16MIN: 5 / MAX: 21.95MIN: 4.75 / MAX: 26.01MIN: 3.88 / MAX: 30.89MIN: 4.78 / MAX: 21.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: vgg16RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.11, N = 3SE +/- 1.00, N = 3SE +/- 0.10, N = 3SE +/- 3.46, N = 34.407.0725.147.90MIN: 3.72 / MAX: 24.39MIN: 4.46 / MAX: 48.35MIN: 12.85 / MAX: 33.3MIN: 3.84 / MAX: 36.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: blazefaceRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1.23532.47063.70594.94126.1765SE +/- 0.22, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 35.385.493.953.89MIN: 4.06 / MAX: 22.14MIN: 3.72 / MAX: 33.58MIN: 3.22 / MAX: 30.17MIN: 3.33 / MAX: 36.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: efficientnet-b0RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny612182430SE +/- 0.63, N = 3SE +/- 0.20, N = 3SE +/- 0.36, N = 3SE +/- 0.10, N = 323.9726.096.426.30MIN: 10.15 / MAX: 37.01MIN: 12.37 / MAX: 40.64MIN: 5.35 / MAX: 28.33MIN: 5.32 / MAX: 26.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: mnasnetRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny510152025SE +/- 0.13, N = 3SE +/- 0.73, N = 3SE +/- 0.10, N = 2SE +/- 0.02, N = 319.167.034.394.37MIN: 7.92 / MAX: 33.04MIN: 4.55 / MAX: 25.62MIN: 3.79 / MAX: 22MIN: 3.71 / MAX: 23.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU - Model: shufflenet-v2RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny48121620SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 317.576.114.854.78MIN: 8.24 / MAX: 39.36MIN: 4.77 / MAX: 33.2MIN: 4.28 / MAX: 25.38MIN: 4.03 / MAX: 23.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny510152025SE +/- 0.18, N = 3SE +/- 0.83, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 318.7818.774.975.02MIN: 8.92 / MAX: 34.42MIN: 7.68 / MAX: 35.81MIN: 4.23 / MAX: 23.03MIN: 4.12 / MAX: 23.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2RTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny48121620SE +/- 0.28, N = 3SE +/- 0.39, N = 3SE +/- 0.01, N = 3SE +/- 0.10, N = 318.256.354.424.32MIN: 7.4 / MAX: 39.06MIN: 4.5 / MAX: 27.11MIN: 3.68 / MAX: 20.27MIN: 3.68 / MAX: 23.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

vkpeak

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac42.071.379.0RTX 2080 Ti 22GB -Dell56.077.383.0RTX 4090 24GB -Nvidia42.053.965.0RTX 4080 16GB -Pny43.057.366.0OpenBenchmarking.orgCelsius, Fewer Is Bettervkpeak 20210424GPU Temperature Monitor20406080100

vkpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac24.9265.1330.5RTX 2080 Ti 22GB -Dell21.4217.8258.8RTX 4090 24GB -Nvidia6.6203.4347.2RTX 4080 16GB -Pny13.6171.1245.8OpenBenchmarking.orgWatts, Fewer Is Bettervkpeak 20210424GPU Power Consumption Monitor100200300400500

VkResample

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.059.274.0RTX 2080 Ti 22GB -Dell55.065.473.0RTX 4090 24GB -Nvidia38.041.647.0RTX 4080 16GB -Pny38.043.650.0OpenBenchmarking.orgCelsius, Fewer Is BetterVkResample 1.0GPU Temperature Monitor20406080100

VkResample

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.1124.7254.8RTX 2080 Ti 22GB -Dell21.1114.5208.0RTX 4090 24GB -Nvidia6.758.1182.3RTX 4080 16GB -Pny4.259.1135.5OpenBenchmarking.orgWatts, Fewer Is BetterVkResample 1.0GPU Power Consumption Monitor70140210280350

VkResample

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.053.573.0RTX 2080 Ti 22GB -Dell57.059.769.0RTX 4090 24GB -Nvidia35.037.043.0RTX 4080 16GB -Pny36.038.146.0OpenBenchmarking.orgCelsius, Fewer Is BetterVkResample 1.0GPU Temperature Monitor20406080100

VkResample

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.278.0357.0RTX 2080 Ti 22GB -Dell21.460.8255.9RTX 4090 24GB -Nvidia6.537.5270.4RTX 4080 16GB -Pny4.335.1203.1OpenBenchmarking.orgWatts, Fewer Is BetterVkResample 1.0GPU Power Consumption Monitor100200300400500

VkFFT

GPU Temperature Monitor

OpenBenchmarking.orgCelsius, Fewer Is BetterVkFFT 1.1.1GPU Temperature MonitorRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1530456075Min: 47 / Avg: 62.8 / Max: 74Min: 54 / Avg: 68.6 / Max: 77Min: 35 / Avg: 36.89 / Max: 43Min: 37 / Avg: 39.61 / Max: 46

VkFFT

GPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterVkFFT 1.1.1GPU Power Consumption MonitorRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny70140210280350Min: 19.05 / Avg: 140.55 / Max: 367.21Min: 21.45 / Avg: 119.47 / Max: 264.1Min: 6.91 / Avg: 61.06 / Max: 277.78Min: 11.18 / Avg: 53.17 / Max: 215.86

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.072.878.0RTX 4090 24GB -Nvidia37.042.345.0RTX 4080 16GB -Pny37.046.150.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac18.5252.8299.7RTX 4090 24GB -Nvidia6.9129.1162.7RTX 4080 16GB -Pny13.4121.4156.0OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.064.676.0RTX 4090 24GB -Nvidia36.039.343.0RTX 4080 16GB -Pny37.041.848.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac25.4184.4290.0RTX 4090 24GB -Nvidia7.087.3162.9RTX 4080 16GB -Pny12.678.5154.8OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor70140210280350

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.061.374.0RTX 4090 24GB -Nvidia37.038.542.0RTX 4080 16GB -Pny38.040.847.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.3154.8289.3RTX 4090 24GB -Nvidia7.071.0160.9RTX 4080 16GB -Pny13.462.2154.8OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor70140210280350

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.068.980.0RTX 4090 24GB -Nvidia36.040.745.0RTX 4080 16GB -Pny37.043.950.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.1224.3341.6RTX 4090 24GB -Nvidia7.0114.8203.5RTX 4080 16GB -Pny10.7104.4178.5OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.061.477.0RTX 4090 24GB -Nvidia38.039.544.0RTX 4080 16GB -Pny38.040.348.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.3144.4336.9RTX 4090 24GB -Nvidia6.869.2202.4RTX 4080 16GB -Pny13.460.0179.5OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Caffe

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.060.175.0RTX 4090 24GB -Nvidia40.042.449.0RTX 4080 16GB -Pny39.042.150.0OpenBenchmarking.orgCelsius, Fewer Is BetterCaffe 2020-02-13GPU Temperature Monitor20406080100

Caffe

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac18.7125.3335.1RTX 4090 24GB -Nvidia7.056.9202.9RTX 4080 16GB -Pny14.048.0178.0OpenBenchmarking.orgWatts, Fewer Is BetterCaffe 2020-02-13GPU Power Consumption Monitor80160240320400

Blender

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.075.682.0RTX 2080 Ti 22GB -Dell59.078.782.0RTX 4090 24GB -Nvidia40.048.554.0RTX 4080 16GB -Pny40.051.056.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac18.6292.2358.2RTX 2080 Ti 22GB -Dell22.8226.2257.6RTX 4090 24GB -Nvidia6.8189.4276.9RTX 4080 16GB -Pny13.7161.8215.2OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.071.980.0RTX 2080 Ti 22GB -Dell60.075.880.0RTX 4090 24GB -Nvidia39.045.350.0RTX 4080 16GB -Pny39.048.054.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.5264.0361.3RTX 2080 Ti 22GB -Dell22.7212.9257.2RTX 4090 24GB -Nvidia6.9172.2302.0RTX 4080 16GB -Pny12.8140.8239.5OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.070.782.0RTX 2080 Ti 22GB -Dell60.074.480.0RTX 4090 24GB -Nvidia41.045.854.0RTX 4080 16GB -Pny40.048.257.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.4234.9344.1RTX 2080 Ti 22GB -Dell22.9195.0256.9RTX 4090 24GB -Nvidia6.9136.7303.2RTX 4080 16GB -Pny13.4116.8223.6OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor80160240320400

Blender

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.073.081.0RTX 2080 Ti 22GB -Dell59.075.481.0RTX 4090 24GB -Nvidia40.046.552.0RTX 4080 16GB -Pny39.048.355.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.2267.4353.6RTX 2080 Ti 22GB -Dell22.5212.6255.0RTX 4090 24GB -Nvidia7.1162.1291.8RTX 4080 16GB -Pny5.3142.2230.1OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac46.067.379.0RTX 2080 Ti 22GB -Dell60.072.678.0RTX 4090 24GB -Nvidia42.044.850.0RTX 4080 16GB -Pny41.045.351.0OpenBenchmarking.orgCelsius, Fewer Is BetterBlender 3.6GPU Temperature Monitor20406080100

Blender

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.2215.9349.9RTX 2080 Ti 22GB -Dell23.3173.8254.3RTX 4090 24GB -Nvidia7.0107.7242.2RTX 4080 16GB -Pny14.395.0198.6OpenBenchmarking.orgWatts, Fewer Is BetterBlender 3.6GPU Power Consumption Monitor100200300400500

Blender

Blend File: BMW27 - Compute: NVIDIA OptiX

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: NVIDIA OptiXRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny246810SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 156.138.933.704.43

Chaos Group V-RAY

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.068.380.0RTX 2080 Ti 22GB -Dell55.071.782.0RTX 4090 24GB -Nvidia41.048.654.0RTX 4080 16GB -Pny39.048.054.0OpenBenchmarking.orgCelsius, Fewer Is BetterChaos Group V-RAY 5.02GPU Temperature Monitor20406080100

Chaos Group V-RAY

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac18.9235.8358.0RTX 2080 Ti 22GB -Dell21.6167.9253.1RTX 4090 24GB -Nvidia6.8169.0272.7RTX 4080 16GB -Pny4.5130.0211.7OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 5.02GPU Power Consumption Monitor100200300400500

Chaos Group V-RAY

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.071.582.0RTX 2080 Ti 22GB -Dell57.073.582.0RTX 4090 24GB -Nvidia42.050.857.0RTX 4080 16GB -Pny42.051.857.0OpenBenchmarking.orgCelsius, Fewer Is BetterChaos Group V-RAY 5.02GPU Temperature Monitor20406080100

Chaos Group V-RAY

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.2250.0356.2RTX 2080 Ti 22GB -Dell22.7181.0256.4RTX 4090 24GB -Nvidia7.2199.1296.3RTX 4080 16GB -Pny13.4159.5227.6OpenBenchmarking.orgWatts, Fewer Is BetterChaos Group V-RAY 5.02GPU Power Consumption Monitor100200300400500

IndigoBench

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.077.784.0RTX 2080 Ti 22GB -Dell61.076.682.0RTX 4090 24GB -Nvidia41.051.757.0RTX 4080 16GB -Pny42.054.059.0OpenBenchmarking.orgCelsius, Fewer Is BetterIndigoBench 4.4GPU Temperature Monitor20406080100

IndigoBench

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.2287.2353.5RTX 2080 Ti 22GB -Dell23.1213.7254.5RTX 4090 24GB -Nvidia6.7211.2296.7RTX 4080 16GB -Pny14.0180.5230.3OpenBenchmarking.orgWatts, Fewer Is BetterIndigoBench 4.4GPU Power Consumption Monitor100200300400500

IndigoBench

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.077.683.0RTX 2080 Ti 22GB -Dell57.076.081.0RTX 4090 24GB -Nvidia42.049.452.0RTX 4080 16GB -Pny40.051.956.0OpenBenchmarking.orgCelsius, Fewer Is BetterIndigoBench 4.4GPU Temperature Monitor20406080100

IndigoBench

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.9282.9356.8RTX 2080 Ti 22GB -Dell21.3211.7259.5RTX 4090 24GB -Nvidia6.7172.9252.7RTX 4080 16GB -Pny13.0159.4207.9OpenBenchmarking.orgWatts, Fewer Is BetterIndigoBench 4.4GPU Power Consumption Monitor100200300400500

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.058.162.0RTX 2080 Ti 22GB -Dell56.058.060.0RTX 4090 24GB -Nvidia40.041.543.0RTX 4080 16GB -Pny38.039.541.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.885.0125.6RTX 2080 Ti 22GB -Dell21.356.580.7RTX 4090 24GB -Nvidia6.639.858.8RTX 4080 16GB -Pny12.230.647.0OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor4080120160200

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.056.260.0RTX 2080 Ti 22GB -Dell57.059.462.0RTX 4090 24GB -Nvidia38.039.240.0RTX 4080 16GB -Pny35.037.038.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.872.1117.2RTX 2080 Ti 22GB -Dell22.053.476.5RTX 4090 24GB -Nvidia6.533.855.0RTX 4080 16GB -Pny12.128.540.3OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor4080120160200

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.062.972.0RTX 2080 Ti 22GB -Dell57.062.968.0RTX 4090 24GB -Nvidia34.036.440.0RTX 4080 16GB -Pny36.036.739.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.0125.2273.7RTX 2080 Ti 22GB -Dell21.793.4198.2RTX 4090 24GB -Nvidia6.441.6108.0RTX 4080 16GB -Pny13.339.398.6OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor70140210280350

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.060.273.0RTX 2080 Ti 22GB -Dell59.062.468.0RTX 4090 24GB -Nvidia35.035.637.0RTX 4080 16GB -Pny36.038.655.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.5105.9323.3RTX 2080 Ti 22GB -Dell22.184.4261.2RTX 4090 24GB -Nvidia7.041.7150.7RTX 4080 16GB -Pny12.844.7278.8OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor80160240320400

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac53.063.874.0RTX 2080 Ti 22GB -Dell56.064.471.0RTX 4090 24GB -Nvidia36.037.142.0RTX 4080 16GB -Pny36.038.546.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac25.8138.9317.8RTX 2080 Ti 22GB -Dell22.3107.8225.4RTX 4090 24GB -Nvidia7.059.0207.2RTX 4080 16GB -Pny11.951.2195.6OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor80160240320400

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac49.058.065.0RTX 2080 Ti 22GB -Dell58.060.764.0RTX 4090 24GB -Nvidia37.038.039.0RTX 4080 16GB -Pny37.038.140.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac24.998.4223.7RTX 2080 Ti 22GB -Dell23.273.4184.7RTX 4090 24GB -Nvidia7.045.5106.3RTX 4080 16GB -Pny12.238.772.2OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor60120180240300

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.072.279.0RTX 2080 Ti 22GB -Dell59.074.078.0RTX 4090 24GB -Nvidia37.041.847.0RTX 4080 16GB -Pny37.043.149.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac22.8253.9357.5RTX 2080 Ti 22GB -Dell22.2184.6246.0RTX 4090 24GB -Nvidia7.0125.0250.8RTX 4080 16GB -Pny13.598.3224.9OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor100200300400500

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature MonitorRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1632486480Min: 52 / Avg: 70.48 / Max: 82Min: 55 / Avg: 73.85 / Max: 82Min: 39 / Avg: 43.63 / Max: 66Min: 38 / Avg: 43.44 / Max: 73

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption MonitorRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny80160240320400Min: 20.35 / Avg: 201.61 / Max: 329.19Min: 21.69 / Avg: 175.84 / Max: 257.38Min: 6.77 / Avg: 116.7 / Max: 447.83Min: 13.84 / Avg: 90.17 / Max: 331.21

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.058.865.0RTX 2080 Ti 22GB -Dell51.055.159.0RTX 4090 24GB -Nvidia37.039.141.0RTX 4080 16GB -Pny41.042.745.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.786.6154.5RTX 2080 Ti 22GB -Dell20.666.6107.8RTX 4090 24GB -Nvidia6.643.872.7RTX 4080 16GB -Pny13.938.262.5OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor4080120160200

SHOC Scalable HeterOgeneous Computing

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac55.061.168.0RTX 2080 Ti 22GB -Dell44.049.754.0RTX 4090 24GB -Nvidia34.036.138.0RTX 4080 16GB -Pny38.040.142.0OpenBenchmarking.orgCelsius, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Temperature Monitor20406080100

SHOC Scalable HeterOgeneous Computing

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.393.5158.5RTX 2080 Ti 22GB -Dell20.163.6102.9RTX 4090 24GB -Nvidia6.443.472.8RTX 4080 16GB -Pny13.737.060.9OpenBenchmarking.orgWatts, Fewer Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17GPU Power Consumption Monitor50100150200250

ViennaCL

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.053.357.0RTX 2080 Ti 22GB -Dell44.049.258.0RTX 4090 24GB -Nvidia32.033.937.0RTX 4080 16GB -Pny34.035.738.0OpenBenchmarking.orgCelsius, Fewer Is BetterViennaCL 1.7.1GPU Temperature Monitor1632486480

ViennaCL

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.026.237.2RTX 2080 Ti 22GB -Dell18.421.643.4RTX 4090 24GB -Nvidia6.08.015.8RTX 4080 16GB -Pny4.012.118.8OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.7.1GPU Power Consumption Monitor1224364860

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny306090120150SE +/- 5.83, N = 15SE +/- 4.93, N = 15SE +/- 9.73, N = 5SE +/- 6.46, N = 111431451441371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny306090120150SE +/- 5.51, N = 15SE +/- 4.47, N = 15SE +/- 9.50, N = 5SE +/- 4.72, N = 121411421371351. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny306090120150SE +/- 4.37, N = 15SE +/- 2.19, N = 15SE +/- 0.48, N = 4SE +/- 4.18, N = 121201191151221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny306090120150SE +/- 1.05, N = 15SE +/- 1.59, N = 15SE +/- 10.59, N = 5SE +/- 1.29, N = 12107.0106.8120.0103.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny4080120160200SE +/- 8.48, N = 15SE +/- 1.84, N = 15SE +/- 1.74, N = 5SE +/- 0.91, N = 121721791881871. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac49.070.883.0RTX 2080 Ti 22GB -Dell59.072.278.0RTX 4090 24GB -Nvidia36.041.548.0RTX 4080 16GB -Pny37.044.054.0OpenBenchmarking.orgCelsius, Fewer Is BetterViennaCL 1.7.1GPU Temperature Monitor20406080100

ViennaCL

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac28.9232.9357.9RTX 2080 Ti 22GB -Dell22.4173.6255.0RTX 4090 24GB -Nvidia6.9128.0256.4RTX 4080 16GB -Pny13.4109.2223.4OpenBenchmarking.orgWatts, Fewer Is BetterViennaCL 1.7.1GPU Power Consumption Monitor100200300400500

MandelGPU

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.062.979.0RTX 2080 Ti 22GB -Dell60.067.674.0RTX 4090 24GB -Nvidia36.038.846.0RTX 4080 16GB -Pny37.040.952.0OpenBenchmarking.orgCelsius, Fewer Is BetterMandelGPU 1.3pts1GPU Temperature Monitor20406080100

MandelGPU

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.7149.1321.4RTX 2080 Ti 22GB -Dell22.6126.4254.8RTX 4090 24GB -Nvidia6.968.6215.0RTX 4080 16GB -Pny14.265.5184.4OpenBenchmarking.orgWatts, Fewer Is BetterMandelGPU 1.3pts1GPU Power Consumption Monitor80160240320400

cl-mem

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.064.474.0RTX 2080 Ti 22GB -Dell60.067.673.0RTX 4090 24GB -Nvidia35.038.040.0RTX 4080 16GB -Pny37.040.343.0OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature Monitor20406080100

cl-mem

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.4168.8328.7RTX 2080 Ti 22GB -Dell23.0127.1210.3RTX 4090 24GB -Nvidia6.890.7205.9RTX 4080 16GB -Pny10.780.4155.4OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13GPU Power Consumption Monitor80160240320400

cl-mem

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac49.064.574.0RTX 2080 Ti 22GB -Dell60.067.773.0RTX 4090 24GB -Nvidia36.037.940.0RTX 4080 16GB -Pny38.040.743.0OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature Monitor20406080100

cl-mem

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.7171.2327.8RTX 2080 Ti 22GB -Dell23.2126.9212.8RTX 4090 24GB -Nvidia6.887.4204.4RTX 4080 16GB -Pny12.877.9155.5OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13GPU Power Consumption Monitor80160240320400

cl-mem

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.063.973.0RTX 2080 Ti 22GB -Dell59.067.873.0RTX 4090 24GB -Nvidia38.039.642.0RTX 4080 16GB -Pny39.041.644.0OpenBenchmarking.orgCelsius, Fewer Is Bettercl-mem 2017-01-13GPU Temperature Monitor20406080100

cl-mem

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.8166.2329.4RTX 2080 Ti 22GB -Dell22.9130.3212.4RTX 4090 24GB -Nvidia6.892.1205.2RTX 4080 16GB -Pny12.978.5155.7OpenBenchmarking.orgWatts, Fewer Is Bettercl-mem 2017-01-13GPU Power Consumption Monitor80160240320400

LeelaChessZero

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac53.075.483.0RTX 2080 Ti 22GB -Dell55.079.482.0RTX 4090 24GB -Nvidia36.043.646.0RTX 4080 16GB -Pny38.047.050.0OpenBenchmarking.orgCelsius, Fewer Is BetterLeelaChessZero 0.28GPU Temperature Monitor20406080100

LeelaChessZero

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac28.5263.6349.6RTX 2080 Ti 22GB -Dell21.9229.3264.6RTX 4090 24GB -Nvidia6.5122.6165.3RTX 4080 16GB -Pny14.0117.3160.6OpenBenchmarking.orgWatts, Fewer Is BetterLeelaChessZero 0.28GPU Power Consumption Monitor100200300400500

FinanceBench

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.055.860.0RTX 2080 Ti 22GB -Dell55.056.858.0RTX 4090 24GB -Nvidia33.034.736.0RTX 4080 16GB -Pny35.036.438.0OpenBenchmarking.orgCelsius, Fewer Is BetterFinanceBench 2016-07-25GPU Temperature Monitor1632486480

FinanceBench

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.566.6119.2RTX 2080 Ti 22GB -Dell21.345.473.1RTX 4090 24GB -Nvidia6.429.351.3RTX 4080 16GB -Pny13.026.639.8OpenBenchmarking.orgWatts, Fewer Is BetterFinanceBench 2016-07-25GPU Power Consumption Monitor4080120160200

NeatBench

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.057.363.0RTX 2080 Ti 22GB -Dell56.058.361.0RTX 4090 24GB -Nvidia33.034.436.0RTX 4080 16GB -Pny34.034.937.0OpenBenchmarking.orgCelsius, Fewer Is BetterNeatBench 5GPU Temperature Monitor20406080100

NeatBench

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac23.685.4168.2RTX 2080 Ti 22GB -Dell22.168.0119.6RTX 4090 24GB -Nvidia6.936.373.8RTX 4080 16GB -Pny4.627.989.5OpenBenchmarking.orgWatts, Fewer Is BetterNeatBench 5GPU Power Consumption Monitor50100150200250

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.061.672.0RTX 2080 Ti 22GB -Dell59.062.568.0RTX 4090 24GB -Nvidia36.038.555.0RTX 4080 16GB -Pny35.038.556.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac23.7127.5323.8RTX 2080 Ti 22GB -Dell22.390.1254.1RTX 4090 24GB -Nvidia7.183.9421.2RTX 4080 16GB -Pny14.858.3297.3OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor110220330440550

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.067.173.0RTX 2080 Ti 22GB -Dell59.070.075.0RTX 4090 24GB -Nvidia36.039.541.0RTX 4080 16GB -Pny35.039.241.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.3170.7209.4RTX 2080 Ti 22GB -Dell22.4145.6183.3RTX 4090 24GB -Nvidia7.090.3122.6RTX 4080 16GB -Pny12.870.593.0OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor60120180240300

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.061.173.0RTX 2080 Ti 22GB -Dell60.065.973.0RTX 4090 24GB -Nvidia36.038.852.0RTX 4080 16GB -Pny36.038.052.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac22.4114.3268.7RTX 2080 Ti 22GB -Dell21.8109.9254.5RTX 4090 24GB -Nvidia6.970.4356.0RTX 4080 16GB -Pny14.356.1255.4OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor100200300400500

clpeak

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.062.075.0RTX 2080 Ti 22GB -Dell59.066.474.0RTX 4090 24GB -Nvidia35.037.440.0RTX 4080 16GB -Pny36.038.443.0OpenBenchmarking.orgCelsius, Fewer Is Betterclpeak 1.1.2GPU Temperature Monitor20406080100

clpeak

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.3147.9355.6RTX 2080 Ti 22GB -Dell22.0120.7246.3RTX 4090 24GB -Nvidia6.879.8231.0RTX 4080 16GB -Pny14.668.3177.2OpenBenchmarking.orgWatts, Fewer Is Betterclpeak 1.1.2GPU Power Consumption Monitor100200300400500

ArrayFire

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac49.061.972.0RTX 2080 Ti 22GB -Dell59.064.471.0RTX 4090 24GB -Nvidia36.038.541.0RTX 4080 16GB -Pny37.039.442.0OpenBenchmarking.orgCelsius, Fewer Is BetterArrayFire 3.7GPU Temperature Monitor20406080100

ArrayFire

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac24.2149.2279.0RTX 2080 Ti 22GB -Dell22.4103.9212.7RTX 4090 24GB -Nvidia6.747.6113.6RTX 4080 16GB -Pny14.758.6132.1OpenBenchmarking.orgWatts, Fewer Is BetterArrayFire 3.7GPU Power Consumption Monitor70140210280350

Rodinia

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac52.064.977.0RTX 2080 Ti 22GB -Dell62.069.274.0RTX 4090 24GB -Nvidia41.044.650.0RTX 4080 16GB -Pny39.043.849.0OpenBenchmarking.orgCelsius, Fewer Is BetterRodinia 3.1GPU Temperature Monitor20406080100

Rodinia

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.9148.1254.9RTX 2080 Ti 22GB -Dell23.3133.0219.9RTX 4090 24GB -Nvidia6.867.5167.3RTX 4080 16GB -Pny14.563.1123.6OpenBenchmarking.orgWatts, Fewer Is BetterRodinia 3.1GPU Power Consumption Monitor70140210280350

Rodinia

Test: OpenCL Particle Filter

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenCL Particle FilterRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny1.01862.03723.05584.07445.093SE +/- 0.029, N = 10SE +/- 0.031, N = 8SE +/- 0.080, N = 3SE +/- 0.005, N = 103.7684.5272.0852.6851. (CXX) g++ options: -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl

OctaneBench

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.079.382.0RTX 2080 Ti 22GB -Dell58.080.083.0RTX 4090 24GB -Nvidia35.055.060.0RTX 4080 16GB -Pny35.054.759.0OpenBenchmarking.orgCelsius, Fewer Is BetterOctaneBench 2020.1GPU Temperature Monitor20406080100

OctaneBench

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac28.6337.3362.8RTX 2080 Ti 22GB -Dell22.6242.0259.3RTX 4090 24GB -Nvidia6.8279.5322.5RTX 4080 16GB -Pny12.7220.4259.6OpenBenchmarking.orgWatts, Fewer Is BetterOctaneBench 2020.1GPU Power Consumption Monitor100200300400500

NAMD CUDA

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.054.378.0RTX 2080 Ti 22GB -Dell60.067.374.0RTX 4090 24GB -Nvidia34.037.344.0RTX 4080 16GB -Pny36.039.552.0OpenBenchmarking.orgCelsius, Fewer Is BetterNAMD CUDA 2.14GPU Temperature Monitor20406080100

NAMD CUDA

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.068.9327.9RTX 2080 Ti 22GB -Dell23.6127.2297.8RTX 4090 24GB -Nvidia6.454.4281.7RTX 4080 16GB -Pny13.469.1263.4OpenBenchmarking.orgWatts, Fewer Is BetterNAMD CUDA 2.14GPU Power Consumption Monitor80160240320400

NAMD CUDA

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD CUDA 2.14ATPase Simulation - 327,506 AtomsRTX 3090 24GB -ZotacRTX 2080 Ti 22GB -DellRTX 4090 24GB -NvidiaRTX 4080 16GB -Pny0.02140.04280.06420.08560.107SE +/- 0.00094, N = 3SE +/- 0.00074, N = 15SE +/- 0.00119, N = 15SE +/- 0.00113, N = 150.067420.095150.048500.05427

GROMACS

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac48.071.182.0RTX 2080 Ti 22GB -Dell59.073.680.0RTX 4090 24GB -Nvidia36.045.657.0RTX 4080 16GB -Pny35.048.962.0OpenBenchmarking.orgCelsius, Fewer Is BetterGROMACS 2023GPU Temperature Monitor20406080100

GROMACS

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac28.0243.2365.1RTX 2080 Ti 22GB -Dell22.7194.8276.3RTX 4090 24GB -Nvidia6.7182.8404.4RTX 4080 16GB -Pny12.5176.8317.1OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2023GPU Power Consumption Monitor110220330440550

FAHBench

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac46.066.880.0RTX 2080 Ti 22GB -Dell55.069.581.0RTX 4090 24GB -Nvidia35.040.044.0RTX 4080 16GB -Pny33.039.746.0OpenBenchmarking.orgCelsius, Fewer Is BetterFAHBench 2.3.2GPU Temperature Monitor20406080100

FAHBench

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.0192.6299.0RTX 2080 Ti 22GB -Dell21.7160.0255.5RTX 4090 24GB -Nvidia6.596.4148.8RTX 4080 16GB -Pny10.488.1141.6OpenBenchmarking.orgWatts, Fewer Is BetterFAHBench 2.3.2GPU Power Consumption Monitor80160240320400

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac47.060.777.0RTX 2080 Ti 22GB -Dell59.065.373.0RTX 4090 24GB -Nvidia39.044.862.0RTX 4080 16GB -Pny35.042.863.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.1138.5326.4RTX 2080 Ti 22GB -Dell22.8115.4347.3RTX 4090 24GB -Nvidia6.6136.8444.0RTX 4080 16GB -Pny12.494.7308.4OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor120240360480600

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac45.060.476.0RTX 2080 Ti 22GB -Dell59.065.673.0RTX 4090 24GB -Nvidia41.046.662.0RTX 4080 16GB -Pny36.043.063.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.1155.4349.1RTX 2080 Ti 22GB -Dell22.9121.2333.4RTX 4090 24GB -Nvidia6.8144.6456.1RTX 4080 16GB -Pny13.2101.3323.7OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor120240360480600

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac45.065.278.0RTX 2080 Ti 22GB -Dell58.069.276.0RTX 4090 24GB -Nvidia41.052.565.0RTX 4080 16GB -Pny35.048.964.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.5192.6326.5RTX 2080 Ti 22GB -Dell22.3156.5321.2RTX 4090 24GB -Nvidia6.8231.7417.6RTX 4080 16GB -Pny12.9157.1290.4OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor110220330440550

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac46.065.478.0RTX 2080 Ti 22GB -Dell58.068.876.0RTX 4090 24GB -Nvidia40.050.864.0RTX 4080 16GB -Pny34.047.562.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac19.1192.7330.0RTX 2080 Ti 22GB -Dell22.5154.3261.7RTX 4090 24GB -Nvidia6.7190.1401.1RTX 4080 16GB -Pny12.7148.9281.6OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor110220330440550

Hashcat

GPU Temperature Monitor

MinAvgMaxRTX 3090 24GB -Zotac50.067.582.0RTX 2080 Ti 22GB -Dell50.065.074.0RTX 4090 24GB -Nvidia36.049.165.0RTX 4080 16GB -Pny31.045.763.0OpenBenchmarking.orgCelsius, Fewer Is BetterHashcat 6.2.4GPU Temperature Monitor20406080100

Hashcat

GPU Power Consumption Monitor

MinAvgMaxRTX 3090 24GB -Zotac20.4191.9326.0RTX 2080 Ti 22GB -Dell21.1147.0255.1RTX 4090 24GB -Nvidia6.7197.8444.1RTX 4080 16GB -Pny6.8142.4304.5OpenBenchmarking.orgWatts, Fewer Is BetterHashcat 6.2.4GPU Power Consumption Monitor120240360480600


Phoronix Test Suite v10.8.4