NCNN Vulkan - AMD vs. NVIDIA

NCNN Vulkan benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2009260-PTS-NCNNVULK08&sor&grs.

NCNN Vulkan - AMD vs. NVIDIAProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionGTX 1060GTX 1070GTX 1080GTX 1650GTX 1650 SUPERGTX 1660GTX 1660 SUPERGTX 1660 TiRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiRX Vega 56RX 5600 XTRX 5700RX 5700 XTRadeon VIIAMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBNVIDIA GeForce GTX 1060 6GB (1506/4006MHz)NVIDIA GP106 HD AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.4.0-48-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8NVIDIA 450.664.6.0OpenCL 2.0 AMD-APP (3182.0) + OpenCL 1.2 CUDA 11.0.2281.2.133GCC 9.3.0 + CUDA 11.0ext43840x2160NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)NVIDIA GP104 HD AudioNVIDIA GeForce GTX 1080 8GB (1607/5005MHz)ASUS NVIDIA GeForce GTX 1650 4GB (1485/4001MHz)NVIDIA Device 10faASUS NVIDIA GeForce GTX 1650 SUPER 4GB (375/810MHz)NVIDIA TU116 HD AudioASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)eVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz)eVGA NVIDIA GeForce GTX 1660 Ti 6GB (1500/6000MHz)NVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA TU106 HD AudioNVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)ASUS NVIDIA GeForce RTX 2070 8GB (420/405MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (375/405MHz)NVIDIA TU104 HD AudioZotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA GeForce RTX 2080 SUPER 8GB (405/405MHz)NVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioAMD Radeon RX 56/64 8GB (1590/800MHz)AMD Vega 10 HDMI Audio5.9.0-050900rc6daily20200925-generic (x86_64) 202009244.6 Mesa 20.3.0-devel (git-3173367 2020-09-25 focal-oibaf-ppa) (LLVM 10.0.1)OpenCL 2.0 AMD-APP (3182.0)1.2.145Sapphire AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1780/875MHz)AMD Navi 10 HDMI AudioAMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (1750/875MHz)AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

NCNN Vulkan - AMD vs. NVIDIArealsr-ncnn: 4x - Yesncnn: Vulkan GPU - vgg16realsr-ncnn: 4x - Noncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - squeezenetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2GTX 1060GTX 1070GTX 1080GTX 1650GTX 1650 SUPERGTX 1660GTX 1660 SUPERGTX 1660 TiRTX 2060RTX 2060 SUPERRTX 2070RTX 2070 SUPERRTX 2080RTX 2080 SUPERRTX 2080 TiRX Vega 56RX 5600 XTRX 5700RX 5700 XTRadeon VII143.94920.2020.5435.408.007.775.9910.523.135.450.703.652.071.602.421.97102.78714.9815.3743.845.916.525.479.152.394.580.713.491.961.552.321.85116.45215.0417.2143.784.915.835.268.842.014.130.713.401.931.472.252.23215.51824.1529.3723.748.426.836.2911.953.375.310.703.502.171.572.291.94186.17921.5525.6582.955.835.805.4210.672.614.090.643.211.631.401.881.60138.72116.9819.8933.055.755.525.579.922.374.330.673.192.001.512.141.77120.14815.417.5382.755.025.205.099.412.383.940.632.871.791.331.981.48117.67415.3317.1502.745.045.245.139.452.263.780.662.841.631.461.831.6186.37211.0813.4002.434.264.724.908.742.003.560.692.841.641.421.791.5476.37310.3112.1292.223.844.604.628.541.723.360.622.611.681.321.931.4575.4379.6412.0072.243.884.674.838.421.733.710.622.891.771.321.701.4563.8638.4710.6152.063.754.334.658.301.683.230.662.691.541.361.751.4460.3608.0510.0611.983.574.284.658.221.633.330.712.761.511.441.761.4954.8407.639.4631.933.534.304.458.121.513.230.632.711.511.391.621.4844.9865.608.2601.613.144.014.537.771.423.050.672.731.481.361.731.4466.10811.5610.2544.386.544.896.317.672.205.740.8710.543.092.344.652.9580.26015.2412.0795.245.023.586.548.531.734.240.767.252.331.843.112.1558.92914.919.3585.385.243.586.608.491.714.270.777.852.251.863.132.1855.67813.708.9824.814.463.366.588.321.573.870.736.632.081.732.962.0541.28810.427.1773.786.625.039.0210.002.047.220.9211.073.152.604.132.84OpenBenchmarking.org

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRadeon VIIRTX 2080 TiRTX 2080 SUPERRX 5700 XTRX 5700RTX 2080RTX 2070 SUPERRX Vega 56RTX 2070RTX 2060 SUPERRX 5600 XTRTX 2060GTX 1070GTX 1080GTX 1660 TiGTX 1660 SUPERGTX 1660GTX 1060GTX 1650 SUPERGTX 165050100150200250SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.38, N = 3SE +/- 0.31, N = 3SE +/- 0.08, N = 3SE +/- 0.56, N = 3SE +/- 0.43, N = 3SE +/- 0.13, N = 3SE +/- 0.26, N = 3SE +/- 0.36, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.27, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3SE +/- 0.29, N = 341.2944.9954.8455.6858.9360.3663.8666.1175.4476.3780.2686.37102.79116.45117.67120.15138.72143.95186.18215.52

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg16RTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2070RTX 2060 SUPERRadeon VIIRTX 2060RX Vega 56RX 5700 XTRX 5700GTX 1070GTX 1080RX 5600 XTGTX 1660 TiGTX 1660 SUPERGTX 1660GTX 1060GTX 1650 SUPERGTX 1650612182430SE +/- 0.03, N = 15SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.03, N = 15SE +/- 0.22, N = 3SE +/- 0.56, N = 3SE +/- 0.11, N = 4SE +/- 0.06, N = 15SE +/- 0.13, N = 3SE +/- 0.08, N = 14SE +/- 0.04, N = 3SE +/- 0.08, N = 15SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 15SE +/- 0.17, N = 3SE +/- 0.12, N = 7SE +/- 0.19, N = 4SE +/- 0.27, N = 3SE +/- 0.03, N = 45.607.638.058.479.6410.3110.4211.0811.5613.7014.9114.9815.0415.2415.3315.4016.9820.2021.5524.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRadeon VIIRTX 2080 TiRX 5700 XTRX 5700RTX 2080 SUPERRTX 2080RX Vega 56RTX 2070 SUPERRTX 2070RX 5600 XTRTX 2060 SUPERRTX 2060GTX 1070GTX 1660 TiGTX 1080GTX 1660 SUPERGTX 1660GTX 1060GTX 1650 SUPERGTX 1650714212835SE +/- 0.018, N = 6SE +/- 0.020, N = 6SE +/- 0.020, N = 5SE +/- 0.035, N = 5SE +/- 0.041, N = 5SE +/- 0.038, N = 5SE +/- 0.021, N = 5SE +/- 0.034, N = 5SE +/- 0.039, N = 4SE +/- 0.024, N = 4SE +/- 0.039, N = 4SE +/- 0.021, N = 4SE +/- 0.035, N = 4SE +/- 0.030, N = 3SE +/- 0.068, N = 3SE +/- 0.010, N = 3SE +/- 0.045, N = 3SE +/- 0.020, N = 3SE +/- 0.050, N = 3SE +/- 0.039, N = 37.1778.2608.9829.3589.46310.06110.25410.61512.00712.07912.12913.40015.37417.15017.21417.53819.89320.54325.65829.372

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnetRTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2070RTX 2060GTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1660GTX 1650GTX 1080Radeon VIIGTX 1070RX Vega 56RX 5700 XTRX 5600 XTRX 5700GTX 10601.2152.433.6454.866.075SE +/- 0.02, N = 15SE +/- 0.03, N = 15SE +/- 0.01, N = 14SE +/- 0.02, N = 15SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 7SE +/- 0.03, N = 4SE +/- 0.04, N = 3SE +/- 0.00, N = 4SE +/- 0.04, N = 14SE +/- 0.00, N = 3SE +/- 0.01, N = 14SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 41.611.931.982.062.222.242.432.742.752.953.053.743.783.783.844.384.815.245.385.401. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet50RTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRTX 2060 SUPERRTX 2070RTX 2060RX 5700 XTGTX 1080GTX 1660 SUPERRX 5600 XTGTX 1660 TiRX 5700GTX 1660GTX 1650 SUPERGTX 1070RX Vega 56Radeon VIIGTX 1060GTX 1650246810SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 15SE +/- 0.00, N = 14SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 14SE +/- 0.16, N = 3SE +/- 0.02, N = 7SE +/- 0.03, N = 3SE +/- 0.04, N = 15SE +/- 0.07, N = 3SE +/- 0.08, N = 4SE +/- 0.09, N = 4SE +/- 0.18, N = 43.143.533.573.753.843.884.264.464.915.025.025.045.245.755.835.916.546.628.008.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenetRX 5700 XTRX 5600 XTRX 5700RTX 2080 TiRTX 2080RTX 2080 SUPERRTX 2070 SUPERRTX 2060 SUPERRTX 2070RTX 2060RX Vega 56Radeon VIIGTX 1660 SUPERGTX 1660 TiGTX 1660GTX 1650 SUPERGTX 1080GTX 1070GTX 1650GTX 1060246810SE +/- 0.03, N = 14SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.03, N = 15SE +/- 0.05, N = 15SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 15SE +/- 0.00, N = 3SE +/- 0.07, N = 4SE +/- 0.03, N = 3SE +/- 0.17, N = 15SE +/- 0.06, N = 7SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 15SE +/- 0.09, N = 4SE +/- 0.11, N = 43.363.583.584.014.284.304.334.604.674.724.895.035.205.245.525.805.836.526.837.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenetRTX 2080 SUPERRTX 2080 TiRTX 2060 SUPERRTX 2070 SUPERRTX 2080RTX 2070RTX 2060GTX 1660 SUPERGTX 1660 TiGTX 1080GTX 1650 SUPERGTX 1070GTX 1660GTX 1060GTX 1650RX Vega 56RX 5600 XTRX 5700 XTRX 5700Radeon VII3691215SE +/- 0.05, N = 15SE +/- 0.03, N = 14SE +/- 0.10, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.03, N = 3SE +/- 0.05, N = 15SE +/- 0.12, N = 3SE +/- 0.04, N = 15SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 15SE +/- 0.01, N = 7SE +/- 0.08, N = 4SE +/- 0.02, N = 4SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 14SE +/- 0.01, N = 3SE +/- 0.06, N = 44.454.534.624.654.654.834.905.095.135.265.425.475.575.996.296.316.546.586.609.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tinyRX Vega 56RTX 2080 TiRTX 2080 SUPERRTX 2080RTX 2070 SUPERRX 5700 XTRTX 2070RX 5700RX 5600 XTRTX 2060 SUPERRTX 2060GTX 1080GTX 1070GTX 1660 SUPERGTX 1660 TiGTX 1660Radeon VIIGTX 1060GTX 1650 SUPERGTX 16503691215SE +/- 0.02, N = 3SE +/- 0.07, N = 15SE +/- 0.08, N = 14SE +/- 0.06, N = 15SE +/- 0.06, N = 15SE +/- 0.01, N = 14SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 15SE +/- 0.11, N = 3SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.04, N = 15SE +/- 0.12, N = 7SE +/- 0.01, N = 4SE +/- 0.08, N = 4SE +/- 0.08, N = 3SE +/- 0.11, N = 47.677.778.128.228.308.328.428.498.538.548.748.849.159.419.459.9210.0010.5210.6711.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

GPU Temperature Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRadeon VIIRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 2060RX Vega 56RX 5700RX 5700 XTRTX 2060 SUPERGTX 1660 TiRTX 2070GTX 1650 SUPERGTX 1060GTX 1650RX 5600 XTRTX 2080GTX 1660 SUPERGTX 1070GTX 1660GTX 10801632486480Min: 29 / Avg: 44.95 / Max: 66Min: 26 / Avg: 46.54 / Max: 68Min: 28 / Avg: 48.45 / Max: 72Min: 32 / Avg: 49.95 / Max: 71Min: 28 / Avg: 50.87 / Max: 70Min: 29 / Avg: 51.18 / Max: 65Min: 38 / Avg: 52.05 / Max: 67Min: 35 / Avg: 52.14 / Max: 74Min: 29 / Avg: 52.23 / Max: 71Min: 29 / Avg: 52.99 / Max: 69Min: 29 / Avg: 54.07 / Max: 74Min: 30 / Avg: 55.56 / Max: 66Min: 27 / Avg: 55.8 / Max: 69Min: 31 / Avg: 56.9 / Max: 66Min: 44 / Avg: 57.67 / Max: 66Min: 29 / Avg: 58.81 / Max: 82Min: 33 / Avg: 59.37 / Max: 70Min: 29 / Avg: 60.05 / Max: 78Min: 29 / Avg: 60.56 / Max: 79Min: 29 / Avg: 62.17 / Max: 79

GPU Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringGTX 1650GTX 1650 SUPERGTX 1660GTX 1660 TiGTX 1060RX 5700 XTRX 5700GTX 1660 SUPERRTX 2060GTX 1070RX Vega 56RX 5600 XTRadeon VIIRTX 2070RTX 2070 SUPERRTX 2060 SUPERRTX 2080RTX 2080 SUPERGTX 1080RTX 2080 Ti50100150200250Min: 5.43 / Avg: 53.96 / Max: 68.86Min: 6.51 / Avg: 71.29 / Max: 96.43Min: 6.69 / Avg: 72.09 / Max: 109.23Min: 7.41 / Avg: 78.56 / Max: 133.19Min: 5.95 / Avg: 86 / Max: 131.08Min: 29 / Avg: 88.95 / Max: 225Min: 31 / Avg: 89.52 / Max: 165Min: 10.75 / Avg: 90.37 / Max: 129.22Min: 9.49 / Avg: 91.04 / Max: 164.41Min: 6.75 / Avg: 92.52 / Max: 162.74Min: 11 / Avg: 96.87 / Max: 167Min: 22 / Avg: 98.3 / Max: 171Min: 21 / Avg: 104.49 / Max: 267Min: 7.53 / Avg: 105.98 / Max: 181.01Min: 14.42 / Avg: 107.67 / Max: 220.78Min: 10.34 / Avg: 108.02 / Max: 179.76Min: 13.31 / Avg: 108.43 / Max: 221.85Min: 8.89 / Avg: 110.83 / Max: 255.3Min: 6.57 / Avg: 115.01 / Max: 188.22Min: 7.79 / Avg: 115.51 / Max: 274.43

RealSR-NCNN

GPU Temperature Monitor

OpenBenchmarking.orgCelsius, Fewer Is BetterRealSR-NCNN 20200818GPU Temperature MonitorRadeon VIIRX Vega 56RX 5600 XTRX 5700GTX 1660 TiRTX 2070 SUPERGTX 1660 SUPERGTX 1650RTX 2060RTX 2060 SUPERGTX 1060GTX 1650 SUPERRTX 2080 TiRTX 2080 SUPERRTX 2070RX 5700 XTGTX 1660RTX 2080GTX 1070GTX 10801428425670Min: 48 / Avg: 53.03 / Max: 58Min: 49 / Avg: 54.75 / Max: 59Min: 51 / Avg: 57.75 / Max: 63Min: 52 / Avg: 57.79 / Max: 63Min: 48 / Avg: 58.39 / Max: 64Min: 53 / Avg: 58.67 / Max: 63Min: 46 / Avg: 59.14 / Max: 65Min: 50 / Avg: 59.36 / Max: 62Min: 52 / Avg: 60.15 / Max: 65Min: 53 / Avg: 60.46 / Max: 65Min: 54 / Avg: 60.61 / Max: 64Min: 55 / Avg: 60.68 / Max: 63Min: 57 / Avg: 61.2 / Max: 66Min: 56 / Avg: 61.59 / Max: 66Min: 57 / Avg: 62.21 / Max: 66Min: 57 / Avg: 62.43 / Max: 68Min: 59 / Avg: 67.45 / Max: 72Min: 64 / Avg: 68.73 / Max: 73Min: 65 / Avg: 68.84 / Max: 71Min: 65 / Avg: 69.26 / Max: 72

RealSR-NCNN

GPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterRealSR-NCNN 20200818GPU Power Consumption MonitorGTX 1650GTX 1650 SUPERGTX 1660RX 5600 XTGTX 1660 TiGTX 1060RX 5700GTX 1660 SUPERRX Vega 56RTX 2060GTX 1070RX 5700 XTRadeon VIIRTX 2070RTX 2060 SUPERGTX 1080RTX 2070 SUPERRTX 2080RTX 2080 TiRTX 2080 SUPER50100150200250Min: 6.5 / Avg: 55.46 / Max: 68.27Min: 7 / Avg: 72.64 / Max: 95.3Min: 7.2 / Avg: 77.74 / Max: 106.6Min: 22 / Avg: 89.3 / Max: 156Min: 8.15 / Avg: 90.4 / Max: 132.9Min: 7.31 / Avg: 91.17 / Max: 124.12Min: 31 / Avg: 92.51 / Max: 165Min: 12.58 / Avg: 94.43 / Max: 127.87Min: 11 / Avg: 97.73 / Max: 167Min: 9.95 / Avg: 106.24 / Max: 163.21Min: 10.43 / Avg: 107.9 / Max: 153.11Min: 32 / Avg: 109.14 / Max: 212Min: 21 / Avg: 114.21 / Max: 262Min: 10.76 / Avg: 115.01 / Max: 178.57Min: 10.8 / Avg: 116.43 / Max: 178.77Min: 9.76 / Avg: 120.6 / Max: 186.3Min: 15.5 / Avg: 128.75 / Max: 219.45Min: 18.23 / Avg: 129.31 / Max: 221.85Min: 10.28 / Avg: 138.39 / Max: 267.04Min: 10.06 / Avg: 141.36 / Max: 255.21

RealSR-NCNN

GPU Temperature Monitor

OpenBenchmarking.orgCelsius, Fewer Is BetterRealSR-NCNN 20200818GPU Temperature MonitorRadeon VIIRTX 2070 SUPERRX 5700RTX 2080 TiRX Vega 56GTX 1650 SUPERRX 5600 XTRTX 2080 SUPERRTX 2060 SUPERGTX 1650RTX 2060GTX 1060GTX 1660 TiRTX 2070RX 5700 XTGTX 1660 SUPERGTX 1070GTX 1080GTX 1660RTX 20801632486480Min: 32 / Avg: 56.43 / Max: 66Min: 32 / Avg: 58.46 / Max: 68Min: 40 / Avg: 59.96 / Max: 67Min: 38 / Avg: 60.3 / Max: 71Min: 37 / Avg: 60.4 / Max: 65Min: 34 / Avg: 60.94 / Max: 66Min: 45 / Avg: 61.02 / Max: 66Min: 34 / Avg: 61.48 / Max: 72Min: 31 / Avg: 61.48 / Max: 71Min: 34 / Avg: 62.36 / Max: 66Min: 34 / Avg: 63.16 / Max: 70Min: 32 / Avg: 63.49 / Max: 69Min: 35 / Avg: 63.96 / Max: 69Min: 33 / Avg: 63.99 / Max: 74Min: 42 / Avg: 65.79 / Max: 74Min: 41 / Avg: 66.66 / Max: 70Min: 41 / Avg: 70.65 / Max: 78Min: 37 / Avg: 70.82 / Max: 79Min: 37 / Avg: 71.23 / Max: 79Min: 43 / Avg: 73.55 / Max: 82

RealSR-NCNN

GPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterRealSR-NCNN 20200818GPU Power Consumption MonitorGTX 1650GTX 1650 SUPERGTX 1660GTX 1060GTX 1660 SUPERGTX 1660 TiRX 5600 XTRX 5700GTX 1070RTX 2060RX Vega 56GTX 1080RX 5700 XTRTX 2070RTX 2060 SUPERRTX 2070 SUPERRTX 2080Radeon VIIRTX 2080 SUPERRTX 2080 Ti50100150200250Min: 5.59 / Avg: 64.99 / Max: 68.86Min: 6.74 / Avg: 88 / Max: 96.43Min: 6.81 / Avg: 99.98 / Max: 109.23Min: 6.15 / Avg: 112.02 / Max: 131.08Min: 11.09 / Avg: 119.27 / Max: 129.22Min: 7.82 / Avg: 122.28 / Max: 133.19Min: 22 / Avg: 124.95 / Max: 171Min: 31 / Avg: 136.68 / Max: 165Min: 8.06 / Avg: 139.07 / Max: 162.74Min: 9.49 / Avg: 148.27 / Max: 164.41Min: 12 / Avg: 150.39 / Max: 167Min: 7.24 / Avg: 152.95 / Max: 188.22Min: 32 / Avg: 159.84 / Max: 225Min: 7.96 / Avg: 160.99 / Max: 181.01Min: 10.5 / Avg: 162.04 / Max: 179.76Min: 14.42 / Avg: 196.82 / Max: 220.78Min: 14.09 / Avg: 197.01 / Max: 220.77Min: 21 / Avg: 208.15 / Max: 267Min: 9.25 / Avg: 219.85 / Max: 255.3Min: 8.53 / Avg: 226.48 / Max: 274.43

NCNN

GPU Temperature Monitor

OpenBenchmarking.orgCelsius, Fewer Is BetterNCNN 20200916GPU Temperature MonitorRadeon VIIGTX 1650 SUPERRTX 2070RTX 2060 SUPERRTX 2070 SUPERGTX 1060GTX 1080RX Vega 56RTX 2080 SUPERGTX 1660GTX 1650RX 5700RTX 2060RTX 2080 TiGTX 1660 TiRX 5700 XTGTX 1660 SUPERRTX 2080RX 5600 XTGTX 10701224364860Min: 29 / Avg: 35.12 / Max: 39Min: 30 / Avg: 35.87 / Max: 41Min: 29 / Avg: 36.15 / Max: 43Min: 29 / Avg: 36.86 / Max: 43Min: 26 / Avg: 38.38 / Max: 45Min: 27 / Avg: 38.51 / Max: 46Min: 29 / Avg: 38.57 / Max: 48Min: 29 / Avg: 40.31 / Max: 46Min: 28 / Avg: 40.39 / Max: 47Min: 29 / Avg: 41.23 / Max: 50Min: 31 / Avg: 41.54 / Max: 49Min: 38 / Avg: 41.97 / Max: 44Min: 28 / Avg: 41.98 / Max: 50Min: 32 / Avg: 43.67 / Max: 50Min: 29 / Avg: 44.32 / Max: 52Min: 35 / Avg: 45.08 / Max: 50Min: 33 / Avg: 47.22 / Max: 58Min: 29 / Avg: 49.32 / Max: 61Min: 45 / Avg: 49.92 / Max: 54Min: 29 / Avg: 51.63 / Max: 61

NCNN

GPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterNCNN 20200916GPU Power Consumption MonitorGTX 1650GTX 1660GTX 1650 SUPERRX 5600 XTGTX 1660 TiGTX 1060RX 5700Radeon VIIGTX 1660 SUPERRTX 2070RX 5700 XTRTX 2060RTX 2060 SUPERGTX 1080GTX 1070RX Vega 56RTX 2080 SUPERRTX 2070 SUPERRTX 2080RTX 2080 Ti4080120160200Min: 5.43 / Avg: 36.97 / Max: 58.71Min: 6.75 / Avg: 41.49 / Max: 72.99Min: 6.69 / Avg: 41.85 / Max: 70.35Min: 22 / Avg: 50.03 / Max: 95Min: 7.41 / Avg: 52.59 / Max: 95.92Min: 5.95 / Avg: 52.99 / Max: 89.44Min: 31 / Avg: 54.05 / Max: 117Min: 21 / Avg: 56.3 / Max: 178Min: 10.79 / Avg: 58.28 / Max: 103.39Min: 7.53 / Avg: 63.22 / Max: 132.69Min: 29 / Avg: 64.95 / Max: 172Min: 9.5 / Avg: 66.76 / Max: 125.56Min: 10.49 / Avg: 67.01 / Max: 138.49Min: 6.66 / Avg: 67.08 / Max: 127.61Min: 6.75 / Avg: 72.5 / Max: 121.46Min: 12 / Avg: 73.93 / Max: 164Min: 8.89 / Avg: 75.86 / Max: 159.23Min: 14.5 / Avg: 76.37 / Max: 153.13Min: 13.31 / Avg: 80.01 / Max: 161.84Min: 8.14 / Avg: 92.05 / Max: 219.12

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet18RTX 2080 TiRTX 2080 SUPERRX 5700 XTRTX 2080RTX 2070 SUPERRX 5700RTX 2060 SUPERRTX 2070RX 5600 XTRTX 2060GTX 1080Radeon VIIRX Vega 56GTX 1660 TiGTX 1660GTX 1660 SUPERGTX 1070GTX 1650 SUPERGTX 1060GTX 16500.75831.51662.27493.03323.7915SE +/- 0.06, N = 14SE +/- 0.02, N = 15SE +/- 0.00, N = 14SE +/- 0.04, N = 14SE +/- 0.04, N = 15SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 15SE +/- 0.01, N = 3SE +/- 0.00, N = 4SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.01, N = 7SE +/- 0.12, N = 3SE +/- 0.03, N = 15SE +/- 0.02, N = 3SE +/- 0.06, N = 4SE +/- 0.06, N = 41.421.511.571.631.681.711.721.731.732.002.012.042.202.262.372.382.392.613.133.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenetRTX 2080 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080RTX 2060 SUPERRTX 2060RTX 2070GTX 1660 TiRX 5700 XTGTX 1660 SUPERGTX 1650 SUPERGTX 1080RX 5600 XTRX 5700GTX 1660GTX 1070GTX 1650GTX 1060RX Vega 56Radeon VII246810SE +/- 0.04, N = 15SE +/- 0.06, N = 15SE +/- 0.05, N = 15SE +/- 0.07, N = 15SE +/- 0.09, N = 3SE +/- 0.04, N = 15SE +/- 0.22, N = 3SE +/- 0.04, N = 15SE +/- 0.00, N = 14SE +/- 0.17, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 7SE +/- 0.04, N = 15SE +/- 0.01, N = 4SE +/- 0.01, N = 4SE +/- 0.01, N = 3SE +/- 0.15, N = 43.053.233.233.333.363.563.713.783.873.944.094.134.244.274.334.585.315.455.747.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazefaceRTX 2060 SUPERRTX 2070GTX 1660 SUPERRTX 2080 SUPERGTX 1650 SUPERGTX 1660 TiRTX 2070 SUPERGTX 1660RTX 2080 TiRTX 2060GTX 1060GTX 1650GTX 1070GTX 1080RTX 2080RX 5700 XTRX 5600 XTRX 5700RX Vega 56Radeon VII0.2070.4140.6210.8281.035SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 15SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.03, N = 15SE +/- 0.00, N = 7SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.01, N = 4SE +/- 0.01, N = 4SE +/- 0.03, N = 15SE +/- 0.05, N = 3SE +/- 0.04, N = 15SE +/- 0.00, N = 14SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 40.620.620.630.630.640.660.660.670.670.690.700.700.710.710.710.730.760.770.870.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b0RTX 2060 SUPERRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 2080GTX 1660 TiRTX 2060GTX 1660 SUPERRTX 2070GTX 1660GTX 1650 SUPERGTX 1080GTX 1070GTX 1650GTX 1060RX 5700 XTRX 5600 XTRX 5700RX Vega 56Radeon VII3691215SE +/- 0.01, N = 3SE +/- 0.04, N = 15SE +/- 0.05, N = 15SE +/- 0.08, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 7SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 15SE +/- 0.08, N = 4SE +/- 0.10, N = 4SE +/- 0.05, N = 14SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 42.612.692.712.732.762.842.842.872.893.193.213.403.493.503.656.637.257.8510.5411.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnetRTX 2080 TiRTX 2080RTX 2080 SUPERRTX 2070 SUPERGTX 1650 SUPERGTX 1660 TiRTX 2060RTX 2060 SUPERRTX 2070GTX 1660 SUPERGTX 1080GTX 1070GTX 1660GTX 1060RX 5700 XTGTX 1650RX 5700RX 5600 XTRX Vega 56Radeon VII0.70881.41762.12642.83523.544SE +/- 0.04, N = 14SE +/- 0.03, N = 15SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.01, N = 3SE +/- 0.04, N = 15SE +/- 0.05, N = 15SE +/- 0.17, N = 2SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 15SE +/- 0.07, N = 7SE +/- 0.03, N = 4SE +/- 0.02, N = 14SE +/- 0.10, N = 4SE +/- 0.00, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.01, N = 41.481.511.511.541.631.631.641.681.771.791.931.962.002.072.082.172.252.333.093.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v2RTX 2060 SUPERRTX 2070GTX 1660 SUPERRTX 2070 SUPERRTX 2080 TiRTX 2080 SUPERGTX 1650 SUPERRTX 2060RTX 2080GTX 1660 TiGTX 1080GTX 1660GTX 1070GTX 1650GTX 1060RX 5700 XTRX 5600 XTRX 5700RX Vega 56Radeon VII0.5851.171.7552.342.925SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.05, N = 15SE +/- 0.02, N = 3SE +/- 0.04, N = 15SE +/- 0.08, N = 15SE +/- 0.05, N = 15SE +/- 0.03, N = 3SE +/- 0.05, N = 7SE +/- 0.02, N = 15SE +/- 0.00, N = 3SE +/- 0.01, N = 4SE +/- 0.00, N = 14SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 41.321.321.331.361.361.391.401.421.441.461.471.511.551.571.601.731.841.862.342.601. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3RTX 2080 SUPERRTX 2070RTX 2080 TiRTX 2070 SUPERRTX 2080RTX 2060GTX 1660 TiGTX 1650 SUPERRTX 2060 SUPERGTX 1660 SUPERGTX 1660GTX 1080GTX 1650GTX 1070GTX 1060RX 5700 XTRX 5600 XTRX 5700Radeon VIIRX Vega 561.04632.09263.13894.18525.2315SE +/- 0.03, N = 15SE +/- 0.02, N = 3SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.06, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 7SE +/- 0.11, N = 3SE +/- 0.12, N = 4SE +/- 0.05, N = 15SE +/- 0.09, N = 4SE +/- 0.12, N = 14SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 4SE +/- 0.93, N = 31.621.701.731.751.761.791.831.881.931.982.142.252.292.322.422.963.113.134.134.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2RTX 2070 SUPERRTX 2080 TiRTX 2060 SUPERRTX 2070GTX 1660 SUPERRTX 2080 SUPERRTX 2080RTX 2060GTX 1650 SUPERGTX 1660 TiGTX 1660GTX 1070GTX 1650GTX 1060RX 5700 XTRX 5600 XTRX 5700GTX 1080Radeon VIIRX Vega 560.66381.32761.99142.65523.319SE +/- 0.03, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 15SE +/- 0.06, N = 15SE +/- 0.04, N = 15SE +/- 0.02, N = 3SE +/- 0.04, N = 15SE +/- 0.06, N = 7SE +/- 0.02, N = 15SE +/- 0.00, N = 4SE +/- 0.02, N = 4SE +/- 0.07, N = 14SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 4SE +/- 0.36, N = 31.441.441.451.451.481.481.491.541.601.611.771.851.941.972.052.152.182.232.842.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.5