NCNN Vulkan - AMD vs. NVIDIA

NCNN Vulkan benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2009260-PTS-NCNNVULK08
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
GTX 1060
September 25 2020
  34 Minutes
GTX 1070
September 26 2020
  1 Hour, 20 Minutes
GTX 1080
September 25 2020
  23 Minutes
GTX 1650
September 25 2020
  38 Minutes
GTX 1650 SUPER
September 26 2020
  29 Minutes
GTX 1660
September 26 2020
  41 Minutes
GTX 1660 SUPER
September 26 2020
  23 Minutes
GTX 1660 Ti
September 26 2020
  1 Hour, 15 Minutes
RTX 2060
September 25 2020
  1 Hour, 7 Minutes
RTX 2060 SUPER
September 25 2020
  19 Minutes
RTX 2070
September 26 2020
  19 Minutes
RTX 2070 SUPER
September 25 2020
  1 Hour, 1 Minute
RTX 2080
September 26 2020
  1 Hour
RTX 2080 SUPER
September 26 2020
  58 Minutes
RTX 2080 Ti
September 26 2020
  55 Minutes
RX Vega 56
September 25 2020
  20 Minutes
RX 5600 XT
September 25 2020
  19 Minutes
RX 5700
September 25 2020
  18 Minutes
RX 5700 XT
September 25 2020
  1 Hour, 5 Minutes
Radeon VII
September 25 2020
  25 Minutes
Invert Behavior (Only Show Selected Data)
  42 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


NCNN Vulkan - AMD vs. NVIDIAProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBSapphire AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1780/875MHz)AMD Navi 10 HDMI AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.9.0-050900rc6daily20200925-generic (x86_64) 20200924GNOME Shell 3.36.4X Server 1.20.84.6 Mesa 20.3.0-devel (git-3173367 2020-09-25 focal-oibaf-ppa) (LLVM 10.0.1)OpenCL 2.0 AMD-APP (3182.0)1.2.145GCC 9.3.0 + CUDA 11.0ext43840x2160AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (1750/875MHz)AMD Radeon RX 56/64 8GB (1590/800MHz)AMD Vega 10 HDMI AudioAMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioAMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS NVIDIA GeForce GTX 1650 4GB (1485/4001MHz)NVIDIA Device 10fa5.4.0-48-generic (x86_64)NVIDIA 450.664.6.0OpenCL 2.0 AMD-APP (3182.0) + OpenCL 1.2 CUDA 11.0.2281.2.133NVIDIA GeForce GTX 1080 8GB (1607/5005MHz)NVIDIA GP104 HD AudioNVIDIA GeForce GTX 1060 6GB (1506/4006MHz)NVIDIA GP106 HD AudioNVIDIA GeForce RTX 2060 6GB (1365/7000MHz)NVIDIA TU106 HD AudioNVIDIA GeForce RTX 2060 SUPER 8GB (1470/7000MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (375/405MHz)NVIDIA TU104 HD AudioeVGA NVIDIA GeForce GTX 1660 Ti 6GB (1500/6000MHz)NVIDIA TU116 HD AudioeVGA NVIDIA GeForce GTX 1660 SUPER 6GB (1530/7000MHz)ASUS NVIDIA GeForce GTX 1650 SUPER 4GB (375/810MHz)NVIDIA GeForce GTX 1070 8GB (1506/4006MHz)NVIDIA GP104 HD AudioNVIDIA GeForce RTX 2080 Ti 11GB (1350/7000MHz)NVIDIA TU102 HD AudioZotac NVIDIA GeForce RTX 2080 8GB (1515/7000MHz)NVIDIA TU104 HD AudioASUS NVIDIA GeForce RTX 2070 8GB (420/405MHz)NVIDIA TU106 HD AudioNVIDIA GeForce RTX 2080 SUPER 8GB (405/405MHz)NVIDIA TU104 HD AudioASUS NVIDIA GeForce GTX 1660 6GB (1530/4001MHz)NVIDIA TU116 HD AudioOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Details- Scaling Governor: acpi-cpufreq ondemand - CPU Microcode: 0x8701013Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660Result OverviewPhoronix Test Suite100%206%311%417%522%RealSR-NCNNNCNNNCNNRealSR-NCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNN4x - YesVulkan GPU - vgg16Vulkan GPU - efficientnet-b04x - NoVulkan GPU - alexnetVulkan GPU-v3-v3 - mobilenet-v3Vulkan GPU - resnet50Vulkan GPU - resnet18Vulkan GPU - googlenetVulkan GPU - squeezenetVulkan GPU - mnasnetVulkan GPU-v2-v2 - mobilenet-v2Vulkan GPU - mobilenetVulkan GPU - shufflenet-v2Vulkan GPU - yolov4-tinyVulkan GPU - blazeface

NCNN Vulkan - AMD vs. NVIDIAncnn: Vulkan GPU - squeezenetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyrealsr-ncnn: 4x - Yesrealsr-ncnn: 4x - NoRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16603.586.542.153.111.842.337.250.764.2415.241.735.245.028.5380.26012.0793.586.602.183.131.862.257.850.774.2714.911.715.385.248.4958.9299.3584.896.312.954.652.343.0910.540.875.7411.562.204.386.547.6766.10810.2545.039.022.844.132.603.1511.070.927.2210.422.043.786.6210.0041.2887.1773.366.582.052.961.732.086.630.733.8713.701.574.814.468.3255.6788.9826.836.291.942.291.572.173.500.705.3124.153.373.748.4211.95215.51829.3725.835.262.232.251.471.933.400.714.1315.042.013.784.918.84116.45217.2147.775.991.972.421.602.073.650.705.4520.203.135.408.0010.52143.94920.5434.724.901.541.791.421.642.840.693.5611.082.002.434.268.7486.37213.4004.604.621.451.931.321.682.610.623.3610.311.722.223.848.5476.37312.1294.334.651.441.751.361.542.690.663.238.471.682.063.758.3063.86310.6155.245.131.611.831.461.632.840.663.7815.332.262.745.049.45117.67417.1505.205.091.481.981.331.792.870.633.9415.42.382.755.029.41120.14817.5385.805.421.601.881.401.633.210.644.0921.552.612.955.8310.67186.17925.6586.525.471.852.321.551.963.490.714.5814.982.393.845.919.15102.78715.3744.014.531.441.731.361.482.730.673.055.601.421.613.147.7744.9868.2604.284.651.491.761.441.512.760.713.338.051.631.983.578.2260.36010.0614.674.831.451.701.321.772.890.623.719.641.732.243.888.4275.43712.0074.304.451.481.621.391.512.710.633.237.631.511.933.538.1254.8409.4635.525.571.772.141.512.003.190.674.3316.982.373.055.759.92138.72119.893OpenBenchmarking.org

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenetRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 4SE +/- 0.03, N = 14SE +/- 0.09, N = 4SE +/- 0.02, N = 3SE +/- 0.11, N = 4SE +/- 0.05, N = 15SE +/- 0.05, N = 3SE +/- 0.05, N = 15SE +/- 0.17, N = 15SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.21, N = 15SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.07, N = 3SE +/- 0.03, N = 15SE +/- 0.06, N = 73.583.584.895.033.366.835.837.774.724.604.335.245.205.806.524.014.284.674.305.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenetRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16603691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 4SE +/- 0.01, N = 14SE +/- 0.02, N = 4SE +/- 0.03, N = 3SE +/- 0.08, N = 4SE +/- 0.05, N = 15SE +/- 0.10, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 15SE +/- 0.03, N = 14SE +/- 0.04, N = 15SE +/- 0.03, N = 3SE +/- 0.05, N = 15SE +/- 0.01, N = 76.546.606.319.026.586.295.265.994.904.624.655.135.095.425.474.534.654.834.455.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16600.66381.32761.99142.65523.319SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.36, N = 3SE +/- 0.01, N = 4SE +/- 0.07, N = 14SE +/- 0.00, N = 4SE +/- 0.16, N = 3SE +/- 0.02, N = 4SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.03, N = 15SE +/- 0.04, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 15SE +/- 0.04, N = 15SE +/- 0.06, N = 15SE +/- 0.02, N = 3SE +/- 0.05, N = 15SE +/- 0.06, N = 72.152.182.952.842.051.942.231.971.541.451.441.611.481.601.851.441.491.451.481.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16601.04632.09263.13894.18525.2315SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.93, N = 3SE +/- 0.12, N = 4SE +/- 0.12, N = 14SE +/- 0.12, N = 4SE +/- 0.11, N = 3SE +/- 0.09, N = 4SE +/- 0.04, N = 15SE +/- 0.11, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 15SE +/- 0.05, N = 15SE +/- 0.06, N = 15SE +/- 0.02, N = 3SE +/- 0.03, N = 15SE +/- 0.06, N = 73.113.134.654.132.962.292.252.421.791.931.751.831.981.882.321.731.761.701.622.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v2RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16600.5851.171.7552.342.925SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 14SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 4SE +/- 0.04, N = 15SE +/- 0.03, N = 3SE +/- 0.04, N = 15SE +/- 0.05, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 15SE +/- 0.04, N = 15SE +/- 0.08, N = 15SE +/- 0.02, N = 3SE +/- 0.05, N = 15SE +/- 0.05, N = 71.841.862.342.601.731.571.471.601.421.321.361.461.331.401.551.361.441.321.391.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnetRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16600.70881.41762.12642.83523.544SE +/- 0.08, N = 3SE +/- 0.00, N = 3SE +/- 0.31, N = 3SE +/- 0.01, N = 4SE +/- 0.02, N = 14SE +/- 0.10, N = 4SE +/- 0.12, N = 3SE +/- 0.03, N = 4SE +/- 0.05, N = 15SE +/- 0.17, N = 2SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.04, N = 14SE +/- 0.03, N = 15SE +/- 0.14, N = 3SE +/- 0.05, N = 15SE +/- 0.07, N = 72.332.253.093.152.082.171.932.071.641.681.541.631.791.631.961.481.511.771.512.001. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b0RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16603691215SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 4SE +/- 0.05, N = 14SE +/- 0.08, N = 4SE +/- 0.10, N = 3SE +/- 0.10, N = 4SE +/- 0.04, N = 15SE +/- 0.01, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.25, N = 3SE +/- 0.04, N = 15SE +/- 0.08, N = 15SE +/- 0.04, N = 15SE +/- 0.11, N = 3SE +/- 0.05, N = 15SE +/- 0.05, N = 77.257.8510.5411.076.633.503.403.652.842.612.692.842.873.213.492.732.762.892.713.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazefaceRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16600.2070.4140.6210.8281.035SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 14SE +/- 0.01, N = 4SE +/- 0.05, N = 3SE +/- 0.01, N = 4SE +/- 0.04, N = 15SE +/- 0.00, N = 3SE +/- 0.03, N = 15SE +/- 0.03, N = 15SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 15SE +/- 0.05, N = 15SE +/- 0.04, N = 15SE +/- 0.01, N = 3SE +/- 0.02, N = 15SE +/- 0.00, N = 70.760.770.870.920.730.700.710.700.690.620.660.660.630.640.710.670.710.620.630.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenetRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.15, N = 4SE +/- 0.00, N = 14SE +/- 0.01, N = 4SE +/- 0.06, N = 3SE +/- 0.01, N = 4SE +/- 0.04, N = 15SE +/- 0.09, N = 3SE +/- 0.06, N = 15SE +/- 0.04, N = 15SE +/- 0.17, N = 3SE +/- 0.16, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.07, N = 15SE +/- 0.22, N = 3SE +/- 0.05, N = 15SE +/- 0.06, N = 74.244.275.747.223.875.314.135.453.563.363.233.783.944.094.583.053.333.713.234.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg16RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660612182430SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 4SE +/- 0.08, N = 14SE +/- 0.03, N = 4SE +/- 0.14, N = 3SE +/- 0.19, N = 4SE +/- 0.06, N = 15SE +/- 0.56, N = 3SE +/- 0.03, N = 15SE +/- 0.09, N = 15SE +/- 0.17, N = 3SE +/- 0.27, N = 3SE +/- 0.08, N = 15SE +/- 0.03, N = 15SE +/- 0.06, N = 15SE +/- 0.22, N = 3SE +/- 0.06, N = 15SE +/- 0.12, N = 715.2414.9111.5610.4213.7024.1515.0420.2011.0810.318.4715.3315.4021.5514.985.608.059.647.6316.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet18RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16600.75831.51662.27493.03323.7915SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 4SE +/- 0.00, N = 14SE +/- 0.06, N = 4SE +/- 0.01, N = 3SE +/- 0.06, N = 4SE +/- 0.04, N = 15SE +/- 0.04, N = 3SE +/- 0.04, N = 15SE +/- 0.03, N = 15SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 15SE +/- 0.06, N = 14SE +/- 0.04, N = 14SE +/- 0.03, N = 3SE +/- 0.02, N = 15SE +/- 0.01, N = 71.731.712.202.041.573.372.013.132.001.721.682.262.382.612.391.421.631.731.512.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnetRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16601.2152.433.6454.866.075SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 4SE +/- 0.01, N = 14SE +/- 0.03, N = 4SE +/- 0.04, N = 3SE +/- 0.02, N = 4SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.02, N = 15SE +/- 0.04, N = 15SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 14SE +/- 0.02, N = 15SE +/- 0.01, N = 14SE +/- 0.10, N = 3SE +/- 0.03, N = 15SE +/- 0.06, N = 75.245.384.383.784.813.743.785.402.432.222.062.742.752.953.841.611.982.241.933.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet50RX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660246810SE +/- 0.04, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 4SE +/- 0.00, N = 14SE +/- 0.18, N = 4SE +/- 0.10, N = 3SE +/- 0.09, N = 4SE +/- 0.04, N = 15SE +/- 0.08, N = 3SE +/- 0.04, N = 15SE +/- 0.03, N = 14SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.09, N = 3SE +/- 0.04, N = 15SE +/- 0.02, N = 75.025.246.546.624.468.424.918.004.263.843.755.045.025.835.913.143.573.883.535.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tinyRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16603691215SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 4SE +/- 0.01, N = 14SE +/- 0.11, N = 4SE +/- 0.11, N = 3SE +/- 0.08, N = 4SE +/- 0.05, N = 15SE +/- 0.10, N = 3SE +/- 0.06, N = 15SE +/- 0.04, N = 15SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 15SE +/- 0.07, N = 15SE +/- 0.06, N = 15SE +/- 0.07, N = 3SE +/- 0.08, N = 14SE +/- 0.12, N = 78.538.497.6710.008.3211.958.8410.528.748.548.309.459.4110.679.157.778.228.428.129.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

RealSR-NCNN

RealSR-NCNN is an NCNN neural network implementation of the RealSR project and accelerated using the Vulkan API. RealSR is the Real-World Super Resolution via Kernel Estimation and Noise Injection. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image by a scale of 4x with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 166050100150200250SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 0.08, N = 3SE +/- 0.33, N = 3SE +/- 0.26, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.36, N = 3SE +/- 0.23, N = 3SE +/- 0.38, N = 3SE +/- 0.56, N = 3SE +/- 0.19, N = 3SE +/- 0.27, N = 380.2658.9366.1141.2955.68215.52116.45143.9586.3776.3763.86117.67120.15186.18102.7944.9960.3675.4454.84138.72

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 1660714212835SE +/- 0.024, N = 4SE +/- 0.035, N = 5SE +/- 0.021, N = 5SE +/- 0.018, N = 6SE +/- 0.020, N = 5SE +/- 0.039, N = 3SE +/- 0.068, N = 3SE +/- 0.020, N = 3SE +/- 0.021, N = 4SE +/- 0.039, N = 4SE +/- 0.034, N = 5SE +/- 0.030, N = 3SE +/- 0.010, N = 3SE +/- 0.050, N = 3SE +/- 0.035, N = 4SE +/- 0.020, N = 6SE +/- 0.038, N = 5SE +/- 0.039, N = 4SE +/- 0.041, N = 5SE +/- 0.045, N = 312.0799.35810.2547.1778.98229.37217.21420.54313.40012.12910.61517.15017.53825.65815.3748.26010.06112.0079.46319.893

GPU Power Consumption Monitor

OpenBenchmarking.orgWattsGPU Power Consumption MonitorPhoronix Test Suite System MonitoringRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 166050100150200250Min: 22 / Avg: 98.3 / Max: 171Min: 31 / Avg: 89.52 / Max: 165Min: 11 / Avg: 96.87 / Max: 167Min: 21 / Avg: 104.49 / Max: 267Min: 29 / Avg: 88.95 / Max: 225Min: 5.43 / Avg: 53.96 / Max: 68.86Min: 6.57 / Avg: 115.01 / Max: 188.22Min: 5.95 / Avg: 86 / Max: 131.08Min: 9.49 / Avg: 91.04 / Max: 164.41Min: 10.34 / Avg: 108.02 / Max: 179.76Min: 14.42 / Avg: 107.67 / Max: 220.78Min: 7.41 / Avg: 78.56 / Max: 133.19Min: 10.75 / Avg: 90.37 / Max: 129.22Min: 6.51 / Avg: 71.29 / Max: 96.43Min: 6.75 / Avg: 92.52 / Max: 162.74Min: 7.79 / Avg: 115.51 / Max: 274.43Min: 13.31 / Avg: 108.43 / Max: 221.85Min: 7.53 / Avg: 105.98 / Max: 181.01Min: 8.89 / Avg: 110.83 / Max: 255.3Min: 6.69 / Avg: 72.09 / Max: 109.23

GPU Temperature Monitor

OpenBenchmarking.orgCelsiusGPU Temperature MonitorPhoronix Test Suite System MonitoringRX 5600 XTRX 5700RX Vega 56Radeon VIIRX 5700 XTGTX 1650GTX 1080GTX 1060RTX 2060RTX 2060 SUPERRTX 2070 SUPERGTX 1660 TiGTX 1660 SUPERGTX 1650 SUPERGTX 1070RTX 2080 TiRTX 2080RTX 2070RTX 2080 SUPERGTX 16601632486480Min: 44 / Avg: 57.67 / Max: 66Min: 38 / Avg: 52.05 / Max: 67Min: 29 / Avg: 51.18 / Max: 65Min: 29 / Avg: 44.95 / Max: 66Min: 35 / Avg: 52.14 / Max: 74Min: 31 / Avg: 56.9 / Max: 66Min: 29 / Avg: 62.17 / Max: 79Min: 27 / Avg: 55.8 / Max: 69Min: 28 / Avg: 50.87 / Max: 70Min: 29 / Avg: 52.23 / Max: 71Min: 26 / Avg: 46.54 / Max: 68Min: 29 / Avg: 52.99 / Max: 69Min: 33 / Avg: 59.37 / Max: 70Min: 30 / Avg: 55.56 / Max: 66Min: 29 / Avg: 60.05 / Max: 78Min: 32 / Avg: 49.95 / Max: 71Min: 29 / Avg: 58.81 / Max: 82Min: 29 / Avg: 54.07 / Max: 74Min: 28 / Avg: 48.45 / Max: 72Min: 29 / Avg: 60.56 / Max: 79