Feb 2021 Vulkan Compute

Vulkan benchmark comparison for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2102067-HA-FEB2021VU83&grr&sor.

Feb 2021 Vulkan ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLRX 5700 XTRadeon VIIRX 6800RX 6800 XTRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiGTX 3060 TiRTX 3080AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600 + 2000GBAMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.0-051100rc5daily20210129-generic (x86_64) 20210128GNOME Shell 3.38.2X Server 1.20.9amd4.6 Mesa 21.1.0-devel (git-824ae64 2021-02-01 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.145GCC 10.2.0 + Clang 11.0.1-1~oibaf~gext43840x2160AMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioAMD SIENNA_CICHLID 16GB (2475/1000MHz)AMD Device ab28AMD SIENNA_CICHLID 16GB (2575/1000MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)NVIDIA TU104 HD Audio5.8.0-41-generic (x86_64)NVIDIA 460.394.6.0OpenCL 1.2 CUDA 11.2.1361.2.155NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA GeForce RTX 2080 Ti 11GB (420/405MHz)NVIDIA TU102 HD AudioNVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz)NVIDIA Device 228bNVIDIA GeForce RTX 3080 10GB (390/405MHz)NVIDIA Device 1aefOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Graphics Details- RX 5700 XT, Radeon VII, RX 6800, RX 6800 XT: GLAMORSecurity Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Feb 2021 Vulkan Computevkfft: realsr-ncnn: 4x - Yesvkresample: 2x - Singlerealsr-ncnn: 4x - Nowaifu2x-ncnn: 2x - 3 - YesRX 5700 XTRadeon VIIRX 6800RX 6800 XTRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiGTX 3060 TiRTX 30802067658.01132.0648.9185.0663391239.46513.7236.6125.7694480432.20918.3755.7333.4764973030.57617.9475.5553.2733190764.31819.87910.1494.8133250958.43018.6299.6794.6724163545.78315.0138.5484.0213208758.55119.4049.3984.7545203536.93212.1846.9133.981OpenBenchmarking.org

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1RTX 3080RX 6800 XTRX 6800RTX 2080 TiRadeon VIIRTX 2080 SUPERGTX 3060 TiRTX 2070 SUPERRX 5700 XT11K22K33K44K55KSE +/- 193.08, N = 3SE +/- 576.21, N = 3SE +/- 137.01, N = 3SE +/- 22.81, N = 3SE +/- 1.53, N = 3SE +/- 76.95, N = 3SE +/- 19.65, N = 3SE +/- 31.18, N = 3SE +/- 15.62, N = 35203549730448044163533912325093208731907206761. (CXX) g++ options: -O3 -pthread

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesRX 6800 XTRX 6800RTX 3080Radeon VIIRTX 2080 TiRX 5700 XTRTX 2080 SUPERGTX 3060 TiRTX 2070 SUPER1428425670SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.13, N = 330.5832.2136.9339.4745.7858.0158.4358.5564.32

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleRTX 3080Radeon VIIRTX 2080 TiRX 6800 XTRX 6800RTX 2080 SUPERGTX 3060 TiRTX 2070 SUPERRX 5700 XT714212835SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 312.1813.7215.0117.9518.3818.6319.4019.8832.061. (CXX) g++ options: -O3 -pthread

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoRX 6800 XTRX 6800Radeon VIIRTX 3080RTX 2080 TiRX 5700 XTGTX 3060 TiRTX 2080 SUPERRTX 2070 SUPER3691215SE +/- 0.010, N = 3SE +/- 0.015, N = 3SE +/- 0.051, N = 3SE +/- 0.077, N = 15SE +/- 0.082, N = 3SE +/- 0.036, N = 3SE +/- 0.107, N = 3SE +/- 0.094, N = 3SE +/- 0.103, N = 35.5555.7336.6126.9138.5488.9189.3989.67910.149

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesRX 6800 XTRX 6800RTX 3080RTX 2080 TiRTX 2080 SUPERGTX 3060 TiRTX 2070 SUPERRX 5700 XTRadeon VII1.2982.5963.8945.1926.49SE +/- 0.011, N = 3SE +/- 0.006, N = 3SE +/- 0.031, N = 3SE +/- 0.046, N = 4SE +/- 0.031, N = 14SE +/- 0.061, N = 3SE +/- 0.067, N = 3SE +/- 0.014, N = 3SE +/- 0.013, N = 33.2733.4763.9814.0214.6724.7544.8135.0665.769


Phoronix Test Suite v10.8.4