Feb 2021 Vulkan Compute

Vulkan benchmark comparison for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2102067-HA-FEB2021VU83&gru&sro.

Feb 2021 Vulkan ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLRX 5700 XTRadeon VIIRX 6800RX 6800 XTRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiGTX 3060 TiRTX 3080AMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (3202 BIOS)AMD Starship/Matisse32GB2000GB Corsair Force MP600 + 2000GBAMD Radeon RX 5600 OEM/5600 XT / 5700/5700 8GB (2100/875MHz)AMD Navi 10 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.105.11.0-051100rc5daily20210129-generic (x86_64) 20210128GNOME Shell 3.38.2X Server 1.20.9amd4.6 Mesa 21.1.0-devel (git-824ae64 2021-02-01 groovy-oibaf-ppa) (LLVM 11.0.1)1.2.145GCC 10.2.0 + Clang 11.0.1-1~oibaf~gext43840x2160AMD Radeon VII 16GB (1801/1000MHz)AMD Vega 20 HDMI AudioAMD SIENNA_CICHLID 16GB (2475/1000MHz)AMD Device ab28AMD SIENNA_CICHLID 16GB (2575/1000MHz)NVIDIA GeForce RTX 2070 SUPER 8GB (1605/7000MHz)NVIDIA TU104 HD Audio5.8.0-41-generic (x86_64)NVIDIA 460.394.6.0OpenCL 1.2 CUDA 11.2.1361.2.155NVIDIA GeForce RTX 2080 SUPER 8GB (1650/7750MHz)NVIDIA GeForce RTX 2080 Ti 11GB (420/405MHz)NVIDIA TU102 HD AudioNVIDIA GeForce RTX 3060 Ti 8GB (1665/7000MHz)NVIDIA Device 228bNVIDIA GeForce RTX 3080 10GB (390/405MHz)NVIDIA Device 1aefOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009Graphics Details- RX 5700 XT, Radeon VII, RX 6800, RX 6800 XT: GLAMORSecurity Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Feb 2021 Vulkan Computevkfft: vkresample: 2x - Singlerealsr-ncnn: 4x - Norealsr-ncnn: 4x - Yeswaifu2x-ncnn: 2x - 3 - YesRX 5700 XTRadeon VIIRX 6800RX 6800 XTRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiGTX 3060 TiRTX 30802067632.0648.91858.0115.0663391213.7236.61239.4655.7694480418.3755.73332.2093.4764973017.9475.55530.5763.2733190719.87910.14964.3184.8133250918.6299.67958.4304.6724163515.0138.54845.7834.0213208719.4049.39858.5514.7545203512.1846.91336.9323.981OpenBenchmarking.org

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1GTX 3060 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 3080RX 5700 XTRX 6800RX 6800 XTRadeon VII11K22K33K44K55KSE +/- 19.65, N = 3SE +/- 31.18, N = 3SE +/- 76.95, N = 3SE +/- 22.81, N = 3SE +/- 193.08, N = 3SE +/- 15.62, N = 3SE +/- 137.01, N = 3SE +/- 576.21, N = 3SE +/- 1.53, N = 33208731907325094163552035206764480449730339121. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleGTX 3060 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 3080RX 5700 XTRX 6800RX 6800 XTRadeon VII714212835SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 319.4019.8818.6315.0112.1832.0618.3817.9513.721. (CXX) g++ options: -O3 -pthread

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoGTX 3060 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 3080RX 5700 XTRX 6800RX 6800 XTRadeon VII3691215SE +/- 0.107, N = 3SE +/- 0.103, N = 3SE +/- 0.094, N = 3SE +/- 0.082, N = 3SE +/- 0.077, N = 15SE +/- 0.036, N = 3SE +/- 0.015, N = 3SE +/- 0.010, N = 3SE +/- 0.051, N = 39.39810.1499.6798.5486.9138.9185.7335.5556.612

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesGTX 3060 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 3080RX 5700 XTRX 6800RX 6800 XTRadeon VII1428425670SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 358.5564.3258.4345.7836.9358.0132.2130.5839.47

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesGTX 3060 TiRTX 2070 SUPERRTX 2080 SUPERRTX 2080 TiRTX 3080RX 5700 XTRX 6800RX 6800 XTRadeon VII1.2982.5963.8945.1926.49SE +/- 0.061, N = 3SE +/- 0.067, N = 3SE +/- 0.031, N = 14SE +/- 0.046, N = 4SE +/- 0.031, N = 3SE +/- 0.014, N = 3SE +/- 0.006, N = 3SE +/- 0.011, N = 3SE +/- 0.013, N = 34.7544.8134.6724.0213.9815.0663.4763.2735.769


Phoronix Test Suite v10.8.4