Vulkan Compute

AMD Ryzen 5 3600 6-Core testing with a Gigabyte X570 AORUS PRO (F34 BIOS) and AMD Radeon VII on ManjaroLinux 21.1.0 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2108171-IB-2107307PT42&grs.

Vulkan ComputeProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionOpenCLNVIDIA RTX 3060Radeon VIIAMD Ryzen 9 5900X 12-Core @ 3.70GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (3501 BIOS)AMD Starship/Matisse16GB1000GB Sabrent Rocket 4.0 Plus + 2000GBeVGA NVIDIA GeForce RTX 3060 12GBNVIDIA Device 228eASUS VP28URealtek RTL8125 2.5GbE + Intel I211Ubuntu 21.045.11.0-25-generic (x86_64)GNOME Shell 3.38.4X Server 1.20.11NVIDIA 470.57.024.6.01.2.175GCC 10.3.0ext43840x2160AMD Ryzen 5 3600 6-Core @ 3.60GHz (6 Cores / 12 Threads)Gigabyte X570 AORUS PRO (F34 BIOS)32GB1000GB Sabrent Rocket 4.0 1TB + 240GB SanDisk SDSSDA24 + 256GB SanDisk SD8SN8U2 + 0GB Multiple ReaderAMD Radeon VII (1801/1000MHz)AMD Vega 20 HDMI AudioIntel I211 + Intel Wi-Fi 6 AX200ManjaroLinux 21.1.05.14.0-1-MANJARO (x86_64)X Server 1.20.134.6 Mesa 21.1.6 (LLVM 12.0.1)OpenCL 2.0 AMD-APP.dbg (3305.0)1.2.174GCC 11.1.0 + Clang 12.0.1 + CUDA 11.4f2fs2560x1440OpenBenchmarking.orgKernel Details- NVIDIA RTX 3060: Transparent Huge Pages: madvise- Radeon VII: amdgpu.ppfeaturemask=0xffffffff - Transparent Huge Pages: madviseCompiler Details- NVIDIA RTX 3060: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Radeon VII: --disable-libssp --disable-libstdcxx-pch --disable-libunwind-exceptions --disable-werror --enable-__cxa_atexit --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-install-libiberty --enable-languages=c,c++,ada,fortran,go,lto,objc,obj-c++,d --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-isl --with-linker-hash-style=gnu Processor Details- NVIDIA RTX 3060: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa201009- Radeon VII: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8701021Security Details- NVIDIA RTX 3060: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected - Radeon VII: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Graphics Details- Radeon VII: GLAMOR

Vulkan Computencnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - blazefacevkpeak: fp64-vec4vkpeak: fp64-scalarvkpeak: fp32-scalarvkpeak: int16-vec4vkpeak: int32-vec4vkpeak: int32-scalarvkpeak: fp16-vec4vkpeak: int16-scalarvkpeak: fp32-vec4realsr-ncnn: 4x - Yesrealsr-ncnn: 4x - Nowaifu2x-ncnn: 2x - 3 - Yesvkpeak: fp16-scalarvkresample: 2x - Singlevkfft: NVIDIA RTX 3060Radeon VII1.937.34.192.104.882.447.214.574.271.741.953.252.042.180.98214.29214.256829.755957.526766.466830.2813242.604480.119079.5067.90310.5054.9746849.6423.1492733715.9655.3528.1812.8721.879.7125.8216.2014.924.575.027.064.244.441.973327.363285.0413306.1311149.714216.744374.3419985.456562.7412171.1151.0338.6585.7316619.51OpenBenchmarking.org

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18NVIDIA RTX 3060Radeon VII48121620SE +/- 0.00, N = 2SE +/- 0.08, N = 31.9315.96MIN: 1.91 / MAX: 2.25MIN: 15.56 / MAX: 23.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16NVIDIA RTX 3060Radeon VII1224364860SE +/- 0.00, N = 3SE +/- 0.11, N = 37.3055.35MIN: 7.17 / MAX: 15.83MIN: 53.94 / MAX: 79.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50NVIDIA RTX 3060Radeon VII714212835SE +/- 0.01, N = 3SE +/- 0.65, N = 34.1928.18MIN: 4.17 / MAX: 4.84MIN: 27.11 / MAX: 241.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetNVIDIA RTX 3060Radeon VII3691215SE +/- 0.00, N = 3SE +/- 0.18, N = 32.1012.87MIN: 2.07 / MAX: 4.52MIN: 12.33 / MAX: 18.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdNVIDIA RTX 3060Radeon VII510152025SE +/- 0.07, N = 3SE +/- 0.14, N = 34.8821.87MIN: 4.67 / MAX: 10.31MIN: 21.28 / MAX: 43.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mNVIDIA RTX 3060Radeon VII3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 32.449.71MIN: 2.41 / MAX: 3.44MIN: 9.48 / MAX: 16.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyNVIDIA RTX 3060Radeon VII612182430SE +/- 0.01, N = 3SE +/- 0.08, N = 37.2125.82MIN: 7.02 / MAX: 7.52MIN: 25.32 / MAX: 69.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetNVIDIA RTX 3060Radeon VII48121620SE +/- 0.01, N = 3SE +/- 0.08, N = 34.5716.20MIN: 4.52 / MAX: 4.92MIN: 15.73 / MAX: 24.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetNVIDIA RTX 3060Radeon VII48121620SE +/- 0.08, N = 3SE +/- 0.09, N = 34.2714.92MIN: 3.95 / MAX: 15.21MIN: 14.55 / MAX: 20.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2NVIDIA RTX 3060Radeon VII1.02832.05663.08494.11325.1415SE +/- 0.00, N = 3SE +/- 0.06, N = 31.744.57MIN: 1.71 / MAX: 3.74MIN: 4.38 / MAX: 9.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2NVIDIA RTX 3060Radeon VII1.12952.2593.38854.5185.6475SE +/- 0.00, N = 3SE +/- 0.02, N = 31.955.02MIN: 1.92 / MAX: 2.86MIN: 4.79 / MAX: 14.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0NVIDIA RTX 3060Radeon VII246810SE +/- 0.01, N = 3SE +/- 0.11, N = 33.257.06MIN: 3.22 / MAX: 3.7MIN: 6.78 / MAX: 61.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetNVIDIA RTX 3060Radeon VII0.9541.9082.8623.8164.77SE +/- 0.01, N = 3SE +/- 0.03, N = 32.044.24MIN: 2.02 / MAX: 4.46MIN: 4.1 / MAX: 9.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3NVIDIA RTX 3060Radeon VII0.9991.9982.9973.9964.995SE +/- 0.00, N = 3SE +/- 0.02, N = 32.184.44MIN: 2.16 / MAX: 3.53MIN: 4.27 / MAX: 14.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceNVIDIA RTX 3060Radeon VII0.44330.88661.32991.77322.2165SE +/- 0.00, N = 3SE +/- 0.05, N = 30.981.97MIN: 0.95 / MAX: 2.33MIN: 1.81 / MAX: 49.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

vkpeak

fp64-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-vec4NVIDIA RTX 3060Radeon VII7001400210028003500SE +/- 0.01, N = 3SE +/- 56.99, N = 3214.293327.36

vkpeak

fp64-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp64-scalarNVIDIA RTX 3060Radeon VII7001400210028003500SE +/- 0.03, N = 3SE +/- 61.71, N = 3214.253285.04

vkpeak

fp32-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-scalarNVIDIA RTX 3060Radeon VII3K6K9K12K15KSE +/- 10.62, N = 3SE +/- 139.79, N = 36829.7513306.13

vkpeak

int16-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-vec4NVIDIA RTX 3060Radeon VII2K4K6K8K10KSE +/- 0.03, N = 3SE +/- 177.74, N = 35957.5211149.71

vkpeak

int32-vec4

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-vec4NVIDIA RTX 3060Radeon VII15003000450060007500SE +/- 17.45, N = 3SE +/- 71.24, N = 36766.464216.74

vkpeak

int32-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int32-scalarNVIDIA RTX 3060Radeon VII15003000450060007500SE +/- 0.58, N = 3SE +/- 85.69, N = 36830.284374.34

vkpeak

fp16-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-vec4NVIDIA RTX 3060Radeon VII4K8K12K16K20KSE +/- 2.31, N = 3SE +/- 540.60, N = 313242.6019985.45

vkpeak

int16-scalar

OpenBenchmarking.orgGIOPS, More Is Bettervkpeak 20210424int16-scalarNVIDIA RTX 3060Radeon VII14002800420056007000SE +/- 0.29, N = 3SE +/- 58.06, N = 34480.116562.74

vkpeak

fp32-vec4

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp32-vec4NVIDIA RTX 3060Radeon VII3K6K9K12K15KSE +/- 0.30, N = 3SE +/- 255.84, N = 39079.5012171.11

RealSR-NCNN

Scale: 4x - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: YesNVIDIA RTX 3060Radeon VII1530456075SE +/- 0.06, N = 3SE +/- 0.33, N = 367.9051.03

RealSR-NCNN

Scale: 4x - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterRealSR-NCNN 20200818Scale: 4x - TAA: NoNVIDIA RTX 3060Radeon VII3691215SE +/- 0.003, N = 3SE +/- 0.054, N = 310.5058.658

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: YesNVIDIA RTX 3060Radeon VII1.28952.5793.86855.1586.4475SE +/- 0.004, N = 3SE +/- 0.018, N = 34.9745.731

vkpeak

fp16-scalar

OpenBenchmarking.orgGFLOPS, More Is Bettervkpeak 20210424fp16-scalarNVIDIA RTX 3060Radeon VII15003000450060007500SE +/- 17.95, N = 3SE +/- 43.13, N = 36849.646619.51

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: SingleNVIDIA RTX 3060612182430SE +/- 0.02, N = 323.151. (CXX) g++ options: -O3 -pthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.1NVIDIA RTX 30606K12K18K24K30KSE +/- 308.54, N = 3273371. (CXX) g++ options: -O3 -pthread


Phoronix Test Suite v10.8.4