TNN NCNN

AMD Ryzen 9 3950X 16-Core testing with a ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS) and Sapphire AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2009258-PTS-TNNNCNN783.

TNN NCNNProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution3950X + Navi23AMD Ryzen 9 3950X 16-Core @ 3.50GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (1302 BIOS)AMD Starship/Matisse16GB2000GB Corsair Force MP600 + 2000GBSapphire AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1780/875MHz)AMD Navi 10 HDMI AudioDELL P2415QRealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 20.045.9.0-050900rc6daily20200925-generic (x86_64) 20200924GNOME Shell 3.36.4X Server 1.20.84.6 Mesa 20.3.0-devel (git-3173367 2020-09-25 focal-oibaf-ppa) (LLVM 10.0.1)OpenCL 2.0 AMD-APP (3182.0)1.2.145GCC 9.3.0 + CUDA 11.0ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8701013Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

TNN NCNNncnn: CPU - squeezenetncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: Vulkan GPU - squeezenetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinytnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.13950X + Navi2315.9617.575.925.155.15.217.052.0119.2270.2917.6916.5829.7029.263.586.582.143.11.842.236.830.764.2514.971.735.234.958.65244.905236.99615.8517.185.835.175.105.187.202.0119.3370.0117.6016.6329.3628.943.586.582.143.101.862.236.870.764.2514.971.725.214.958.55240.615225.24715.9017.165.865.165.095.197.012.0118.9070.2917.4316.5029.5328.823.606.592.143.091.832.236.800.764.2614.981.735.194.968.55240.195238.144OpenBenchmarking.org

NCNN

Target: CPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenet3950X + Navi2348121620SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 315.9615.8515.90MIN: 15.48 / MAX: 17.29MIN: 15.47 / MAX: 16.5MIN: 15.48 / MAX: 16.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenet3950X + Navi2348121620SE +/- 0.34, N = 3SE +/- 0.21, N = 3SE +/- 0.10, N = 317.5717.1817.16MIN: 16.92 / MAX: 18.71MIN: 16.64 / MAX: 18.06MIN: 16.72 / MAX: 17.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v23950X + Navi231.3322.6643.9965.3286.66SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 35.925.835.86MIN: 5.69 / MAX: 29.4MIN: 5.69 / MAX: 7.35MIN: 5.7 / MAX: 7.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v33950X + Navi231.16332.32663.48994.65325.8165SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 35.155.175.16MIN: 5.04 / MAX: 16.03MIN: 5.05 / MAX: 6.68MIN: 5.04 / MAX: 6.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v23950X + Navi231.14752.2953.44254.595.7375SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.105.105.09MIN: 5.04 / MAX: 6.5MIN: 5.02 / MAX: 6.45MIN: 5.01 / MAX: 6.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnet3950X + Navi231.17232.34463.51694.68925.8615SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.215.185.19MIN: 5.1 / MAX: 7.28MIN: 5.07 / MAX: 7.05MIN: 5.1 / MAX: 6.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b03950X + Navi23246810SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 37.057.207.01MIN: 6.94 / MAX: 7.27MIN: 6.89 / MAX: 32.31MIN: 6.92 / MAX: 7.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazeface3950X + Navi230.45230.90461.35691.80922.2615SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.012.012.01MIN: 1.97 / MAX: 2.14MIN: 1.97 / MAX: 2.11MIN: 1.98 / MAX: 2.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenet3950X + Navi23510152025SE +/- 0.23, N = 3SE +/- 0.35, N = 3SE +/- 0.01, N = 319.2219.3318.90MIN: 18.31 / MAX: 56.39MIN: 18.21 / MAX: 23.94MIN: 18.17 / MAX: 19.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg163950X + Navi231632486480SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 370.2970.0170.29MIN: 68.71 / MAX: 72.85MIN: 68.34 / MAX: 79.57MIN: 68.56 / MAX: 118.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet183950X + Navi2348121620SE +/- 0.17, N = 3SE +/- 0.16, N = 3SE +/- 0.01, N = 317.6917.6017.43MIN: 17.36 / MAX: 18.26MIN: 17.3 / MAX: 18.73MIN: 17.29 / MAX: 17.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnet3950X + Navi2348121620SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 316.5816.6316.50MIN: 16.38 / MAX: 18.55MIN: 16.38 / MAX: 35.76MIN: 16.37 / MAX: 17.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet503950X + Navi23714212835SE +/- 0.16, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 329.7029.3629.53MIN: 29.28 / MAX: 71.36MIN: 29.03 / MAX: 30.44MIN: 29.11 / MAX: 67.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tiny3950X + Navi23714212835SE +/- 0.18, N = 3SE +/- 0.16, N = 3SE +/- 0.08, N = 329.2628.9428.82MIN: 28.61 / MAX: 39.41MIN: 28.45 / MAX: 30.87MIN: 28.43 / MAX: 30.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: squeezenet3950X + Navi230.811.622.433.244.05SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.583.583.60MIN: 3.47 / MAX: 3.8MIN: 3.47 / MAX: 3.76MIN: 3.48 / MAX: 3.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mobilenet3950X + Navi23246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.586.586.59MIN: 6.53 / MAX: 7.07MIN: 6.54 / MAX: 6.79MIN: 6.53 / MAX: 12.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v2-v2 - Model: mobilenet-v23950X + Navi230.48150.9631.44451.9262.4075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.142.142.14MIN: 2.09 / MAX: 2.55MIN: 2.09 / MAX: 3.7MIN: 2.09 / MAX: 3.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU-v3-v3 - Model: mobilenet-v33950X + Navi230.69751.3952.09252.793.4875SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.103.103.09MIN: 3.05 / MAX: 4.87MIN: 3.05 / MAX: 3.51MIN: 3.05 / MAX: 3.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: shufflenet-v23950X + Navi230.41850.8371.25551.6742.0925SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 31.841.861.83MIN: 1.82 / MAX: 3.18MIN: 1.82 / MAX: 11.33MIN: 1.82 / MAX: 1.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: mnasnet3950X + Navi230.50181.00361.50542.00722.509SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.232.232.23MIN: 2.19 / MAX: 2.47MIN: 2.19 / MAX: 2.46MIN: 2.19 / MAX: 2.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: efficientnet-b03950X + Navi23246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 36.836.876.80MIN: 6.71 / MAX: 15.49MIN: 6.71 / MAX: 17.28MIN: 6.7 / MAX: 10.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: blazeface3950X + Navi230.1710.3420.5130.6840.855SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 30.760.760.76MIN: 0.75 / MAX: 0.99MIN: 0.75 / MAX: 0.96MIN: 0.75 / MAX: 0.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: googlenet3950X + Navi230.95851.9172.87553.8344.7925SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.254.254.26MIN: 4.22 / MAX: 4.91MIN: 4.22 / MAX: 5.32MIN: 4.23 / MAX: 9.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: vgg163950X + Navi2348121620SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 314.9714.9714.98MIN: 14.41 / MAX: 15.48MIN: 14.46 / MAX: 15.66MIN: 14.38 / MAX: 19.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet183950X + Navi230.38930.77861.16791.55721.9465SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.731.721.73MIN: 1.7 / MAX: 2.77MIN: 1.7 / MAX: 2.25MIN: 1.71 / MAX: 2.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: alexnet3950X + Navi231.17682.35363.53044.70725.884SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 35.235.215.19MIN: 4.89 / MAX: 16.85MIN: 4.86 / MAX: 20.01MIN: 4.9 / MAX: 9.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: resnet503950X + Navi231.1162.2323.3484.4645.58SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.954.954.96MIN: 4.88 / MAX: 8.9MIN: 4.9 / MAX: 8.86MIN: 4.89 / MAX: 10.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: Vulkan GPU - Model: yolov4-tiny3950X + Navi23246810SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.658.558.55MIN: 8.5 / MAX: 46.41MIN: 8.46 / MAX: 13.54MIN: 8.5 / MAX: 8.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v23950X + Navi2350100150200250SE +/- 2.51, N = 3SE +/- 0.38, N = 3SE +/- 0.31, N = 3244.91240.62240.20MIN: 240.06 / MAX: 267.36MIN: 239.1 / MAX: 274.3MIN: 238.85 / MAX: 245.711. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.13950X + Navi2350100150200250SE +/- 0.72, N = 3SE +/- 0.72, N = 3SE +/- 0.09, N = 3237.00225.25238.14MIN: 234.3 / MAX: 240MIN: 223.02 / MAX: 233.9MIN: 235.38 / MAX: 238.841. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl


Phoronix Test Suite v10.8.4