mnn ncnn apple m1

Apple M1 testing with a Apple Mac mini (M1 2020) and llvmpipe on Arch Linux ARM via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2208136-NE-MNNNCNNAP23&sor.

mnn ncnn apple m1ProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionABCDApple M1 @ 2.06GHz (4 Cores / 8 Threads)Apple Mac mini (M1 2020)8GB251GB APPLE SSD AP0256Q + 2 x 0GB APPLE SSD AP0256QllvmpipeBroadcom NetXtreme BCM57762 PCIe + Broadcom BRCM4378 + Broadcom Device 5f69Arch Linux ARM5.19.0-rc7-asahi-2-1-ARCH (aarch64)KDE Plasma 5.25.4X Server 1.21.1.44.5 Mesa 22.1.4 (LLVM 14.0.6 128 bits)GCC 12.1.0 + Clang 14.0.6ext41920x1080OpenBenchmarking.orgCompiler Details- --build=aarch64-unknown-linux-gnu --disable-libssp --disable-libstdcxx-pch --disable-multilib --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-fix-cortex-a53-835769 --enable-fix-cortex-a53-843419 --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,fortran,go,lto,objc,obj-c++ --enable-lto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-unknown-linux-gnu --mandir=/usr/share/man --with-arch=armv8-a --with-linker-hash-style=gnu Processor Details- Scaling Governor: apple-cpufreq schedutilSecurity Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

mnn ncnn apple m1mnn: mobilenetV3mnn: squeezenetv1.1mnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - blazefacencnn: CPU - FastestDetABCD2.0124.12031.2657.2024.6785.75244.47614.512.492.552.132.633.8514.3834.136.6313.4317.6916.9310.945.81556.152.831.912.1224.13931.5127.2255.0255.74143.88414.672.462.522.182.453.8614.7734.076.8213.6217.5417.0911.135.82557.832.792.151.9294.11531.3467.1664.5115.73544.07014.892.462.432.192.563.9014.3233.916.5613.7317.5617.0411.385.86558.922.812.011.9764.11931.3337.1654.5365.74943.92714.622.482.562.162.433.9014.2833.946.7813.6017.8117.1311.105.95557.342.862.00OpenBenchmarking.org

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: mobilenetV3CDAB0.47750.9551.43251.912.3875SE +/- 0.034, N = 12SE +/- 0.043, N = 12SE +/- 0.042, N = 12SE +/- 0.020, N = 31.9291.9762.0122.122MIN: 1.78 / MAX: 2.9MIN: 1.78 / MAX: 2.46MIN: 1.79 / MAX: 5.19MIN: 2.07 / MAX: 2.371. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: squeezenetv1.1CDAB0.93131.86262.79393.72524.6565SE +/- 0.010, N = 12SE +/- 0.009, N = 12SE +/- 0.012, N = 12SE +/- 0.014, N = 34.1154.1194.1204.139MIN: 4.04 / MAX: 7.45MIN: 4.04 / MAX: 9.15MIN: 4.03 / MAX: 4.54MIN: 4.07 / MAX: 8.541. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: resnet-v2-50ADCB714212835SE +/- 0.06, N = 12SE +/- 0.05, N = 12SE +/- 0.07, N = 12SE +/- 0.12, N = 331.2731.3331.3531.51MIN: 30.89 / MAX: 36.52MIN: 31.02 / MAX: 36.84MIN: 30.87 / MAX: 35.51MIN: 30.98 / MAX: 35.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: SqueezeNetV1.0DCAB246810SE +/- 0.049, N = 12SE +/- 0.051, N = 12SE +/- 0.051, N = 12SE +/- 0.102, N = 37.1657.1667.2027.225MIN: 6.94 / MAX: 8.77MIN: 6.93 / MAX: 11.58MIN: 6.94 / MAX: 10.14MIN: 6.99 / MAX: 7.661. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: MobileNetV2_224CDAB1.13062.26123.39184.52245.653SE +/- 0.116, N = 12SE +/- 0.110, N = 12SE +/- 0.117, N = 12SE +/- 0.031, N = 34.5114.5364.6785.025MIN: 4.09 / MAX: 10MIN: 4.16 / MAX: 10.91MIN: 4.14 / MAX: 8.31MIN: 4.97 / MAX: 5.271. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: mobilenet-v1-1.0CBDA1.29422.58843.88265.17686.471SE +/- 0.008, N = 12SE +/- 0.031, N = 3SE +/- 0.010, N = 12SE +/- 0.008, N = 125.7355.7415.7495.752MIN: 5.68 / MAX: 11.55MIN: 5.66 / MAX: 8.59MIN: 5.65 / MAX: 11.69MIN: 5.68 / MAX: 9.41. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: inception-v3BDCA1020304050SE +/- 0.81, N = 3SE +/- 0.23, N = 12SE +/- 0.23, N = 12SE +/- 0.27, N = 1243.8843.9344.0744.48MIN: 42.56 / MAX: 52.14MIN: 42.55 / MAX: 50.76MIN: 43.4 / MAX: 51.58MIN: 43.4 / MAX: 71.041. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenetADBC48121620SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 314.5114.6214.6714.89MIN: 9.47 / MAX: 25.78MIN: 9.37 / MAX: 25.12MIN: 9.45 / MAX: 26.22MIN: 9.41 / MAX: 25.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v2BCDA0.56031.12061.68092.24122.8015SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 32.462.462.482.49MIN: 2.43 / MAX: 2.9MIN: 2.44 / MAX: 4.62MIN: 2.44 / MAX: 11.8MIN: 2.44 / MAX: 13.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v3CBAD0.5761.1521.7282.3042.88SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 2SE +/- 0.01, N = 32.432.522.552.56MIN: 2.27 / MAX: 2.59MIN: 2.46 / MAX: 2.78MIN: 2.5 / MAX: 4.94MIN: 2.51 / MAX: 5.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v2ADBC0.49280.98561.47841.97122.464SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 32.132.162.182.19MIN: 2.09 / MAX: 2.44MIN: 2.12 / MAX: 3.03MIN: 2.12 / MAX: 7.27MIN: 2.11 / MAX: 20.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetDBCA0.59181.18361.77542.36722.959SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 32.432.452.562.63MIN: 2.41 / MAX: 3.11MIN: 2.41 / MAX: 4.31MIN: 2.41 / MAX: 4.91MIN: 2.41 / MAX: 12.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0ABCD0.87751.7552.63253.514.3875SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 33.853.863.903.90MIN: 3.81 / MAX: 6.92MIN: 3.81 / MAX: 14.93MIN: 3.81 / MAX: 15.96MIN: 3.81 / MAX: 13.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetDCAB48121620SE +/- 0.26, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 214.2814.3214.3814.77MIN: 9.29 / MAX: 23.48MIN: 9.41 / MAX: 23.61MIN: 9.51 / MAX: 22.97MIN: 9.79 / MAX: 24.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg16CDBA816243240SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 333.9133.9434.0734.13MIN: 30.14 / MAX: 47.42MIN: 30.26 / MAX: 46.13MIN: 30.33 / MAX: 45.17MIN: 30.19 / MAX: 46.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet18CADB246810SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 36.566.636.786.82MIN: 5.75 / MAX: 20.92MIN: 5.77 / MAX: 15.63MIN: 5.74 / MAX: 16.12MIN: 5.74 / MAX: 16.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnetADBC48121620SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 313.4313.6013.6213.73MIN: 9.68 / MAX: 21.9MIN: 9.79 / MAX: 21.97MIN: 9.81 / MAX: 22.21MIN: 9.79 / MAX: 22.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50BCAD48121620SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 317.5417.5617.6917.81MIN: 15.32 / MAX: 28.27MIN: 15.37 / MAX: 27.48MIN: 15.3 / MAX: 27.57MIN: 15.42 / MAX: 30.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tinyACBD48121620SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 316.9317.0417.0917.13MIN: 14.21 / MAX: 26.86MIN: 14.15 / MAX: 25.65MIN: 13.92 / MAX: 30.17MIN: 14.16 / MAX: 27.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssdADBC3691215SE +/- 0.17, N = 3SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 310.9411.1011.1311.38MIN: 8.03 / MAX: 20.45MIN: 8.04 / MAX: 21.85MIN: 8.08 / MAX: 21.53MIN: 8.02 / MAX: 21.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mABCD1.33882.67764.01645.35526.694SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 35.815.825.865.95MIN: 5.78 / MAX: 6.09MIN: 5.78 / MAX: 10.73MIN: 5.79 / MAX: 7.87MIN: 5.79 / MAX: 8.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerADBC120240360480600SE +/- 0.36, N = 3SE +/- 1.03, N = 3SE +/- 0.64, N = 3SE +/- 0.98, N = 3556.15557.34557.83558.92MIN: 516.47 / MAX: 589.99MIN: 523.45 / MAX: 591.96MIN: 520.8 / MAX: 597.49MIN: 522.41 / MAX: 596.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceBCAD0.64351.2871.93052.5743.2175SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 2SE +/- 0.03, N = 32.792.812.832.86MIN: 1.38 / MAX: 10.17MIN: 1.17 / MAX: 9.42MIN: 1.26 / MAX: 9.38MIN: 1.14 / MAX: 9.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetADCB0.48380.96761.45141.93522.419SE +/- 0.01, N = 2SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 31.912.002.012.15MIN: 1.88 / MAX: 3.36MIN: 1.89 / MAX: 2.18MIN: 1.88 / MAX: 8.67MIN: 2.08 / MAX: 9.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.4