mnn ncnn apple m1

Apple M1 testing with a Apple Mac mini (M1 2020) and llvmpipe on Arch Linux ARM via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2208136-NE-MNNNCNNAP23
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
A
August 13 2022
  6 Hours, 18 Minutes
B
August 13 2022
  3 Hours, 44 Minutes
C
August 13 2022
  6 Hours, 17 Minutes
D
August 13 2022
  6 Hours, 17 Minutes
Invert Behavior (Only Show Selected Data)
  5 Hours, 39 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


mnn ncnn apple m1OpenBenchmarking.orgPhoronix Test SuiteApple M1 @ 2.06GHz (4 Cores / 8 Threads)Apple Mac mini (M1 2020)8GB251GB APPLE SSD AP0256Q + 2 x 0GB APPLE SSD AP0256QllvmpipeBroadcom NetXtreme BCM57762 PCIe + Broadcom BRCM4378 + Broadcom Device 5f69Arch Linux ARM5.19.0-rc7-asahi-2-1-ARCH (aarch64)KDE Plasma 5.25.4X Server 1.21.1.44.5 Mesa 22.1.4 (LLVM 14.0.6 128 bits)GCC 12.1.0 + Clang 14.0.6ext41920x1080ProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionMnn Ncnn Apple M1 BenchmarksSystem Logs- --build=aarch64-unknown-linux-gnu --disable-libssp --disable-libstdcxx-pch --disable-multilib --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-fix-cortex-a53-835769 --enable-fix-cortex-a53-843419 --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=c,c++,fortran,go,lto,objc,obj-c++ --enable-lto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-unknown-linux-gnu --mandir=/usr/share/man --with-arch=armv8-a --with-linker-hash-style=gnu - Scaling Governor: apple-cpufreq schedutil- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

ABCDResult OverviewPhoronix Test Suite100%103%106%109%113%NCNNMobile Neural NetworkMobile Neural NetworkNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNNCNNMobile Neural NetworkNCNNNCNNNCNNMobile Neural NetworkMobile Neural NetworkNCNNMobile Neural NetworkNCNNMobile Neural NetworkCPU - FastestDetMobileNetV2_224mobilenetV3CPU - mnasnetCPU-v3-v3 - mobilenet-v3CPU - squeezenet_ssdCPU - resnet18CPU - googlenetCPU - shufflenet-v2CPU - mobilenetCPU - blazefaceCPU - regnety_400mCPU - alexnetCPU - resnet50inception-v3CPU - efficientnet-b0CPU-v2-v2 - mobilenet-v2CPU - yolov4-tinySqueezeNetV1.0resnet-v2-50CPU - vgg16squeezenetv1.1CPU - vision_transformermobilenet-v1-1.0

mnn ncnn apple m1mnn: mobilenetV3mnn: squeezenetv1.1mnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - blazefacencnn: CPU - FastestDetABCD2.0124.12031.2657.2024.6785.75244.47614.512.492.552.132.633.8514.3834.136.6313.4317.6916.9310.945.81556.152.831.912.1224.13931.5127.2255.0255.74143.88414.672.462.522.182.453.8614.7734.076.8213.6217.5417.0911.135.82557.832.792.151.9294.11531.3467.1664.5115.73544.07014.892.462.432.192.563.9014.3233.916.5613.7317.5617.0411.385.86558.922.812.011.9764.11931.3337.1654.5365.74943.92714.622.482.562.162.433.9014.2833.946.7813.6017.8117.1311.105.95557.342.862.00OpenBenchmarking.org

Mobile Neural Network

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: mobilenetV3CDAB0.47750.9551.43251.912.3875SE +/- 0.034, N = 12SE +/- 0.043, N = 12SE +/- 0.042, N = 12SE +/- 0.020, N = 31.9291.9762.0122.122MIN: 1.78 / MAX: 2.9MIN: 1.78 / MAX: 2.46MIN: 1.79 / MAX: 5.19MIN: 2.07 / MAX: 2.371. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: squeezenetv1.1CDAB0.93131.86262.79393.72524.6565SE +/- 0.010, N = 12SE +/- 0.009, N = 12SE +/- 0.012, N = 12SE +/- 0.014, N = 34.1154.1194.1204.139MIN: 4.04 / MAX: 7.45MIN: 4.04 / MAX: 9.15MIN: 4.03 / MAX: 4.54MIN: 4.07 / MAX: 8.541. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: resnet-v2-50ADCB714212835SE +/- 0.06, N = 12SE +/- 0.05, N = 12SE +/- 0.07, N = 12SE +/- 0.12, N = 331.2731.3331.3531.51MIN: 30.89 / MAX: 36.52MIN: 31.02 / MAX: 36.84MIN: 30.87 / MAX: 35.51MIN: 30.98 / MAX: 35.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: SqueezeNetV1.0DCAB246810SE +/- 0.049, N = 12SE +/- 0.051, N = 12SE +/- 0.051, N = 12SE +/- 0.102, N = 37.1657.1667.2027.225MIN: 6.94 / MAX: 8.77MIN: 6.93 / MAX: 11.58MIN: 6.94 / MAX: 10.14MIN: 6.99 / MAX: 7.661. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: MobileNetV2_224CDAB1.13062.26123.39184.52245.653SE +/- 0.116, N = 12SE +/- 0.110, N = 12SE +/- 0.117, N = 12SE +/- 0.031, N = 34.5114.5364.6785.025MIN: 4.09 / MAX: 10MIN: 4.16 / MAX: 10.91MIN: 4.14 / MAX: 8.31MIN: 4.97 / MAX: 5.271. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: mobilenet-v1-1.0CBDA1.29422.58843.88265.17686.471SE +/- 0.008, N = 12SE +/- 0.031, N = 3SE +/- 0.010, N = 12SE +/- 0.008, N = 125.7355.7415.7495.752MIN: 5.68 / MAX: 11.55MIN: 5.66 / MAX: 8.59MIN: 5.65 / MAX: 11.69MIN: 5.68 / MAX: 9.41. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: inception-v3BDCA1020304050SE +/- 0.81, N = 3SE +/- 0.23, N = 12SE +/- 0.23, N = 12SE +/- 0.27, N = 1243.8843.9344.0744.48MIN: 42.56 / MAX: 52.14MIN: 42.55 / MAX: 50.76MIN: 43.4 / MAX: 51.58MIN: 43.4 / MAX: 71.041. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenetADBC48121620SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 314.5114.6214.6714.89MIN: 9.47 / MAX: 25.78MIN: 9.37 / MAX: 25.12MIN: 9.45 / MAX: 26.22MIN: 9.41 / MAX: 25.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v2BCDA0.56031.12061.68092.24122.8015SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 32.462.462.482.49MIN: 2.43 / MAX: 2.9MIN: 2.44 / MAX: 4.62MIN: 2.44 / MAX: 11.8MIN: 2.44 / MAX: 13.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v3CBAD0.5761.1521.7282.3042.88SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 2SE +/- 0.01, N = 32.432.522.552.56MIN: 2.27 / MAX: 2.59MIN: 2.46 / MAX: 2.78MIN: 2.5 / MAX: 4.94MIN: 2.51 / MAX: 5.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v2ADBC0.49280.98561.47841.97122.464SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 32.132.162.182.19MIN: 2.09 / MAX: 2.44MIN: 2.12 / MAX: 3.03MIN: 2.12 / MAX: 7.27MIN: 2.11 / MAX: 20.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetDBCA0.59181.18361.77542.36722.959SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 32.432.452.562.63MIN: 2.41 / MAX: 3.11MIN: 2.41 / MAX: 4.31MIN: 2.41 / MAX: 4.91MIN: 2.41 / MAX: 12.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0ABCD0.87751.7552.63253.514.3875SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 33.853.863.903.90MIN: 3.81 / MAX: 6.92MIN: 3.81 / MAX: 14.93MIN: 3.81 / MAX: 15.96MIN: 3.81 / MAX: 13.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetDCAB48121620SE +/- 0.26, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.19, N = 214.2814.3214.3814.77MIN: 9.29 / MAX: 23.48MIN: 9.41 / MAX: 23.61MIN: 9.51 / MAX: 22.97MIN: 9.79 / MAX: 24.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg16CDBA816243240SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 333.9133.9434.0734.13MIN: 30.14 / MAX: 47.42MIN: 30.26 / MAX: 46.13MIN: 30.33 / MAX: 45.17MIN: 30.19 / MAX: 46.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet18CADB246810SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 36.566.636.786.82MIN: 5.75 / MAX: 20.92MIN: 5.77 / MAX: 15.63MIN: 5.74 / MAX: 16.12MIN: 5.74 / MAX: 16.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnetADBC48121620SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 313.4313.6013.6213.73MIN: 9.68 / MAX: 21.9MIN: 9.79 / MAX: 21.97MIN: 9.81 / MAX: 22.21MIN: 9.79 / MAX: 22.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50BCAD48121620SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 317.5417.5617.6917.81MIN: 15.32 / MAX: 28.27MIN: 15.37 / MAX: 27.48MIN: 15.3 / MAX: 27.57MIN: 15.42 / MAX: 30.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tinyACBD48121620SE +/- 0.11, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 316.9317.0417.0917.13MIN: 14.21 / MAX: 26.86MIN: 14.15 / MAX: 25.65MIN: 13.92 / MAX: 30.17MIN: 14.16 / MAX: 27.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssdADBC3691215SE +/- 0.17, N = 3SE +/- 0.20, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 310.9411.1011.1311.38MIN: 8.03 / MAX: 20.45MIN: 8.04 / MAX: 21.85MIN: 8.08 / MAX: 21.53MIN: 8.02 / MAX: 21.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mABCD1.33882.67764.01645.35526.694SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 35.815.825.865.95MIN: 5.78 / MAX: 6.09MIN: 5.78 / MAX: 10.73MIN: 5.79 / MAX: 7.87MIN: 5.79 / MAX: 8.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerADBC120240360480600SE +/- 0.36, N = 3SE +/- 1.03, N = 3SE +/- 0.64, N = 3SE +/- 0.98, N = 3556.15557.34557.83558.92MIN: 516.47 / MAX: 589.99MIN: 523.45 / MAX: 591.96MIN: 520.8 / MAX: 597.49MIN: 522.41 / MAX: 596.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceBCAD0.64351.2871.93052.5743.2175SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 2SE +/- 0.03, N = 32.792.812.832.86MIN: 1.38 / MAX: 10.17MIN: 1.17 / MAX: 9.42MIN: 1.26 / MAX: 9.38MIN: 1.14 / MAX: 9.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetADCB0.48380.96761.45141.93522.419SE +/- 0.01, N = 2SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 31.912.002.012.15MIN: 1.88 / MAX: 3.36MIN: 1.89 / MAX: 2.18MIN: 1.88 / MAX: 8.67MIN: 2.08 / MAX: 9.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread