nn

AMD Ryzen Threadripper PRO 5965WX 24-Cores testing with a ASUS Pro WS WRX80E-SAGE SE WIFI (1003 BIOS) and ASUS NVIDIA NV106 2GB on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2208137-NE-NN282134066.

nnProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen ResolutionabcAMD Ryzen Threadripper PRO 5965WX 24-Cores @ 3.80GHz (24 Cores / 48 Threads)ASUS Pro WS WRX80E-SAGE SE WIFI (1003 BIOS)AMD Starship/Matisse128GB1000GB Western Digital WDS100T1X0E-00AFY0ASUS NVIDIA NV106 2GBAMD Starship/MatisseVA24312 x Intel 10G X550T + Intel Wi-Fi 6 AX200Ubuntu 22.045.19.0-051900daily20220809-generic (x86_64)GNOME Shell 42.2X Server 1.21.1.3 + Waylandnouveau4.3 Mesa 22.0.11.2.204GCC 11.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa008203Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

nnmnn: mobilenetV3mnn: squeezenetv1.1mnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetabc2.2573.64818.4335.9463.8202.70022.13413.126.175.697.255.698.283.0914.1623.289.255.9617.1019.4915.5123.84110.268.762.2813.78918.8155.9833.9012.72022.39912.935.945.457.045.467.982.9913.8423.259.135.8916.8119.1615.2422.46110.278.592.0383.29117.1155.6583.5452.72919.85412.845.915.426.965.467.92.9613.823.329.115.9816.7218.9915.2322.26110.278.43OpenBenchmarking.org

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: mobilenetV3abc0.51321.02641.53962.05282.566SE +/- 0.029, N = 15SE +/- 0.026, N = 152.2572.2812.038MIN: 2.09 / MAX: 3.98MIN: 2.06 / MAX: 3.83MIN: 2.01 / MAX: 3.611. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: squeezenetv1.1abc0.85251.7052.55753.414.2625SE +/- 0.056, N = 15SE +/- 0.046, N = 153.6483.7893.291MIN: 3.33 / MAX: 5.24MIN: 3.38 / MAX: 4.92MIN: 3.26 / MAX: 4.161. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: resnet-v2-50abc510152025SE +/- 0.26, N = 15SE +/- 0.25, N = 1518.4318.8217.12MIN: 17.17 / MAX: 21.39MIN: 17.52 / MAX: 21.68MIN: 16.97 / MAX: 18.671. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: SqueezeNetV1.0abc1.34622.69244.03865.38486.731SE +/- 0.055, N = 15SE +/- 0.052, N = 155.9465.9835.658MIN: 5.24 / MAX: 7.59MIN: 5.55 / MAX: 7.35MIN: 5.59 / MAX: 7.091. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: MobileNetV2_224abc0.87771.75542.63313.51084.3885SE +/- 0.034, N = 15SE +/- 0.047, N = 153.8203.9013.545MIN: 3.54 / MAX: 5.26MIN: 3.55 / MAX: 5.3MIN: 3.51 / MAX: 4.491. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: mobilenet-v1-1.0abc0.6141.2281.8422.4563.07SE +/- 0.019, N = 15SE +/- 0.019, N = 152.7002.7202.729MIN: 2.59 / MAX: 3.54MIN: 2.57 / MAX: 4.14MIN: 2.7 / MAX: 3.561. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.0Model: inception-v3abc510152025SE +/- 0.25, N = 15SE +/- 0.30, N = 1522.1322.4019.85MIN: 20.33 / MAX: 24.82MIN: 20.02 / MAX: 25.26MIN: 19.7 / MAX: 21.41. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenetabc3691215SE +/- 0.00, N = 3SE +/- 0.06, N = 313.1212.9312.84MIN: 12.95 / MAX: 19.49MIN: 12.71 / MAX: 15MIN: 12.76 / MAX: 13.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v2abc246810SE +/- 0.02, N = 3SE +/- 0.04, N = 36.175.945.91MIN: 6.04 / MAX: 10.12MIN: 5.78 / MAX: 7.68MIN: 5.82 / MAX: 7.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v3abc1.28032.56063.84095.12126.4015SE +/- 0.02, N = 3SE +/- 0.05, N = 35.695.455.42MIN: 5.55 / MAX: 9.2MIN: 5.28 / MAX: 7.8MIN: 5.35 / MAX: 6.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v2abc246810SE +/- 0.03, N = 3SE +/- 0.05, N = 37.257.046.96MIN: 7.12 / MAX: 9.7MIN: 6.85 / MAX: 8.14MIN: 6.9 / MAX: 7.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetabc1.28032.56063.84095.12126.4015SE +/- 0.01, N = 2SE +/- 0.07, N = 25.695.465.46MIN: 5.59 / MAX: 9.66MIN: 5.33 / MAX: 7.09MIN: 5.39 / MAX: 7.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0abc246810SE +/- 0.03, N = 3SE +/- 0.07, N = 38.287.987.90MIN: 8.13 / MAX: 13.55MIN: 7.78 / MAX: 9.57MIN: 7.84 / MAX: 9.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceabc0.69531.39062.08592.78123.4765SE +/- 0.01, N = 3SE +/- 0.03, N = 33.092.992.96MIN: 3.02 / MAX: 5.92MIN: 2.89 / MAX: 3.67MIN: 2.92 / MAX: 3.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetabc48121620SE +/- 0.04, N = 3SE +/- 0.09, N = 314.1613.8413.80MIN: 13.93 / MAX: 19.53MIN: 13.5 / MAX: 17.24MIN: 13.66 / MAX: 15.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg16abc612182430SE +/- 0.04, N = 3SE +/- 0.04, N = 323.2823.2523.32MIN: 22.77 / MAX: 28.58MIN: 22.64 / MAX: 33.92MIN: 22.65 / MAX: 281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet18abc3691215SE +/- 0.01, N = 3SE +/- 0.03, N = 39.259.139.11MIN: 9.06 / MAX: 12.11MIN: 8.93 / MAX: 13.18MIN: 8.93 / MAX: 13.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnetabc1.34552.6914.03655.3826.7275SE +/- 0.07, N = 3SE +/- 0.04, N = 35.965.895.98MIN: 5.71 / MAX: 9.01MIN: 5.69 / MAX: 9.06MIN: 5.7 / MAX: 8.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50abc48121620SE +/- 0.04, N = 3SE +/- 0.04, N = 317.1016.8116.72MIN: 16.78 / MAX: 21.76MIN: 16.47 / MAX: 21.24MIN: 16.54 / MAX: 22.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tinyabc510152025SE +/- 0.17, N = 3SE +/- 0.21, N = 319.4919.1618.99MIN: 18.87 / MAX: 21.72MIN: 18.61 / MAX: 28.33MIN: 18.66 / MAX: 20.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssdabc48121620SE +/- 0.04, N = 3SE +/- 0.09, N = 315.5115.2415.23MIN: 15.09 / MAX: 18.17MIN: 14.8 / MAX: 18.11MIN: 14.9 / MAX: 23.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mabc612182430SE +/- 0.13, N = 3SE +/- 0.29, N = 323.8422.4622.26MIN: 23.38 / MAX: 26.29MIN: 21.61 / MAX: 28.98MIN: 21.92 / MAX: 24.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerabc20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3110.26110.27110.27MIN: 109.56 / MAX: 114.27MIN: 109.69 / MAX: 117.06MIN: 109.27 / MAX: 117.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetabc246810SE +/- 0.04, N = 3SE +/- 0.14, N = 38.768.598.43MIN: 8.6 / MAX: 10.6MIN: 8.26 / MAX: 10.5MIN: 8.35 / MAX: 10.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread


Phoronix Test Suite v10.8.4