AMD EPYC 7551 32-Core testing with a GIGABYTE MZ31-AR0-00 v01010101 (F10 BIOS) and ASPEED on Debian 11 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2208156-NE-MNNNCNNZE48 mnn ncnn zen 1 epyc - Phoronix Test Suite mnn ncnn zen 1 epyc AMD EPYC 7551 32-Core testing with a GIGABYTE MZ31-AR0-00 v01010101 (F10 BIOS) and ASPEED on Debian 11 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2208156-NE-MNNNCNNZE48&grw&sro .
mnn ncnn zen 1 epyc Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Compiler File-System Screen Resolution A B C AMD EPYC 7551 32-Core @ 2.00GHz (32 Cores / 64 Threads) GIGABYTE MZ31-AR0-00 v01010101 (F10 BIOS) AMD 17h 8 x 4 GB DDR4-2133MT/s 9ASF51272PZ-2G6E1 Samsung SSD 960 EVO 500GB ASPEED Realtek RTL8111/8168/8411 + 2 x Broadcom NetXtreme II BCM57810 10 Debian 11 5.10.0-9-amd64 (x86_64) GCC 10.2.1 20210110 ext4 1024x768 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: always Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x8001227 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected
mnn ncnn zen 1 epyc mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: CPU - vision_transformer ncnn: CPU - FastestDet A B C 5.281 9.157 52.380 14.090 10.048 8.341 59.589 45.00 24.04 21.60 29.16 22.37 33.19 12.28 59.41 94.47 45.42 41.36 71.34 63.28 60.88 103.41 353.68 32.28 5.464 9.009 54.152 13.983 8.595 8.119 57.847 41.52 24.63 23.51 26.61 22.07 34.65 14.47 53.82 98.99 45.65 40.94 77.61 55.48 56.80 96.94 352.36 32.75 6.070 9.516 57.826 14.492 9.898 8.783 59.955 45.88 27.05 22.35 27.07 26.29 31.80 11.38 53.24 92.70 40.39 42.47 65.41 60.06 58.65 100.75 350.58 30.45 OpenBenchmarking.org
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: mobilenetV3 A B C 2 4 6 8 10 SE +/- 0.074, N = 5 SE +/- 0.230, N = 9 SE +/- 0.285, N = 9 5.281 5.464 6.070 MIN: 5 / MAX: 10.66 MIN: 4.57 / MAX: 11.22 MIN: 4.96 / MAX: 9.8 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: squeezenetv1.1 A B C 3 6 9 12 15 SE +/- 0.160, N = 5 SE +/- 0.283, N = 9 SE +/- 0.364, N = 9 9.157 9.009 9.516 MIN: 8.54 / MAX: 14.69 MIN: 7.68 / MAX: 15.21 MIN: 8.27 / MAX: 17.73 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: resnet-v2-50 A B C 13 26 39 52 65 SE +/- 1.12, N = 5 SE +/- 2.76, N = 9 SE +/- 2.46, N = 9 52.38 54.15 57.83 MIN: 47.96 / MAX: 141.5 MIN: 47.92 / MAX: 164.33 MIN: 50.22 / MAX: 132.71 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: SqueezeNetV1.0 A B C 4 8 12 16 20 SE +/- 0.32, N = 5 SE +/- 0.36, N = 9 SE +/- 0.35, N = 9 14.09 13.98 14.49 MIN: 12.99 / MAX: 16.82 MIN: 12.79 / MAX: 21.34 MIN: 12.97 / MAX: 21.04 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: MobileNetV2_224 A B C 3 6 9 12 15 SE +/- 0.209, N = 5 SE +/- 0.066, N = 9 SE +/- 0.196, N = 9 10.048 8.595 9.898 MIN: 8.59 / MAX: 32.81 MIN: 7.48 / MAX: 39.13 MIN: 8.56 / MAX: 34.58 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: mobilenet-v1-1.0 A B C 2 4 6 8 10 SE +/- 0.583, N = 5 SE +/- 0.478, N = 9 SE +/- 0.610, N = 9 8.341 8.119 8.783 MIN: 7.04 / MAX: 16.49 MIN: 7.14 / MAX: 17.24 MIN: 6.3 / MAX: 16.9 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 2.0 Model: inception-v3 A B C 13 26 39 52 65 SE +/- 2.14, N = 5 SE +/- 3.36, N = 9 SE +/- 2.98, N = 9 59.59 57.85 59.96 MIN: 54.7 / MAX: 74.6 MIN: 51.06 / MAX: 261.45 MIN: 51.78 / MAX: 271.46 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: mobilenet A B C 10 20 30 40 50 SE +/- 2.07, N = 9 SE +/- 1.16, N = 9 SE +/- 3.76, N = 9 45.00 41.52 45.88 MIN: 35.78 / MAX: 540.85 MIN: 35.31 / MAX: 563.93 MIN: 35.78 / MAX: 543.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU-v2-v2 - Model: mobilenet-v2 A B C 6 12 18 24 30 SE +/- 1.12, N = 9 SE +/- 2.10, N = 9 SE +/- 2.11, N = 9 24.04 24.63 27.05 MIN: 19.32 / MAX: 466.34 MIN: 18.22 / MAX: 475.35 MIN: 18.97 / MAX: 481.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU-v3-v3 - Model: mobilenet-v3 A B C 6 12 18 24 30 SE +/- 1.04, N = 9 SE +/- 1.97, N = 9 SE +/- 1.30, N = 9 21.60 23.51 22.35 MIN: 18.42 / MAX: 485.33 MIN: 17.83 / MAX: 487.74 MIN: 18.39 / MAX: 509.43 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: shufflenet-v2 A B C 7 14 21 28 35 SE +/- 4.25, N = 9 SE +/- 1.64, N = 9 SE +/- 1.75, N = 9 29.16 26.61 27.07 MIN: 21.67 / MAX: 560.88 MIN: 21.24 / MAX: 559.88 MIN: 21.66 / MAX: 552.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: mnasnet A B C 6 12 18 24 30 SE +/- 1.65, N = 9 SE +/- 1.74, N = 9 SE +/- 2.10, N = 9 22.37 22.07 26.29 MIN: 17.86 / MAX: 463.92 MIN: 17 / MAX: 472.21 MIN: 17.4 / MAX: 477.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: efficientnet-b0 A B C 8 16 24 32 40 SE +/- 2.26, N = 9 SE +/- 2.89, N = 9 SE +/- 1.79, N = 9 33.19 34.65 31.80 MIN: 24.9 / MAX: 696.74 MIN: 24.08 / MAX: 682.94 MIN: 24.43 / MAX: 693.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: blazeface A B C 4 8 12 16 20 SE +/- 0.71, N = 9 SE +/- 2.43, N = 9 SE +/- 0.13, N = 9 12.28 14.47 11.38 MIN: 9.88 / MAX: 292.47 MIN: 9.52 / MAX: 295.9 MIN: 9.98 / MAX: 289.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: googlenet A B C 13 26 39 52 65 SE +/- 4.24, N = 9 SE +/- 4.18, N = 9 SE +/- 3.42, N = 9 59.41 53.82 53.24 MIN: 40.06 / MAX: 739.56 MIN: 39.13 / MAX: 737.69 MIN: 40.33 / MAX: 740.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: vgg16 A B C 20 40 60 80 100 SE +/- 4.80, N = 9 SE +/- 7.53, N = 9 SE +/- 6.48, N = 9 94.47 98.99 92.70 MIN: 60.89 / MAX: 237.15 MIN: 56.24 / MAX: 223.87 MIN: 61.84 / MAX: 235.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: resnet18 A B C 10 20 30 40 50 SE +/- 4.60, N = 9 SE +/- 5.31, N = 9 SE +/- 3.06, N = 9 45.42 45.65 40.39 MIN: 28.71 / MAX: 297.65 MIN: 25.28 / MAX: 296.12 MIN: 28.07 / MAX: 300.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: alexnet A B C 10 20 30 40 50 SE +/- 4.07, N = 9 SE +/- 5.19, N = 9 SE +/- 6.22, N = 9 41.36 40.94 42.47 MIN: 22.74 / MAX: 146.56 MIN: 18.3 / MAX: 145.6 MIN: 20.35 / MAX: 145.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: resnet50 A B C 20 40 60 80 100 SE +/- 4.54, N = 9 SE +/- 4.17, N = 9 SE +/- 4.24, N = 9 71.34 77.61 65.41 MIN: 49.22 / MAX: 673.07 MIN: 43.89 / MAX: 663.09 MIN: 47.44 / MAX: 680.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: yolov4-tiny A B C 14 28 42 56 70 SE +/- 3.13, N = 9 SE +/- 1.09, N = 9 SE +/- 2.83, N = 9 63.28 55.48 60.06 MIN: 47.49 / MAX: 295.76 MIN: 47.61 / MAX: 284.14 MIN: 47.24 / MAX: 293.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: squeezenet_ssd A B C 14 28 42 56 70 SE +/- 3.79, N = 9 SE +/- 3.24, N = 9 SE +/- 2.64, N = 8 60.88 56.80 58.65 MIN: 42.5 / MAX: 643.01 MIN: 41.76 / MAX: 641.24 MIN: 42.73 / MAX: 650.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: regnety_400m A B C 20 40 60 80 100 SE +/- 6.20, N = 9 SE +/- 5.95, N = 9 SE +/- 4.65, N = 9 103.41 96.94 100.75 MIN: 81.96 / MAX: 3182 MIN: 81.39 / MAX: 3202.54 MIN: 82.33 / MAX: 2992.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: vision_transformer A B C 80 160 240 320 400 SE +/- 2.70, N = 9 SE +/- 2.37, N = 9 SE +/- 1.57, N = 9 353.68 352.36 350.58 MIN: 271.26 / MAX: 973.13 MIN: 270.26 / MAX: 969.01 MIN: 285.44 / MAX: 1166.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: FastestDet A B C 8 16 24 32 40 SE +/- 1.74, N = 9 SE +/- 2.48, N = 9 SE +/- 1.25, N = 9 32.28 32.75 30.45 MIN: 26.06 / MAX: 617.85 MIN: 25.68 / MAX: 619.82 MIN: 25.8 / MAX: 616.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Phoronix Test Suite v10.8.4