8380 2P August 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108108-IB-83802PAUG13&rdt&gru .
8380 2P August Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution 1 2 2a 3 4 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 3841GB Micron_9300_MTFDHAL3T8TDP + 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 20.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.36.4 X Server 1.20.9 GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270 Java Details - OpenJDK Runtime Environment (build 11.0.10+9-Ubuntu-0ubuntu1.20.04) Python Details - Python 2.7.18 + Python 3.8.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
8380 2P August dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit cassandra: Reads cassandra: Writes cassandra: Mixed 1:1 cassandra: Mixed 1:3 ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m qe: AUSURF112 yafaray: Total Time For Sample Scene ecp-candle: P1B2 ecp-candle: P3B1 1 2 2a 3 4 356.36 203.86 382.15 266.34 189245 138008 153944 177132 19.81 11.22 11.9 11.24 10.92 14.17 6.7 22.7 28.14 13.96 10.46 26.59 24.83 23.57 36.95 560.09 70.086 47.259 1214.331 353.82 201.44 387.27 264.56 20.72 11.29 11.3 11.41 11.05 13.96 6.68 23.92 29.33 15.37 10.22 26.07 26.51 23.77 38.66 560.54 88.255 352.60 203.27 392.58 266.46 176738 95548 174685 157216 20.15 11.11 10.82 11.29 10.58 13.22 6.64 21.80 29.96 14.03 9.69 25.27 24.77 23.47 37.02 560.30 73.184 41.763 1192.239 354.92 203.59 384.02 263.66 179346 101816 145550 171347 20.56 11.29 11.08 12.17 10.81 13.98 6.76 22.84 29.52 14.63 10.73 25.51 25.25 24.62 38.3 567.44 78.424 42.581 1201.416 354.67 199.48 386.51 264.38 191414 100063 163400 160361 20.02 10.94 10.71 11.01 10.45 13.41 6.54 22.17 28.19 13.68 10.22 24.68 24.54 23.21 36.3 560.33 91.043 42.55 OpenBenchmarking.org
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Chimera 1080p 1 2 2a 3 4 80 160 240 320 400 SE +/- 1.12, N = 3 356.36 353.82 352.60 354.92 354.67 MIN: 167.09 / MAX: 462.27 MIN: 161.26 / MAX: 452.29 MIN: 157.05 / MAX: 459.21 MIN: 162.67 / MAX: 462.54 MIN: 165.08 / MAX: 462.64 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Summer Nature 4K 1 2 2a 3 4 40 80 120 160 200 SE +/- 0.88, N = 3 203.86 201.44 203.27 203.59 199.48 MIN: 52.43 / MAX: 238.5 MIN: 50.53 / MAX: 236.45 MIN: 50.77 / MAX: 237.86 MIN: 52.05 / MAX: 238.93 MIN: 48.3 / MAX: 234.85 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Summer Nature 1080p 1 2 2a 3 4 90 180 270 360 450 SE +/- 2.83, N = 3 382.15 387.27 392.58 384.02 386.51 MIN: 86.09 / MAX: 461.12 MIN: 90.2 / MAX: 464.45 MIN: 91.07 / MAX: 466.17 MIN: 87.73 / MAX: 461.49 MIN: 90.67 / MAX: 463.81 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.1 Video Input: Chimera 1080p 10-bit 1 2 2a 3 4 60 120 180 240 300 SE +/- 1.52, N = 3 266.34 264.56 266.46 263.66 264.38 MIN: 132.54 / MAX: 328.22 MIN: 128.93 / MAX: 323.49 MIN: 127.15 / MAX: 334.54 MIN: 124.19 / MAX: 320.63 MIN: 124.79 / MAX: 322.35 1. (CC) gcc options: -pthread
Apache Cassandra Test: Reads OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Reads 1 2a 3 4 40K 80K 120K 160K 200K 189245 176738 179346 191414
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Writes 1 2a 3 4 30K 60K 90K 120K 150K 138008 95548 101816 100063
Apache Cassandra Test: Mixed 1:1 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Mixed 1:1 1 2a 3 4 40K 80K 120K 160K 200K 153944 174685 145550 163400
Apache Cassandra Test: Mixed 1:3 OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 4.0 Test: Mixed 1:3 1 2a 3 4 40K 80K 120K 160K 200K 177132 157216 171347 160361
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet 1 2 2a 3 4 5 10 15 20 25 SE +/- 0.29, N = 3 19.81 20.72 20.15 20.56 20.02 MIN: 19.28 / MAX: 20.68 MIN: 20.13 / MAX: 41.09 MIN: 19.15 / MAX: 42.96 MIN: 19.81 / MAX: 23.54 MIN: 19.65 / MAX: 20.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 1 2 2a 3 4 3 6 9 12 15 SE +/- 0.10, N = 3 11.22 11.29 11.11 11.29 10.94 MIN: 10.91 / MAX: 28.87 MIN: 11.05 / MAX: 12.14 MIN: 10.74 / MAX: 50.75 MIN: 10.95 / MAX: 31.3 MIN: 10.57 / MAX: 14.72 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 1 2 2a 3 4 3 6 9 12 15 SE +/- 0.01, N = 3 11.90 11.30 10.82 11.08 10.71 MIN: 10.7 / MAX: 232.13 MIN: 10.81 / MAX: 31.95 MIN: 10.51 / MAX: 18.14 MIN: 10.81 / MAX: 17.45 MIN: 10.41 / MAX: 30.63 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 1 2 2a 3 4 3 6 9 12 15 SE +/- 0.03, N = 3 11.24 11.41 11.29 12.17 11.01 MIN: 11 / MAX: 12.02 MIN: 11.18 / MAX: 12.02 MIN: 11.03 / MAX: 12.25 MIN: 11.15 / MAX: 209.8 MIN: 10.79 / MAX: 11.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet 1 2 2a 3 4 3 6 9 12 15 SE +/- 0.09, N = 3 10.92 11.05 10.58 10.81 10.45 MIN: 10.53 / MAX: 26.72 MIN: 10.76 / MAX: 30.77 MIN: 10.22 / MAX: 30.76 MIN: 10.55 / MAX: 21.52 MIN: 10.12 / MAX: 30.53 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 1 2 2a 3 4 4 8 12 16 20 SE +/- 0.07, N = 3 14.17 13.96 13.22 13.98 13.41 MIN: 13.75 / MAX: 15.44 MIN: 13.45 / MAX: 14.94 MIN: 12.83 / MAX: 35.44 MIN: 13.34 / MAX: 34.5 MIN: 12.99 / MAX: 15.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface 1 2 2a 3 4 2 4 6 8 10 SE +/- 0.09, N = 3 6.70 6.68 6.64 6.76 6.54 MIN: 6.45 / MAX: 25.59 MIN: 6.52 / MAX: 8.02 MIN: 6.39 / MAX: 29.27 MIN: 6.48 / MAX: 18.63 MIN: 6.34 / MAX: 8.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet 1 2 2a 3 4 6 12 18 24 30 SE +/- 0.83, N = 3 22.70 23.92 21.80 22.84 22.17 MIN: 21.89 / MAX: 34.16 MIN: 22.88 / MAX: 96.24 MIN: 20 / MAX: 55.53 MIN: 21.84 / MAX: 43.49 MIN: 21.57 / MAX: 23.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 1 2 2a 3 4 7 14 21 28 35 SE +/- 1.81, N = 3 28.14 29.33 29.96 29.52 28.19 MIN: 27.61 / MAX: 47.7 MIN: 28.75 / MAX: 42.35 MIN: 27.33 / MAX: 54.01 MIN: 28.95 / MAX: 30.57 MIN: 27.48 / MAX: 48.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 1 2 2a 3 4 4 8 12 16 20 SE +/- 0.52, N = 3 13.96 15.37 14.03 14.63 13.68 MIN: 13.41 / MAX: 34.68 MIN: 14.63 / MAX: 16.73 MIN: 12.99 / MAX: 48.17 MIN: 14.04 / MAX: 15.62 MIN: 13.11 / MAX: 36.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet 1 2 2a 3 4 3 6 9 12 15 SE +/- 0.44, N = 3 10.46 10.22 9.69 10.73 10.22 MIN: 10.14 / MAX: 11.45 MIN: 9.87 / MAX: 30.4 MIN: 8.44 / MAX: 30.13 MIN: 10.33 / MAX: 30.83 MIN: 9.95 / MAX: 11.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 1 2 2a 3 4 6 12 18 24 30 SE +/- 0.44, N = 3 26.59 26.07 25.27 25.51 24.68 MIN: 24.41 / MAX: 306.36 MIN: 25.33 / MAX: 47.05 MIN: 23.96 / MAX: 45.64 MIN: 24.79 / MAX: 50.63 MIN: 23.97 / MAX: 45.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny 1 2 2a 3 4 6 12 18 24 30 SE +/- 0.20, N = 3 24.83 26.51 24.77 25.25 24.54 MIN: 23.75 / MAX: 39.21 MIN: 25.6 / MAX: 45.59 MIN: 23.49 / MAX: 42.86 MIN: 24.06 / MAX: 42.3 MIN: 23.82 / MAX: 52.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd 1 2 2a 3 4 6 12 18 24 30 SE +/- 0.28, N = 3 23.57 23.77 23.47 24.62 23.21 MIN: 22.79 / MAX: 37.78 MIN: 23.14 / MAX: 45.98 MIN: 22.62 / MAX: 45.32 MIN: 23.67 / MAX: 51.85 MIN: 22.77 / MAX: 26.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m 1 2 2a 3 4 9 18 27 36 45 SE +/- 0.20, N = 3 36.95 38.66 37.02 38.30 36.30 MIN: 36.59 / MAX: 46.2 MIN: 37.27 / MAX: 172.89 MIN: 36.03 / MAX: 140.65 MIN: 37.23 / MAX: 79.12 MIN: 35.72 / MAX: 56.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.8 Input: AUSURF112 1 2 2a 3 4 120 240 360 480 600 SE +/- 0.49, N = 3 560.09 560.54 560.30 567.44 560.33 1. (F9X) gfortran options: -ldevXlib -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.5.1 Total Time For Sample Scene 1 2 2a 3 4 20 40 60 80 100 SE +/- 0.86, N = 3 70.09 88.26 73.18 78.42 91.04 1. (CXX) g++ options: -std=c++11 -pthread -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P1B2 1 2a 3 4 11 22 33 44 55 47.26 41.76 42.58 42.55
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B1 1 2a 3 300 600 900 1200 1500 1214.33 1192.24 1201.42
Phoronix Test Suite v10.8.4