lczero onnx Ice Lake 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2108264-TJ-LCZEROONN24&grw&sro .
lczero onnx Ice Lake Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution 1 2 3 4 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.14.0-rc1-folio (x86_64) 20210715 GNOME Shell 3.38.4 X Server 1.20.11 GCC 10.3.0 ext4 1920x1080 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd0002a0 Python Details - Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
lczero onnx Ice Lake synthmark: VoiceMark_100 lczero: BLAS lczero: Eigen onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU 1 2 3 4 552.043 882 4120 531 721 465 11748 6771 551.034 835 4260 528 697 463 11396 6359 550.938 910 3954 538 718 461 11597 7050 553.201 873 4148 528 735 460 11638 OpenBenchmarking.org
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 1 2 3 4 120 240 360 480 600 SE +/- 2.04, N = 3 SE +/- 2.57, N = 3 SE +/- 2.48, N = 3 552.04 551.03 550.94 553.20 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS 1 2 3 4 200 400 600 800 1000 SE +/- 31.49, N = 6 SE +/- 19.54, N = 6 SE +/- 7.79, N = 9 882 835 910 873 1. (CXX) g++ options: -flto -pthread
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: Eigen 1 2 3 4 900 1800 2700 3600 4500 SE +/- 3.93, N = 3 SE +/- 46.87, N = 3 SE +/- 40.81, N = 3 4120 4260 3954 4148 1. (CXX) g++ options: -flto -pthread
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: yolov4 - Device: OpenMP CPU 1 2 3 4 120 240 360 480 600 SE +/- 6.09, N = 3 SE +/- 4.33, N = 3 SE +/- 2.75, N = 3 531 528 538 528 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: bertsquad-10 - Device: OpenMP CPU 1 2 3 4 160 320 480 640 800 SE +/- 10.36, N = 12 SE +/- 19.35, N = 12 SE +/- 16.15, N = 12 721 697 718 735 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: fcn-resnet101-11 - Device: OpenMP CPU 1 2 3 4 100 200 300 400 500 SE +/- 3.47, N = 3 SE +/- 1.42, N = 3 SE +/- 1.17, N = 3 465 463 461 460 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: shufflenet-v2-10 - Device: OpenMP CPU 1 2 3 4 3K 6K 9K 12K 15K SE +/- 68.29, N = 3 SE +/- 51.60, N = 3 SE +/- 73.24, N = 3 11748 11396 11597 11638 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.8.2 Model: super-resolution-10 - Device: OpenMP CPU 1 2 3 1500 3000 4500 6000 7500 SE +/- 233.89, N = 12 SE +/- 203.29, N = 12 6771 6359 7050 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Phoronix Test Suite v10.8.5