2 x Intel Xeon Gold 5220R testing with a TYAN S7106 (V2.01.B40 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2101191-HA-XEONGOLD534 Xeon Gold 5220R 2P 2021 - Phoronix Test Suite Xeon Gold 5220R 2P 2021 2 x Intel Xeon Gold 5220R testing with a TYAN S7106 (V2.01.B40 BIOS) and llvmpipe on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2101191-HA-XEONGOLD534&gru&rdt&rro .
Xeon Gold 5220R 2P 2021 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Display Driver OpenGL Compiler File-System Screen Resolution 1 2 3 2 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads) TYAN S7106 (V2.01.B40 BIOS) Intel Sky Lake-E DMI3 Registers 94GB 500GB Samsung SSD 860 llvmpipe VE228 2 x Intel I210 + 2 x QLogic cLOM8214 1/10GbE Ubuntu 20.04 5.9.0-050900rc6-generic (x86_64) 20200920 GNOME Shell 3.36.4 X Server 1.20.8 modesetting 1.20.8 3.3 Mesa 20.0.4 (LLVM 9.0.1 256 bits) GCC 9.3.0 ext4 1920x1080 OpenBenchmarking.org Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5003003 Python Details - Python 2.7.18rc1 + Python 3.8.5 Security Details - itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Xeon Gold 5220R 2P 2021 amg: dav1d: Chimera 1080p dav1d: Summer Nature 4K dav1d: Summer Nature 1080p dav1d: Chimera 1080p 10-bit rav1e: 1 rav1e: 5 rav1e: 6 rav1e: 10 onnx: yolov4 - OpenMP CPU onnx: bertsquad-10 - OpenMP CPU onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU lammps: 20k Atoms lammps: Rhodopsin Protein kripke: synthmark: VoiceMark_100 lulesh: mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v1.1 cloverleaf: Lagrangian-Eulerian Hydrodynamics openfoam: Motorbike 30M openfoam: Motorbike 60M qe: AUSURF112 relion: Basic - CPU build-godot: Time To Compile 1 2 3 1108502667 338.35 186.73 339.62 70.83 0.361 0.964 1.223 2.573 346 396 111 6334 5319 18.033 16.728 76581811 535.222 13911.540 8.567 29.878 5.244 2.839 36.881 397.487 349.673 25.12 30.50 238.33 1645.38 652.598 82.401 1165849333 338.91 186.14 344.01 70.84 0.364 0.961 1.236 2.573 342 405 111 6582 5327 18.091 16.776 76420567 536.681 15312.502 8.493 31.064 4.764 2.901 37.414 386.480 349.815 24.96 30.62 234.22 1740.65 642.292 82.919 1174487333 338.75 187.89 343.24 71.34 0.363 0.955 1.239 2.548 346 398 109 6596 5289 18.063 16.827 75540700 534.631 15372.560 8.464 30.882 4.952 2.908 37.013 388.704 349.705 24.97 30.57 237.59 1764.97 654.522 82.656 OpenBenchmarking.org
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 3 2 1 300M 600M 900M 1200M 1500M SE +/- 7168295.30, N = 3 SE +/- 4505964.10, N = 3 SE +/- 10611072.21, N = 3 1174487333 1165849333 1108502667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
dav1d Video Input: Chimera 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 3 2 1 70 140 210 280 350 SE +/- 0.29, N = 3 SE +/- 1.22, N = 3 SE +/- 1.04, N = 3 338.75 338.91 338.35 MIN: 238.25 / MAX: 436.84 MIN: 237.5 / MAX: 439.58 MIN: 226.18 / MAX: 437.6 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 4K 3 2 1 40 80 120 160 200 SE +/- 0.46, N = 3 SE +/- 1.69, N = 3 SE +/- 1.36, N = 3 187.89 186.14 186.73 MIN: 124.73 / MAX: 202.3 MIN: 107.72 / MAX: 201.56 MIN: 112.08 / MAX: 202.84 1. (CC) gcc options: -pthread
dav1d Video Input: Summer Nature 1080p OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Summer Nature 1080p 3 2 1 70 140 210 280 350 SE +/- 0.92, N = 3 SE +/- 1.08, N = 3 SE +/- 2.79, N = 3 343.24 344.01 339.62 MIN: 204.44 / MAX: 379.88 MIN: 204.26 / MAX: 378.62 MIN: 166.21 / MAX: 377.75 1. (CC) gcc options: -pthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.8.1 Video Input: Chimera 1080p 10-bit 3 2 1 16 32 48 64 80 SE +/- 0.13, N = 3 SE +/- 0.13, N = 3 SE +/- 0.16, N = 3 71.34 70.84 70.83 MIN: 54.55 / MAX: 109.86 MIN: 54.3 / MAX: 106.84 MIN: 54.39 / MAX: 106.93 1. (CC) gcc options: -pthread
rav1e Speed: 1 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 1 3 2 1 0.0819 0.1638 0.2457 0.3276 0.4095 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 0.363 0.364 0.361
rav1e Speed: 5 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 5 3 2 1 0.2169 0.4338 0.6507 0.8676 1.0845 SE +/- 0.006, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 0.955 0.961 0.964
rav1e Speed: 6 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 6 3 2 1 0.2788 0.5576 0.8364 1.1152 1.394 SE +/- 0.004, N = 3 SE +/- 0.004, N = 3 SE +/- 0.010, N = 3 1.239 1.236 1.223
rav1e Speed: 10 OpenBenchmarking.org Frames Per Second, More Is Better rav1e 0.4 Speed: 10 3 2 1 0.5789 1.1578 1.7367 2.3156 2.8945 SE +/- 0.020, N = 3 SE +/- 0.007, N = 3 SE +/- 0.002, N = 3 2.548 2.573 2.573
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU 3 2 1 80 160 240 320 400 SE +/- 1.15, N = 3 SE +/- 5.07, N = 3 SE +/- 1.59, N = 3 346 342 346 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU 3 2 1 90 180 270 360 450 SE +/- 5.16, N = 12 SE +/- 8.06, N = 12 SE +/- 6.36, N = 3 398 405 396 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU 3 2 1 20 40 60 80 100 SE +/- 1.09, N = 3 SE +/- 0.58, N = 3 SE +/- 0.29, N = 3 109 111 111 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU 3 2 1 1400 2800 4200 5600 7000 SE +/- 16.10, N = 3 SE +/- 9.53, N = 3 SE +/- 158.17, N = 12 6596 6582 6334 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 3 2 1 1100 2200 3300 4400 5500 SE +/- 27.57, N = 3 SE +/- 9.44, N = 3 SE +/- 12.57, N = 3 5289 5327 5319 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 3 2 1 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 18.06 18.09 18.03 1. (CXX) g++ options: -O3 -pthread -lm
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 3 2 1 4 8 12 16 20 SE +/- 0.22, N = 15 SE +/- 0.20, N = 15 SE +/- 0.08, N = 3 16.83 16.78 16.73 1. (CXX) g++ options: -O3 -pthread -lm
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 3 2 1 16M 32M 48M 64M 80M SE +/- 1808221.02, N = 12 SE +/- 1685258.80, N = 15 SE +/- 1820096.95, N = 12 75540700 76420567 76581811 1. (CXX) g++ options: -O3 -fopenmp
Google SynthMark Test: VoiceMark_100 OpenBenchmarking.org Voices, More Is Better Google SynthMark 20201109 Test: VoiceMark_100 3 2 1 120 240 360 480 600 SE +/- 0.43, N = 3 SE +/- 0.60, N = 3 SE +/- 0.90, N = 3 534.63 536.68 535.22 1. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 3 2 1 3K 6K 9K 12K 15K SE +/- 131.93, N = 15 SE +/- 138.21, N = 15 SE +/- 150.18, N = 3 15372.56 15312.50 13911.54 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: SqueezeNetV1.0 3 2 1 2 4 6 8 10 SE +/- 0.097, N = 15 SE +/- 0.144, N = 3 SE +/- 0.111, N = 3 8.464 8.493 8.567 MIN: 7.25 / MAX: 15.53 MIN: 7.73 / MAX: 9.13 MIN: 7.54 / MAX: 9.36 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: resnet-v2-50 3 2 1 7 14 21 28 35 SE +/- 0.26, N = 15 SE +/- 0.41, N = 3 SE +/- 0.16, N = 3 30.88 31.06 29.88 MIN: 29.09 / MAX: 131.06 MIN: 29.7 / MAX: 81.8 MIN: 29.23 / MAX: 117.71 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: MobileNetV2_224 3 2 1 1.1799 2.3598 3.5397 4.7196 5.8995 SE +/- 0.064, N = 15 SE +/- 0.044, N = 3 SE +/- 0.081, N = 3 4.952 4.764 5.244 MIN: 3.83 / MAX: 25.34 MIN: 4.01 / MAX: 13.65 MIN: 4.52 / MAX: 14.09 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: mobilenet-v1-1.0 3 2 1 0.6543 1.3086 1.9629 2.6172 3.2715 SE +/- 0.053, N = 15 SE +/- 0.082, N = 3 SE +/- 0.005, N = 3 2.908 2.901 2.839 MIN: 2.55 / MAX: 8.41 MIN: 2.64 / MAX: 3.23 MIN: 2.59 / MAX: 3.94 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.1.1 Model: inception-v3 3 2 1 9 18 27 36 45 SE +/- 0.14, N = 15 SE +/- 0.24, N = 3 SE +/- 0.12, N = 3 37.01 37.41 36.88 MIN: 35.64 / MAX: 118.76 MIN: 36.55 / MAX: 86.82 MIN: 35.86 / MAX: 106.43 1. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: MobileNet v2 3 2 1 90 180 270 360 450 SE +/- 2.72, N = 3 SE +/- 0.65, N = 3 SE +/- 5.67, N = 3 388.70 386.48 397.49 MIN: 383.05 / MAX: 540.22 MIN: 383.42 / MAX: 469.27 MIN: 383.2 / MAX: 556.49 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.2.3 Target: CPU - Model: SqueezeNet v1.1 3 2 1 80 160 240 320 400 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.07, N = 3 349.71 349.82 349.67 MIN: 348.99 / MAX: 358.54 MIN: 349.05 / MAX: 360.06 MIN: 349.01 / MAX: 351.27 1. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 3 2 1 6 12 18 24 30 SE +/- 0.35, N = 3 SE +/- 0.06, N = 3 SE +/- 0.34, N = 3 24.97 24.96 25.12 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 3 2 1 7 14 21 28 35 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 30.57 30.62 30.50 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M 3 2 1 50 100 150 200 250 SE +/- 0.16, N = 3 SE +/- 1.31, N = 3 SE +/- 0.25, N = 3 237.59 234.22 238.33 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 3 2 1 400 800 1200 1600 2000 SE +/- 29.14, N = 3 SE +/- 33.06, N = 9 SE +/- 47.91, N = 7 1764.97 1740.65 1645.38 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU 3 2 1 140 280 420 560 700 SE +/- 3.01, N = 3 SE +/- 3.96, N = 3 SE +/- 7.75, N = 9 654.52 642.29 652.60 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 3 2 1 20 40 60 80 100 SE +/- 0.32, N = 3 SE +/- 0.95, N = 3 SE +/- 0.55, N = 3 82.66 82.92 82.40
Phoronix Test Suite v10.8.4