single Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2106046-IB-SINGLE80720&sor&grs .
single Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution 2 x Xeon Platinum 8380 Xeon Platinum 8380 Xeon Platinum 8380 rest Intel Xeon Platinum 8380 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 7682GB INTEL SSDPF2KX076TZ llvmpipe VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.13.0-051300rc4-generic (x86_64) 20210530 GNOME Shell 3.38.4 X Server 4.5 Mesa 21.0.1 (LLVM 11.0.1 256 bits) GCC 10.3.0 ext4 1920x1080 Intel Xeon Platinum 8380 @ 3.40GHz (40 Cores / 80 Threads) 252GB ASPEED OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270 Python Details - 2 x Xeon Platinum 8380, Xeon Platinum 8380, Intel Xeon Platinum 8380: Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
single brl-cad: VGR Performance Metric cpuminer-opt: Magi svt-av1: Preset 8 - Bosphorus 4K cpuminer-opt: Garlicoin svt-av1: Preset 4 - Bosphorus 4K npb: SP.B npb: SP.C pennant: leblancbig incompact3d: input.i3d 193 Cells Per Direction openfoam: Motorbike 60M toybrot: TBB npb: MG.C liquid-dsp: 128 - 256 - 57 incompact3d: input.i3d 129 Cells Per Direction rocksdb: Rand Read openssl: RSA 4096-bit Performance aircrack-ng: blender: Classroom - CPU-Only coremark: CoreMark Size 666 - Iterations Per Second askap: tConvolve MPI - Degridding helsing: 14 digit john-the-ripper: MD5 john-the-ripper: Blowfish askap: tConvolve MPI - Gridding c-ray: Total Time - 4K, 16 Rays Per Pixel tachyon: Total Time astcenc: Exhaustive embree: Pathtracer - Asian Dragon relion: Basic - CPU m-queens: Time To Solve liquid-dsp: 160 - 256 - 57 primesieve: 1e12 Prime Number Generation onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU npb: EP.D asmfish: 1024 Hash Memory, 26 Depth npb: LU.C toybrot: OpenMP toybrot: C++ Threads amg: stockfish: Total Time namd: ATPase Simulation - 327,506 Atoms mt-dgemm: Sustained Floating-Point Rate incompact3d: X3D-benchmarking input.i3d lulesh: v-ray: CPU rays1bench: Large Scene blender: Barbershop - CPU-Only wrf: conus 2.5km blender: Pabellon Barcelona - CPU-Only toktx: UASTC 4 + Zstd Compression 19 embree: Pathtracer ISPC - Asian Dragon rodinia: OpenMP LavaMD liquid-dsp: 64 - 256 - 57 npb: EP.C graphics-magick: Sharpen xmrig: Wownero - 1M embree: Pathtracer - Asian Dragon Obj gromacs: MPI CPU - water_GMX50_bare xmrig: Monero - 1M blender: BMW27 - CPU-Only embree: Pathtracer - Crown embree: Pathtracer ISPC - Asian Dragon Obj tensorflow-lite: Inception ResNet V2 graphics-magick: Enhanced tensorflow-lite: Inception V4 compress-7zip: Compress Speed Test npb: FT.C tensorflow-lite: Mobilenet Float npb: CG.C askap: tConvolve MT - Degridding onednn: Deconvolution Batch shapes_3d - f32 - CPU toybrot: C++ Tasks luxcorerender: DLSC - CPU tensorflow-lite: Mobilenet Quant embree: Pathtracer ISPC - Crown tensorflow-lite: SqueezeNet onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU oidn: RTLightmap.hdr.4096x4096 npb: BT.C oidn: RT.hdr_alb_nrm.3840x2160 blender: Fishy Cat - CPU-Only povray: Trace Time ebizzy: appleseed: Material Tester nwchem: C240 Buckyball oidn: RT.ldr_alb_nrm.3840x2160 onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU pennant: sedovbig onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU openfoam: Motorbike 30M onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU build-llvm: Ninja luxcorerender: Orange Juice - CPU luxcorerender: Danish Mood - CPU cloverleaf: Lagrangian-Eulerian Hydrodynamics lammps: 20k Atoms tungsten: Hair svt-hevc: 1 - Bosphorus 1080p build-nodejs: Time To Compile lammps: Rhodopsin Protein graphics-magick: Swirl build-linux-kernel: Time To Compile qe: AUSURF112 askap: tConvolve OpenMP - Degridding basis: UASTC Level 3 onednn: IP Shapes 1D - u8s8f32 - CPU keydb: askap: tConvolve OpenMP - Gridding onednn: IP Shapes 1D - f32 - CPU kvazaar: Bosphorus 4K - Very Fast rocksdb: Read While Writing svt-vp9: VMAF Optimized - Bosphorus 1080p onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onnx: yolov4 - OpenMP CPU onednn: IP Shapes 1D - bf16bf16bf16 - CPU build-ffmpeg: Time To Compile rodinia: OpenMP CFD Solver appleseed: Disney Material build-llvm: Unix Makefiles npb: IS.D dav1d: Summer Nature 4K mysqlslap: 256 basis: UASTC Level 2 mysqlslap: 512 toktx: UASTC 3 graphics-magick: Noise-Gaussian openvkl: vklBenchmark minife: Small onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU build-imagemagick: Time To Compile build-mesa: Time To Compile build-godot: Time To Compile plaidml: No - Inference - ResNet 50 - CPU onnx: bertsquad-10 - OpenMP CPU build2: Time To Compile tensorflow-lite: NASNet Mobile graphics-magick: Rotate toktx: UASTC 3 + Zstd Compression 19 appleseed: Emily rodinia: OpenMP Streamcluster onednn: IP Shapes 3D - bf16bf16bf16 - CPU kvazaar: Bosphorus 4K - Ultra Fast plaidml: No - Inference - VGG19 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU graphics-magick: HWB Color Space dav1d: Chimera 1080p 10-bit onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU tungsten: Non-Exponential build-php: Time To Compile onednn: Recurrent Neural Network Inference - u8s8f32 - CPU x265: Bosphorus 4K plaidml: No - Inference - VGG16 - CPU svt-hevc: 7 - Bosphorus 1080p onnx: shufflenet-v2-10 - OpenMP CPU build-gdb: Time To Compile askap: tConvolve MT - Gridding onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM onednn: IP Shapes 3D - f32 - CPU onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU onnx: fcn-resnet101-11 - OpenMP CPU srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM webp2: Quality 100, Lossless Compression svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p tungsten: Water Caustic webp2: Quality 75, Compression Effort 7 ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping avifenc: 6, Lossless webp2: Quality 95, Compression Effort 7 onednn: IP Shapes 3D - u8s8f32 - CPU svt-vp9: Visual Quality Optimized - Bosphorus 1080p askap: Hogbom Clean OpenMP kripke: srsran: 4G PHY_DL_Test 100 PRB SISO 256-QAM srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM webp2: Quality 100, Compression Effort 5 astcenc: Medium openvkl: vklBenchmarkStructuredVolume srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM vpxenc: Speed 5 - Bosphorus 1080p srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM srsran: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM astcenc: Thorough srsran: 4G PHY_DL_Test 100 PRB MIMO 64-QAM build-apache: Time To Compile openvkl: vklBenchmarkVdbVolume srsran: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM build-wasmer: Time To Compile svt-hevc: 10 - Bosphorus 1080p srsran: 4G PHY_DL_Test 100 PRB MIMO 256-QAM rocksdb: Rand Fill Sync vpxenc: Speed 5 - Bosphorus 4K srsran: OFDM_Test srsran: 4G PHY_DL_Test 100 PRB SISO 64-QAM astcenc: Exhaustive vpxenc: Speed 0 - Bosphorus 1080p vpxenc: Speed 0 - Bosphorus 4K onnx: super-resolution-10 - OpenMP CPU cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Myriad-Groestl cpuminer-opt: Skeincoin cpuminer-opt: Blake-2 S cpuminer-opt: Deepcoin cpuminer-opt: x25x onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU yafaray: Total Time For Sample Scene luxcorerender: Rainbow Colors and Prism - CPU luxcorerender: LuxCore Benchmark - CPU rodinia: OpenMP Leukocyte 2 x Xeon Platinum 8380 Xeon Platinum 8380 Xeon Platinum 8380 rest Intel Xeon Platinum 8380 2411587 3651.80 57.416 41269 4.734 123538.14 92499.87 4.707610 11.0581346 103.61 6924 118831.52 3279266667 2.57966831 373831966 17835.8 211019.990 71.11 2365407.624214 18222.5 81.464 10200000 118869 20662.0 11.008 13.6825 16.6421 83.5723 350.743 11.301 3086533333 3.693 2.08924 8890.34 171121628 188790.14 7281 6992 2082103333 180945380 0.27064 28.146733 291.416199 35311.845 66328 346.31 103.87 9822.727 87.63 54.917 107.9025 39.069 3047333333 7939.02 672 42770.8 72.4517 9.063 26504.2 28.24 64.7709 89.6186 572750 1125 666967 355140 100812.54 32738.9 40188.84 12928.8 0.839556 7883 9.41 34155.0 75.4967 47974.6 0.190628 1.44 198052.38 2.98 44.45 9.257 2035986 177.779916 1846.8 2.99 3.25912 3.56396 14.41379 0.604378 14.30 0.915086 127.993 14.33 6.97 10.08 35.989 5.73331 37.49 90.570 31.561 2191 21.627 1169.15 24810.2 15.513 1.26700 541996.24 18859.8 0.914865 38.71 9170260 475.45 0.249611 482 2.99485 16.426 4.709 57.396983 190.683 3086.24 532.57 697 11.415 664 4.658 732 820 28391.4 675.163 675.115 11.962 18.712 48.068 6.21 516 57.639 74702.4 753 9.111 123.700529 7.652 1.81271 47.84 33.43 439.372 1042 861.39 439.640 2.56596 35.344 441.260 28.65 38.51 313.93 8402 39.835 4997.05 0.363977 1.36768 1.39717 0.228632 197 413.223 469.70 20.3009 117.573 1354.59 29.692 218.163 0.438511 371.88 1238.71 178965233 6.251 104922070 19.790 29143044 37.688 584.72 482191 7609 402726 84872 774163 3662348 75061 2725.03 757.062 6.96910 81.513 14.11 6.89 47.443 423324 1346.63 21.801 15618 1.859 50133.58 37645.41 11.32899 24.0432320 224.58 13486 56287.53 1565266667 5.26849863 182180961 8726.8 105477.352 142.20 1206142.696836 9239.59 161.647 5305667 60553 10580.9 21.645 26.8736 32.4903 42.6100 686.961 22.128 1579300000 7.166 4.02680 4593.84 89220384 98453.55 14057 13439 1084411667 94879448 0.51889 14.696138 554.938761 18779.645 35347 184.56 195.16 18356.463 164.01 102.030 58.2919 71.991 1654066667 4334.33 370 23539.9 39.9934 5.017 14658.2 51.05 36.0739 50.0994 1026060 641 1178463 201596 57275.08 56410.0 22890.25 7452.07 1.37155 13726 5.50 57591.0 44.1689 80455.7 0.310362 0.85 117333.47 1.78 74.51 15.515 1215587 106.17271 3085.1 1.79 5.37652 5.87433 23.84300 0.960174 23.49 1.48256 208.651 8.95 4.42 15.92 22.682 9.15446 23.70 142.509 20.841 1452 32.012 1783.19 16445.2 22.870 0.872017 771533.42 14169.2 1.21203 27.57 6676556 485.12 0.329636 659 4.03139 22.316 6.211 77.521925 256.630 2333.56 404.27 918 14.930 870 6.013 586 641 23125.0 812.260 810.358 14.369 21.794 56.580 7.30 586 67.161 81669.1 867 10.387 140.509089 6.762 2.02106 42.65 31.06 476.024 1159 775.38 476.214 2.81248 38.269 475.861 31.23 36.97 290.36 9056 42.710 5305.67 0.343258 1.42703 1.45952 0.234327 204 425.324 485.54 20.9080 120.690 1356.01 30.533 223.969 0.427987 370.64 1266.33 175890300 6.303 105023963 19.788 29092155 37.683 583.64 481504 6381 273780 75707 407502 2476303 33583 1022.32 813.025 4.51612 74.903 14.39 4.50 44.021 110.1 306.8 207.5 129.6 273.2 4.8512 98.3 25.46 95.9 132.2 7.4107 279.9 71.4 305.6 13.97 120400000 174.3 19.5959 11.80 5.67 420308 1350.2 21.37 19310 1.848 50295.8 37582.44 11.32109 23.9703636 224.44 14660 56347.39 1565800000 5.34965897 180764311 8708.9 105738.891 141.61 1190387.619969 9174.67 161.418 5163000 60339 10495.8 21.298 26.8592 32.6752 44.0619 687.41 22.113 1582500000 7.099 4.0518 4588.57 88559064 97768.8 14058 13486 1082117000 94073662 0.51829 14.690897 556.783875 18688.073 35107 183.45 195.1 18393.741 163.77 102.132 58.6048 71.881 1662300000 4493.03 368 23489.6 39.7975 5.011 14707.2 50.74 36.5306 50.0223 1024320 629 1180890 200732 57129.81 57520.4 22960.66 7408.86 1.4641 13739 5.49 58467.7 44.1713 81701.5 0.323917 0.85 116983.5 1.77 74.49 15.436 1250548 107.456913 2993.9 1.79 5.44095 5.91004 23.78728 0.997116 23.39 1.50009 209.62 8.88 4.34 16.10 22.538 9.04144 23.57 143.458 20.339 1425 33.056 1596.34 19018.3 22.962 0.918737 758663.03 13312.8 1.29513 27.42 6553007 349.52 0.344166 661 4.08878 22.243 6.395 77.159473 248.889 2321.21 405.76 870 15.021 866 6.037 567 639 23015.8 823.733 821.59 14.187 22.122 56.793 7.11 605 67.39 87000.9 841 10.291 140.182233 6.797 2.04221 42.56 29.94 488.713 1093 774.74 485.533 2.79965 38.671 482.101 31.26 35.38 289.72 9037 42.922 5290.73 0.344966 104.2 1.4445 1.4731 0.239514 206 295.9 427.16 484.52 20.9618 121.36 1317.18 30.551 224.435 0.43564 363.45 1250 175168000 203.4 127.1 268.2 6.32 4.9012 103982528 99.1 25.3 95.3 131.4 7.4554 278.4 19.894 28998877 71.7 37.543 583.09 304.8 482217 13.99 120300000 174.2 19.5906 11.8 5.67 6055 246000 127280 440310 2546270 42720 1036.39 832.645 3.63762 72.499 13.35 4.35 50.889 OpenBenchmarking.org
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 500K 1000K 1500K 2000K 2500K 2411587 423324 420308 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 800 1600 2400 3200 4000 SE +/- 39.37, N = 15 SE +/- 12.34, N = 15 3651.80 1350.20 1346.63 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 13 26 39 52 65 SE +/- 0.11, N = 4 SE +/- 0.05, N = 3 57.42 21.80 21.37 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 9K 18K 27K 36K 45K SE +/- 585.01, N = 15 SE +/- 181.40, N = 4 41269 19310 15618 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 1.0652 2.1304 3.1956 4.2608 5.326 SE +/- 0.036, N = 3 SE +/- 0.007, N = 3 4.734 1.859 1.848 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 30K 60K 90K 120K 150K SE +/- 276.83, N = 9 SE +/- 122.82, N = 6 123538.14 50295.80 50133.58 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 20K 40K 60K 80K 100K SE +/- 74.38, N = 4 SE +/- 75.47, N = 3 92499.87 37645.41 37582.44 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 3 6 9 12 15 SE +/- 0.011962, N = 7 SE +/- 0.018568, N = 4 4.707610 11.321090 11.328990 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 6 12 18 24 30 SE +/- 0.02, N = 4 SE +/- 0.01, N = 3 11.06 23.97 24.04 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 50 100 150 200 250 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 103.61 224.44 224.58 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3K 6K 9K 12K 15K SE +/- 55.68, N = 9 SE +/- 115.12, N = 8 6924 13486 14660 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 30K 60K 90K 120K 150K SE +/- 268.68, N = 11 SE +/- 182.12, N = 9 118831.52 56347.39 56287.53 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 700M 1400M 2100M 2800M 3500M SE +/- 4603018.33, N = 3 SE +/- 3090487.20, N = 3 3279266667 1565800000 1565266667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 1.2037 2.4074 3.6111 4.8148 6.0185 SE +/- 0.00858289, N = 9 SE +/- 0.00960015, N = 7 2.57966831 5.26849863 5.34965897 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Read 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 80M 160M 240M 320M 400M SE +/- 1003430.17, N = 3 SE +/- 1359270.00, N = 3 373831966 182180961 180764311 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4K 8K 12K 16K 20K SE +/- 75.18, N = 3 SE +/- 55.43, N = 3 17835.8 8726.8 8708.9 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 50K 100K 150K 200K 250K SE +/- 353.02, N = 3 SE +/- 128.50, N = 3 211019.99 105738.89 105477.35 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.25, N = 3 71.11 141.61 142.20
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 500K 1000K 1500K 2000K 2500K SE +/- 1412.48, N = 3 SE +/- 1975.05, N = 3 2365407.62 1206142.70 1190387.62 1. (CC) gcc options: -O2 -lrt" -lrt
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4K 8K 12K 16K 20K SE +/- 73.04, N = 3 SE +/- 37.57, N = 3 18222.50 9239.59 9174.67 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 40 80 120 160 200 SE +/- 0.89, N = 3 SE +/- 0.57, N = 3 81.46 161.42 161.65 1. (CC) gcc options: -O2 -pthread
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2M 4M 6M 8M 10M SE +/- 14502.87, N = 3 SE +/- 15762.12, N = 3 10200000 5305667 5163000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 30K 60K 90K 120K 150K SE +/- 139.20, N = 3 SE +/- 132.64, N = 3 118869 60553 60339 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4K 8K 12K 16K 20K SE +/- 93.94, N = 3 SE +/- 49.28, N = 3 20662.0 10580.9 10495.8 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 5 10 15 20 25 SE +/- 0.01, N = 5 SE +/- 0.17, N = 3 11.01 21.30 21.65 1. (CC) gcc options: -lm -lpthread -O3
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 6 12 18 24 30 SE +/- 0.05, N = 4 SE +/- 0.07, N = 3 13.68 26.86 26.87 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 8 16 24 32 40 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 16.64 32.49 32.68 1. (CXX) g++ options: -O3 -flto -pthread
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.22, N = 5 SE +/- 0.50, N = 4 83.57 44.06 42.61 MIN: 69.17 / MAX: 92.21 MIN: 41.96 / MAX: 49.48 MIN: 39.84 / MAX: 49.36
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 150 300 450 600 750 SE +/- 1.68, N = 3 SE +/- 1.68, N = 3 350.74 686.96 687.41 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 5 10 15 20 25 SE +/- 0.03, N = 5 SE +/- 0.04, N = 3 11.30 22.11 22.13 1. (CXX) g++ options: -fopenmp -O2 -march=native
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 700M 1400M 2100M 2800M 3500M SE +/- 284800.12, N = 3 SE +/- 2451530.13, N = 3 3086533333 1582500000 1579300000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.010, N = 9 SE +/- 0.020, N = 6 3.693 7.099 7.166 1. (CXX) g++ options: -O3 -lpthread
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.9117 1.8234 2.7351 3.6468 4.5585 SE +/- 0.00172, N = 7 SE +/- 0.00912, N = 7 2.08924 4.02680 4.05180 MIN: 2.03 MIN: 3.98 MIN: 3.98 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2K 4K 6K 8K 10K SE +/- 95.83, N = 5 SE +/- 12.97, N = 3 8890.34 4593.84 4588.57 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 40M 80M 120M 160M 200M SE +/- 1594885.41, N = 12 SE +/- 767934.53, N = 3 171121628 89220384 88559064
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 40K 80K 120K 160K 200K SE +/- 58.73, N = 4 SE +/- 47.08, N = 3 188790.14 98453.55 97768.80 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3K 6K 9K 12K 15K SE +/- 8.35, N = 6 SE +/- 13.75, N = 4 7281 14057 14058 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3K 6K 9K 12K 15K SE +/- 17.11, N = 6 SE +/- 27.98, N = 4 6992 13439 13486 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 400M 800M 1200M 1600M 2000M SE +/- 1152755.01, N = 3 SE +/- 408650.35, N = 3 2082103333 1084411667 1082117000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 40M 80M 120M 160M 200M SE +/- 1599896.72, N = 3 SE +/- 570049.11, N = 3 180945380 94879448 94073662 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 0.1168 0.2336 0.3504 0.4672 0.584 SE +/- 0.00027, N = 3 SE +/- 0.00101, N = 3 0.27064 0.51829 0.51889
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 7 14 21 28 35 SE +/- 0.08, N = 6 SE +/- 0.03, N = 4 28.15 14.70 14.69 1. (CC) gcc options: -O3 -march=native -fopenmp
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 120 240 360 480 600 SE +/- 1.61, N = 3 SE +/- 0.44, N = 3 291.42 554.94 556.78 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 8K 16K 24K 32K 40K SE +/- 62.90, N = 4 SE +/- 16.09, N = 5 35311.85 18779.65 18688.07 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 14K 28K 42K 56K 70K SE +/- 308.74, N = 3 SE +/- 131.93, N = 3 66328 35347 35107
rays1bench Large Scene OpenBenchmarking.org mrays/s, More Is Better rays1bench 2020-01-09 Large Scene 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 80 160 240 320 400 SE +/- 0.57, N = 8 SE +/- 0.17, N = 7 346.31 184.56 183.45
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 40 80 120 160 200 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 103.87 195.10 195.16
WRF Input: conus 2.5km OpenBenchmarking.org Seconds, Fewer Is Better WRF 4.2.2 Input: conus 2.5km 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4K 8K 12K 16K 20K 9822.73 18356.46 18393.74 1. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 40 80 120 160 200 SE +/- 0.02, N = 3 SE +/- 0.11, N = 3 87.63 163.77 164.01
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.33, N = 3 SE +/- 0.27, N = 3 54.92 102.03 102.13
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.34, N = 6 SE +/- 0.10, N = 5 107.90 58.60 58.29 MIN: 96.18 / MAX: 112.26 MIN: 53.96 / MAX: 61.75 MIN: 53.64 / MAX: 61.72
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 16 32 48 64 80 SE +/- 0.24, N = 3 SE +/- 0.66, N = 3 39.07 71.88 71.99 1. (CXX) g++ options: -O2 -lOpenCL
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 700M 1400M 2100M 2800M 3500M SE +/- 4053941.84, N = 3 SE +/- 1039764.93, N = 3 3047333333 1662300000 1654066667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 2K 4K 6K 8K 10K SE +/- 69.57, N = 15 SE +/- 32.20, N = 11 7939.02 4493.03 4334.33 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 150 300 450 600 750 SE +/- 1.45, N = 3 672 370 368 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 9K 18K 27K 36K 45K SE +/- 97.10, N = 3 SE +/- 73.93, N = 3 42770.8 23539.9 23489.6 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 16 32 48 64 80 SE +/- 0.27, N = 3 SE +/- 0.12, N = 3 72.45 39.99 39.80 MIN: 62.12 / MAX: 82.66 MIN: 38.39 / MAX: 44.94 MIN: 38.22 / MAX: 44.71
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3 6 9 12 15 SE +/- 0.005, N = 3 SE +/- 0.010, N = 3 9.063 5.017 5.011 1. (CXX) g++ options: -O3 -pthread
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 6K 12K 18K 24K 30K SE +/- 67.30, N = 3 SE +/- 6.99, N = 3 26504.2 14707.2 14658.2 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 12 24 36 48 60 SE +/- 0.23, N = 3 SE +/- 0.09, N = 3 28.24 50.74 51.05
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 14 28 42 56 70 SE +/- 0.10, N = 5 SE +/- 0.07, N = 3 64.77 36.53 36.07 MIN: 59.87 / MAX: 79.46 MIN: 35.31 / MAX: 40.33 MIN: 34.8 / MAX: 40.77
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.40, N = 3 SE +/- 0.05, N = 3 89.62 50.10 50.02 MIN: 70.4 / MAX: 98.44 MIN: 46.69 / MAX: 54.05 MIN: 46.7 / MAX: 53.93
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 200K 400K 600K 800K 1000K SE +/- 1484.41, N = 3 SE +/- 276.22, N = 3 572750 1024320 1026060
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 1125 641 629 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 300K 600K 900K 1200K 1500K SE +/- 2539.56, N = 3 SE +/- 1036.96, N = 3 666967 1178463 1180890
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 80K 160K 240K 320K 400K SE +/- 3489.62, N = 3 SE +/- 596.42, N = 3 355140 201596 200732 1. (CXX) g++ options: -pipe -lpthread
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 20K 40K 60K 80K 100K SE +/- 106.40, N = 7 SE +/- 41.49, N = 6 100812.54 57275.08 57129.81 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 12K 24K 36K 48K 60K SE +/- 83.39, N = 3 SE +/- 122.59, N = 3 32738.9 56410.0 57520.4
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 9K 18K 27K 36K 45K SE +/- 84.43, N = 8 SE +/- 41.74, N = 6 40188.84 22960.66 22890.25 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3K 6K 9K 12K 15K SE +/- 404.92, N = 3 SE +/- 5.70, N = 3 12928.80 7452.07 7408.86 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.3294 0.6588 0.9882 1.3176 1.647 SE +/- 0.000669, N = 9 SE +/- 0.000612, N = 9 0.839556 1.371550 1.464100 MIN: 0.8 MIN: 1.31 MIN: 1.31 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3K 6K 9K 12K 15K SE +/- 27.45, N = 6 SE +/- 31.23, N = 4 7883 13726 13739 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: DLSC - Acceleration: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 3 6 9 12 15 SE +/- 0.08, N = 3 SE +/- 0.00, N = 3 9.41 5.50 5.49 MIN: 8.72 / MAX: 12.17 MIN: 5.29 / MAX: 6.23 MIN: 5.31 / MAX: 6.18
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 13K 26K 39K 52K 65K SE +/- 253.99, N = 12 SE +/- 71.21, N = 3 34155.0 57591.0 58467.7
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.26, N = 5 SE +/- 0.09, N = 4 75.50 44.17 44.17 MIN: 65.3 / MAX: 94.46 MIN: 42.4 / MAX: 48.84 MIN: 42.27 / MAX: 48.76
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 20K 40K 60K 80K 100K SE +/- 433.14, N = 7 SE +/- 65.86, N = 3 47974.6 80455.7 81701.5
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.0729 0.1458 0.2187 0.2916 0.3645 SE +/- 0.000612, N = 9 SE +/- 0.002283, N = 15 0.190628 0.310362 0.323917 MIN: 0.18 MIN: 0.29 MIN: 0.29 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 0.324 0.648 0.972 1.296 1.62 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.44 0.85 0.85
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 40K 80K 120K 160K 200K SE +/- 236.31, N = 4 SE +/- 204.16, N = 3 198052.38 117333.47 116983.50 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.00, N = 5 SE +/- 0.00, N = 3 2.98 1.78 1.77
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 44.45 74.49 74.51
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 4 8 12 16 20 SE +/- 0.039, N = 5 SE +/- 0.041, N = 3 9.257 15.436 15.515 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
ebizzy OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 400K 800K 1200K 1600K 2000K SE +/- 19700.68, N = 15 SE +/- 17257.96, N = 15 2035986 1250548 1215587 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
Appleseed Scene: Material Tester OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Material Tester Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 40 80 120 160 200 106.17 107.46 177.78
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 700 1400 2100 2800 3500 1846.8 2993.9 3085.1 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.00, N = 5 SE +/- 0.00, N = 3 2.99 1.79 1.79
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 1.2242 2.4484 3.6726 4.8968 6.121 SE +/- 0.00173, N = 3 SE +/- 0.01029, N = 3 3.25912 5.37652 5.44095 MIN: 3.09 MIN: 5.25 MIN: 5.25 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 1.3298 2.6596 3.9894 5.3192 6.649 SE +/- 0.00332, N = 9 SE +/- 0.00133, N = 9 3.56396 5.87433 5.91004 MIN: 3.49 MIN: 5.71 MIN: 5.76 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 6 12 18 24 30 SE +/- 0.01, N = 4 SE +/- 0.02, N = 3 14.41 23.79 23.84 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.2244 0.4488 0.6732 0.8976 1.122 SE +/- 0.001365, N = 4 SE +/- 0.001560, N = 4 0.604378 0.960174 0.997116 MIN: 0.56 MIN: 0.91 MIN: 0.92 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 14.30 23.39 23.49 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.3375 0.675 1.0125 1.35 1.6875 SE +/- 0.002416, N = 7 SE +/- 0.000981, N = 7 0.915086 1.482560 1.500090 MIN: 0.85 MIN: 1.32 MIN: 1.41 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 50 100 150 200 250 SE +/- 0.54, N = 3 SE +/- 0.93, N = 3 127.99 208.65 209.62
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Orange Juice - Acceleration: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4 8 12 16 20 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 14.33 8.95 8.88 MIN: 11.53 / MAX: 18.29 MIN: 7.51 / MAX: 10.05 MIN: 7.54 / MAX: 10.04
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Danish Mood - Acceleration: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 6.97 4.42 4.34 MIN: 3.13 / MAX: 8.11 MIN: 1.74 / MAX: 5.11 MIN: 1.86 / MAX: 5.01
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4 8 12 16 20 SE +/- 0.04, N = 5 SE +/- 0.01, N = 4 10.08 15.92 16.10 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 8 16 24 32 40 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 35.99 22.68 22.54 1. (CXX) g++ options: -O3 -pthread -lm
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 3 6 9 12 15 SE +/- 0.05061, N = 8 SE +/- 0.04039, N = 5 5.73331 9.04144 9.15446 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 9 18 27 36 45 SE +/- 0.32, N = 3 SE +/- 0.06, N = 3 37.49 23.70 23.57 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 30 60 90 120 150 SE +/- 0.22, N = 3 SE +/- 0.25, N = 3 90.57 142.51 143.46
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 7 14 21 28 35 SE +/- 0.33, N = 15 SE +/- 0.23, N = 15 31.56 20.84 20.34 1. (CXX) g++ options: -O3 -pthread -lm
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 500 1000 1500 2000 2500 SE +/- 4.10, N = 3 SE +/- 0.67, N = 3 2191 1452 1425 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 8 16 24 32 40 SE +/- 0.24, N = 5 SE +/- 0.41, N = 3 21.63 32.01 33.06
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 400 800 1200 1600 2000 SE +/- 12.08, N = 9 SE +/- 29.35, N = 9 1169.15 1596.34 1783.19 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 5K 10K 15K 20K 25K SE +/- 396.15, N = 8 SE +/- 195.78, N = 5 24810.2 19018.3 16445.2 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 6 12 18 24 30 SE +/- 0.04, N = 4 SE +/- 0.01, N = 3 15.51 22.87 22.96 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 0.2851 0.5702 0.8553 1.1404 1.4255 SE +/- 0.006531, N = 15 SE +/- 0.010254, N = 9 0.872017 0.918737 1.267000 MIN: 0.77 MIN: 0.77 MIN: 0.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 170K 340K 510K 680K 850K SE +/- 10511.44, N = 3 SE +/- 5876.97, N = 15 771533.42 758663.03 541996.24 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4K 8K 12K 16K 20K SE +/- 158.49, N = 8 SE +/- 155.70, N = 5 18859.8 14169.2 13312.8 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.2914 0.5828 0.8742 1.1656 1.457 SE +/- 0.003790, N = 4 SE +/- 0.000855, N = 4 0.914865 1.212030 1.295130 MIN: 0.85 MIN: 1.16 MIN: 1.16 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 9 18 27 36 45 SE +/- 0.19, N = 4 SE +/- 0.02, N = 3 38.71 27.57 27.42 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Read While Writing 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2M 4M 6M 8M 10M SE +/- 103809.83, N = 15 SE +/- 63537.44, N = 3 9170260 6676556 6553007 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Xeon Platinum 8380 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 100 200 300 400 500 SE +/- 4.09, N = 15 SE +/- 5.22, N = 15 485.12 475.45 349.52 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.0774 0.1548 0.2322 0.3096 0.387 SE +/- 0.000574, N = 4 SE +/- 0.000427, N = 4 0.249611 0.329636 0.344166 MIN: 0.23 MIN: 0.29 MIN: 0.29 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 140 280 420 560 700 SE +/- 2.17, N = 3 SE +/- 5.49, N = 3 661 659 482 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.92 1.84 2.76 3.68 4.6 SE +/- 0.00418, N = 4 SE +/- 0.02087, N = 4 2.99485 4.03139 4.08878 MIN: 2.85 MIN: 3.2 MIN: 3.19 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 16.43 22.24 22.32
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.024, N = 8 SE +/- 0.007, N = 7 4.709 6.211 6.395 1. (CXX) g++ options: -O2 -lOpenCL
Appleseed Scene: Disney Material OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Disney Material 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 20 40 60 80 100 57.40 77.16 77.52
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 60 120 180 240 300 SE +/- 0.65, N = 3 SE +/- 0.72, N = 3 190.68 248.89 256.63
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 700 1400 2100 2800 3500 SE +/- 20.88, N = 4 SE +/- 6.51, N = 3 3086.24 2333.56 2321.21 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 120 240 360 480 600 SE +/- 0.65, N = 3 SE +/- 0.60, N = 3 532.57 405.76 404.27 MIN: 189.3 / MAX: 586.86 MIN: 281.5 / MAX: 454.16 MIN: 275.43 / MAX: 456.48 1. (CC) gcc options: -pthread -lm
MariaDB Clients: 256 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 10.5.2 Clients: 256 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 200 400 600 800 1000 SE +/- 1.12, N = 3 SE +/- 1.84, N = 3 918 870 697 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -ldl -lz -lrt
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 4 8 12 16 20 SE +/- 0.02, N = 4 SE +/- 0.02, N = 4 11.42 14.93 15.02 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
MariaDB Clients: 512 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 10.5.2 Clients: 512 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 200 400 600 800 1000 SE +/- 0.76, N = 3 SE +/- 2.75, N = 3 870 866 664 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -ldl -lz -lrt
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.005, N = 8 SE +/- 0.006, N = 7 4.658 6.013 6.037
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 160 320 480 640 800 SE +/- 2.03, N = 3 SE +/- 2.33, N = 3 732 586 567 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 200 400 600 800 1000 SE +/- 3.53, N = 3 820 641 639 MIN: 1 / MAX: 3230 MIN: 1 / MAX: 2858 MIN: 1 / MAX: 2861
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 6K 12K 18K 24K 30K SE +/- 90.91, N = 4 SE +/- 13.81, N = 4 28391.4 23125.0 23015.8 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 200 400 600 800 1000 SE +/- 2.14, N = 3 SE +/- 4.02, N = 3 675.16 812.26 823.73 MIN: 645.81 MIN: 778.91 MIN: 781.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 200 400 600 800 1000 SE +/- 0.92, N = 3 SE +/- 2.62, N = 3 675.12 810.36 821.59 MIN: 649.43 MIN: 779.4 MIN: 782.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 4 8 12 16 20 SE +/- 0.14, N = 4 SE +/- 0.06, N = 4 11.96 14.19 14.37
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 5 10 15 20 25 SE +/- 0.12, N = 3 SE +/- 0.01, N = 3 18.71 21.79 22.12
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 13 26 39 52 65 SE +/- 0.08, N = 3 SE +/- 0.18, N = 3 48.07 56.58 56.79
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 7.30 7.11 6.21
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 130 260 390 520 650 SE +/- 6.23, N = 5 SE +/- 8.14, N = 12 605 586 516 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 15 30 45 60 75 SE +/- 0.32, N = 3 SE +/- 0.25, N = 3 57.64 67.16 67.39
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 20K 40K 60K 80K 100K SE +/- 439.19, N = 3 SE +/- 82.87, N = 3 74702.4 81669.1 87000.9
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 200 400 600 800 1000 SE +/- 11.50, N = 3 SE +/- 5.94, N = 10 867 841 753 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 3 6 9 12 15 SE +/- 0.024, N = 5 SE +/- 0.026, N = 5 9.111 10.291 10.387
Appleseed Scene: Emily OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Emily 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 30 60 90 120 150 123.70 140.18 140.51
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.063, N = 15 SE +/- 0.063, N = 15 6.762 6.797 7.652 1. (CXX) g++ options: -O2 -lOpenCL
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.4595 0.919 1.3785 1.838 2.2975 SE +/- 0.00217, N = 5 SE +/- 0.00210, N = 5 1.81271 2.02106 2.04221 MIN: 1.67 MIN: 1.69 MIN: 1.72 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 11 22 33 44 55 SE +/- 0.33, N = 4 SE +/- 0.08, N = 4 47.84 42.65 42.56 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 8 16 24 32 40 SE +/- 0.33, N = 3 SE +/- 0.21, N = 3 33.43 31.06 29.94
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 110 220 330 440 550 SE +/- 1.03, N = 3 SE +/- 1.28, N = 3 439.37 476.02 488.71 MIN: 423.06 MIN: 462.14 MIN: 463.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 200 400 600 800 1000 SE +/- 2.33, N = 3 1159 1093 1042 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 200 400 600 800 1000 SE +/- 1.95, N = 3 SE +/- 0.41, N = 3 861.39 775.38 774.74 MIN: 524.86 / MAX: 1144.29 MIN: 588.21 / MAX: 1071.6 MIN: 590.4 / MAX: 1067.33 1. (CC) gcc options: -pthread -lm
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 110 220 330 440 550 SE +/- 2.02, N = 3 SE +/- 0.97, N = 3 439.64 476.21 485.53 MIN: 422.12 MIN: 462.69 MIN: 463.97 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 0.6328 1.2656 1.8984 2.5312 3.164 SE +/- 0.01197, N = 10 SE +/- 0.00595, N = 10 2.56596 2.79965 2.81248 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 35.34 38.27 38.67
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 100 200 300 400 500 SE +/- 2.16, N = 3 SE +/- 1.74, N = 3 441.26 475.86 482.10 MIN: 423.47 MIN: 461.87 MIN: 463.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.08, N = 3 31.26 31.23 28.65 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 9 18 27 36 45 SE +/- 0.31, N = 3 SE +/- 0.12, N = 3 38.51 36.97 35.38
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 70 140 210 280 350 SE +/- 2.91, N = 15 SE +/- 0.54, N = 10 313.93 290.36 289.72 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 2K 4K 6K 8K 10K SE +/- 18.20, N = 3 SE +/- 2.92, N = 3 9056 9037 8402 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 10 20 30 40 50 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 39.84 42.71 42.92
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 1100 2200 3300 4400 5500 SE +/- 4.90, N = 3 SE +/- 46.19, N = 3 5305.67 5290.73 4997.05 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 0.0819 0.1638 0.2457 0.3276 0.4095 SE +/- 0.000930, N = 3 SE +/- 0.001200, N = 3 0.343258 0.344966 0.363977 MIN: 0.3 MIN: 0.3 MIN: 0.32 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 20 40 60 80 100 SE +/- 2.67, N = 3 110.1 104.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.325 0.65 0.975 1.3 1.625 SE +/- 0.00190, N = 5 SE +/- 0.00142, N = 5 1.36768 1.42703 1.44450 MIN: 1.33 MIN: 1.4 MIN: 1.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.3314 0.6628 0.9942 1.3256 1.657 SE +/- 0.00305, N = 7 SE +/- 0.00041, N = 7 1.39717 1.45952 1.47310 MIN: 1.24 MIN: 1.43 MIN: 1.43 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 0.0539 0.1078 0.1617 0.2156 0.2695 SE +/- 0.001610, N = 4 SE +/- 0.000310, N = 4 0.228632 0.234327 0.239514 MIN: 0.2 MIN: 0.21 MIN: 0.21 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 50 100 150 200 250 SE +/- 1.48, N = 3 SE +/- 1.69, N = 3 206 204 197 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 70 140 210 280 350 SE +/- 1.16, N = 3 306.8 295.9 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 90 180 270 360 450 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 413.22 425.32 427.16 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 110 220 330 440 550 SE +/- 0.89, N = 11 SE +/- 1.83, N = 10 485.54 484.52 469.70 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 20.30 20.91 20.96 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 30 60 90 120 150 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 117.57 120.69 121.36 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping Xeon Platinum 8380 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 300 600 900 1200 1500 SE +/- 3.16, N = 3 SE +/- 13.05, N = 3 1356.01 1354.59 1317.18 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 7 14 21 28 35 SE +/- 0.31, N = 3 SE +/- 0.18, N = 3 29.69 30.53 30.55 1. (CXX) g++ options: -O3 -fPIC -lm
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 50 100 150 200 250 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 218.16 223.97 224.44 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 0.0987 0.1974 0.2961 0.3948 0.4935 SE +/- 0.000558, N = 5 SE +/- 0.001219, N = 5 0.427987 0.435640 0.438511 MIN: 0.41 MIN: 0.41 MIN: 0.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 80 160 240 320 400 SE +/- 2.30, N = 9 SE +/- 2.16, N = 10 371.88 370.64 363.45 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Xeon Platinum 8380 Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 300 600 900 1200 1500 SE +/- 14.64, N = 4 SE +/- 11.55, N = 4 1266.33 1250.00 1238.71 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 40M 80M 120M 160M 200M SE +/- 1334158.46, N = 15 SE +/- 953041.14, N = 3 178965233 175890300 175168000 1. (CXX) g++ options: -O3 -fopenmp
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 256-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 50 100 150 200 250 SE +/- 0.49, N = 3 207.5 203.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 30 60 90 120 150 SE +/- 0.60, N = 3 129.6 127.1 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 60 120 180 240 300 SE +/- 0.45, N = 3 273.2 268.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.007, N = 6 SE +/- 0.002, N = 6 6.251 6.303 6.320 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
ASTC Encoder Preset: Medium OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Medium Xeon Platinum 8380 rest Intel Xeon Platinum 8380 1.1028 2.2056 3.3084 4.4112 5.514 SE +/- 0.0185, N = 3 4.8512 4.9012 1. (CXX) g++ options: -O3 -flto -pthread
OpenVKL Benchmark: vklBenchmarkStructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Xeon Platinum 8380 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 20M 40M 60M 80M 100M SE +/- 47834.23, N = 3 SE +/- 177762.64, N = 3 105023963 104922070 103982528 MIN: 1379843 / MAX: 899785764 MIN: 1391949 / MAX: 891929412 MIN: 1363161 / MAX: 855550476
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM Intel Xeon Platinum 8380 Xeon Platinum 8380 rest 20 40 60 80 100 SE +/- 0.52, N = 3 99.1 98.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 1080p Xeon Platinum 8380 rest Intel Xeon Platinum 8380 6 12 18 24 30 SE +/- 0.02, N = 3 25.46 25.30 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 20 40 60 80 100 SE +/- 0.06, N = 3 95.9 95.3 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 270 PRB SISO 256-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 30 60 90 120 150 SE +/- 0.12, N = 3 132.2 131.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
ASTC Encoder Preset: Thorough OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Thorough Xeon Platinum 8380 rest Intel Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.0247, N = 3 7.4107 7.4554 1. (CXX) g++ options: -O3 -flto -pthread
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 64-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 60 120 180 240 300 SE +/- 0.25, N = 3 279.9 278.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Xeon Platinum 8380 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 19.79 19.79 19.89
OpenVKL Benchmark: vklBenchmarkVdbVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 6M 12M 18M 24M 30M SE +/- 89609.36, N = 3 SE +/- 65507.38, N = 3 29143044 29092155 28998877 MIN: 1069452 / MAX: 176387184 MIN: 1047483 / MAX: 175929480 MIN: 1042294 / MAX: 173633256
srsRAN Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 5G PHY_DL_NR Test 52 PRB SISO 64-QAM Intel Xeon Platinum 8380 Xeon Platinum 8380 rest 16 32 48 64 80 SE +/- 0.21, N = 3 71.7 71.4 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 9 18 27 36 45 SE +/- 0.18, N = 3 SE +/- 0.14, N = 3 37.54 37.68 37.69 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 130 260 390 520 650 SE +/- 3.29, N = 11 SE +/- 1.28, N = 12 584.72 583.64 583.09 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
srsRAN Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM OpenBenchmarking.org eNb Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB MIMO 256-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 70 140 210 280 350 SE +/- 0.12, N = 3 305.6 304.8 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
Facebook RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill Sync Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 Xeon Platinum 8380 100K 200K 300K 400K 500K SE +/- 556.11, N = 3 SE +/- 985.63, N = 3 482217 482191 481504 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
VP9 libvpx Encoding Speed: Speed 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 5 - Input: Bosphorus 4K Intel Xeon Platinum 8380 Xeon Platinum 8380 rest 4 8 12 16 20 SE +/- 0.07, N = 3 13.99 13.97 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
srsRAN Test: OFDM_Test OpenBenchmarking.org Samples / Second, More Is Better srsRAN 21.04 Test: OFDM_Test Xeon Platinum 8380 rest Intel Xeon Platinum 8380 30M 60M 90M 120M 150M SE +/- 305505.05, N = 3 120400000 120300000 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
srsRAN Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM OpenBenchmarking.org UE Mb/s, More Is Better srsRAN 21.04 Test: 4G PHY_DL_Test 100 PRB SISO 64-QAM Xeon Platinum 8380 rest Intel Xeon Platinum 8380 40 80 120 160 200 SE +/- 0.03, N = 3 174.3 174.2 1. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lbladeRF -lm -lfftw3f -lmbedcrypto
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 3.0 Preset: Exhaustive Intel Xeon Platinum 8380 Xeon Platinum 8380 rest 5 10 15 20 25 SE +/- 0.02, N = 3 19.59 19.60 1. (CXX) g++ options: -O3 -flto -pthread
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 1080p Intel Xeon Platinum 8380 Xeon Platinum 8380 rest 3 6 9 12 15 SE +/- 0.01, N = 3 11.80 11.80 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
VP9 libvpx Encoding Speed: Speed 0 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.10.0 Speed: Speed 0 - Input: Bosphorus 4K Intel Xeon Platinum 8380 Xeon Platinum 8380 rest 1.2758 2.5516 3.8274 5.1032 6.379 SE +/- 0.00, N = 3 5.67 5.67 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 1600 3200 4800 6400 8000 SE +/- 15.90, N = 3 SE +/- 184.76, N = 12 7609 6381 6055 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 90K 180K 270K 360K 450K SE +/- 30611.01, N = 12 SE +/- 7613.62, N = 12 402726 273780 246000 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl Intel Xeon Platinum 8380 2 x Xeon Platinum 8380 Xeon Platinum 8380 30K 60K 90K 120K 150K SE +/- 1515.66, N = 15 SE +/- 5943.89, N = 15 127280 84872 75707 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 170K 340K 510K 680K 850K SE +/- 19372.86, N = 12 SE +/- 23435.25, N = 12 774163 440310 407502 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 800K 1600K 2400K 3200K 4000K SE +/- 58124.19, N = 12 SE +/- 117578.84, N = 12 3662348 2546270 2476303 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 16K 32K 48K 64K 80K SE +/- 2190.61, N = 15 SE +/- 1301.01, N = 15 75061 42720 33583 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 Xeon Platinum 8380 600 1200 1800 2400 3000 SE +/- 50.45, N = 15 SE +/- 14.01, N = 3 2725.03 1036.39 1022.32 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 200 400 600 800 1000 SE +/- 68.99, N = 15 SE +/- 2.97, N = 3 757.06 813.03 832.65 MIN: 648.2 MIN: 779.61 MIN: 780.27 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.21071, N = 12 SE +/- 0.01372, N = 3 3.63762 4.51612 6.96910 MIN: 3.35 MIN: 3.3 MIN: 6.54 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Intel Xeon Platinum 8380 Xeon Platinum 8380 2 x Xeon Platinum 8380 20 40 60 80 100 SE +/- 1.05, N = 3 SE +/- 2.28, N = 15 72.50 74.90 81.51 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Rainbow Colors and Prism - Acceleration: CPU Xeon Platinum 8380 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 4 8 12 16 20 SE +/- 0.27, N = 15 SE +/- 0.63, N = 15 14.39 14.11 13.35 MIN: 12.31 / MAX: 19.48 MIN: 10.63 / MAX: 19.45 MIN: 12.25 / MAX: 13.95
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: LuxCore Benchmark - Acceleration: CPU 2 x Xeon Platinum 8380 Xeon Platinum 8380 Intel Xeon Platinum 8380 2 4 6 8 10 SE +/- 0.14, N = 15 SE +/- 0.06, N = 15 6.89 4.50 4.35 MIN: 2.38 / MAX: 8.45 MIN: 1.72 / MAX: 5.53 MIN: 1.8 / MAX: 5.01
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Xeon Platinum 8380 2 x Xeon Platinum 8380 Intel Xeon Platinum 8380 11 22 33 44 55 SE +/- 0.81, N = 15 SE +/- 0.66, N = 3 44.02 47.44 50.89 1. (CXX) g++ options: -O2 -lOpenCL
Phoronix Test Suite v10.8.4