Server CPUs June 2021 2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2106013-IB-SERVERCPU45&grr .
Server CPUs June 2021 Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution Xeon Platinum 8380 2P 2 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads) Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) Intel Device 0998 504GB 7682GB INTEL SSDPF2KX076TZ ASPEED VE228 2 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFP Ubuntu 21.04 5.13.0-051300rc4-generic (x86_64) 20210530 GNOME Shell 3.38.4 X Server 4.5 Mesa 21.0.1 (LLVM 11.0.1 256 bits) GCC 10.3.0 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270 - Python 3.9.5 - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
Server CPUs June 2021 wrf: conus 2.5km qe: AUSURF112 openvkl: vklBenchmarkUnstructuredVolume nwchem: C240 Buckyball brl-cad: VGR Performance Metric onnx: bertsquad-10 - OpenMP CPU onnx: super-resolution-10 - OpenMP CPU openvkl: vklBenchmark helsing: 14 digit webp2: Quality 100, Lossless Compression yafaray: Total Time For Sample Scene onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU plaidml: No - Inference - ResNet 50 - CPU relion: Basic - CPU luxcorerender: Danish Mood - CPU luxcorerender: LuxCore Benchmark - CPU mysqlslap: 512 graphics-magick: Resizing askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding incompact3d: X3D-benchmarking input.i3d mysqlslap: 256 rocksdb: Seq Fill luxcorerender: DLSC - CPU lammps: 20k Atoms webp2: Quality 95, Compression Effort 7 onnx: yolov4 - OpenMP CPU build-llvm: Unix Makefiles openfoam: Motorbike 60M cpuminer-opt: Blake-2 S cpuminer-opt: x25x cpuminer-opt: Skeincoin cpuminer-opt: Magi cpuminer-opt: Myriad-Groestl asmfish: 1024 Hash Memory, 26 Depth cpuminer-opt: Deepcoin build-llvm: Ninja cpuminer-opt: Garlicoin cpuminer-opt: LBC, LBRY Credits onnx: fcn-resnet101-11 - OpenMP CPU onnx: shufflenet-v2-10 - OpenMP CPU appleseed: Material Tester stockfish: Total Time webp2: Quality 75, Compression Effort 7 blender: Barbershop - CPU-Only rodinia: OpenMP HotSpot3D ebizzy: keydb: build-nodejs: Time To Compile blender: Pabellon Barcelona - CPU-Only appleseed: Emily v-ray: CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU plaidml: No - Inference - VGG19 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU compress-7zip: Compress Speed Test blender: Classroom - CPU-Only tungsten: Volumetric Caustic openvkl: vklBenchmarkVdbVolume plaidml: No - Inference - VGG16 - CPU luxcorerender: Orange Juice - CPU tensorflow-lite: Inception V4 tensorflow-lite: Inception ResNet V2 tensorflow-lite: SqueezeNet tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Quant tensorflow-lite: Mobilenet Float john-the-ripper: MD5 rocksdb: Rand Fill rocksdb: Rand Fill Sync rocksdb: Read While Writing rocksdb: Rand Read graphics-magick: Sharpen graphics-magick: Rotate graphics-magick: Enhanced graphics-magick: Noise-Gaussian graphics-magick: HWB Color Space graphics-magick: Swirl build2: Time To Compile toktx: UASTC 4 + Zstd Compression 19 luxcorerender: Rainbow Colors and Prism - CPU rust-mandel: Time To Complete Serial/Parallel Mandelbrot build-godot: Time To Compile openvkl: vklBenchmarkStructuredVolume blender: Fishy Cat - CPU-Only rodinia: OpenMP Leukocyte askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding npb: EP.D askap: tConvolve MPI - Gridding askap: tConvolve MPI - Degridding build-gdb: Time To Compile xmrig: Monero - 1M svt-av1: Preset 4 - Bosphorus 4K rodinia: OpenMP LavaMD appleseed: Disney Material build-wasmer: Time To Compile build-linux-kernel: Time To Compile build-php: Time To Compile namd: ATPase Simulation - 327,506 Atoms coremark: CoreMark Size 666 - Iterations Per Second gromacs: MPI CPU - water_GMX50_bare john-the-ripper: Blowfish aircrack-ng: avifenc: 6, Lossless tungsten: Water Caustic blender: BMW27 - CPU-Only basis: ETC1S xmrig: Wownero - 1M openfoam: Motorbike 30M embree: Pathtracer - Asian Dragon Obj amg: astcenc: Exhaustive embree: Pathtracer ISPC - Asian Dragon Obj oidn: RTLightmap.hdr.4096x4096 x265: Bosphorus 4K onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU kvazaar: Bosphorus 4K - Ultra Fast liquid-dsp: 160 - 256 - 57 openssl: RSA 4096-bit Performance liquid-dsp: 128 - 256 - 57 liquid-dsp: 64 - 256 - 57 build-apache: Time To Compile kripke: build-mesa: Time To Compile ttsiod-renderer: Phong Rendering With Soft-Shadow Mapping svt-hevc: 1 - Bosphorus 1080p askap: Hogbom Clean OpenMP npb: SP.C build-ffmpeg: Time To Compile svt-av1: Preset 8 - Bosphorus 4K basis: UASTC Level 3 kvazaar: Bosphorus 4K - Very Fast npb: BT.C pennant: sedovbig onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - bf16bf16bf16 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU tachyon: Total Time minife: Small svt-vp9: VMAF Optimized - Bosphorus 1080p dav1d: Chimera 1080p 10-bit npb: IS.D lulesh: toktx: UASTC 3 + Zstd Compression 19 onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPU onednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU build-imagemagick: Time To Compile avifenc: 6 incompact3d: input.i3d 193 Cells Per Direction npb: LU.C povray: Trace Time basis: UASTC Level 2 c-ray: Total Time - 4K, 16 Rays Per Pixel m-queens: Time To Solve oidn: RT.hdr_alb_nrm.3840x2160 oidn: RT.ldr_alb_nrm.3840x2160 embree: Pathtracer - Crown cloverleaf: Lagrangian-Eulerian Hydrodynamics onednn: IP Shapes 3D - f32 - CPU embree: Pathtracer ISPC - Crown onednn: IP Shapes 3D - bf16bf16bf16 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU embree: Pathtracer - Asian Dragon mt-dgemm: Sustained Floating-Point Rate toybrot: C++ Tasks basis: UASTC Level 0 rodinia: OpenMP Streamcluster lammps: Rhodopsin Protein toybrot: OpenMP avifenc: 10, Lossless toybrot: C++ Threads dav1d: Summer Nature 4K embree: Pathtracer ISPC - Asian Dragon toybrot: TBB webp2: Quality 100, Compression Effort 5 onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPU tungsten: Hair pennant: leblancbig npb: FT.C rodinia: OpenMP CFD Solver svt-hevc: 7 - Bosphorus 1080p toktx: UASTC 3 npb: CG.C avifenc: 10 rays1bench: Large Scene primesieve: 1e12 Prime Number Generation npb: SP.B svt-vp9: Visual Quality Optimized - Bosphorus 1080p incompact3d: input.i3d 129 Cells Per Direction svt-vp9: PSNR/SSIM Optimized - Bosphorus 1080p onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU tungsten: Non-Exponential npb: EP.C npb: MG.C svt-hevc: 10 - Bosphorus 1080p Xeon Platinum 8380 2P 9806.656 1166.80 1794150 1880.6 2414033 499 7327 814 82.917 413.224 82.107 446.076 6.15 350.060 6.83 6.98 664 378 9286.52 5192.96 291.414846 694 597789 9.18 36.049 218.108 476 191.646 103.86 3581744 2841.22 803416 3764.88 89832 171766875 75288 127.374 40542 356768 199 8201 179.192446 178977022 117.627 104.18 104.312 2002029 542875.78 91.001 87.69 123.622839 66647 675.111 674.916 679.322 440.641 34.04 439.162 351890 71.19 14.0630 29321974 39.39 14.37 664404 568468 47633.4 73580.0 34437.6 32738.8 10252333 589877 484574 8685478 367939009 672 780 1127 731 1048 2170 57.333 56.557 12.60 48.740 48.338 105233677 44.46 45.211 25496.0 18680.2 8886.39 20825.1 18180.0 39.944 26540.4 4.788 38.968 57.669194 37.457 21.506 35.402 0.27293 2366961.039552 9.038 118496 210909.563 29.956 20.3189 28.36 26.484 42661.3 14.34 72.9156 2078582667 16.6304 89.1757 1.44 28.60 6.97288 3.26354 0.360963 48.04 3089933333 17870.9 3289000000 3062166667 19.775 176213733 18.589 1363.68 38.27 1220.00 92298.34 16.249 58.185 15.561 38.93 198263.35 14.40466 0.925533 2.98665 1.26992 13.6431 28477.1 470.79 865.87 3030.35 35415.170 9.133 0.251017 0.605773 0.228136 12.130 12.027 11.0113789 188559.27 9.312 11.389 10.989 11.314 2.99 2.98 64.4853 9.95 1.36484 74.8384 1.81027 0.435839 83.0529 28.081573 7936 7.628 7.631 31.745 7278 7.295 7050 534.79 107.8659 6803 6.242 1.39960 0.912920 2.09290 5.72330 4.681398 101071.35 4.750 314.07 4.667 40407.42 4.337 346.01 3.658 123085.11 372.57 2.57972169 471.75 0.839584 3.56680 0.191254 2.53696 7792.57 120157.87 588.85 OpenBenchmarking.org
WRF Input: conus 2.5km OpenBenchmarking.org Seconds, Fewer Is Better WRF 4.2.2 Input: conus 2.5km Xeon Platinum 8380 2P 2K 4K 6K 8K 10K 9806.66 1. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
Quantum ESPRESSO Input: AUSURF112 OpenBenchmarking.org Seconds, Fewer Is Better Quantum ESPRESSO 6.7 Input: AUSURF112 Xeon Platinum 8380 2P 300 600 900 1200 1500 SE +/- 6.32, N = 3 1166.80 1. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
OpenVKL Benchmark: vklBenchmarkUnstructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkUnstructuredVolume Xeon Platinum 8380 2P 400K 800K 1200K 1600K 2000K SE +/- 8230.26, N = 3 1794150 MIN: 24223 / MAX: 5801986
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball Xeon Platinum 8380 2P 400 800 1200 1600 2000 1880.6 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
BRL-CAD VGR Performance Metric OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.32.2 VGR Performance Metric Xeon Platinum 8380 2P 500K 1000K 1500K 2000K 2500K 2414033 1. (CXX) g++ options: -std=c++11 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -pthread -ldl -lm
ONNX Runtime Model: bertsquad-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: bertsquad-10 - Device: OpenMP CPU Xeon Platinum 8380 2P 110 220 330 440 550 SE +/- 7.32, N = 12 499 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: super-resolution-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: super-resolution-10 - Device: OpenMP CPU Xeon Platinum 8380 2P 1600 3200 4800 6400 8000 SE +/- 194.09, N = 12 7327 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
OpenVKL Benchmark: vklBenchmark OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmark Xeon Platinum 8380 2P 200 400 600 800 1000 SE +/- 2.19, N = 3 814 MIN: 1 / MAX: 3191
Helsing Digit Range: 14 digit OpenBenchmarking.org Seconds, Fewer Is Better Helsing 1.0-beta Digit Range: 14 digit Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.78, N = 15 82.92 1. (CC) gcc options: -O2 -pthread
WebP2 Image Encode Encode Settings: Quality 100, Lossless Compression OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Lossless Compression Xeon Platinum 8380 2P 90 180 270 360 450 SE +/- 0.07, N = 3 413.22 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
YafaRay Total Time For Sample Scene OpenBenchmarking.org Seconds, Fewer Is Better YafaRay 3.4.1 Total Time For Sample Scene Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 2.43, N = 15 82.11 1. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 100 200 300 400 500 SE +/- 3.39, N = 15 446.08 MIN: 422.13 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.04, N = 3 6.15
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 3.1.1 Test: Basic - Device: CPU Xeon Platinum 8380 2P 80 160 240 320 400 SE +/- 2.01, N = 3 350.06 1. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Danish Mood - Acceleration: CPU Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.07, N = 15 6.83 MIN: 2.78 / MAX: 8.24
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: LuxCore Benchmark - Acceleration: CPU Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.12, N = 15 6.98 MIN: 2.5 / MAX: 8.5
MariaDB Clients: 512 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 10.5.2 Clients: 512 Xeon Platinum 8380 2P 140 280 420 560 700 SE +/- 2.44, N = 3 664 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -ldl -lz -lrt
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Resizing Xeon Platinum 8380 2P 80 160 240 320 400 SE +/- 10.32, N = 15 378 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Xeon Platinum 8380 2P 2K 4K 6K 8K 10K SE +/- 50.18, N = 3 9286.52 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Xeon Platinum 8380 2P 1100 2200 3300 4400 5500 SE +/- 9.97, N = 3 5192.96 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Xeon Platinum 8380 2P 60 120 180 240 300 SE +/- 1.10, N = 3 291.41 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
MariaDB Clients: 256 OpenBenchmarking.org Queries Per Second, More Is Better MariaDB 10.5.2 Clients: 256 Xeon Platinum 8380 2P 150 300 450 600 750 SE +/- 1.37, N = 3 694 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -ldl -lz -lrt
Facebook RocksDB Test: Sequential Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Sequential Fill Xeon Platinum 8380 2P 130K 260K 390K 520K 650K SE +/- 219.25, N = 3 597789 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: DLSC - Acceleration: CPU Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.06, N = 13 9.18 MIN: 8.24 / MAX: 11.84
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms Xeon Platinum 8380 2P 8 16 24 32 40 SE +/- 0.04, N = 3 36.05 1. (CXX) g++ options: -O3 -pthread -lm
WebP2 Image Encode Encode Settings: Quality 95, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 95, Compression Effort 7 Xeon Platinum 8380 2P 50 100 150 200 250 SE +/- 0.05, N = 3 218.11 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
ONNX Runtime Model: yolov4 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: yolov4 - Device: OpenMP CPU Xeon Platinum 8380 2P 100 200 300 400 500 SE +/- 4.64, N = 5 476 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Timed LLVM Compilation Build System: Unix Makefiles OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Unix Makefiles Xeon Platinum 8380 2P 40 80 120 160 200 SE +/- 0.80, N = 3 191.65
OpenFOAM Input: Motorbike 60M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 60M Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.08, N = 3 103.86 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Blake-2 S Xeon Platinum 8380 2P 800K 1600K 2400K 3200K 4000K SE +/- 49135.02, N = 15 3581744 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: x25x OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: x25x Xeon Platinum 8380 2P 600 1200 1800 2400 3000 SE +/- 39.67, N = 15 2841.22 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Skeincoin Xeon Platinum 8380 2P 200K 400K 600K 800K 1000K SE +/- 12703.27, N = 15 803416 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Magi Xeon Platinum 8380 2P 800 1600 2400 3200 4000 SE +/- 69.20, N = 15 3764.88 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Myriad-Groestl Xeon Platinum 8380 2P 20K 40K 60K 80K 100K SE +/- 3690.26, N = 15 89832 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth Xeon Platinum 8380 2P 40M 80M 120M 160M 200M SE +/- 2422180.09, N = 3 171766875
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Deepcoin Xeon Platinum 8380 2P 16K 32K 48K 64K 80K SE +/- 2338.19, N = 13 75288 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 12.0 Build System: Ninja Xeon Platinum 8380 2P 30 60 90 120 150 SE +/- 0.08, N = 3 127.37
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: Garlicoin Xeon Platinum 8380 2P 9K 18K 27K 36K 45K SE +/- 691.85, N = 12 40542 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 3.15.5 Algorithm: LBC, LBRY Credits Xeon Platinum 8380 2P 80K 160K 240K 320K 400K SE +/- 37110.47, N = 12 356768 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
ONNX Runtime Model: fcn-resnet101-11 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: fcn-resnet101-11 - Device: OpenMP CPU Xeon Platinum 8380 2P 40 80 120 160 200 SE +/- 1.04, N = 3 199 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
ONNX Runtime Model: shufflenet-v2-10 - Device: OpenMP CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.6 Model: shufflenet-v2-10 - Device: OpenMP CPU Xeon Platinum 8380 2P 2K 4K 6K 8K 10K SE +/- 98.21, N = 3 8201 1. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt
Appleseed Scene: Material Tester OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Material Tester Xeon Platinum 8380 2P 40 80 120 160 200 179.19
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 13 Total Time Xeon Platinum 8380 2P 40M 80M 120M 160M 200M SE +/- 1484193.33, N = 15 178977022 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
WebP2 Image Encode Encode Settings: Quality 75, Compression Effort 7 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 75, Compression Effort 7 Xeon Platinum 8380 2P 30 60 90 120 150 SE +/- 0.07, N = 3 117.63 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Barbershop - Compute: CPU-Only Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.09, N = 3 104.18
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.52, N = 3 104.31 1. (CXX) g++ options: -O2 -lOpenCL
ebizzy OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 Xeon Platinum 8380 2P 400K 800K 1200K 1600K 2000K SE +/- 29690.74, N = 15 2002029 1. (CC) gcc options: -pthread -lpthread -O3 -march=native
KeyDB OpenBenchmarking.org Ops/sec, More Is Better KeyDB 6.0.16 Xeon Platinum 8380 2P 120K 240K 360K 480K 600K SE +/- 6645.22, N = 4 542875.78 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 15.11 Time To Compile Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.41, N = 3 91.00
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Pabellon Barcelona - Compute: CPU-Only Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.07, N = 3 87.69
Appleseed Scene: Emily OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Emily Xeon Platinum 8380 2P 30 60 90 120 150 123.62
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5 Mode: CPU Xeon Platinum 8380 2P 14K 28K 42K 56K 70K SE +/- 427.71, N = 3 66647
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 150 300 450 600 750 SE +/- 1.36, N = 3 675.11 MIN: 648.46 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 150 300 450 600 750 SE +/- 1.50, N = 3 674.92 MIN: 647.95 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 150 300 450 600 750 SE +/- 3.62, N = 3 679.32 MIN: 650.53 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 100 200 300 400 500 SE +/- 1.60, N = 3 440.64 MIN: 422.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG19 - Device: CPU Xeon Platinum 8380 2P 8 16 24 32 40 SE +/- 0.15, N = 3 34.04
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 100 200 300 400 500 SE +/- 0.54, N = 3 439.16 MIN: 423.83 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
7-Zip Compression Compress Speed Test OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 16.02 Compress Speed Test Xeon Platinum 8380 2P 80K 160K 240K 320K 400K SE +/- 3382.48, N = 3 351890 1. (CXX) g++ options: -pipe -lpthread
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Classroom - Compute: CPU-Only Xeon Platinum 8380 2P 16 32 48 64 80 SE +/- 0.02, N = 3 71.19
Tungsten Renderer Scene: Volumetric Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Volumetric Caustic Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.46, N = 15 14.06 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
OpenVKL Benchmark: vklBenchmarkVdbVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkVdbVolume Xeon Platinum 8380 2P 6M 12M 18M 24M 30M SE +/- 48952.20, N = 3 29321974 MIN: 1065722 / MAX: 177593256
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU Xeon Platinum 8380 2P 9 18 27 36 45 SE +/- 0.36, N = 3 39.39
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Orange Juice - Acceleration: CPU Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.16, N = 3 14.37 MIN: 11.84 / MAX: 18.68
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Xeon Platinum 8380 2P 140K 280K 420K 560K 700K SE +/- 3600.01, N = 3 664404
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Xeon Platinum 8380 2P 120K 240K 360K 480K 600K SE +/- 995.84, N = 3 568468
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Xeon Platinum 8380 2P 10K 20K 30K 40K 50K SE +/- 108.68, N = 3 47633.4
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Xeon Platinum 8380 2P 16K 32K 48K 64K 80K SE +/- 259.50, N = 3 73580.0
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Xeon Platinum 8380 2P 7K 14K 21K 28K 35K SE +/- 283.60, N = 3 34437.6
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Xeon Platinum 8380 2P 7K 14K 21K 28K 35K SE +/- 168.32, N = 3 32738.8
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: MD5 Xeon Platinum 8380 2P 2M 4M 6M 8M 10M SE +/- 7688.38, N = 3 10252333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Facebook RocksDB Test: Random Fill OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill Xeon Platinum 8380 2P 130K 260K 390K 520K 650K SE +/- 312.21, N = 3 589877 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Random Fill Sync OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Fill Sync Xeon Platinum 8380 2P 100K 200K 300K 400K 500K SE +/- 705.45, N = 3 484574 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Read While Writing Xeon Platinum 8380 2P 2M 4M 6M 8M 10M SE +/- 68643.06, N = 3 8685478 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
Facebook RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better Facebook RocksDB 6.3.6 Test: Random Read Xeon Platinum 8380 2P 80M 160M 240M 320M 400M SE +/- 3927743.74, N = 3 367939009 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Sharpen Xeon Platinum 8380 2P 150 300 450 600 750 SE +/- 0.33, N = 3 672 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Rotate Xeon Platinum 8380 2P 200 400 600 800 1000 SE +/- 6.00, N = 3 780 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Enhanced Xeon Platinum 8380 2P 200 400 600 800 1000 1127 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Noise-Gaussian Xeon Platinum 8380 2P 160 320 480 640 800 SE +/- 2.91, N = 3 731 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: HWB Color Space Xeon Platinum 8380 2P 200 400 600 800 1000 SE +/- 0.67, N = 3 1048 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.33 Operation: Swirl Xeon Platinum 8380 2P 500 1000 1500 2000 2500 SE +/- 7.33, N = 3 2170 1. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.13 Time To Compile Xeon Platinum 8380 2P 13 26 39 52 65 SE +/- 0.31, N = 3 57.33
KTX-Software toktx Settings: UASTC 4 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 4 + Zstd Compression 19 Xeon Platinum 8380 2P 13 26 39 52 65 SE +/- 0.55, N = 3 56.56
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.5 Scene: Rainbow Colors and Prism - Acceleration: CPU Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.45, N = 12 12.60 MIN: 9.89 / MAX: 16.35
Rust Mandelbrot Time To Complete Serial/Parallel Mandelbrot OpenBenchmarking.org Seconds, Fewer Is Better Rust Mandelbrot Time To Complete Serial/Parallel Mandelbrot Xeon Platinum 8380 2P 11 22 33 44 55 SE +/- 0.11, N = 3 48.74 1. (CC) gcc options: -m64 -pie -nodefaultlibs -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 3.2.3 Time To Compile Xeon Platinum 8380 2P 11 22 33 44 55 SE +/- 0.15, N = 3 48.34
OpenVKL Benchmark: vklBenchmarkStructuredVolume OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 0.9 Benchmark: vklBenchmarkStructuredVolume Xeon Platinum 8380 2P 20M 40M 60M 80M 100M SE +/- 536027.69, N = 3 105233677 MIN: 1381357 / MAX: 914006052
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: Fishy Cat - Compute: CPU-Only Xeon Platinum 8380 2P 10 20 30 40 50 SE +/- 0.15, N = 3 44.46
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte Xeon Platinum 8380 2P 10 20 30 40 50 SE +/- 0.43, N = 3 45.21 1. (CXX) g++ options: -O2 -lOpenCL
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Xeon Platinum 8380 2P 5K 10K 15K 20K 25K SE +/- 322.73, N = 15 25496.0 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Xeon Platinum 8380 2P 4K 8K 12K 16K 20K SE +/- 149.85, N = 15 18680.2 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D Xeon Platinum 8380 2P 2K 4K 6K 8K 10K SE +/- 73.52, N = 8 8886.39 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
ASKAP Test: tConvolve MPI - Gridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding Xeon Platinum 8380 2P 4K 8K 12K 16K 20K SE +/- 0.00, N = 3 20825.1 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MPI - Degridding OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding Xeon Platinum 8380 2P 4K 8K 12K 16K 20K SE +/- 41.87, N = 3 18180.0 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Timed GDB GNU Debugger Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed GDB GNU Debugger Compilation 10.2 Time To Compile Xeon Platinum 8380 2P 9 18 27 36 45 SE +/- 0.09, N = 3 39.94
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Monero - Hash Count: 1M Xeon Platinum 8380 2P 6K 12K 18K 24K 30K SE +/- 165.49, N = 3 26540.4 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K Xeon Platinum 8380 2P 1.0773 2.1546 3.2319 4.3092 5.3865 SE +/- 0.015, N = 3 4.788 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD Xeon Platinum 8380 2P 9 18 27 36 45 SE +/- 0.21, N = 3 38.97 1. (CXX) g++ options: -O2 -lOpenCL
Appleseed Scene: Disney Material OpenBenchmarking.org Seconds, Fewer Is Better Appleseed 2.0 Beta Scene: Disney Material Xeon Platinum 8380 2P 13 26 39 52 65 57.67
Timed Wasmer Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Wasmer Compilation 1.0.2 Time To Compile Xeon Platinum 8380 2P 9 18 27 36 45 SE +/- 0.17, N = 3 37.46 1. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lgcc_s -lutil -lrt -lpthread -lm -lc
Timed Linux Kernel Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 5.10.20 Time To Compile Xeon Platinum 8380 2P 5 10 15 20 25 SE +/- 0.22, N = 5 21.51
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 7.4.2 Time To Compile Xeon Platinum 8380 2P 8 16 24 32 40 SE +/- 0.25, N = 3 35.40
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Xeon Platinum 8380 2P 0.0614 0.1228 0.1842 0.2456 0.307 SE +/- 0.00009, N = 3 0.27293
Coremark CoreMark Size 666 - Iterations Per Second OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second Xeon Platinum 8380 2P 500K 1000K 1500K 2000K 2500K SE +/- 3368.39, N = 3 2366961.04 1. (CC) gcc options: -O2 -lrt" -lrt
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.031, N = 3 9.038 1. (CXX) g++ options: -O3 -pthread
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 1.9.0-jumbo-1 Test: Blowfish Xeon Platinum 8380 2P 30K 60K 90K 120K 150K SE +/- 649.53, N = 3 118496 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2
Aircrack-ng OpenBenchmarking.org k/s, More Is Better Aircrack-ng 1.5.2 Xeon Platinum 8380 2P 50K 100K 150K 200K 250K SE +/- 51.66, N = 3 210909.56 1. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -fcommon -rdynamic -lpthread -lz -lcrypto -lhwloc -ldl -lm -pthread
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6, Lossless Xeon Platinum 8380 2P 7 14 21 28 35 SE +/- 0.16, N = 3 29.96 1. (CXX) g++ options: -O3 -fPIC -lm
Tungsten Renderer Scene: Water Caustic OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Water Caustic Xeon Platinum 8380 2P 5 10 15 20 25 SE +/- 0.02, N = 3 20.32 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 2.92 Blend File: BMW27 - Compute: CPU-Only Xeon Platinum 8380 2P 7 14 21 28 35 SE +/- 0.14, N = 3 28.36
Basis Universal Settings: ETC1S OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: ETC1S Xeon Platinum 8380 2P 6 12 18 24 30 SE +/- 0.09, N = 3 26.48 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.12.1 Variant: Wownero - Hash Count: 1M Xeon Platinum 8380 2P 9K 18K 27K 36K 45K SE +/- 98.02, N = 3 42661.3 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenFOAM Input: Motorbike 30M OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 8 Input: Motorbike 30M Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.01, N = 3 14.34 1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -ldecompose -lgenericPatchFields -lmetisDecomp -lscotchDecomp -llagrangian -lregionModels -lOpenFOAM -ldl -lm
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Obj Xeon Platinum 8380 2P 16 32 48 64 80 SE +/- 0.18, N = 3 72.92 MIN: 62.87 / MAX: 82.16
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 Xeon Platinum 8380 2P 400M 800M 1200M 1600M 2000M SE +/- 4482364.60, N = 3 2078582667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org Seconds, Fewer Is Better ASTC Encoder 2.4 Preset: Exhaustive Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.02, N = 3 16.63 1. (CXX) g++ options: -O3 -flto -pthread
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Obj Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.25, N = 3 89.18 MIN: 70.6 / MAX: 98.78
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RTLightmap.hdr.4096x4096 Xeon Platinum 8380 2P 0.324 0.648 0.972 1.296 1.62 SE +/- 0.00, N = 3 1.44
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K Xeon Platinum 8380 2P 7 14 21 28 35 SE +/- 0.04, N = 3 28.60 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.00642, N = 3 6.97288 MIN: 6.48 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 0.7343 1.4686 2.2029 2.9372 3.6715 SE +/- 0.00402, N = 3 3.26354 MIN: 3.1 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 0.0812 0.1624 0.2436 0.3248 0.406 SE +/- 0.001970, N = 3 0.360963 MIN: 0.32 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Ultra Fast Xeon Platinum 8380 2P 11 22 33 44 55 SE +/- 0.48, N = 5 48.04 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Liquid-DSP Threads: 160 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 160 - Buffer Length: 256 - Filter Length: 57 Xeon Platinum 8380 2P 700M 1400M 2100M 2800M 3500M SE +/- 2057776.58, N = 3 3089933333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenSSL RSA 4096-bit Performance OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance Xeon Platinum 8380 2P 4K 8K 12K 16K 20K SE +/- 35.03, N = 3 17870.9 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Liquid-DSP Threads: 128 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 128 - Buffer Length: 256 - Filter Length: 57 Xeon Platinum 8380 2P 700M 1400M 2100M 2800M 3500M SE +/- 1609347.69, N = 3 3289000000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 64 - Buffer Length: 256 - Filter Length: 57 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 2021.01.31 Threads: 64 - Buffer Length: 256 - Filter Length: 57 Xeon Platinum 8380 2P 700M 1400M 2100M 2800M 3500M SE +/- 19173708.85, N = 3 3062166667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Timed Apache Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Apache Compilation 2.4.41 Time To Compile Xeon Platinum 8380 2P 5 10 15 20 25 SE +/- 0.03, N = 3 19.78
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Xeon Platinum 8380 2P 40M 80M 120M 160M 200M SE +/- 1163183.44, N = 3 176213733 1. (CXX) g++ options: -O3 -fopenmp
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile Xeon Platinum 8380 2P 5 10 15 20 25 SE +/- 0.12, N = 3 18.59
TTSIOD 3D Renderer Phong Rendering With Soft-Shadow Mapping OpenBenchmarking.org FPS, More Is Better TTSIOD 3D Renderer 2.3b Phong Rendering With Soft-Shadow Mapping Xeon Platinum 8380 2P 300 600 900 1200 1500 SE +/- 10.87, N = 3 1363.68 1. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++
SVT-HEVC Tuning: 1 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p Xeon Platinum 8380 2P 9 18 27 36 45 SE +/- 0.23, N = 3 38.27 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Xeon Platinum 8380 2P 300 600 900 1200 1500 SE +/- 17.18, N = 3 1220.00 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C Xeon Platinum 8380 2P 20K 40K 60K 80K 100K SE +/- 167.38, N = 3 92298.34 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 4.4 Time To Compile Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.02, N = 3 16.25
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 0.8.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K Xeon Platinum 8380 2P 13 26 39 52 65 SE +/- 0.25, N = 3 58.19 1. (CXX) g++ options: -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie
Basis Universal Settings: UASTC Level 3 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 3 Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.03, N = 3 15.56 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.0 Video Input: Bosphorus 4K - Video Preset: Very Fast Xeon Platinum 8380 2P 9 18 27 36 45 SE +/- 0.06, N = 3 38.93 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C Xeon Platinum 8380 2P 40K 80K 120K 160K 200K SE +/- 72.10, N = 3 198263.35 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.03, N = 3 14.40 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 0.2082 0.4164 0.6246 0.8328 1.041 SE +/- 0.000773, N = 3 0.925533 MIN: 0.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 0.672 1.344 2.016 2.688 3.36 SE +/- 0.00109, N = 3 2.98665 MIN: 2.86 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 0.2857 0.5714 0.8571 1.1428 1.4285 SE +/- 0.00859, N = 3 1.26992 MIN: 0.93 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Tachyon Total Time OpenBenchmarking.org Seconds, Fewer Is Better Tachyon 0.99b6 Total Time Xeon Platinum 8380 2P 4 8 12 16 20 SE +/- 0.06, N = 3 13.64 1. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small Xeon Platinum 8380 2P 6K 12K 18K 24K 30K SE +/- 17.94, N = 3 28477.1 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p Xeon Platinum 8380 2P 100 200 300 400 500 SE +/- 5.30, N = 14 470.79 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
dav1d Video Input: Chimera 1080p 10-bit OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Chimera 1080p 10-bit Xeon Platinum 8380 2P 200 400 600 800 1000 SE +/- 2.94, N = 3 865.87 MIN: 538.6 / MAX: 1163 1. (CC) gcc options: -pthread -lm
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D Xeon Platinum 8380 2P 700 1400 2100 2800 3500 SE +/- 6.59, N = 3 3030.35 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
LULESH OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 Xeon Platinum 8380 2P 8K 16K 24K 32K 40K SE +/- 58.62, N = 3 35415.17 1. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
KTX-Software toktx Settings: UASTC 3 + Zstd Compression 19 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 + Zstd Compression 19 Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.095, N = 4 9.133
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 0.0565 0.113 0.1695 0.226 0.2825 SE +/- 0.000372, N = 3 0.251017 MIN: 0.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 0.1363 0.2726 0.4089 0.5452 0.6815 SE +/- 0.004005, N = 3 0.605773 MIN: 0.56 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 0.0513 0.1026 0.1539 0.2052 0.2565 SE +/- 0.001707, N = 3 0.228136 MIN: 0.2 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Timed ImageMagick Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed ImageMagick Compilation 6.9.0 Time To Compile Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.09, N = 3 12.13
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 6 Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.02, N = 3 12.03 1. (CXX) g++ options: -O3 -fPIC -lm
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.01, N = 3 11.01 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C Xeon Platinum 8380 2P 40K 80K 120K 160K 200K SE +/- 112.54, N = 3 188559.27 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
POV-Ray Trace Time OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray 3.7.0.7 Trace Time Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.066, N = 3 9.312 1. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lSM -lICE -lX11 -lIlmImf -lIlmImf-2_5 -lImath-2_5 -lHalf-2_5 -lIex-2_5 -lIexMath-2_5 -lIlmThread-2_5 -lIlmThread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
Basis Universal Settings: UASTC Level 2 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 2 Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.01, N = 3 11.39 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.02, N = 3 10.99 1. (CC) gcc options: -lm -lpthread -O3
m-queens Time To Solve OpenBenchmarking.org Seconds, Fewer Is Better m-queens 1.2 Time To Solve Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.04, N = 3 11.31 1. (CXX) g++ options: -fopenmp -O2 -march=native
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.hdr_alb_nrm.3840x2160 Xeon Platinum 8380 2P 0.6728 1.3456 2.0184 2.6912 3.364 SE +/- 0.00, N = 3 2.99
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 1.4.0 Run: RT.ldr_alb_nrm.3840x2160 Xeon Platinum 8380 2P 0.6705 1.341 2.0115 2.682 3.3525 SE +/- 0.01, N = 3 2.98
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Crown Xeon Platinum 8380 2P 14 28 42 56 70 SE +/- 0.27, N = 3 64.49 MIN: 58.63 / MAX: 78.79
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Xeon Platinum 8380 2P 3 6 9 12 15 SE +/- 0.07, N = 3 9.95 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 0.3071 0.6142 0.9213 1.2284 1.5355 SE +/- 0.00058, N = 3 1.36484 MIN: 1.33 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Crown Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.10, N = 3 74.84 MIN: 65.53 / MAX: 94.42
oneDNN Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 0.4073 0.8146 1.2219 1.6292 2.0365 SE +/- 0.00286, N = 3 1.81027 MIN: 1.68 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 0.0981 0.1962 0.2943 0.3924 0.4905 SE +/- 0.001481, N = 3 0.435839 MIN: 0.4 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer - Model: Asian Dragon Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.83, N = 3 83.05 MIN: 68.45 / MAX: 92.49
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Xeon Platinum 8380 2P 7 14 21 28 35 SE +/- 0.03, N = 3 28.08 1. (CC) gcc options: -O3 -march=native -fopenmp
toyBrot Fractal Generator Implementation: C++ Tasks OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Tasks Xeon Platinum 8380 2P 2K 4K 6K 8K 10K SE +/- 41.26, N = 3 7936 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
Basis Universal Settings: UASTC Level 0 OpenBenchmarking.org Seconds, Fewer Is Better Basis Universal 1.13 Settings: UASTC Level 0 Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.004, N = 3 7.628 1. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.105, N = 3 7.631 1. (CXX) g++ options: -O2 -lOpenCL
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein Xeon Platinum 8380 2P 7 14 21 28 35 SE +/- 0.39, N = 15 31.75 1. (CXX) g++ options: -O3 -pthread -lm
toyBrot Fractal Generator Implementation: OpenMP OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: OpenMP Xeon Platinum 8380 2P 1600 3200 4800 6400 8000 SE +/- 14.19, N = 3 7278 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10, Lossless Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.008, N = 3 7.295 1. (CXX) g++ options: -O3 -fPIC -lm
toyBrot Fractal Generator Implementation: C++ Threads OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: C++ Threads Xeon Platinum 8380 2P 1500 3000 4500 6000 7500 SE +/- 10.17, N = 3 7050 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
dav1d Video Input: Summer Nature 4K OpenBenchmarking.org FPS, More Is Better dav1d 0.9.0 Video Input: Summer Nature 4K Xeon Platinum 8380 2P 120 240 360 480 600 SE +/- 1.43, N = 3 534.79 MIN: 189.77 / MAX: 591.13 1. (CC) gcc options: -pthread -lm
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 3.13 Binary: Pathtracer ISPC - Model: Asian Dragon Xeon Platinum 8380 2P 20 40 60 80 100 SE +/- 0.61, N = 3 107.87 MIN: 88.89 / MAX: 113.17
toyBrot Fractal Generator Implementation: TBB OpenBenchmarking.org ms, Fewer Is Better toyBrot Fractal Generator 2020-11-18 Implementation: TBB Xeon Platinum 8380 2P 1500 3000 4500 6000 7500 SE +/- 85.54, N = 3 6803 1. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc
WebP2 Image Encode Encode Settings: Quality 100, Compression Effort 5 OpenBenchmarking.org Seconds, Fewer Is Better WebP2 Image Encode 20210126 Encode Settings: Quality 100, Compression Effort 5 Xeon Platinum 8380 2P 2 4 6 8 10 SE +/- 0.012, N = 3 6.242 1. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lwebp -lwebpdemux
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 0.3149 0.6298 0.9447 1.2596 1.5745 SE +/- 0.00288, N = 3 1.39960 MIN: 1.23 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 0.2054 0.4108 0.6162 0.8216 1.027 SE +/- 0.001400, N = 3 0.912920 MIN: 0.85 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 0.4709 0.9418 1.4127 1.8836 2.3545 SE +/- 0.00197, N = 3 2.09290 MIN: 2.03 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Tungsten Renderer Scene: Hair OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Hair Xeon Platinum 8380 2P 1.2877 2.5754 3.8631 5.1508 6.4385 SE +/- 0.05174, N = 3 5.72330 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig Xeon Platinum 8380 2P 1.0533 2.1066 3.1599 4.2132 5.2665 SE +/- 0.012558, N = 3 4.681398 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C Xeon Platinum 8380 2P 20K 40K 60K 80K 100K SE +/- 75.32, N = 3 101071.35 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver Xeon Platinum 8380 2P 1.0688 2.1376 3.2064 4.2752 5.344 SE +/- 0.004, N = 3 4.750 1. (CXX) g++ options: -O2 -lOpenCL
SVT-HEVC Tuning: 7 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p Xeon Platinum 8380 2P 70 140 210 280 350 SE +/- 3.22, N = 5 314.07 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
KTX-Software toktx Settings: UASTC 3 OpenBenchmarking.org Seconds, Fewer Is Better KTX-Software toktx 4.0 Settings: UASTC 3 Xeon Platinum 8380 2P 1.0501 2.1002 3.1503 4.2004 5.2505 SE +/- 0.024, N = 3 4.667
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C Xeon Platinum 8380 2P 9K 18K 27K 36K 45K SE +/- 77.27, N = 3 40407.42 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
libavif avifenc Encoder Speed: 10 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 0.9.0 Encoder Speed: 10 Xeon Platinum 8380 2P 0.9758 1.9516 2.9274 3.9032 4.879 SE +/- 0.029, N = 3 4.337 1. (CXX) g++ options: -O3 -fPIC -lm
rays1bench Large Scene OpenBenchmarking.org mrays/s, More Is Better rays1bench 2020-01-09 Large Scene Xeon Platinum 8380 2P 80 160 240 320 400 SE +/- 1.67, N = 3 346.01
Primesieve 1e12 Prime Number Generation OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 7.4 1e12 Prime Number Generation Xeon Platinum 8380 2P 0.8231 1.6462 2.4693 3.2924 4.1155 SE +/- 0.027, N = 3 3.658 1. (CXX) g++ options: -O3 -lpthread
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B Xeon Platinum 8380 2P 30K 60K 90K 120K 150K SE +/- 508.08, N = 3 123085.11 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p Xeon Platinum 8380 2P 80 160 240 320 400 SE +/- 3.38, N = 3 372.57 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Xeon Platinum 8380 2P 0.5804 1.1608 1.7412 2.3216 2.902 SE +/- 0.00661669, N = 3 2.57972169 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p Xeon Platinum 8380 2P 100 200 300 400 500 SE +/- 1.17, N = 3 471.75 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU Xeon Platinum 8380 2P 0.1889 0.3778 0.5667 0.7556 0.9445 SE +/- 0.002277, N = 3 0.839584 MIN: 0.8 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU Xeon Platinum 8380 2P 0.8025 1.605 2.4075 3.21 4.0125 SE +/- 0.00463, N = 3 3.56680 MIN: 3.49 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU Xeon Platinum 8380 2P 0.043 0.086 0.129 0.172 0.215 SE +/- 0.001883, N = 3 0.191254 MIN: 0.18 1. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Tungsten Renderer Scene: Non-Exponential OpenBenchmarking.org Seconds, Fewer Is Better Tungsten Renderer 0.2.2 Scene: Non-Exponential Xeon Platinum 8380 2P 0.5708 1.1416 1.7124 2.2832 2.854 SE +/- 0.00643, N = 3 2.53696 1. (CXX) g++ options: -std=c++0x -march=core2 -msse2 -msse3 -mssse3 -mno-sse4.1 -mno-sse4.2 -mno-sse4a -mno-avx -mno-fma -mno-bmi2 -mno-avx2 -mno-xop -mno-fma4 -mno-avx512f -mno-avx512vl -mno-avx512pf -mno-avx512er -mno-avx512cd -mno-avx512dq -mno-avx512bw -mno-avx512ifma -mno-avx512vbmi -fstrict-aliasing -O3 -rdynamic -lIlmImf -lIlmThread -lImath -lHalf -lIex -lz -ljpeg -lpthread -ldl
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C Xeon Platinum 8380 2P 2K 4K 6K 8K 10K SE +/- 95.81, N = 4 7792.57 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C Xeon Platinum 8380 2P 30K 60K 90K 120K 150K SE +/- 743.51, N = 3 120157.87 1. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
SVT-HEVC Tuning: 10 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p Xeon Platinum 8380 2P 130 260 390 520 650 SE +/- 8.17, N = 3 588.85 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
Phoronix Test Suite v10.8.4