4 x Intel Xeon Platinum 8160 testing with a HPE ProLiant DL560 Gen10 (U34 BIOS) and llvmpipe on CentOS 7.6.1810 via the Phoronix Test Suite.
1TB-Run1 Processor: 4 x Intel Xeon Platinum 8160 (96 Cores / 192 Threads), Motherboard: HPE ProLiant DL560 Gen10 (U34 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32 x GB DDR4-2666MT/s HPE, Disk: 429GB P824i-p MR Gen10 + 107GB P824i-p MR Gen10 + 1100GB P824i-p MR Gen10 + 36757GB P824i-p MR Gen10, Graphics: llvmpipe, Monitor: Smart Cable, Network: 4 x Intel I350 + 2 x Intel X710 for 10GbE SFP+ + 2 x Intel 10G X550T
OS: CentOS 7.6.1810, Kernel: 3.10.0-957.21.3.el7.x86_64 (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, OpenGL: 2.1 Mesa 18.0.5 (LLVM 6.0 256 bits), Compiler: GCC 4.8.5 20150623, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,java,fortran,ada,go,lto --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-linker-hash-style=gnu --with-tune=genericProcessor Notes: CPU Microcode: 0x2006e05Java Notes: OpenJDK Runtime Environment (build 1.8.0_222-b10)Python Notes: Python 2.7.5 + Python 3.6.8Security Notes: l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of Load fences __user pointer sanitization + spectre_v2: Mitigation of IBRS (kernel) IBPB
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 1TB-Run1 70 140 210 280 350 SE +/- 0.80, N = 6 334.01 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 1TB-Run1 3 6 9 12 15 SE +/- 0.03, N = 6 13.36 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 1TB-Run1 90 180 270 360 450 SE +/- 0.59, N = 6 408.42 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 1TB-Run1 4 8 12 16 20 SE +/- 0.02, N = 6 16.34 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
CloverLeaf CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 1TB-Run1 13 26 39 52 65 SE +/- 0.83, N = 100 59.40 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 1TB-Run1 0.0735 0.147 0.2205 0.294 0.3675 SE +/- 0.00162, N = 25 0.32648
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 1TB-Run1 400M 800M 1200M 1600M 2000M SE +/- 833796.71, N = 20 1915113800 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -m64 -lmpi
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d 1TB-Run1 90 180 270 360 450 SE +/- 25.18, N = 18 436.61 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -m64 -lmpi_usempi -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 1TB-Run1 0.7295 1.459 2.1885 2.918 3.6475 SE +/- 0.03923848, N = 30 3.24225036 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -m64 -lmpi_usempi -lmpi_mpifh -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 1TB-Run1 4 8 12 16 20 SE +/- 0.08, N = 30 14.57 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -m64 -lmpi_usempi -lmpi_mpifh -lmpi
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK 1TB-Run1 40K 80K 120K 160K 200K SE +/- 957.18, N = 100 206487 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish 1TB-Run1 20K 40K 60K 80K 100K SE +/- 289.73, N = 20 110611 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 1TB-Run1 16M 32M 48M 64M 80M SE +/- 392784.50, N = 100 76133780 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lrt -lz -ldl -lcrypt -lbz2
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 1TB-Run1 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 42604.32, N = 100 6146050 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lrt -lz -ldl -lcrypt -lbz2
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 4 - Input: Bosphorus 4K 1TB-Run1 0.5441 1.0882 1.6323 2.1764 2.7205 SE +/- 0.008, N = 20 2.418 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 8 - Input: Bosphorus 4K 1TB-Run1 10 20 30 40 50 SE +/- 0.18, N = 35 41.84 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 12 - Input: Bosphorus 4K 1TB-Run1 20 40 60 80 100 SE +/- 0.46, N = 20 94.10 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 13 - Input: Bosphorus 4K 1TB-Run1 20 40 60 80 100 SE +/- 0.34, N = 100 101.53 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 4 - Input: Bosphorus 1080p 1TB-Run1 2 4 6 8 10 SE +/- 0.020, N = 20 6.130 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 8 - Input: Bosphorus 1080p 1TB-Run1 20 40 60 80 100 SE +/- 0.25, N = 20 96.28 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 12 - Input: Bosphorus 1080p 1TB-Run1 70 140 210 280 350 SE +/- 1.04, N = 20 312.88 1. (CXX) g++ options: -march=native -mno-avx
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.4 Encoder Mode: Preset 13 - Input: Bosphorus 1080p 1TB-Run1 70 140 210 280 350 SE +/- 0.85, N = 20 337.64 1. (CXX) g++ options: -march=native -mno-avx
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K 1TB-Run1 3 6 9 12 15 SE +/- 0.02, N = 20 9.22 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K 1TB-Run1 20 40 60 80 100 SE +/- 0.33, N = 20 92.45 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K 1TB-Run1 20 40 60 80 100 SE +/- 0.48, N = 100 110.72 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 1TB-Run1 7 14 21 28 35 SE +/- 0.07, N = 20 30.38 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 1TB-Run1 50 100 150 200 250 SE +/- 0.38, N = 20 215.31 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 1TB-Run1 80 160 240 320 400 SE +/- 0.99, N = 20 360.29 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
oneDNN This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU 1TB-Run1 0.4433 0.8866 1.3299 1.7732 2.2165 SE +/- 0.00379, N = 20 1.97024 MIN: 1.46 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU 1TB-Run1 7 14 21 28 35 SE +/- 0.14, N = 100 31.19 MIN: 12.07 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 0.9587 1.9174 2.8761 3.8348 4.7935 SE +/- 0.01688, N = 100 4.26108 MIN: 3.41 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 0.2819 0.5638 0.8457 1.1276 1.4095 SE +/- 0.01966, N = 100 1.25277 MIN: 0.93 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 0.9019 1.8038 2.7057 3.6076 4.5095 SE +/- 0.00608, N = 20 4.00835 MIN: 3.61 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 0.526 1.052 1.578 2.104 2.63 SE +/- 0.00635, N = 100 2.33777 MIN: 2.04 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU 1TB-Run1 0.8609 1.7218 2.5827 3.4436 4.3045 SE +/- 0.01989, N = 100 3.82614 MIN: 3.28 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU 1TB-Run1 5 10 15 20 25 SE +/- 0.09, N = 100 18.72 MIN: 14.06 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU 1TB-Run1 0.2659 0.5318 0.7977 1.0636 1.3295 SE +/- 0.00289, N = 20 1.18163 MIN: 1.09 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 0.5143 1.0286 1.5429 2.0572 2.5715 SE +/- 0.03967, N = 100 2.28579 MIN: 1.47 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 0.2373 0.4746 0.7119 0.9492 1.1865 SE +/- 0.016991, N = 100 1.054541 MIN: 0.71 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 0.1505 0.301 0.4515 0.602 0.7525 SE +/- 0.001146, N = 20 0.668686 MIN: 0.62 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU 1TB-Run1 300 600 900 1200 1500 SE +/- 6.04, N = 20 1183.02 MIN: 1073.9 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU 1TB-Run1 130 260 390 520 650 SE +/- 1.43, N = 20 597.36 MIN: 558.68 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 300 600 900 1200 1500 SE +/- 6.02, N = 20 1199.89 MIN: 1105.11 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 0.5448 1.0896 1.6344 2.1792 2.724 SE +/- 0.00373, N = 100 2.42126 MIN: 2.16 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 1.0928 2.1856 3.2784 4.3712 5.464 SE +/- 0.02263, N = 28 4.85699 MIN: 4.44 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 0.9441 1.8882 2.8323 3.7764 4.7205 SE +/- 0.00586, N = 100 4.19586 MIN: 4.03 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 130 260 390 520 650 SE +/- 0.87, N = 100 597.54 MIN: 555.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU 1TB-Run1 0.113 0.226 0.339 0.452 0.565 SE +/- 0.001016, N = 20 0.502251 MIN: 0.36 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 300 600 900 1200 1500 SE +/- 5.07, N = 34 1187.99 MIN: 1074.42 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 130 260 390 520 650 SE +/- 0.92, N = 100 596.69 MIN: 555.79 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU 1TB-Run1 2 4 6 8 10 SE +/- 0.01507, N = 100 6.78058 MIN: 5.8 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.0 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU 1TB-Run1 2 4 6 8 10 SE +/- 0.02799, N = 20 6.02812 MIN: 5.32 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl
Stress-NG OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Hash 1TB-Run1 3M 6M 9M 12M 15M SE +/- 72150.37, N = 80 12668067.86 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: MMAP 1TB-Run1 2K 4K 6K 8K 10K SE +/- 310.71, N = 100 10864.33 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: NUMA 1TB-Run1 13 26 39 52 65 SE +/- 1.31, N = 80 58.51 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Poll 1TB-Run1 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 1356.27, N = 20 5837581.44 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Zlib 1TB-Run1 900 1800 2700 3600 4500 SE +/- 79.90, N = 100 4415.01 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Futex 1TB-Run1 50 100 150 200 250 SE +/- 76.38, N = 80 248.70 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
Test: MEMFD
1TB-Run1: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Mutex 1TB-Run1 200K 400K 600K 800K 1000K SE +/- 74916.71, N = 100 933053.65 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Atomic 1TB-Run1 15 30 45 60 75 SE +/- 0.55, N = 100 66.04 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Crypto 1TB-Run1 20K 40K 60K 80K 100K SE +/- 26.26, N = 20 83163.04 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Malloc 1TB-Run1 20M 40M 60M 80M 100M SE +/- 74227.52, N = 20 91315767.44 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Forking 1TB-Run1 9K 18K 27K 36K 45K SE +/- 307.02, N = 100 40359.97 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Pthread 1TB-Run1 15K 30K 45K 60K 75K SE +/- 3937.61, N = 100 71318.00 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
Test: IO_uring
1TB-Run1: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: SENDFILE 1TB-Run1 300K 600K 900K 1200K 1500K SE +/- 15306.66, N = 100 1363842.77 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: CPU Cache 1TB-Run1 8 16 24 32 40 SE +/- 7.41, N = 100 35.07 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: CPU Stress 1TB-Run1 30K 60K 90K 120K 150K SE +/- 30.91, N = 20 120051.27 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Semaphores 1TB-Run1 4M 8M 12M 16M 20M SE +/- 7878.32, N = 20 16591808.11 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Matrix Math 1TB-Run1 70K 140K 210K 280K 350K SE +/- 86.84, N = 20 332280.31 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Vector Math 1TB-Run1 40K 80K 120K 160K 200K SE +/- 2261.24, N = 80 166779.74 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Function Call 1TB-Run1 80K 160K 240K 320K 400K SE +/- 4589.77, N = 100 390289.59 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: x86_64 RdRand 1TB-Run1 200K 400K 600K 800K 1000K SE +/- 26362.36, N = 96 1051693.30 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Memory Copying 1TB-Run1 300 600 900 1200 1500 SE +/- 290.96, N = 80 1300.94 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Socket Activity 1TB-Run1 700 1400 2100 2800 3500 SE +/- 465.54, N = 100 3142.25 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Context Switching 1TB-Run1 6M 12M 18M 24M 30M SE +/- 879721.20, N = 100 29527790.45 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Glibc C String Functions 1TB-Run1 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 99455.84, N = 100 4959677.98 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: Glibc Qsort Data Sorting 1TB-Run1 200 400 600 800 1000 SE +/- 14.74, N = 100 807.08 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.04 Test: System V Message Passing 1TB-Run1 1000K 2000K 3000K 4000K 5000K SE +/- 268866.31, N = 100 4711689.93 1. (CC) gcc options: -std=gnu99 -O2 -lm -laio -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU-v2-v2 - Model: mobilenet-v2 1TB-Run1 8 16 24 32 40 SE +/- 2.77, N = 6 34.40 MIN: 28.61 / MAX: 4378.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU-v3-v3 - Model: mobilenet-v3 1TB-Run1 7 14 21 28 35 SE +/- 0.20, N = 6 30.50 MIN: 28.91 / MAX: 343.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: shufflenet-v2 1TB-Run1 8 16 24 32 40 SE +/- 0.19, N = 6 35.06 MIN: 33.54 / MAX: 86.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: mnasnet 1TB-Run1 7 14 21 28 35 SE +/- 0.14, N = 6 30.09 MIN: 27.93 / MAX: 182.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: efficientnet-b0 1TB-Run1 10 20 30 40 50 SE +/- 0.51, N = 6 42.49 MIN: 40.21 / MAX: 398.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: blazeface 1TB-Run1 4 8 12 16 20 SE +/- 0.10, N = 6 16.98 MIN: 16.28 / MAX: 49.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: googlenet 1TB-Run1 13 26 39 52 65 SE +/- 0.74, N = 6 56.37 MIN: 49.86 / MAX: 528.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: vgg16 1TB-Run1 30 60 90 120 150 SE +/- 30.15, N = 6 153.02 MIN: 41.73 / MAX: 2529 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: resnet18 1TB-Run1 7 14 21 28 35 SE +/- 1.33, N = 6 30.61 MIN: 23.87 / MAX: 483.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: alexnet 1TB-Run1 4 8 12 16 20 SE +/- 0.45, N = 6 17.23 MIN: 14.52 / MAX: 56.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: resnet50 1TB-Run1 20 40 60 80 100 SE +/- 8.44, N = 6 74.46 MIN: 50.79 / MAX: 8860.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: yolov4-tiny 1TB-Run1 13 26 39 52 65 SE +/- 2.91, N = 6 57.33 MIN: 48.4 / MAX: 3482.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: squeezenet_ssd 1TB-Run1 14 28 42 56 70 SE +/- 1.05, N = 6 63.48 MIN: 54.4 / MAX: 732.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: regnety_400m 1TB-Run1 40 80 120 160 200 SE +/- 5.81, N = 6 171.52 MIN: 156.86 / MAX: 4341.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: vision_transformer 1TB-Run1 50 100 150 200 250 SE +/- 10.15, N = 6 232.60 MIN: 191.01 / MAX: 4140.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20220729 Target: CPU - Model: FastestDet 1TB-Run1 9 18 27 36 45 SE +/- 0.32, N = 6 39.51 MIN: 37.77 / MAX: 261.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
PyBench This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times 1TB-Run1 300 600 900 1200 1500 SE +/- 1.36, N = 20 1573
PHPBench PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite 1TB-Run1 40K 80K 120K 160K 200K SE +/- 455.27, N = 6 196055
OpenCV This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.7 Test: Core 1TB-Run1 50K 100K 150K 200K 250K SE +/- 3547.19, N = 18 221406 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
Test: Graph API
1TB-Run1: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: opencv: line 4: ./opencv_perf_gapi: No such file or directory
OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.7 Test: Stitching 1TB-Run1 70K 140K 210K 280K 350K SE +/- 2269.46, N = 18 334442 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.7 Test: Features 2D 1TB-Run1 50K 100K 150K 200K 250K SE +/- 11211.58, N = 24 222884 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.7 Test: Image Processing 1TB-Run1 80K 160K 240K 320K 400K SE +/- 2962.47, N = 18 370799 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.7 Test: Object Detection 1TB-Run1 10K 20K 30K 40K 50K SE +/- 467.86, N = 6 45847 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.7 Test: DNN - Deep Neural Network 1TB-Run1 17K 34K 51K 68K 85K SE +/- 1179.12, N = 30 77064 1. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared
1TB-Run1 Processor: 4 x Intel Xeon Platinum 8160 (96 Cores / 192 Threads), Motherboard: HPE ProLiant DL560 Gen10 (U34 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 32 x GB DDR4-2666MT/s HPE, Disk: 429GB P824i-p MR Gen10 + 107GB P824i-p MR Gen10 + 1100GB P824i-p MR Gen10 + 36757GB P824i-p MR Gen10, Graphics: llvmpipe, Monitor: Smart Cable, Network: 4 x Intel I350 + 2 x Intel X710 for 10GbE SFP+ + 2 x Intel 10G X550T
OS: CentOS 7.6.1810, Kernel: 3.10.0-957.21.3.el7.x86_64 (x86_64), Desktop: GNOME Shell 3.28.3, Display Server: X Server 1.20.1, OpenGL: 2.1 Mesa 18.0.5 (LLVM 6.0 256 bits), Compiler: GCC 4.8.5 20150623, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-redhat-linux --disable-libgcj --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,objc,obj-c++,java,fortran,ada,go,lto --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-linker-hash-style=gnu --with-tune=genericProcessor Notes: CPU Microcode: 0x2006e05Java Notes: OpenJDK Runtime Environment (build 1.8.0_222-b10)Python Notes: Python 2.7.5 + Python 3.6.8Security Notes: l1tf: Mitigation of PTE Inversion + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of Load fences __user pointer sanitization + spectre_v2: Mitigation of IBRS (kernel) IBPB
Testing initiated at 24 March 2023 17:04 by user root.