Benchmarks by Michael Larabel for a future article.
ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 62GB, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 Changed Memory to 32GB .
Processor Change: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 70M 140M 210M 280M 350M 306921000 300334000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
Scale: 26
2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 70M 140M 210M 280M 350M 306039000 298742000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
Scale: 26
2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6000M 12000M 18000M 24000M 30000M SE +/- 18003849.28, N = 3 SE +/- 9989074.60, N = 3 26743666450 27349750547 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2000M 4000M 6000M 8000M 10000M SE +/- 2477623.36, N = 3 SE +/- 366393.93, N = 3 8602768667 8682965163 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20000M 40000M 60000M 80000M 100000M SE +/- 11744627.19, N = 3 SE +/- 16750913.82, N = 3 102705524193 103197333003 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40000M 80000M 120000M 160000M 200000M SE +/- 21084657.26, N = 3 SE +/- 274285510.34, N = 3 200634689310 201249399420 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40000M 80000M 120000M 160000M 200000M SE +/- 69554441.77, N = 3 SE +/- 66208636.68, N = 3 172018317083 172565385397 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16000M 32000M 48000M 64000M 80000M SE +/- 4988612.49, N = 3 SE +/- 13473201.11, N = 3 73016433687 73344138900 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80M 160M 240M 320M 400M SE +/- 131925.90, N = 3 SE +/- 94090.92, N = 3 356193300 371467767 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 800M 1600M 2400M 3200M 4000M SE +/- 40610118.61, N = 3 SE +/- 32494022.19, N = 3 3438190000 3731113333 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1200M 2400M 3600M 4800M 6000M SE +/- 5179579.56, N = 3 SE +/- 1808731.91, N = 3 5056683333 5418963333 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.2893 2.5786 3.8679 5.1572 6.4465 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 5.55 5.73 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 20.84 20.89 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Vehicle Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 300 600 900 1200 1500 SE +/- 3.99, N = 3 SE +/- 1.01, N = 3 1343.46 1349.65 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Machine Translation EN To DE FP16 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.20, N = 3 SE +/- 0.11, N = 3 100.27 102.78 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 500 1000 1500 2000 2500 SE +/- 2.54, N = 3 SE +/- 1.80, N = 3 2137.18 2147.01 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Person Vehicle Bike Detection FP16 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 300 600 900 1200 1500 SE +/- 0.52, N = 3 SE +/- 1.87, N = 3 1215.67 1222.82 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org FPS, More Is Better OpenVINO 2022.3 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3K 6K 9K 12K 15K SE +/- 6.65, N = 3 SE +/- 12.89, N = 3 16143.58 16261.19 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Crown 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.06, N = 3 SE +/- 0.13, N = 3 24.64 24.77 MIN: 24.27 / MAX: 25.49 MIN: 24.28 / MAX: 25.63
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 25.85 26.28 MIN: 25.63 / MAX: 26.37 MIN: 26.02 / MAX: 26.93
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.1 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 22.01 22.32 MIN: 21.81 / MAX: 22.51 MIN: 22.14 / MAX: 22.84
Kvazaar This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 16.88 16.97 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Medium 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16 32 48 64 80 SE +/- 0.22, N = 3 SE +/- 0.18, N = 3 73.39 74.10 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 40.25 40.56 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12 24 36 48 60 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 50.95 51.36 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15 30 45 60 75 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 65.46 66.17 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Very Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.24, N = 3 SE +/- 0.21, N = 3 142.14 145.30 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Super Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 0.24, N = 3 190.35 195.23 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 0.40, N = 3 SE +/- 0.18, N = 3 253.28 259.00 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.1918 2.3836 3.5754 4.7672 5.959 SE +/- 0.009, N = 3 SE +/- 0.005, N = 3 5.281 5.297 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16 32 48 64 80 SE +/- 0.33, N = 3 SE +/- 0.37, N = 3 68.87 70.79 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.30, N = 3 SE +/- 0.64, N = 3 150.81 157.91 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.09, N = 3 SE +/- 0.63, N = 3 151.42 158.00 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 4 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 16.05 16.07 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.27, N = 3 SE +/- 0.69, N = 3 154.28 158.10 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 140 280 420 560 700 SE +/- 0.59, N = 3 SE +/- 2.53, N = 3 589.18 626.85 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 150 300 450 600 750 SE +/- 1.16, N = 3 SE +/- 0.59, N = 3 656.34 705.36 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-VP9 This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 1.25, N = 12 SE +/- 1.51, N = 12 86.57 93.80 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80 160 240 320 400 SE +/- 2.31, N = 12 SE +/- 2.81, N = 12 326.47 351.03 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.07, N = 3 94.39 101.61 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80 160 240 320 400 SE +/- 0.40, N = 3 SE +/- 0.47, N = 3 337.13 361.55 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.16, N = 3 83.19 88.36 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 70 140 210 280 350 SE +/- 0.12, N = 3 SE +/- 0.60, N = 3 292.08 308.17 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 12.04 12.12
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Medium 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 13 26 39 52 65 SE +/- 0.16, N = 3 SE +/- 0.04, N = 3 55.38 55.69
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 34.90 35.09
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 36.81 37.15
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 43.09 43.45
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Very Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 132.15 134.22
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Super Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.11, N = 3 SE +/- 0.03, N = 3 145.07 147.74
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 1080p - Video Preset: Ultra Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.14, N = 3 SE +/- 0.50, N = 3 170.56 173.43
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.035, N = 3 SE +/- 0.044, N = 3 7.192 7.267 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 13.87 14.11 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Fast 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 21.86 22.19 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Faster 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 10 20 30 40 50 SE +/- 0.30, N = 3 SE +/- 0.07, N = 3 41.54 42.58 1. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 7 14 21 28 35 SE +/- 0.09, N = 3 SE +/- 0.19, N = 3 27.63 28.30 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.23, N = 3 SE +/- 0.26, N = 3 104.83 107.94 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
AOM AV1 This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google as the AV1 Codec Library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.0833 0.1666 0.2499 0.3332 0.4165 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.37 0.37 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 10.61 10.76 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.36, N = 3 SE +/- 1.33, N = 3 95.27 99.91 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 23.14 23.92 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.89, N = 3 89.37 94.78 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.10, N = 3 101.21 106.17 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.37, N = 3 SE +/- 0.33, N = 3 101.80 108.41 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 11 Realtime - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.82, N = 3 SE +/- 0.37, N = 3 102.26 108.08 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.2588 0.5176 0.7764 1.0352 1.294 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.13 1.15 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 24.08 24.34 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 SE +/- 1.67, N = 3 SE +/- 1.37, N = 3 219.55 236.28 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.47, N = 3 SE +/- 0.18, N = 3 72.89 74.81 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 SE +/- 1.31, N = 3 SE +/- 2.30, N = 15 237.25 246.40 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 2.06, N = 15 SE +/- 2.29, N = 8 249.75 265.33 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 10 Realtime - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 5.83, N = 12 SE +/- 6.66, N = 12 237.64 255.73 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
OpenBenchmarking.org Frames Per Second, More Is Better AOM AV1 3.7 Encoder Mode: Speed 11 Realtime - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 1.11, N = 3 SE +/- 4.46, N = 15 253.21 263.76 1. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.044 2.088 3.132 4.176 5.22 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.59 4.64 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 78.82 81.31 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.43, N = 3 SE +/- 0.20, N = 3 130.05 139.71 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.01, N = 3 18.46 18.57 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 1.26, N = 3 SE +/- 0.21, N = 3 244.68 251.89 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 100 200 300 400 500 SE +/- 0.88, N = 3 SE +/- 0.74, N = 3 444.01 468.51 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
VP9 libvpx Encoding This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.13 Speed: Speed 0 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 10.51 10.59 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.13 Speed: Speed 5 - Input: Bosphorus 4K 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.16, N = 3 SE +/- 0.19, N = 3 22.56 22.38 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.13 Speed: Speed 0 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 20.86 21.15 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
OpenBenchmarking.org Frames Per Second, More Is Better VP9 libvpx Encoding 1.13 Speed: Speed 5 - Input: Bosphorus 1080p 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 10 20 30 40 50 SE +/- 0.26, N = 3 SE +/- 0.21, N = 3 41.88 42.16 1. (CXX) g++ options: -m64 -lm -lpthread -O3 -fPIC -U_FORTIFY_SOURCE -std=gnu++11
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.00510, N = 3 SE +/- 0.00392, N = 3 6.88840 7.17053 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 98.7 107.3 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 SE +/- 0.12, N = 3 SE +/- 0.06, N = 3 194.2 214.6 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.0 Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.0743 0.1486 0.2229 0.2972 0.3715 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.32 0.33
OpenVKL OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 1.3.1 Benchmark: vklBenchmark ISPC 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 0.33, N = 3 SE +/- 0.67, N = 3 273 278 MIN: 41 / MAX: 2850 MIN: 41 / MAX: 3013
OSPRay Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3388 2.6776 4.0164 5.3552 6.694 SE +/- 0.01432, N = 3 SE +/- 0.00172, N = 3 5.93670 5.95035
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3397 2.6794 4.0191 5.3588 6.6985 SE +/- 0.00292, N = 3 SE +/- 0.00103, N = 3 5.94200 5.95429
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.05, N = 3 SE +/- 0.11, N = 3 190.48 193.20
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3433 2.6866 4.0299 5.3732 6.7165 SE +/- 0.00287, N = 3 SE +/- 0.01708, N = 3 5.93196 5.97011
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3038 2.6076 3.9114 5.2152 6.519 SE +/- 0.00168, N = 3 SE +/- 0.02945, N = 3 5.76809 5.79485
OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.00514, N = 3 SE +/- 0.00551, N = 3 6.95787 6.95533
Neural Magic DeepSparse OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 17.18 17.48
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 170 340 510 680 850 SE +/- 1.44, N = 3 SE +/- 1.17, N = 3 763.42 771.41
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.55, N = 3 85.85 88.66
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 500 1000 1500 2000 2500 SE +/- 2.95, N = 3 SE +/- 10.69, N = 3 2469.40 2487.34
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 SE +/- 0.25, N = 3 SE +/- 0.06, N = 3 218.16 219.73
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.51, N = 3 97.27 98.55
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 7 14 21 28 35 SE +/- 0.06, N = 3 SE +/- 0.05, N = 3 29.63 30.55
OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80 160 240 320 400 SE +/- 0.22, N = 3 SE +/- 0.51, N = 3 369.45 368.98
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.8978 1.7956 2.6934 3.5912 4.489 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 3.98 3.99 MIN: 3.88 / MAX: 4.18 MIN: 3.89 / MAX: 4.22
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.7313 1.4626 2.1939 2.9252 3.6565 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 3.23 3.25 MIN: 1.19 / MAX: 3.74 MIN: 1.27 / MAX: 3.74
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 6.22 6.28 MIN: 5.69 / MAX: 6.45 MIN: 5.74 / MAX: 6.51
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.828 1.656 2.484 3.312 4.14 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 3.63 3.68 MIN: 1.48 / MAX: 4.15 MIN: 1.53 / MAX: 4.19
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.12, N = 8 SE +/- 0.07, N = 3 14.72 15.09 MIN: 12.97 / MAX: 15.15 MIN: 13.77 / MAX: 15.33
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.75, N = 3 SE +/- 0.79, N = 3 163.40 163.01 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.86, N = 3 SE +/- 0.24, N = 3 177.59 177.32 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14K 28K 42K 56K 70K SE +/- 379.98, N = 3 SE +/- 205.98, N = 3 59099.10 63378.31 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14K 28K 42K 56K 70K SE +/- 298.85, N = 3 SE +/- 448.66, N = 3 59325.80 63208.95 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12K 24K 36K 48K 60K SE +/- 262.98, N = 3 SE +/- 219.37, N = 3 51519.76 55747.60 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 13K 26K 39K 52K 65K SE +/- 138.78, N = 3 SE +/- 237.56, N = 3 55012.86 59447.54 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Floating Point 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12K 24K 36K 48K 60K SE +/- 69.76, N = 3 SE +/- 409.48, N = 3 51297.84 55783.21 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Floating Point 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14K 28K 42K 56K 70K SE +/- 346.66, N = 3 SE +/- 299.63, N = 3 57979.94 63287.00 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Floating Point 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 13K 26K 39K 52K 65K SE +/- 253.76, N = 3 SE +/- 123.36, N = 3 58489.12 62841.89 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Floating Point 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12K 24K 36K 48K 60K SE +/- 314.84, N = 3 SE +/- 40.10, N = 3 50907.57 54408.85 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Floating Point 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 13K 26K 39K 52K 65K SE +/- 242.87, N = 3 SE +/- 313.52, N = 3 54343.27 58976.40 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8K 16K 24K 32K 40K SE +/- 9.06, N = 5 SE +/- 12.47, N = 5 32995.4 35875.4 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8K 16K 24K 32K 40K SE +/- 90.97, N = 5 SE +/- 60.43, N = 5 36597.2 39645.1 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8K 16K 24K 32K 40K SE +/- 66.38, N = 5 SE +/- 91.08, N = 5 36575.9 39523.5 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
OpenBenchmarking.org MB/s, More Is Better Tinymembench 2018-05-28 Standard Memset 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6K 12K 18K 24K 30K SE +/- 0.77, N = 3 SE +/- 0.07, N = 3 28761.1 28761.5 1. (CC) gcc options: -O2 -lm
Zstd Compression This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 50 100 150 200 250 SE +/- 0.43, N = 3 SE +/- 0.87, N = 3 SE +/- 1.04, N = 3 210.2 224.3 224.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 500 1000 1500 2000 2500 SE +/- 24.53, N = 3 SE +/- 11.18, N = 3 SE +/- 14.71, N = 3 2484.0 2494.8 2496.7 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 5 10 15 20 25 SE +/- 0.09, N = 3 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 21.0 21.3 21.4 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 500 1000 1500 2000 2500 SE +/- 16.82, N = 3 SE +/- 22.19, N = 3 SE +/- 27.30, N = 3 2145.3 2133.3 2126.5 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 3 6 9 12 15 SE +/- 0.13, N = 4 SE +/- 0.14, N = 15 SE +/- 0.13, N = 15 11.4 11.8 11.9 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 400 800 1200 1600 2000 SE +/- 14.13, N = 4 SE +/- 4.90, N = 15 SE +/- 6.45, N = 15 2014.0 2036.4 2013.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200 400 600 800 1000 SE +/- 3.13, N = 3 SE +/- 2.61, N = 3 757.8 834.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 500 1000 1500 2000 2500 SE +/- 6.57, N = 3 SE +/- 20.46, N = 3 2383.9 2402.2 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200 400 600 800 1000 SE +/- 0.96, N = 3 SE +/- 0.35, N = 3 731.6 777.8 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 500 1000 1500 2000 2500 SE +/- 14.21, N = 3 SE +/- 5.44, N = 3 2401.6 2409.0 1. (CC) gcc options: -O3 -pthread -lz -llzma
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200 400 600 800 1000 SE +/- 10.47, N = 15 SE +/- 7.52, N = 3 967.1 1092.5 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 900 1800 2700 3600 4500 SE +/- 12.46, N = 3 SE +/- 6.44, N = 3 4276.9 4409.4 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 70 140 210 280 350 SE +/- 1.02, N = 3 SE +/- 1.96, N = 3 295.4 326.7 1. (CXX) g++ options: -march=native -mfma -O3 -fno-trapping-math -fno-math-errno -lgtest
QuantLib QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1000 2000 3000 4000 5000 SE +/- 43.54, N = 3 SE +/- 44.23, N = 3 4519.0 4655.3 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 1024 MiB 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 190.75, N = 3 SE +/- 193.50, N = 5 17331.50 18081.17 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 10.23, N = 3 SE +/- 116.99, N = 3 16985.11 18238.19 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 193.18, N = 3 SE +/- 206.95, N = 3 17212.47 17939.14 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 1024 MiB 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3K 6K 9K 12K 15K SE +/- 126.62, N = 8 SE +/- 69.58, N = 3 14368.87 15457.76 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3K 6K 9K 12K 15K SE +/- 161.65, N = 5 SE +/- 161.08, N = 15 15014.79 15412.81 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3K 6K 9K 12K 15K SE +/- 40.44, N = 3 SE +/- 140.85, N = 15 14181.10 16074.85 1. (CC) gcc options: -O3 -march=native
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 30K 60K 90K 120K 150K SE +/- 90.39, N = 3 SE +/- 317.71, N = 3 SE +/- 147.67, N = 3 130974 140348 141067 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 30K 60K 90K 120K 150K SE +/- 51.33, N = 3 SE +/- 441.76, N = 3 SE +/- 114.33, N = 3 135066 134884 136265 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
WebP Image Encode This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.24, N = 3 16.12 16.94 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.4973 0.9946 1.4919 1.9892 2.4865 SE +/- 0.02, N = 3 SE +/- 0.02, N = 4 2.21 2.18 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.188 2.376 3.564 4.752 5.94 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 5.25 5.28 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.1935 0.387 0.5805 0.774 0.9675 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.86 0.83 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
ASTC Encoder ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.16, N = 3 106.88 108.38 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 13.28 13.46 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.3173 0.6346 0.9519 1.2692 1.5865 SE +/- 0.0019, N = 3 SE +/- 0.0017, N = 3 1.4013 1.4101 1. (CXX) g++ options: -O3 -flto -pthread
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.4757 0.9514 1.4271 1.9028 2.3785 SE +/- 0.003, N = 3 SE +/- 0.002, N = 3 2.018 2.114 1. (CXX) g++ options: -O3
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Read 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20M 40M 60M 80M 100M SE +/- 268039.95, N = 3 SE +/- 66438.66, N = 3 106190957 106616973 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 916.62, N = 3 SE +/- 6762.79, N = 3 775176 778312 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read While Writing 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 700K 1400K 2100K 2800K 3500K SE +/- 20061.25, N = 3 SE +/- 40225.96, N = 3 3293392 3352699 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 600K 1200K 1800K 2400K 3000K SE +/- 7025.36, N = 3 SE +/- 2942.07, N = 3 2698470 2708007 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Apache Hadoop This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 20 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 20093.59, N = 15 SE +/- 13218.18, N = 15 829540 829370
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Open - Threads: 50 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 13980.14, N = 15 SE +/- 16677.13, N = 15 775104 836383
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 20 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16K 32K 48K 64K 80K SE +/- 491.37, N = 3 SE +/- 243.37, N = 3 75327 73585
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Create - Threads: 50 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16K 32K 48K 64K 80K SE +/- 359.76, N = 3 SE +/- 1047.23, N = 3 75041 72987
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 20 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 427.25, N = 3 SE +/- 1352.95, N = 5 121658 124066
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Delete - Threads: 50 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 1769.73, N = 3 SE +/- 1183.36, N = 15 125677 125098
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 20 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 750.99, N = 13 SE +/- 1394.69, N = 12 103260 97687
OpenBenchmarking.org Ops per sec, More Is Better Apache Hadoop 3.3.6 Operation: Rename - Threads: 50 - Files: 100000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 894.50, N = 15 SE +/- 920.50, N = 15 100776 100530
CockroachDB CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 128 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 822.13, N = 4 SE +/- 787.39, N = 3 77441.3 78166.3
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 50% Reads - Concurrency: 256 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16K 32K 48K 64K 80K SE +/- 653.39, N = 3 SE +/- 41.56, N = 3 73610.5 75257.1
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 128 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 990.54, N = 3 SE +/- 697.87, N = 3 100636.8 100798.3
OpenBenchmarking.org ops/s, More Is Better CockroachDB 22.2 Workload: KV, 95% Reads - Concurrency: 256 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 1195.50, N = 3 SE +/- 106.43, N = 3 95740.5 96705.8
Dragonflydb Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:10 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 32669.01, N = 3 SE +/- 6658.86, N = 3 5140102.15 5240237.86 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:10 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 16951.27, N = 3 SE +/- 19573.23, N = 3 5878075.04 5996163.91 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:10 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 11796.50, N = 3 SE +/- 14897.87, N = 3 5944828.03 6037214.72 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 10 - Set To Get Ratio: 1:100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.1M 2.2M 3.3M 4.4M 5.5M SE +/- 31698.82, N = 3 SE +/- 36507.60, N = 3 5253368.41 5291751.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 20 - Set To Get Ratio: 1:100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 27895.51, N = 3 SE +/- 8256.25, N = 3 5948955.30 5953842.77 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Dragonflydb 1.6.2 Clients Per Thread: 50 - Set To Get Ratio: 1:100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 6885.71, N = 3 SE +/- 9307.21, N = 3 6008502.76 6048110.00 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14M 28M 42M 56M 70M SE +/- 372112.96, N = 3 SE +/- 290322.39, N = 3 58695851 66418128
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14M 28M 42M 56M 70M SE +/- 287862.26, N = 3 SE +/- 588025.58, N = 3 58893494 65321333
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15M 30M 45M 60M 75M SE +/- 269298.08, N = 3 SE +/- 543458.77, N = 3 56171045 71717191
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16M 32M 48M 64M 80M SE +/- 153626.87, N = 3 SE +/- 433159.86, N = 3 53933029 73038986
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15M 30M 45M 60M 75M SE +/- 178797.77, N = 3 SE +/- 264558.26, N = 3 54794795 70828684
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15M 30M 45M 60M 75M SE +/- 53876.17, N = 3 SE +/- 607082.75, N = 3 55743848 72311746
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14M 28M 42M 56M 70M SE +/- 101658.25, N = 3 SE +/- 205546.22, N = 3 50504136 67043792
OpenBenchmarking.org point/sec, More Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15M 30M 45M 60M 75M SE +/- 145136.93, N = 3 SE +/- 293988.26, N = 3 48756719 68878145
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_read_write - Threads: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 10K 20K 30K 40K 50K SE +/- 476.60, N = 5 SE +/- 119.99, N = 3 48524 48859
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_read_write - Threads: 64 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12K 24K 36K 48K 60K SE +/- 623.44, N = 3 SE +/- 156.68, N = 3 54459 55430
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 16 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16K 32K 48K 64K 80K SE +/- 182.25, N = 3 SE +/- 353.80, N = 3 74159 74528
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 317.87, N = 3 SE +/- 376.68, N = 3 94745 95108
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_point_select - Threads: 64 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 548.75, N = 3 SE +/- 487.30, N = 3 112896 114305
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 16 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3K 6K 9K 12K 15K SE +/- 51.10, N = 3 SE +/- 86.86, N = 3 14370 14600
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 224.65, N = 3 SE +/- 32.57, N = 3 18910 19178
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_index - Threads: 64 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5K 10K 15K 20K 25K SE +/- 207.09, N = 3 SE +/- 80.80, N = 3 22535 23008
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_non_index - Threads: 16 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5K 10K 15K 20K 25K SE +/- 19.25, N = 3 SE +/- 68.95, N = 3 21705 21964
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_non_index - Threads: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 7K 14K 21K 28K 35K SE +/- 103.92, N = 3 SE +/- 16.94, N = 3 30502 30023
OpenBenchmarking.org Queries Per Second, More Is Better TiDB Community Server 7.3 Test: oltp_update_non_index - Threads: 64 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8K 16K 24K 32K 40K SE +/- 164.06, N = 3 SE +/- 123.60, N = 3 36798 37171
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 181.00, N = 3 SE +/- 268.82, N = 3 130443 131759 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 7K 14K 21K 28K 35K SE +/- 18.77, N = 3 SE +/- 12.60, N = 3 34271 34600 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30M 60M 90M 120M 150M SE +/- 120701.19, N = 3 SE +/- 207062.26, N = 3 159204333 160593667 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 800K 1600K 2400K 3200K 4000K SE +/- 9683.89, N = 3 SE +/- 7310.57, N = 3 3667667 3698333 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt
Stargate Digital Audio Workstation Stargate is an open-source, cross-platform digital audio workstation (DAW) software package with "a unique and carefully curated experience" with scalability from old systems up through modern multi-core systems. Stargate is GPLv3 licensed and makes use of Qt5 (PyQt5) for its user-interface. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 44100 - Buffer Size: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.2971 2.5942 3.8913 5.1884 6.4855 SE +/- 0.010960, N = 3 SE +/- 0.026044, N = 3 5.765053 5.728028 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 96000 - Buffer Size: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.9645 1.929 2.8935 3.858 4.8225 SE +/- 0.014209, N = 3 SE +/- 0.014676, N = 3 4.286524 4.240463 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.6458 1.2916 1.9374 2.5832 3.229 SE +/- 0.010228, N = 3 SE +/- 0.006661, N = 3 2.859208 2.870041 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 44100 - Buffer Size: 1024 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.011092, N = 3 SE +/- 0.011002, N = 3 6.025177 5.959696 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 480000 - Buffer Size: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.2741 2.5482 3.8223 5.0964 6.3705 SE +/- 0.015747, N = 3 SE +/- 0.020388, N = 3 5.662499 5.633758 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 96000 - Buffer Size: 1024 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.0256 2.0512 3.0768 4.1024 5.128 SE +/- 0.007957, N = 3 SE +/- 0.018890, N = 3 4.558103 4.526957 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 192000 - Buffer Size: 1024 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.6903 1.3806 2.0709 2.7612 3.4515 SE +/- 0.009717, N = 3 SE +/- 0.014543, N = 3 3.067990 3.050610 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
OpenBenchmarking.org Render Ratio, More Is Better Stargate Digital Audio Workstation 22.11.5 Sample Rate: 480000 - Buffer Size: 1024 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.3477 2.6954 4.0431 5.3908 6.7385 SE +/- 0.018480, N = 3 SE +/- 0.029038, N = 3 5.989850 5.962076 1. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 136.48, N = 3 SE +/- 200.78, N = 3 117530.07 117324.45 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 78.59, N = 3 SE +/- 57.25, N = 3 118959.48 119443.54 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 47.99, N = 3 SE +/- 207.03, N = 3 118724.12 119149.86 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 69.20, N = 3 SE +/- 166.77, N = 3 108972.50 111611.86 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Apache HTTP Server This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 325.89, N = 3 SE +/- 179.74, N = 3 119302.97 119019.29 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 200 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 363.84, N = 3 SE +/- 441.46, N = 3 138039.75 139396.07 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 500 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 366.85, N = 3 SE +/- 376.52, N = 3 134394.82 134868.33 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 1000 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30K 60K 90K 120K 150K SE +/- 539.86, N = 3 SE +/- 581.22, N = 3 132624.82 133361.68 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12M 24M 36M 48M 60M SE +/- 45456.94, N = 3 SE +/- 95095.27, N = 3 54007000 52314667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 57 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16M 32M 48M 64M 80M SE +/- 201681.54, N = 3 SE +/- 507267.52, N = 3 76486333 76795000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20M 40M 60M 80M 100M SE +/- 92074.85, N = 3 SE +/- 185592.15, N = 3 103963333 105016667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 57 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30M 60M 90M 120M 150M SE +/- 1060990.10, N = 3 SE +/- 1637162.18, N = 5 151160000 153460000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40M 80M 120M 160M 200M SE +/- 612726.15, N = 3 SE +/- 491878.49, N = 3 205550000 206353333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 57 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60M 120M 180M 240M 300M SE +/- 384230.83, N = 3 SE +/- 838298.54, N = 3 277850000 279836667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 90M 180M 270M 360M 450M SE +/- 72341.78, N = 3 SE +/- 2131267.23, N = 3 403840000 404770000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 57 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 110M 220M 330M 440M 550M SE +/- 2524786.02, N = 3 SE +/- 6904733.48, N = 3 524076667 518873333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 1 - Buffer Length: 256 - Filter Length: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4M 8M 12M 16M 20M SE +/- 229910.85, N = 13 SE +/- 209513.87, N = 4 17666846 18032250 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 160M 320M 480M 640M 800M SE +/- 430851.87, N = 3 SE +/- 2142342.95, N = 3 763460000 766330000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 57 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200M 400M 600M 800M 1000M SE +/- 1765524.79, N = 3 SE +/- 1017289.43, N = 3 871723333 872803333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 2 - Buffer Length: 256 - Filter Length: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8M 16M 24M 32M 40M SE +/- 143251.53, N = 3 SE +/- 249107.96, N = 3 36886000 37017667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 24 - Buffer Length: 256 - Filter Length: 32 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200M 400M 600M 800M 1000M SE +/- 650640.71, N = 3 SE +/- 2107130.75, N = 3 1091000000 1097400000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 24 - Buffer Length: 256 - Filter Length: 57 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200M 400M 600M 800M 1000M SE +/- 5294126.72, N = 3 SE +/- 3143246.73, N = 3 1092133333 1096800000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 4 - Buffer Length: 256 - Filter Length: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16M 32M 48M 64M 80M SE +/- 126786.35, N = 3 SE +/- 204671.28, N = 3 72197667 73452000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30M 60M 90M 120M 150M SE +/- 973333.33, N = 3 SE +/- 381881.31, N = 3 140806667 143190000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50M 100M 150M 200M 250M SE +/- 535630.47, N = 3 SE +/- 177982.52, N = 3 241300000 241973333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 24 - Buffer Length: 256 - Filter Length: 512 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 70M 140M 210M 280M 350M SE +/- 1171556.42, N = 3 SE +/- 653435.37, N = 3 303403333 305273333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2K 4K 6K 8K 10K SE +/- 25.36, N = 3 SE +/- 17.28, N = 3 11571.4 11649.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 30M 60M 90M 120M 150M 121811000 121821000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
Scale: 26
2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 20M 40M 60M 80M 100M 92609900 91888500 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
Scale: 26
2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node phoronix-1U4LW-C252 exited on signal 9 (Killed).
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8K 16K 24K 32K 40K SE +/- 52.40, N = 3 SE +/- 154.80, N = 3 36519.19 39380.09 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2K 4K 6K 8K 10K SE +/- 97.48, N = 5 SE +/- 31.61, N = 3 9658.84 9826.00 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 500 1000 1500 2000 2500 SE +/- 25.96, N = 3 SE +/- 32.06, N = 3 2321.38 2299.56 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5K 10K 15K 20K 25K SE +/- 52.90, N = 3 SE +/- 52.81, N = 3 20904.65 22527.51 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 300 600 900 1200 1500 SE +/- 6.25, N = 3 SE +/- 2.45, N = 3 1157.60 1282.49 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5K 10K 15K 20K 25K SE +/- 15.37, N = 3 SE +/- 10.42, N = 3 20726.61 22578.55 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 33.41, N = 3 SE +/- 17.60, N = 3 17145.14 18539.44 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 1651.14, N = 3 SE +/- 1635.92, N = 3 981009 968980 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 2781.84, N = 3 SE +/- 8224.15, N = 3 914050 915964 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 10883.31, N = 3 SE +/- 4891.02, N = 3 850511 885803 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Write 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 10K 20K 30K 40K 50K SE +/- 354.32, N = 12 SE +/- 511.63, N = 3 45091 45146 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 8K 16K 24K 32K 40K SE +/- 228.04, N = 3 SE +/- 197.73, N = 3 36847 37012 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1000 - Clients: 500 - Mode: Read Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 140K 280K 420K 560K 700K SE +/- 2269.07, N = 3 SE +/- 4638.52, N = 3 629915 660143 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200K 400K 600K 800K 1000K SE +/- 686.26, N = 3 SE +/- 7583.57, N = 3 924876 946108 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 7K 14K 21K 28K 35K SE +/- 176.96, N = 3 SE +/- 466.86, N = 3 32363 34758 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60K 120K 180K 240K 300K SE +/- 36.23, N = 3 SE +/- 38.91, N = 3 292797.7 294250.7 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Apache IoTDB Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.56, N = 3 SE +/- 0.56, N = 3 78.02 67.97 MAX: 12610.39 MAX: 12602.49
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 3.01, N = 3 SE +/- 3.42, N = 3 270.50 243.35 MAX: 26705.76 MAX: 27508.98
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 1.07, N = 3 SE +/- 1.16, N = 3 132.06 102.90 MAX: 12635.71 MAX: 12639.99
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 100 200 300 400 500 SE +/- 2.38, N = 3 SE +/- 4.70, N = 3 453.03 342.45 MAX: 28182.85 MAX: 26708.4
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.32, N = 3 SE +/- 0.15, N = 3 87.10 66.56 MAX: 24898.05 MAX: 23926.91
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 70 140 210 280 350 SE +/- 0.85, N = 3 SE +/- 5.34, N = 3 329.03 249.87 MAX: 27136.22 MAX: 26714.72
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 100 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.41, N = 3 SE +/- 0.15, N = 3 153.38 114.26 MAX: 23885.16 MAX: 23920.85
OpenBenchmarking.org Average Latency, Fewer Is Better Apache IoTDB 1.2 Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 130 260 390 520 650 SE +/- 6.05, N = 3 SE +/- 2.85, N = 3 608.41 426.96 MAX: 27433.35 MAX: 28418.81
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.2354 0.4708 0.7062 0.9416 1.177 SE +/- 0.00250, N = 3 SE +/- 0.00030, N = 3 1.04621 1.03689
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Only - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.1161 0.2322 0.3483 0.4644 0.5805 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.510 0.516 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.1969 0.3938 0.5907 0.7876 0.9845 SE +/- 0.003, N = 3 SE +/- 0.008, N = 3 0.875 0.873 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.2646 0.5292 0.7938 1.0584 1.323 SE +/- 0.015, N = 3 SE +/- 0.006, N = 3 1.176 1.129 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 500 - Mode: Read Write - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 3 6 9 12 15 SE +/- 0.09, N = 12 SE +/- 0.12, N = 3 11.10 11.08 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5 10 15 20 25 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 21.71 21.62 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1000 - Clients: 500 - Mode: Read Only - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.1784 0.3568 0.5352 0.7136 0.892 SE +/- 0.003, N = 3 SE +/- 0.005, N = 3 0.793 0.757 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lm -licui18n -licuuc -licudata
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1100 2200 3300 4400 5500 SE +/- 6.11, N = 3 SE +/- 0.67, N = 3 5095 5059 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1100 2200 3300 4400 5500 SE +/- 2.08, N = 3 SE +/- 6.17, N = 3 5189 5158 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1300 2600 3900 5200 6500 SE +/- 3.38, N = 3 SE +/- 7.54, N = 3 6079 6052 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 70.93, N = 3 SE +/- 248.56, N = 3 84622 84431 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40K 80K 120K 160K 200K SE +/- 243.26, N = 3 SE +/- 206.17, N = 3 166093 165182 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 49.08, N = 3 SE +/- 107.79, N = 3 86126 85850 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40K 80K 120K 160K 200K SE +/- 158.45, N = 3 SE +/- 139.48, N = 3 168418 167725 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20K 40K 60K 80K 100K SE +/- 172.60, N = 3 SE +/- 52.54, N = 3 100656 100228 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40K 80K 120K 160K 200K SE +/- 481.75, N = 3 SE +/- 621.12, N = 3 197819 197114 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 300 600 900 1200 1500 SE +/- 1.45, N = 3 SE +/- 1.15, N = 3 1277 1272 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 300 600 900 1200 1500 SE +/- 0.58, N = 3 SE +/- 1.00, N = 3 1302 1294 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 1526 1525 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 18.93, N = 3 SE +/- 39.18, N = 3 20423 20355 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 9K 18K 27K 36K 45K SE +/- 17.46, N = 3 SE +/- 28.20, N = 3 40919 40788 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4K 8K 12K 16K 20K SE +/- 13.86, N = 3 SE +/- 32.42, N = 3 20774 20655 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 9K 18K 27K 36K 45K SE +/- 17.70, N = 3 SE +/- 21.39, N = 3 41387 41366 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 5K 10K 15K 20K 25K SE +/- 24.83, N = 3 SE +/- 16.64, N = 3 24418 24373 1. (CXX) g++ options: -O3 -lm -ldl
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.11 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 10K 20K 30K 40K 50K SE +/- 59.05, N = 3 SE +/- 53.61, N = 3 48775 48638 1. (CXX) g++ options: -O3 -lm -ldl
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.1946 0.3892 0.5838 0.7784 0.973 SE +/- 0.001, N = 3 SE +/- 0.007, N = 3 0.865 0.846 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 15 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.13, N = 3 SE +/- 0.31, N = 3 24.72 23.03 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenVINO This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Detection FP16 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200 400 600 800 1000 SE +/- 4.39, N = 3 SE +/- 2.77, N = 3 1076.71 1042.34 MIN: 692.07 / MAX: 1325.49 MIN: 618 / MAX: 1227.84 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Face Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 60 120 180 240 300 SE +/- 0.22, N = 3 SE +/- 0.25, N = 3 287.14 286.62 MIN: 272.04 / MAX: 297.56 MIN: 265.78 / MAX: 295.68 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Vehicle Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.0013 2.0026 3.0039 4.0052 5.0065 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 4.45 4.42 MIN: 2.53 / MAX: 13.88 MIN: 2.54 / MAX: 12.36 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Machine Translation EN To DE FP16 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 13 26 39 52 65 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 59.76 58.31 MIN: 30.86 / MAX: 72.83 MIN: 44.08 / MAX: 73.52 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Weld Porosity Detection FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.2555 2.511 3.7665 5.022 6.2775 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 5.58 5.57 MIN: 2.9 / MAX: 12.97 MIN: 2.85 / MAX: 13.77 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Person Vehicle Bike Detection FP16 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.107 2.214 3.321 4.428 5.535 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 4.92 4.89 MIN: 3.39 / MAX: 8.99 MIN: 3.33 / MAX: 12.31 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2022.3 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.1508 0.3016 0.4524 0.6032 0.754 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 0.67 0.67 MIN: 0.33 / MAX: 5.35 MIN: 0.32 / MAX: 8.02 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared
Neural Magic DeepSparse OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80 160 240 320 400 SE +/- 0.29, N = 3 SE +/- 0.26, N = 3 348.19 342.74
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.0149, N = 3 SE +/- 0.0118, N = 3 7.8452 7.7638
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16 32 48 64 80 SE +/- 0.07, N = 3 SE +/- 0.40, N = 3 69.86 67.63
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.5447 1.0894 1.6341 2.1788 2.7235 SE +/- 0.0029, N = 3 SE +/- 0.0103, N = 3 2.4211 2.4036
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 27.48 27.29
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 14 28 42 56 70 SE +/- 0.07, N = 3 SE +/- 0.32, N = 3 61.63 60.85
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.38, N = 3 SE +/- 0.33, N = 3 202.50 196.37
OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.5 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 16.22 16.25
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1100 2200 3300 4400 5500 5322.8 5155.9 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 18.03 16.65 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 6 12 18 24 30 26.67 26.01 25.80 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 ECC DRAM 50 100 150 200 250 210.07 193.01 192.30 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 7 14 21 28 35 SE +/- 0.29, N = 3 SE +/- 0.33, N = 3 31.87 32.22 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 9 18 27 36 45 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 40.50 40.33 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.8928 1.7856 2.6784 3.5712 4.464 SE +/- 0.023, N = 3 SE +/- 0.022, N = 3 3.968 3.913 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 2 4 6 8 10 SE +/- 0.036, N = 3 SE +/- 0.002, N = 3 6.541 6.411 1. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 0.8744 1.7488 2.6232 3.4976 4.372 SE +/- 0.032, N = 3 SE +/- 0.050, N = 3 3.886 3.836 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 12 24 36 48 60 SE +/- 0.39, N = 3 SE +/- 0.34, N = 3 51.15 50.44
OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Unix Makefiles 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 90 180 270 360 450 SE +/- 0.73, N = 3 SE +/- 0.23, N = 3 419.97 405.10
Timed Mesa Compilation This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.11, N = 3 24.12 23.38
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 70 140 210 280 350 SE +/- 0.32, N = 3 SE +/- 0.28, N = 3 323.95 314.34
Apache CouchDB This is a bulk insertion benchmark of Apache CouchDB. CouchDB is a document-oriented NoSQL database implemented in Erlang. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 100 - Inserts: 1000 - Rounds: 30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 16 32 48 64 80 SE +/- 0.16, N = 3 SE +/- 0.24, N = 3 71.62 71.12 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 100 - Inserts: 3000 - Rounds: 30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 SE +/- 0.55, N = 3 SE +/- 0.99, N = 3 236.66 231.61 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 300 - Inserts: 1000 - Rounds: 30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.56, N = 3 SE +/- 0.53, N = 3 109.35 109.20 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 300 - Inserts: 3000 - Rounds: 30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80 160 240 320 400 SE +/- 1.14, N = 3 SE +/- 1.68, N = 3 367.18 358.02 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 500 - Inserts: 1000 - Rounds: 30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 2.23, N = 12 SE +/- 2.33, N = 4 180.43 190.52 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
OpenBenchmarking.org Seconds, Fewer Is Better Apache CouchDB 3.3.2 Bulk Size: 500 - Inserts: 3000 - Rounds: 30 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 140 280 420 560 700 SE +/- 2.17, N = 3 SE +/- 22.88, N = 6 657.55 671.07 1. (CXX) g++ options: -std=c++17 -lmozjs-91 -lm -lei -fPIC -MMD
Timed MrBayes Analysis This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15 30 45 60 75 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 67.36 67.42 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 233.42 223.13 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 600 1200 1800 2400 3000 2679.31 2474.55 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.18, N = 3 SE +/- 0.22, N = 3 99.31 98.35
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 200 400 600 800 1000 SE +/- 0.56, N = 3 SE +/- 0.83, N = 3 1002.93 912.45
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15 30 45 60 75 SE +/- 0.08, N = 3 SE +/- 0.36, N = 3 66.57 61.85
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.54, N = 3 SE +/- 0.72, N = 3 177.66 171.75
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.37, N = 3 SE +/- 0.10, N = 3 78.54 76.91
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 80 160 240 320 400 SE +/- 0.96, N = 3 SE +/- 1.65, N = 3 346.61 326.70
GNU Octave Benchmark This test profile measures how long it takes to complete several reference GNU Octave files via octave-benchmark. GNU Octave is used for numerical computations and is an open-source alternative to MATLAB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GNU Octave Benchmark 6.4.0 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 1.2751 2.5502 3.8253 5.1004 6.3755 SE +/- 0.046, N = 5 SE +/- 0.014, N = 5 5.667 5.265
RawTherapee RawTherapee is a cross-platform, open-source multi-threaded RAW image processing program. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 9 18 27 36 45 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 39.23 37.71 1. RawTherapee, version 5.8, command line.
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.60, N = 3 SE +/- 0.31, N = 3 181.97 171.42 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 15 30 45 60 75 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 66.99 67.03
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 40 80 120 160 200 SE +/- 0.13, N = 3 SE +/- 0.19, N = 3 172.14 171.29
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 20 40 60 80 100 SE +/- 0.15, N = 3 SE +/- 0.09, N = 3 84.47 83.91
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 140 280 420 560 700 SE +/- 0.43, N = 3 SE +/- 0.24, N = 3 628.52 625.36
OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 50 100 150 200 250 SE +/- 0.25, N = 3 SE +/- 0.19, N = 3 207.57 206.84
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: Li2_STO_ae 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 1.56, N = 5 SE +/- 0.33, N = 3 148.54 146.07 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: simple-H2O 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 16.68 16.55 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.19, N = 3 SE +/- 0.15, N = 3 114.71 114.08 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms 2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 30 60 90 120 150 SE +/- 0.87, N = 3 SE +/- 0.19, N = 3 118.39 118.08 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
ECC 2 x 32GB DDR5-4800 MTC20C2085S1EC48BA1 Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 62GB, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 16 September 2023 13:18 by user phoronix.
2 x 32GB DDR5-6000 CMK64GX5M2B6000C40 @ 4800 Processor: AMD Ryzen 9 7900X 12-Core @ 5.73GHz (12 Cores / 24 Threads), Motherboard: ASRockRack B650D4U-2L2T/BCM (2.09 BIOS), Chipset: AMD Device 14d8, Memory: 32GB, Disk: 3201GB Micron_7450_MTFDKCC3T2TFS + 0GB Virtual HDisk0 + 0GB Virtual HDisk1 + 0GB Virtual HDisk2 + 0GB Virtual HDisk3, Graphics: ASPEED, Audio: AMD Device 1640, Monitor: VA2431, Network: 2 x Intel I210 + 2 x Broadcom BCM57416 NetXtreme-E Dual-Media 10G RDMA
OS: Ubuntu 22.04, Kernel: 6.6.0-060600rc1daily20230913-generic (x86_64), Desktop: GNOME Shell 42.9, Display Server: X Server 1.21.1.4, Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xa601203Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu122.04)Python Notes: Python 3.10.12Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 18 September 2023 11:48 by user phoronix.