AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 Changed Memory to 2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32 .
Java Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 Changed Memory to 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36 .
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 Changed Memory to 2 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38 .
Llamafile OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.8.6 Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.053 2.106 3.159 4.212 5.265 4.68
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 150 300 450 600 750 SE +/- 49.36, N = 9 SE +/- 1.02, N = 3 SE +/- 0.28, N = 3 573.33 668.16 705.55
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 40 80 120 160 200 174.13 177.79 181.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Medium Mesh Size - Mesh Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 500 1000 1500 2000 2500 1909.67 1994.75 2096.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Medium Mesh Size - Execution Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] --> FOAM FATAL ERROR:
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 30.71, N = 9 SE +/- 36.12, N = 6 SE +/- 37.04, N = 3 SE +/- 6.72, N = 3 3699.6 3676.5 3638.6 3616.6 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 100 200 300 400 500 SE +/- 0.20, N = 3 SE +/- 0.26, N = 3 SE +/- 0.27, N = 3 SE +/- 0.20, N = 3 448.96 450.71 459.12 459.19
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 130 260 390 520 650 SE +/- 0.52, N = 3 SE +/- 0.15, N = 3 589.33 590.50
Build: allmodconfig
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: gcc: internal compiler error: Segmentation fault signal terminated program as
LeelaChessZero OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: Eigen 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 1.15, N = 3 SE +/- 2.85, N = 3 SE +/- 1.45, N = 3 247 236 227 220 1. (CXX) g++ options: -flto -pthread
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 5.81, N = 15 SE +/- 0.35, N = 3 SE +/- 0.28, N = 3 SE +/- 0.28, N = 3 124.37 128.01 128.32 128.76
LeelaChessZero OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: BLAS 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 60 120 180 240 300 SE +/- 3.79, N = 3 SE +/- 2.91, N = 3 SE +/- 2.65, N = 3 264 244 226 1. (CXX) g++ options: -flto -pthread
Backend: BLAS
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 0.88, N = 3 SE +/- 1.33, N = 3 756 761 764 786 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 200 400 600 800 1000 SE +/- 4.91, N = 3 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 3.21, N = 3 1115 1118 1124 1154 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 SE +/- 2.96, N = 3 SE +/- 3.18, N = 3 759 765 765 783 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 200 400 600 800 1000 SE +/- 1.76, N = 3 SE +/- 1.20, N = 3 SE +/- 1.86, N = 3 SE +/- 0.58, N = 3 770 773 776 797 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 300 600 900 1200 1500 SE +/- 2.89, N = 3 SE +/- 4.58, N = 3 SE +/- 3.53, N = 3 SE +/- 8.11, N = 3 1291 1302 1303 1334 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 200 400 600 800 1000 SE +/- 2.31, N = 3 SE +/- 3.38, N = 3 SE +/- 1.20, N = 3 SE +/- 2.31, N = 3 985 996 997 1018 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 200 400 600 800 1000 SE +/- 2.31, N = 3 SE +/- 2.31, N = 3 SE +/- 5.00, N = 3 SE +/- 3.18, N = 3 812 815 818 836 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 300 600 900 1200 1500 SE +/- 19.19, N = 3 SE +/- 9.64, N = 3 SE +/- 2.08, N = 3 SE +/- 3.18, N = 3 1516 1527 1553 1590 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 300 600 900 1200 1500 SE +/- 5.21, N = 3 SE +/- 4.98, N = 3 SE +/- 8.84, N = 3 SE +/- 9.00, N = 3 1264 1272 1273 1285 1. (CXX) g++ options: -O3 -lrt -lm
Timed LLVM Compilation This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 70 140 210 280 350 SE +/- 0.22, N = 3 SE +/- 0.12, N = 3 SE +/- 0.38, N = 2 298.05 301.63 304.62
Build System: Ninja
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: /usr/include/c++/13/bits/vector.tcc:445:7: internal compiler error: Segmentation fault
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 80 160 240 320 400 SE +/- 0.24, N = 3 SE +/- 0.34, N = 3 341.68 345.25
Time To Compile
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 0.99, N = 3 SE +/- 0.76, N = 3 SE +/- 0.74, N = 3 239.22 242.00 247.06
Model: INIVOL and Fluid Structure Interaction Drop Container
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test run did not produce a result. E: ** ERROR: FILE fsi_drop_container_0000_0001.rst NOT FOUND
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 110K 220K 330K 440K 550K 498966 494437 489784 486954 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
Stockfish OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish Chess Benchmark 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 11M 22M 33M 44M 55M SE +/- 491754.88, N = 15 SE +/- 166041.67, N = 3 SE +/- 515794.89, N = 15 SE +/- 410305.51, N = 3 51377778 50101396 49974210 49194155 1. Stockfish 16 by the Stockfish developers (see AUTHORS file)
Java JMH This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 20000M 40000M 60000M 80000M 100000M 92045826799.54 91554717298.43 90340365706.79 89012581592.65
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3 6 9 12 15 SE +/- 0.00741, N = 3 SE +/- 0.00584, N = 3 SE +/- 0.00142, N = 3 SE +/- 0.00125, N = 3 9.51601 9.16590 8.93560 8.79737 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 219.27 221.14
Time To Compile
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Warning: Protocol buffer compiler (protoc) not found.
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Execution Time 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13 26 39 52 65 55.57 57.25 58.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Mesh Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 20 40 60 80 100 84.98 84.99 85.47 88.08 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 12 24 36 48 60 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 55.19 54.30 53.54 52.27
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3K 6K 9K 12K 15K SE +/- 15.11, N = 3 SE +/- 16.93, N = 3 SE +/- 42.19, N = 3 SE +/- 18.27, N = 3 16097.00 15610.30 15531.76 14528.26 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: Kostya 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 7.71 7.63 7.57 7.50 1. (CXX) g++ options: -O3 -lrt
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4K 8K 12K 16K 20K SE +/- 58.10, N = 3 SE +/- 144.90, N = 15 SE +/- 239.88, N = 3 SE +/- 10.56, N = 3 20850.88 20349.41 20124.94 19467.39 1. (CC) gcc options: -O3 -march=native
Build2 OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 20 40 60 80 100 SE +/- 0.16, N = 3 SE +/- 0.52, N = 3 SE +/- 0.40, N = 3 SE +/- 0.99, N = 2 76.00 77.01 77.92 80.10
GROMACS OpenBenchmarking.org Ns Per Day, More Is Better GROMACS Input: water_GMX50_bare 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.452 0.904 1.356 1.808 2.26 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.002, N = 3 2.009 1.996 1.927 1. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3
Input: water_GMX50_bare
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Fatal error:
Numpy Benchmark This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better Numpy Benchmark 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 200 400 600 800 1000 SE +/- 2.01, N = 3 SE +/- 8.61, N = 3 SE +/- 4.91, N = 3 SE +/- 1.37, N = 3 1070.61 1068.36 1064.17 1046.91
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 76.90 77.18 77.45 77.81
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16 32 48 64 80 SE +/- 0.59, N = 3 SE +/- 0.59, N = 3 SE +/- 0.31, N = 3 69.44 69.82 72.44 73.14 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: LargeRandom 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 0.477 0.954 1.431 1.908 2.385 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 2.12 2.11 2.11 2.10 1. (CXX) g++ options: -O3 -lrt
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16 32 48 64 80 SE +/- 0.24, N = 3 SE +/- 0.34, N = 3 SE +/- 0.16, N = 2 SE +/- 0.34, N = 3 68.92 69.56 72.08 72.20 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: TopTweet 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.17, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 13.59 13.38 13.24 12.88 1. (CXX) g++ options: -O3 -lrt
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 5K 10K 15K 20K 25K SE +/- 204.03, N = 3 SE +/- 188.50, N = 12 SE +/- 49.10, N = 3 SE +/- 56.11, N = 3 22817.90 22294.06 22247.78 21444.00 1. (CC) gcc options: -O3 -march=native
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 13 26 39 52 65 SE +/- 0.40, N = 15 SE +/- 0.36, N = 3 SE +/- 0.34, N = 3 SE +/- 0.25, N = 3 58.89 58.49 57.54 56.19 MIN: 34.79 / MAX: 62.62 MIN: 34.6 / MAX: 60.01 MIN: 53.14 / MAX: 58.34 MIN: 39.35 / MAX: 57.3
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 110 220 330 440 550 SE +/- 0.09, N = 3 SE +/- 0.10, N = 3 SE +/- 0.33, N = 3 SE +/- 0.38, N = 3 508.9 505.9 491.1 472.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 77.39 77.25 76.20 75.93 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 400 800 1200 1600 2000 SE +/- 0.64, N = 3 SE +/- 0.32, N = 3 SE +/- 1.03, N = 3 SE +/- 2.32, N = 3 1934.61 1931.14 1904.95 1898.26 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 15831.56, N = 3 SE +/- 11599.22, N = 3 SE +/- 11157.27, N = 3 SE +/- 4372.40, N = 3 5915791.34 5898565.34 5874747.73 5694123.52 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 1.6M 3.2M 4.8M 6.4M 8M SE +/- 87376.76, N = 3 SE +/- 1150.91, N = 3 SE +/- 43634.73, N = 3 SE +/- 71150.32, N = 3 7647187.48 7507615.73 7498825.19 7350932.88 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 14 28 42 56 70 SE +/- 0.12, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.17, N = 3 62.09 62.21 63.41 64.08
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.64 8.64 8.55 8.46 MIN: 7.67 / MAX: 9.25 MIN: 7.64 / MAX: 9.25 MIN: 7.48 / MAX: 9.15 MIN: 7.49 / MAX: 9.04
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 1.0553 2.1106 3.1659 4.2212 5.2765 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 4.69 4.65 4.62 4.60 MIN: 2.15 / MAX: 5.26 MIN: 2.02 / MAX: 5.24 MIN: 2.07 / MAX: 5.22 MIN: 2.13 / MAX: 5.16
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 5.15 5.14 5.05 5.04 MIN: 2.38 / MAX: 5.77 MIN: 2.39 / MAX: 5.75 MIN: 2.28 / MAX: 5.66 MIN: 2.28 / MAX: 5.66
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b3067 Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 9.00 8.66 8.36 8.32 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 1.2195 2.439 3.6585 4.878 6.0975 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.42 5.40 5.36 5.34 MIN: 5.31 / MAX: 5.77 MIN: 5.27 / MAX: 5.72 MIN: 5.25 / MAX: 5.7 MIN: 5.23 / MAX: 5.69
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14 28 42 56 70 SE +/- 0.14, N = 3 SE +/- 0.01, N = 2 SE +/- 0.20, N = 3 SE +/- 0.09, N = 3 58.20 60.48 60.80 64.11 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 12 24 36 48 60 SE +/- 0.10, N = 3 SE +/- 0.15, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 53.16 53.23 53.66 54.27
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.7421 1.4842 2.2263 2.9684 3.7105 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 3.298 3.228 3.141 1. (CXX) g++ options: -O3 -lm
Implementation: MPI CPU - Input: water_GMX50_bare
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-System-Product-Name exited on signal 11 (Segmentation fault).
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 6 12 18 24 30 22.58 22.82 23.14 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Small Mesh Size - Mesh Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] #0 Foam::error::printStack(Foam::Ostream&) at ??:?
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13K 26K 39K 52K 65K SE +/- 85.26, N = 3 SE +/- 8.10, N = 3 SE +/- 74.71, N = 3 SE +/- 54.79, N = 3 60124.08 58570.77 58570.03 55287.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.2305 0.461 0.6915 0.922 1.1525 SE +/- 0.00057, N = 3 SE +/- 0.00200, N = 3 SE +/- 0.00137, N = 2 SE +/- 0.00027, N = 3 1.02426 1.01602 0.99852 0.98084
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 6 12 18 24 30 SE +/- 0.20, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 SE +/- 0.16, N = 15 23.00 23.29 23.36 23.87 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 145.88 148.76 157.15 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Small Mesh Size - Execution Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 46.38 46.56 47.51 47.72
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 10 20 30 40 50 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.18, N = 3 43.25 43.42 44.90
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 60 120 180 240 300 SE +/- 0.37, N = 3 SE +/- 0.03, N = 3 SE +/- 0.53, N = 3 SE +/- 0.20, N = 3 258.6 253.3 249.7 237.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 0.19, N = 3 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.07, N = 3 132.2 128.0 127.0 121.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 11 22 33 44 55 SE +/- 0.30, N = 3 SE +/- 0.30, N = 3 SE +/- 0.40, N = 3 47.76 47.78 48.26
Build: defconfig
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: : internal compiler error: Segmentation fault
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 5M 10M 15M 20M 25M SE +/- 21858.13, N = 3 SE +/- 92074.85, N = 3 SE +/- 41633.32, N = 3 SE +/- 16666.67, N = 3 25676667 25666667 25660000 25136667 1. (CXX) g++ options: -fopenmp -O3 -march=native
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 5K 10K 15K 20K 25K SE +/- 72.38, N = 3 SE +/- 31.03, N = 3 SE +/- 265.89, N = 3 SE +/- 227.86, N = 15 22774.95 22539.72 22510.46 22093.99 1. (CC) gcc options: -O3 -march=native
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.811 1.622 2.433 3.244 4.055 SE +/- 0.03856, N = 3 SE +/- 0.02618, N = 15 SE +/- 0.02740, N = 9 SE +/- 0.02586, N = 3 3.60464 3.57755 3.50958 3.41007
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 800 1600 2400 3200 4000 SE +/- 2.85, N = 3 SE +/- 40.74, N = 3 SE +/- 36.74, N = 3 SE +/- 51.16, N = 3 3823.80 3739.16 3714.66 3712.45 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 72.30, N = 3 SE +/- 210.02, N = 3 SE +/- 130.21, N = 3 SE +/- 150.93, N = 3 64031.31 62375.43 62084.72 59920.96 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 15K 30K 45K 60K 75K SE +/- 453.50, N = 3 SE +/- 762.95, N = 5 SE +/- 299.96, N = 3 SE +/- 820.93, N = 3 70747.00 70345.53 66545.24 65797.05 1. (CC) gcc options: -O3 -march=native
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 7 14 21 28 35 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.28, N = 3 29.82 30.54 30.68 31.98 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Stress-NG OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.17.08 Test: Memory Copying 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2K 4K 6K 8K 10K SE +/- 44.94, N = 3 SE +/- 72.54, N = 3 SE +/- 39.56, N = 3 SE +/- 25.18, N = 3 10752.81 10720.31 10608.96 10608.82 1. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 15K 30K 45K 60K 75K SE +/- 315.00, N = 3 SE +/- 204.48, N = 3 SE +/- 805.54, N = 4 SE +/- 119.74, N = 3 69491.42 68588.84 66822.49 65503.93 1. (CC) gcc options: -O3 -march=native
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 SE +/- 0.17, N = 3 24.96 25.19 25.45 25.69 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 108.61, N = 3 SE +/- 382.92, N = 3 SE +/- 64.38, N = 3 SE +/- 426.98, N = 3 65113.52 63967.70 62771.00 61079.93 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 400 800 1200 1600 2000 SE +/- 4.49, N = 3 SE +/- 5.23, N = 3 SE +/- 2.77, N = 3 SE +/- 5.33, N = 3 1635.34 1586.59 1584.93 1505.47 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 439.59, N = 3 SE +/- 33.42, N = 3 SE +/- 130.09, N = 3 SE +/- 156.92, N = 3 66963.03 65845.21 64832.86 62428.70 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16K 32K 48K 64K 80K SE +/- 364.59, N = 3 SE +/- 266.14, N = 3 SE +/- 513.22, N = 3 SE +/- 220.54, N = 3 73724.88 71754.58 70445.74 69141.08 1. (CC) gcc options: -O3 -march=native
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4K 8K 12K 16K 20K SE +/- 211.85, N = 3 SE +/- 32.81, N = 3 SE +/- 36.42, N = 3 SE +/- 24.77, N = 3 20923.99 20817.57 19747.58 19534.16 1. (CC) gcc options: -O3 -march=native
Etcpak OpenBenchmarking.org Mpx/s, More Is Better Etcpak 2.0 Benchmark: Multi-Threaded - Configuration: ETC2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 160 320 480 640 800 SE +/- 1.25, N = 3 SE +/- 0.38, N = 3 SE +/- 1.36, N = 3 SE +/- 0.82, N = 3 741.78 739.74 734.53 731.23 1. (CXX) g++ options: -flto -pthread
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 36.55 36.25 36.11 35.18 MIN: 36.13 / MAX: 37.34 MIN: 35.89 / MAX: 36.94 MIN: 35.69 / MAX: 36.98 MIN: 34.83 / MAX: 36.01
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 20 40 60 80 100 SE +/- 0.63, N = 3 SE +/- 0.96, N = 4 SE +/- 0.27, N = 3 SE +/- 0.82, N = 5 86.79 85.52 84.89 81.89 MIN: 78.82 / MAX: 88.23 MIN: 68.54 / MAX: 88.53 MIN: 77.17 / MAX: 85.87 MIN: 67.48 / MAX: 84.5
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 41.72 41.49 40.94 40.07 MIN: 41.48 / MAX: 42.26 MIN: 41.24 / MAX: 42.02 MIN: 40.73 / MAX: 41.6 MIN: 39.86 / MAX: 40.55
POV-Ray OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray Trace Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 16.10 16.13 16.32 16.40 1. POV-Ray 3.7.0.10.unofficial
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Compression Rating 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 40K 80K 120K 160K 200K SE +/- 467.61, N = 3 SE +/- 94.03, N = 3 SE +/- 322.74, N = 3 203065 200810 195585 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Test: Compression Rating
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
Y-Cruncher OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 15.43 15.61 16.45
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Decompression Rating 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 30K 60K 90K 120K 150K SE +/- 14.15, N = 3 SE +/- 69.90, N = 3 SE +/- 33.91, N = 3 158133 157512 157005 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Test: Decompression Rating
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 10.38, N = 3 SE +/- 26.91, N = 3 SE +/- 40.85, N = 3 SE +/- 21.50, N = 3 23254.40 22908.58 22436.81 21328.28 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
x265 OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 4K 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 38.30 38.27 38.07 37.09 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6K 12K 18K 24K 30K SE +/- 95.45, N = 3 SE +/- 30.67, N = 3 SE +/- 113.15, N = 3 SE +/- 141.73, N = 3 29713.55 28891.86 28743.49 27106.18 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 12.97 13.26 13.40 14.09 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3K 6K 9K 12K 15K SE +/- 10.26, N = 3 SE +/- 41.76, N = 3 SE +/- 36.86, N = 3 SE +/- 21.24, N = 3 11908.47 11662.44 11374.51 11144.88 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.27, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 77.08 76.89 76.12 75.99 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 400 800 1200 1600 2000 SE +/- 0.54, N = 3 SE +/- 6.84, N = 3 SE +/- 0.31, N = 3 SE +/- 0.87, N = 3 1926.89 1922.13 1903.04 1899.80 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 20.08 19.99 19.84 19.33 MIN: 17.96 / MAX: 20.55 MIN: 18.05 / MAX: 20.42 MIN: 17.86 / MAX: 20.21 MIN: 17.37 / MAX: 19.62
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 17.34 17.33 16.92 16.81
Y-Cruncher OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.023, N = 3 SE +/- 0.016, N = 3 SE +/- 0.006, N = 3 7.236 7.247 7.587
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6K 12K 18K 24K 30K SE +/- 34.50, N = 3 SE +/- 10.24, N = 3 SE +/- 20.77, N = 3 SE +/- 6.88, N = 3 26596.46 25715.34 25622.93 24236.21 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
LULESH LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2K 4K 6K 8K 10K SE +/- 13.71, N = 3 SE +/- 67.88, N = 3 SE +/- 19.00, N = 3 SE +/- 38.48, N = 3 10578.52 10235.72 9945.53 9832.64 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
x265 OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 1080p 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 30 60 90 120 150 SE +/- 0.27, N = 3 SE +/- 0.45, N = 3 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 134.52 134.48 134.19 131.28 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 17.86, N = 3 SE +/- 53.14, N = 3 SE +/- 21.03, N = 3 SE +/- 25.12, N = 3 3741.86 3731.97 3722.56 3678.46 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 14 August 2024 14:50 by user phoronix.
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 August 2024 01:12 by user phoronix.
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 20 August 2024 15:50 by user phoronix.
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 August 2024 15:59 by user phoronix.