AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 Changed Memory to 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36 .
Java Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 Changed Memory to 2 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38 .
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 Changed Memory to 2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32 .
LeelaChessZero OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: BLAS 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 60 120 180 240 300 SE +/- 2.91, N = 3 SE +/- 3.79, N = 3 SE +/- 2.65, N = 3 244 264 226 1. (CXX) g++ options: -flto -pthread
Backend: BLAS
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: Eigen 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 2.85, N = 3 SE +/- 1.15, N = 3 SE +/- 1.45, N = 3 236 227 247 220 1. (CXX) g++ options: -flto -pthread
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3K 6K 9K 12K 15K SE +/- 16.93, N = 3 SE +/- 42.19, N = 3 SE +/- 15.11, N = 3 SE +/- 18.27, N = 3 15610.30 15531.76 16097.00 14528.26 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14 28 42 56 70 SE +/- 0.20, N = 3 SE +/- 0.01, N = 2 SE +/- 0.14, N = 3 SE +/- 0.09, N = 3 60.80 60.48 58.20 64.11 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 500 1000 1500 2000 2500 1994.75 1909.67 2096.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Medium Mesh Size - Execution Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] --> FOAM FATAL ERROR:
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6K 12K 18K 24K 30K SE +/- 20.77, N = 3 SE +/- 10.24, N = 3 SE +/- 34.50, N = 3 SE +/- 6.88, N = 3 25622.93 25715.34 26596.46 24236.21 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6K 12K 18K 24K 30K SE +/- 30.67, N = 3 SE +/- 113.15, N = 3 SE +/- 95.45, N = 3 SE +/- 141.73, N = 3 28891.86 28743.49 29713.55 27106.18 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.26, N = 3 SE +/- 0.19, N = 3 SE +/- 0.07, N = 3 128.0 127.0 132.2 121.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 60 120 180 240 300 SE +/- 0.03, N = 3 SE +/- 0.53, N = 3 SE +/- 0.37, N = 3 SE +/- 0.20, N = 3 253.3 249.7 258.6 237.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 26.91, N = 3 SE +/- 40.85, N = 3 SE +/- 10.38, N = 3 SE +/- 21.50, N = 3 22908.58 22436.81 23254.40 21328.28 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13K 26K 39K 52K 65K SE +/- 8.10, N = 3 SE +/- 74.71, N = 3 SE +/- 85.26, N = 3 SE +/- 54.79, N = 3 58570.77 58570.03 60124.08 55287.37 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 400 800 1200 1600 2000 SE +/- 2.77, N = 3 SE +/- 5.23, N = 3 SE +/- 4.49, N = 3 SE +/- 5.33, N = 3 1584.93 1586.59 1635.34 1505.47 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 13.40 13.26 12.97 14.09 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b3067 Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.32 8.66 9.00 8.36 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3 6 9 12 15 SE +/- 0.00125, N = 3 SE +/- 0.00584, N = 3 SE +/- 0.00741, N = 3 SE +/- 0.00142, N = 3 8.79737 9.16590 9.51601 8.93560 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 110 220 330 440 550 SE +/- 0.10, N = 3 SE +/- 0.33, N = 3 SE +/- 0.09, N = 3 SE +/- 0.38, N = 3 505.9 491.1 508.9 472.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 148.76 145.88 157.15 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Small Mesh Size - Execution Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.
LULESH LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2K 4K 6K 8K 10K SE +/- 67.88, N = 3 SE +/- 38.48, N = 3 SE +/- 13.71, N = 3 SE +/- 19.00, N = 3 10235.72 9832.64 10578.52 9945.53 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 15K 30K 45K 60K 75K SE +/- 453.50, N = 3 SE +/- 299.96, N = 3 SE +/- 762.95, N = 5 SE +/- 820.93, N = 3 70747.00 66545.24 70345.53 65797.05 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 33.42, N = 3 SE +/- 130.09, N = 3 SE +/- 439.59, N = 3 SE +/- 156.92, N = 3 65845.21 64832.86 66963.03 62428.70 1. (CC) gcc options: -O3 -march=native
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 7 14 21 28 35 SE +/- 0.39, N = 3 SE +/- 0.38, N = 3 SE +/- 0.28, N = 3 29.82 31.98 30.54 30.68 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4K 8K 12K 16K 20K SE +/- 32.81, N = 3 SE +/- 36.42, N = 3 SE +/- 211.85, N = 3 SE +/- 24.77, N = 3 20817.57 19747.58 20923.99 19534.16 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4K 8K 12K 16K 20K SE +/- 58.10, N = 3 SE +/- 239.88, N = 3 SE +/- 144.90, N = 15 SE +/- 10.56, N = 3 20850.88 20124.94 20349.41 19467.39 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 130.21, N = 3 SE +/- 210.02, N = 3 SE +/- 72.30, N = 3 SE +/- 150.93, N = 3 62084.72 62375.43 64031.31 59920.96 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3K 6K 9K 12K 15K SE +/- 36.86, N = 3 SE +/- 41.76, N = 3 SE +/- 10.26, N = 3 SE +/- 21.24, N = 3 11374.51 11662.44 11908.47 11144.88 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Y-Cruncher OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 15.43 15.61 16.45
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16K 32K 48K 64K 80K SE +/- 266.14, N = 3 SE +/- 513.22, N = 3 SE +/- 364.59, N = 3 SE +/- 220.54, N = 3 71754.58 70445.74 73724.88 69141.08 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14K 28K 42K 56K 70K SE +/- 382.92, N = 3 SE +/- 64.38, N = 3 SE +/- 108.61, N = 3 SE +/- 426.98, N = 3 63967.70 62771.00 65113.52 61079.93 1. (CC) gcc options: -O3 -march=native
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 188.50, N = 12 SE +/- 56.11, N = 3 SE +/- 204.03, N = 3 SE +/- 49.10, N = 3 22294.06 21444.00 22817.90 22247.78 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 15K 30K 45K 60K 75K SE +/- 204.48, N = 3 SE +/- 805.54, N = 4 SE +/- 315.00, N = 3 SE +/- 119.74, N = 3 68588.84 66822.49 69491.42 65503.93 1. (CC) gcc options: -O3 -march=native
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.96, N = 4 SE +/- 0.82, N = 5 SE +/- 0.63, N = 3 SE +/- 0.27, N = 3 85.52 81.89 86.79 84.89 MIN: 68.54 / MAX: 88.53 MIN: 67.48 / MAX: 84.5 MIN: 78.82 / MAX: 88.23 MIN: 77.17 / MAX: 85.87
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.811 1.622 2.433 3.244 4.055 SE +/- 0.03856, N = 3 SE +/- 0.02740, N = 9 SE +/- 0.02618, N = 15 SE +/- 0.02586, N = 3 3.60464 3.50958 3.57755 3.41007
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 12 24 36 48 60 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 54.30 53.54 55.19 52.27
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: TopTweet 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 3 6 9 12 15 SE +/- 0.17, N = 3 SE +/- 0.14, N = 3 SE +/- 0.15, N = 3 SE +/- 0.02, N = 3 13.38 12.88 13.24 13.59 1. (CXX) g++ options: -O3 -lrt
Build2 OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.52, N = 3 SE +/- 0.99, N = 2 SE +/- 0.40, N = 3 SE +/- 0.16, N = 3 77.01 80.10 77.92 76.00
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Execution Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13 26 39 52 65 57.25 55.57 58.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16 32 48 64 80 SE +/- 0.59, N = 3 SE +/- 0.59, N = 3 SE +/- 0.31, N = 3 69.44 72.44 69.82 73.14 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.7421 1.4842 2.2263 2.9684 3.7105 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 3.228 3.298 3.141 1. (CXX) g++ options: -O3 -lm
Implementation: MPI CPU - Input: water_GMX50_bare
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-System-Product-Name exited on signal 11 (Segmentation fault).
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 9.64, N = 3 SE +/- 3.18, N = 3 SE +/- 2.08, N = 3 SE +/- 19.19, N = 3 1527 1590 1553 1516 1. (CXX) g++ options: -O3 -lrt -lm
Y-Cruncher OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.016, N = 3 SE +/- 0.023, N = 3 SE +/- 0.006, N = 3 7.247 7.236 7.587
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 13 26 39 52 65 SE +/- 0.40, N = 15 SE +/- 0.25, N = 3 SE +/- 0.36, N = 3 SE +/- 0.34, N = 3 58.89 56.19 58.49 57.54 MIN: 34.79 / MAX: 62.62 MIN: 39.35 / MAX: 57.3 MIN: 34.6 / MAX: 60.01 MIN: 53.14 / MAX: 58.34
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 16 32 48 64 80 SE +/- 0.34, N = 3 SE +/- 0.16, N = 2 SE +/- 0.24, N = 3 SE +/- 0.34, N = 3 69.56 72.08 68.92 72.20 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Stockfish OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish Chess Benchmark 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 11M 22M 33M 44M 55M SE +/- 410305.51, N = 3 SE +/- 166041.67, N = 3 SE +/- 515794.89, N = 15 SE +/- 491754.88, N = 15 49194155 50101396 49974210 51377778 1. Stockfish 16 by the Stockfish developers (see AUTHORS file)
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.2305 0.461 0.6915 0.922 1.1525 SE +/- 0.00200, N = 3 SE +/- 0.00137, N = 2 SE +/- 0.00057, N = 3 SE +/- 0.00027, N = 3 1.01602 0.99852 1.02426 0.98084
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 40 80 120 160 200 177.79 174.13 181.57 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Medium Mesh Size - Mesh Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.
GROMACS OpenBenchmarking.org Ns Per Day, More Is Better GROMACS Input: water_GMX50_bare 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.452 0.904 1.356 1.808 2.26 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 1.996 2.009 1.927 1. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3
Input: water_GMX50_bare
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Fatal error:
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 10 20 30 40 50 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 40.94 40.07 41.72 41.49 MIN: 40.73 / MAX: 41.6 MIN: 39.86 / MAX: 40.55 MIN: 41.48 / MAX: 42.26 MIN: 41.24 / MAX: 42.02
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.6M 3.2M 4.8M 6.4M 8M SE +/- 87376.76, N = 3 SE +/- 71150.32, N = 3 SE +/- 43634.73, N = 3 SE +/- 1150.91, N = 3 7647187.48 7350932.88 7498825.19 7507615.73 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 0.67, N = 3 SE +/- 1.33, N = 3 SE +/- 0.88, N = 3 761 786 764 756 1. (CXX) g++ options: -O3 -lrt -lm
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 8 16 24 32 40 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 36.11 35.18 36.25 36.55 MIN: 35.69 / MAX: 36.98 MIN: 34.83 / MAX: 36.01 MIN: 35.89 / MAX: 36.94 MIN: 36.13 / MAX: 37.34
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 11599.22, N = 3 SE +/- 4372.40, N = 3 SE +/- 11157.27, N = 3 SE +/- 15831.56, N = 3 5898565.34 5694123.52 5874747.73 5915791.34 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 20.08 19.33 19.84 19.99 MIN: 17.96 / MAX: 20.55 MIN: 17.37 / MAX: 19.62 MIN: 17.86 / MAX: 20.21 MIN: 18.05 / MAX: 20.42
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Compression Rating 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 40K 80K 120K 160K 200K SE +/- 467.61, N = 3 SE +/- 94.03, N = 3 SE +/- 322.74, N = 3 203065 200810 195585 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Test: Compression Rating
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 10 20 30 40 50 SE +/- 0.29, N = 3 SE +/- 0.15, N = 3 SE +/- 0.18, N = 3 43.42 43.25 44.90
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6 12 18 24 30 SE +/- 0.20, N = 3 SE +/- 0.16, N = 15 SE +/- 0.14, N = 3 SE +/- 0.16, N = 3 23.00 23.87 23.29 23.36 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Mesh Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 84.98 85.47 88.08 84.99 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 1.86, N = 3 SE +/- 0.58, N = 3 SE +/- 1.20, N = 3 SE +/- 1.76, N = 3 776 797 773 770 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 4.91, N = 3 SE +/- 3.21, N = 3 SE +/- 2.73, N = 3 SE +/- 1.20, N = 3 1115 1154 1124 1118 1. (CXX) g++ options: -O3 -lrt -lm
Java JMH This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20000M 40000M 60000M 80000M 100000M 92045826799.54 90340365706.79 89012581592.65 91554717298.43
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 3.38, N = 3 SE +/- 2.31, N = 3 SE +/- 1.20, N = 3 SE +/- 2.31, N = 3 996 1018 997 985 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 3.53, N = 3 SE +/- 8.11, N = 3 SE +/- 4.58, N = 3 SE +/- 2.89, N = 3 1303 1334 1302 1291 1. (CXX) g++ options: -O3 -lrt -lm
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 0.99, N = 3 SE +/- 0.76, N = 3 SE +/- 0.74, N = 3 239.22 242.00 247.06
Model: INIVOL and Fluid Structure Interaction Drop Container
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test run did not produce a result. E: ** ERROR: FILE fsi_drop_container_0000_0001.rst NOT FOUND
x265 OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 4K 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 9 18 27 36 45 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 38.30 37.09 38.07 38.27 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 14 28 42 56 70 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 62.21 64.08 63.41 62.09
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 0.88, N = 3 SE +/- 3.18, N = 3 SE +/- 2.96, N = 3 SE +/- 0.33, N = 3 765 783 765 759 1. (CXX) g++ options: -O3 -lrt -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 17.34 16.81 17.33 16.92
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5K 10K 15K 20K 25K SE +/- 265.89, N = 3 SE +/- 227.86, N = 15 SE +/- 31.03, N = 3 SE +/- 72.38, N = 3 22510.46 22093.99 22539.72 22774.95 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 36.74, N = 3 SE +/- 51.16, N = 3 SE +/- 40.74, N = 3 SE +/- 2.85, N = 3 3714.66 3712.45 3739.16 3823.80 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 2.31, N = 3 SE +/- 3.18, N = 3 SE +/- 5.00, N = 3 SE +/- 2.31, N = 3 815 836 818 812 1. (CXX) g++ options: -O3 -lrt -lm
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6 12 18 24 30 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 SE +/- 0.17, N = 3 24.96 25.45 25.19 25.69 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 11 22 33 44 55 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.10, N = 3 SE +/- 0.01, N = 3 46.56 47.51 47.72 46.38
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: Kostya 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 7.63 7.50 7.57 7.71 1. (CXX) g++ options: -O3 -lrt
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 6 12 18 24 30 22.82 23.14 22.58 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Small Mesh Size - Mesh Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] #0 Foam::error::printStack(Foam::Ostream&) at ??:?
x265 OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 1080p 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 0.45, N = 3 SE +/- 0.23, N = 3 SE +/- 0.13, N = 3 SE +/- 0.27, N = 3 134.48 131.28 134.19 134.52 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 110K 220K 330K 440K 550K 494437 489784 486954 498966 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 37.04, N = 3 SE +/- 36.12, N = 6 SE +/- 30.71, N = 9 SE +/- 6.72, N = 3 3638.6 3676.5 3699.6 3616.6 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 100 200 300 400 500 SE +/- 0.26, N = 3 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 SE +/- 0.20, N = 3 450.71 459.19 459.12 448.96
Numpy Benchmark This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better Numpy Benchmark 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 200 400 600 800 1000 SE +/- 2.01, N = 3 SE +/- 4.91, N = 3 SE +/- 1.37, N = 3 SE +/- 8.61, N = 3 1070.61 1064.17 1046.91 1068.36
Timed LLVM Compilation This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 70 140 210 280 350 SE +/- 0.22, N = 3 SE +/- 0.38, N = 2 SE +/- 0.12, N = 3 298.05 304.62 301.63
Build System: Ninja
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: /usr/include/c++/13/bits/vector.tcc:445:7: internal compiler error: Segmentation fault
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.14 5.05 5.04 5.15 MIN: 2.39 / MAX: 5.75 MIN: 2.28 / MAX: 5.66 MIN: 2.28 / MAX: 5.66 MIN: 2.38 / MAX: 5.77
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 5M 10M 15M 20M 25M SE +/- 41633.32, N = 3 SE +/- 16666.67, N = 3 SE +/- 92074.85, N = 3 SE +/- 21858.13, N = 3 25660000 25136667 25666667 25676667 1. (CXX) g++ options: -fopenmp -O3 -march=native
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.64 8.46 8.55 8.64 MIN: 7.67 / MAX: 9.25 MIN: 7.49 / MAX: 9.04 MIN: 7.48 / MAX: 9.15 MIN: 7.64 / MAX: 9.25
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 12 24 36 48 60 SE +/- 0.15, N = 3 SE +/- 0.06, N = 3 SE +/- 0.16, N = 3 SE +/- 0.10, N = 3 53.23 54.27 53.66 53.16
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.0553 2.1106 3.1659 4.2212 5.2765 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 4.69 4.60 4.65 4.62 MIN: 2.15 / MAX: 5.26 MIN: 2.13 / MAX: 5.16 MIN: 2.02 / MAX: 5.24 MIN: 2.07 / MAX: 5.22
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 77.25 75.93 76.20 77.39 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 400 800 1200 1600 2000 SE +/- 0.32, N = 3 SE +/- 2.32, N = 3 SE +/- 1.03, N = 3 SE +/- 0.64, N = 3 1931.14 1898.26 1904.95 1934.61 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
POV-Ray OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray Trace Time 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 16.13 16.32 16.40 16.10 1. POV-Ray 3.7.0.10.unofficial
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 800 1600 2400 3200 4000 SE +/- 53.14, N = 3 SE +/- 21.03, N = 3 SE +/- 17.86, N = 3 SE +/- 25.12, N = 3 3731.97 3722.56 3741.86 3678.46 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 300 600 900 1200 1500 SE +/- 8.84, N = 3 SE +/- 9.00, N = 3 SE +/- 4.98, N = 3 SE +/- 5.21, N = 3 1273 1285 1272 1264 1. (CXX) g++ options: -O3 -lrt -lm
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.2195 2.439 3.6585 4.878 6.0975 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.42 5.34 5.36 5.40 MIN: 5.31 / MAX: 5.77 MIN: 5.23 / MAX: 5.69 MIN: 5.25 / MAX: 5.7 MIN: 5.27 / MAX: 5.72
Etcpak OpenBenchmarking.org Mpx/s, More Is Better Etcpak 2.0 Benchmark: Multi-Threaded - Configuration: ETC2 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 160 320 480 640 800 SE +/- 0.38, N = 3 SE +/- 0.82, N = 3 SE +/- 1.36, N = 3 SE +/- 1.25, N = 3 739.74 731.23 734.53 741.78 1. (CXX) g++ options: -flto -pthread
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.27, N = 3 77.08 75.99 76.12 76.89 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 400 800 1200 1600 2000 SE +/- 0.54, N = 3 SE +/- 0.87, N = 3 SE +/- 0.31, N = 3 SE +/- 6.84, N = 3 1926.89 1899.80 1903.04 1922.13 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Stress-NG OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.17.08 Test: Memory Copying 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2K 4K 6K 8K 10K SE +/- 72.54, N = 3 SE +/- 25.18, N = 3 SE +/- 39.56, N = 3 SE +/- 44.94, N = 3 10720.31 10608.82 10608.96 10752.81 1. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.02, N = 3 SE +/- 0.12, N = 3 SE +/- 0.07, N = 3 77.18 77.81 77.45 76.90
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 11 22 33 44 55 SE +/- 0.30, N = 3 SE +/- 0.40, N = 3 SE +/- 0.30, N = 3 47.78 48.26 47.76
Build: defconfig
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: : internal compiler error: Segmentation fault
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 80 160 240 320 400 SE +/- 0.24, N = 3 SE +/- 0.34, N = 3 341.68 345.25
Time To Compile
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: LargeRandom 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 0.477 0.954 1.431 1.908 2.385 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 2.11 2.11 2.10 2.12 1. (CXX) g++ options: -O3 -lrt
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 50 100 150 200 250 SE +/- 0.19, N = 3 SE +/- 0.11, N = 3 219.27 221.14
Time To Compile
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Warning: Protocol buffer compiler (protoc) not found.
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Decompression Rating 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30K 60K 90K 120K 150K SE +/- 69.90, N = 3 SE +/- 33.91, N = 3 SE +/- 14.15, N = 3 157512 157005 158133 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Test: Decompression Rating
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 130 260 390 520 650 SE +/- 0.52, N = 3 SE +/- 0.15, N = 3 589.33 590.50
Build: allmodconfig
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: gcc: internal compiler error: Segmentation fault signal terminated program as
Llamafile OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.8.6 Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.053 2.106 3.159 4.212 5.265 4.68
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 30 60 90 120 150 SE +/- 0.35, N = 3 SE +/- 5.81, N = 15 SE +/- 0.28, N = 3 SE +/- 0.28, N = 3 128.01 124.37 128.32 128.76
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 150 300 450 600 750 SE +/- 1.02, N = 3 SE +/- 49.36, N = 9 SE +/- 0.28, N = 3 668.16 573.33 705.55
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 14 August 2024 14:50 by user phoronix.
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 20 August 2024 15:50 by user phoronix.
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 August 2024 15:59 by user phoronix.
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 August 2024 01:12 by user phoronix.