AMD Ryzen 9 9950X DDR5 memory module benchmarks by Michael Larabel for a future article.
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 Changed Memory to 2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32 .
Java Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 Changed Memory to 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36 .
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 Changed Memory to 2 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38 .
LeelaChessZero OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: BLAS 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 60 120 180 240 300 SE +/- 2.65, N = 3 SE +/- 3.79, N = 3 SE +/- 2.91, N = 3 226 264 244 1. (CXX) g++ options: -flto -pthread
Backend: BLAS
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.31.1 Backend: Eigen 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 50 100 150 200 250 SE +/- 1.45, N = 3 SE +/- 1.15, N = 3 SE +/- 2.85, N = 3 220 247 227 236 1. (CXX) g++ options: -flto -pthread
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3K 6K 9K 12K 15K SE +/- 18.27, N = 3 SE +/- 15.11, N = 3 SE +/- 42.19, N = 3 SE +/- 16.93, N = 3 14528.26 16097.00 15531.76 15610.30 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 14 28 42 56 70 SE +/- 0.09, N = 3 SE +/- 0.14, N = 3 SE +/- 0.01, N = 2 SE +/- 0.20, N = 3 64.11 58.20 60.48 60.80 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 500 1000 1500 2000 2500 2096.57 1909.67 1994.75 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Medium Mesh Size - Execution Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] --> FOAM FATAL ERROR:
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 6K 12K 18K 24K 30K SE +/- 6.88, N = 3 SE +/- 34.50, N = 3 SE +/- 10.24, N = 3 SE +/- 20.77, N = 3 24236.21 26596.46 25715.34 25622.93 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 6K 12K 18K 24K 30K SE +/- 141.73, N = 3 SE +/- 95.45, N = 3 SE +/- 113.15, N = 3 SE +/- 30.67, N = 3 27106.18 29713.55 28743.49 28891.86 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 30 60 90 120 150 SE +/- 0.07, N = 3 SE +/- 0.19, N = 3 SE +/- 0.26, N = 3 SE +/- 0.07, N = 3 121.0 132.2 127.0 128.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 60 120 180 240 300 SE +/- 0.20, N = 3 SE +/- 0.37, N = 3 SE +/- 0.53, N = 3 SE +/- 0.03, N = 3 237.1 258.6 249.7 253.3 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 5K 10K 15K 20K 25K SE +/- 21.50, N = 3 SE +/- 10.38, N = 3 SE +/- 40.85, N = 3 SE +/- 26.91, N = 3 21328.28 23254.40 22436.81 22908.58 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 13K 26K 39K 52K 65K SE +/- 54.79, N = 3 SE +/- 85.26, N = 3 SE +/- 74.71, N = 3 SE +/- 8.10, N = 3 55287.37 60124.08 58570.03 58570.77 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 400 800 1200 1600 2000 SE +/- 5.33, N = 3 SE +/- 4.49, N = 3 SE +/- 5.23, N = 3 SE +/- 2.77, N = 3 1505.47 1635.34 1586.59 1584.93 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 4 8 12 16 20 SE +/- 0.08, N = 3 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 14.09 12.97 13.26 13.40 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b3067 Model: Meta-Llama-3-8B-Instruct-Q8_0.gguf 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.36 9.00 8.66 8.32 1. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -march=native -mtune=native -lopenblas
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3 6 9 12 15 SE +/- 0.00142, N = 3 SE +/- 0.00741, N = 3 SE +/- 0.00584, N = 3 SE +/- 0.00125, N = 3 8.93560 9.51601 9.16590 8.79737 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 110 220 330 440 550 SE +/- 0.38, N = 3 SE +/- 0.09, N = 3 SE +/- 0.33, N = 3 SE +/- 0.10, N = 3 472.0 508.9 491.1 505.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 30 60 90 120 150 157.15 145.88 148.76 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Small Mesh Size - Execution Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.
LULESH LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2K 4K 6K 8K 10K SE +/- 19.00, N = 3 SE +/- 13.71, N = 3 SE +/- 38.48, N = 3 SE +/- 67.88, N = 3 9945.53 10578.52 9832.64 10235.72 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Copy - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 15K 30K 45K 60K 75K SE +/- 820.93, N = 3 SE +/- 762.95, N = 5 SE +/- 299.96, N = 3 SE +/- 453.50, N = 3 65797.05 70345.53 66545.24 70747.00 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Triad - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 14K 28K 42K 56K 70K SE +/- 156.92, N = 3 SE +/- 439.59, N = 3 SE +/- 130.09, N = 3 SE +/- 33.42, N = 3 62428.70 66963.03 64832.86 65845.21 1. (CC) gcc options: -O3 -march=native
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 7 14 21 28 35 SE +/- 0.28, N = 3 SE +/- 0.38, N = 3 SE +/- 0.39, N = 3 30.68 30.54 31.98 29.82 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 4096 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 4K 8K 12K 16K 20K SE +/- 24.77, N = 3 SE +/- 211.85, N = 3 SE +/- 36.42, N = 3 SE +/- 32.81, N = 3 19534.16 20923.99 19747.58 20817.57 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 4K 8K 12K 16K 20K SE +/- 10.56, N = 3 SE +/- 144.90, N = 15 SE +/- 239.88, N = 3 SE +/- 58.10, N = 3 19467.39 20349.41 20124.94 20850.88 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 14K 28K 42K 56K 70K SE +/- 150.93, N = 3 SE +/- 72.30, N = 3 SE +/- 210.02, N = 3 SE +/- 130.21, N = 3 59920.96 64031.31 62375.43 62084.72 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3K 6K 9K 12K 15K SE +/- 21.24, N = 3 SE +/- 10.26, N = 3 SE +/- 41.76, N = 3 SE +/- 36.86, N = 3 11144.88 11908.47 11662.44 11374.51 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
Y-Cruncher OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 16.45 15.61 15.43
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Scale - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 16K 32K 48K 64K 80K SE +/- 220.54, N = 3 SE +/- 364.59, N = 3 SE +/- 513.22, N = 3 SE +/- 266.14, N = 3 69141.08 73724.88 70445.74 71754.58 1. (CC) gcc options: -O3 -march=native
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Add - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 14K 28K 42K 56K 70K SE +/- 426.98, N = 3 SE +/- 108.61, N = 3 SE +/- 64.38, N = 3 SE +/- 382.92, N = 3 61079.93 65113.52 62771.00 63967.70 1. (CC) gcc options: -O3 -march=native
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 8192 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 5K 10K 15K 20K 25K SE +/- 49.10, N = 3 SE +/- 204.03, N = 3 SE +/- 56.11, N = 3 SE +/- 188.50, N = 12 22247.78 22817.90 21444.00 22294.06 1. (CC) gcc options: -O3 -march=native
RAMspeed SMP This benchmark tests the system memory (RAM) performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better RAMspeed SMP 3.5.0 Type: Average - Benchmark: Integer 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 15K 30K 45K 60K 75K SE +/- 119.74, N = 3 SE +/- 315.00, N = 3 SE +/- 805.54, N = 4 SE +/- 204.48, N = 3 65503.93 69491.42 66822.49 68588.84 1. (CC) gcc options: -O3 -march=native
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.63, N = 3 SE +/- 0.82, N = 5 SE +/- 0.96, N = 4 84.89 86.79 81.89 85.52 MIN: 77.17 / MAX: 85.87 MIN: 78.82 / MAX: 88.23 MIN: 67.48 / MAX: 84.5 MIN: 68.54 / MAX: 88.53
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 0.811 1.622 2.433 3.244 4.055 SE +/- 0.02586, N = 3 SE +/- 0.02618, N = 15 SE +/- 0.02740, N = 9 SE +/- 0.03856, N = 3 3.41007 3.57755 3.50958 3.60464
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 64 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 52.27 55.19 53.54 54.30
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: TopTweet 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 SE +/- 0.17, N = 3 13.59 13.24 12.88 13.38 1. (CXX) g++ options: -O3 -lrt
Build2 OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20 40 60 80 100 SE +/- 0.16, N = 3 SE +/- 0.40, N = 3 SE +/- 0.99, N = 2 SE +/- 0.52, N = 3 76.00 77.92 80.10 77.01
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Execution Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 13 26 39 52 65 58.57 55.57 57.25 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 16 32 48 64 80 SE +/- 0.31, N = 3 SE +/- 0.59, N = 3 SE +/- 0.59, N = 3 73.14 69.82 72.44 69.44 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 0.7421 1.4842 2.2263 2.9684 3.7105 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.004, N = 3 3.141 3.298 3.228 1. (CXX) g++ options: -O3 -lm
Implementation: MPI CPU - Input: water_GMX50_bare
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: mpirun noticed that process rank 0 with PID 0 on node phoronix-System-Product-Name exited on signal 11 (Segmentation fault).
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 300 600 900 1200 1500 SE +/- 19.19, N = 3 SE +/- 2.08, N = 3 SE +/- 3.18, N = 3 SE +/- 9.64, N = 3 1516 1553 1590 1527 1. (CXX) g++ options: -O3 -lrt -lm
Y-Cruncher OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 4 6 8 10 SE +/- 0.006, N = 3 SE +/- 0.023, N = 3 SE +/- 0.016, N = 3 7.587 7.236 7.247
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.2.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 13 26 39 52 65 SE +/- 0.34, N = 3 SE +/- 0.36, N = 3 SE +/- 0.25, N = 3 SE +/- 0.40, N = 15 57.54 58.49 56.19 58.89 MIN: 53.14 / MAX: 58.34 MIN: 34.6 / MAX: 60.01 MIN: 39.35 / MAX: 57.3 MIN: 34.79 / MAX: 62.62
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 16 32 48 64 80 SE +/- 0.34, N = 3 SE +/- 0.24, N = 3 SE +/- 0.16, N = 2 SE +/- 0.34, N = 3 72.20 68.92 72.08 69.56 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Stockfish OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish Chess Benchmark 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 11M 22M 33M 44M 55M SE +/- 491754.88, N = 15 SE +/- 515794.89, N = 15 SE +/- 166041.67, N = 3 SE +/- 410305.51, N = 3 51377778 49974210 50101396 49194155 1. Stockfish 16 by the Stockfish developers (see AUTHORS file)
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 0.2305 0.461 0.6915 0.922 1.1525 SE +/- 0.00027, N = 3 SE +/- 0.00057, N = 3 SE +/- 0.00137, N = 2 SE +/- 0.00200, N = 3 0.98084 1.02426 0.99852 1.01602
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 40 80 120 160 200 181.57 174.13 177.79 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Medium Mesh Size - Mesh Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: This is not a fatal error but might cause some unexpected behaviour.
GROMACS OpenBenchmarking.org Ns Per Day, More Is Better GROMACS Input: water_GMX50_bare 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 0.452 0.904 1.356 1.808 2.26 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 1.927 2.009 1.996 1. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3
Input: water_GMX50_bare
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Fatal error:
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 10 20 30 40 50 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 41.49 41.72 40.07 40.94 MIN: 41.24 / MAX: 42.02 MIN: 41.48 / MAX: 42.26 MIN: 39.86 / MAX: 40.55 MIN: 40.73 / MAX: 41.6
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 1.6M 3.2M 4.8M 6.4M 8M SE +/- 1150.91, N = 3 SE +/- 43634.73, N = 3 SE +/- 71150.32, N = 3 SE +/- 87376.76, N = 3 7507615.73 7498825.19 7350932.88 7647187.48 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 0.88, N = 3 SE +/- 1.33, N = 3 SE +/- 0.67, N = 3 756 764 786 761 1. (CXX) g++ options: -O3 -lrt -lm
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 8 16 24 32 40 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 36.55 36.25 35.18 36.11 MIN: 36.13 / MAX: 37.34 MIN: 35.89 / MAX: 36.94 MIN: 34.83 / MAX: 36.01 MIN: 35.69 / MAX: 36.98
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 15831.56, N = 3 SE +/- 11157.27, N = 3 SE +/- 4372.40, N = 3 SE +/- 11599.22, N = 3 5915791.34 5874747.73 5694123.52 5898565.34 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.08, N = 3 SE +/- 0.13, N = 3 SE +/- 0.15, N = 3 19.99 19.84 19.33 20.08 MIN: 18.05 / MAX: 20.42 MIN: 17.86 / MAX: 20.21 MIN: 17.37 / MAX: 19.62 MIN: 17.96 / MAX: 20.55
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Compression Rating 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 40K 80K 120K 160K 200K SE +/- 322.74, N = 3 SE +/- 94.03, N = 3 SE +/- 467.61, N = 3 195585 200810 203065 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Test: Compression Rating
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 10 20 30 40 50 SE +/- 0.18, N = 3 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 44.90 43.25 43.42
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 6 12 18 24 30 SE +/- 0.16, N = 3 SE +/- 0.14, N = 3 SE +/- 0.16, N = 15 SE +/- 0.20, N = 3 23.36 23.29 23.87 23.00 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: motorBike - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20 40 60 80 100 84.99 88.08 85.47 84.98 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 1.76, N = 3 SE +/- 1.20, N = 3 SE +/- 0.58, N = 3 SE +/- 1.86, N = 3 770 773 797 776 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 1.20, N = 3 SE +/- 2.73, N = 3 SE +/- 3.21, N = 3 SE +/- 4.91, N = 3 1118 1124 1154 1115 1. (CXX) g++ options: -O3 -lrt -lm
Java JMH This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/s, More Is Better Java JMH Throughput 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20000M 40000M 60000M 80000M 100000M 91554717298.43 89012581592.65 90340365706.79 92045826799.54
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 2.31, N = 3 SE +/- 1.20, N = 3 SE +/- 2.31, N = 3 SE +/- 3.38, N = 3 985 997 1018 996 1. (CXX) g++ options: -O3 -lrt -lm
OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP16MobileNetV3Large 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 300 600 900 1200 1500 SE +/- 2.89, N = 3 SE +/- 4.58, N = 3 SE +/- 8.11, N = 3 SE +/- 3.53, N = 3 1291 1302 1334 1303 1. (CXX) g++ options: -O3 -lrt -lm
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 50 100 150 200 250 SE +/- 0.74, N = 3 SE +/- 0.76, N = 3 SE +/- 0.99, N = 3 247.06 242.00 239.22
Model: INIVOL and Fluid Structure Interaction Drop Container
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test run did not produce a result. E: ** ERROR: FILE fsi_drop_container_0000_0001.rst NOT FOUND
x265 OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 4K 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.15, N = 3 SE +/- 0.15, N = 3 38.27 38.07 37.09 38.30 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 14 28 42 56 70 SE +/- 0.12, N = 3 SE +/- 0.08, N = 3 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 62.09 63.41 64.08 62.21
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: QU8MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 0.33, N = 3 SE +/- 2.96, N = 3 SE +/- 3.18, N = 3 SE +/- 0.88, N = 3 759 765 783 765 1. (CXX) g++ options: -O3 -lrt -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 16.92 17.33 16.81 17.34
MBW This is a basic/simple memory (RAM) bandwidth benchmark for memory copy operations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better MBW 2018-09-08 Test: Memory Copy - Array Size: 4096 MiB 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 5K 10K 15K 20K 25K SE +/- 72.38, N = 3 SE +/- 31.03, N = 3 SE +/- 227.86, N = 15 SE +/- 265.89, N = 3 22774.95 22539.72 22093.99 22510.46 1. (CC) gcc options: -O3 -march=native
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 800 1600 2400 3200 4000 SE +/- 2.85, N = 3 SE +/- 40.74, N = 3 SE +/- 51.16, N = 3 SE +/- 36.74, N = 3 3823.80 3739.16 3712.45 3714.66 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV3Small 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 2.31, N = 3 SE +/- 5.00, N = 3 SE +/- 3.18, N = 3 SE +/- 2.31, N = 3 812 818 836 815 1. (CXX) g++ options: -O3 -lrt -lm
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 25.69 25.19 25.45 24.96 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 SE +/- 0.02, N = 3 SE +/- 0.09, N = 3 46.38 47.72 47.51 46.56
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: Kostya 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.06, N = 3 7.71 7.57 7.50 7.63 1. (CXX) g++ options: -O3 -lrt
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 6 12 18 24 30 22.58 23.14 22.82 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Input: drivaerFastback, Small Mesh Size - Mesh Time
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: [13] #0 Foam::error::printStack(Foam::Ostream&) at ??:?
x265 OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 1080p 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 30 60 90 120 150 SE +/- 0.27, N = 3 SE +/- 0.13, N = 3 SE +/- 0.23, N = 3 SE +/- 0.45, N = 3 134.52 134.19 131.28 134.48 1. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.38.2 VGR Performance Metric 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 110K 220K 330K 440K 550K 498966 486954 489784 494437 1. (CXX) g++ options: -std=c++17 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lnetpbm -lregex_brl -lz_brl -lassimp -ldl -lm -ltk8.6
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: GhostRider - Hash Count: 1M 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 800 1600 2400 3200 4000 SE +/- 6.72, N = 3 SE +/- 30.71, N = 9 SE +/- 36.12, N = 6 SE +/- 37.04, N = 3 3616.6 3699.6 3676.5 3638.6 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 100 200 300 400 500 SE +/- 0.20, N = 3 SE +/- 0.27, N = 3 SE +/- 0.20, N = 3 SE +/- 0.26, N = 3 448.96 459.12 459.19 450.71
Numpy Benchmark This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Score, More Is Better Numpy Benchmark 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 200 400 600 800 1000 SE +/- 8.61, N = 3 SE +/- 1.37, N = 3 SE +/- 4.91, N = 3 SE +/- 2.01, N = 3 1068.36 1046.91 1064.17 1070.61
Timed LLVM Compilation This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 70 140 210 280 350 SE +/- 0.12, N = 3 SE +/- 0.38, N = 2 SE +/- 0.22, N = 3 301.63 304.62 298.05
Build System: Ninja
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: /usr/include/c++/13/bits/vector.tcc:445:7: internal compiler error: Segmentation fault
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 5.15 5.04 5.05 5.14 MIN: 2.38 / MAX: 5.77 MIN: 2.28 / MAX: 5.66 MIN: 2.28 / MAX: 5.66 MIN: 2.39 / MAX: 5.75
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 5M 10M 15M 20M 25M SE +/- 21858.13, N = 3 SE +/- 92074.85, N = 3 SE +/- 16666.67, N = 3 SE +/- 41633.32, N = 3 25676667 25666667 25136667 25660000 1. (CXX) g++ options: -fopenmp -O3 -march=native
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.64 8.55 8.46 8.64 MIN: 7.64 / MAX: 9.25 MIN: 7.48 / MAX: 9.15 MIN: 7.49 / MAX: 9.04 MIN: 7.67 / MAX: 9.25
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 12 24 36 48 60 SE +/- 0.10, N = 3 SE +/- 0.16, N = 3 SE +/- 0.06, N = 3 SE +/- 0.15, N = 3 53.16 53.66 54.27 53.23
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 1.0553 2.1106 3.1659 4.2212 5.2765 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 4.62 4.65 4.60 4.69 MIN: 2.07 / MAX: 5.22 MIN: 2.02 / MAX: 5.24 MIN: 2.13 / MAX: 5.16 MIN: 2.15 / MAX: 5.26
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 77.39 76.20 75.93 77.25 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 400 800 1200 1600 2000 SE +/- 0.64, N = 3 SE +/- 1.03, N = 3 SE +/- 2.32, N = 3 SE +/- 0.32, N = 3 1934.61 1904.95 1898.26 1931.14 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
POV-Ray OpenBenchmarking.org Seconds, Fewer Is Better POV-Ray Trace Time 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 4 8 12 16 20 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 16.10 16.40 16.32 16.13 1. POV-Ray 3.7.0.10.unofficial
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 800 1600 2400 3200 4000 SE +/- 25.12, N = 3 SE +/- 17.86, N = 3 SE +/- 21.03, N = 3 SE +/- 53.14, N = 3 3678.46 3741.86 3722.56 3731.97 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6
XNNPACK OpenBenchmarking.org us, Fewer Is Better XNNPACK 2cd86b Model: FP32MobileNetV2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 300 600 900 1200 1500 SE +/- 5.21, N = 3 SE +/- 4.98, N = 3 SE +/- 9.00, N = 3 SE +/- 8.84, N = 3 1264 1272 1285 1273 1. (CXX) g++ options: -O3 -lrt -lm
LuxCoreRender LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 1.2195 2.439 3.6585 4.878 6.0975 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 5.40 5.36 5.34 5.42 MIN: 5.27 / MAX: 5.72 MIN: 5.25 / MAX: 5.7 MIN: 5.23 / MAX: 5.69 MIN: 5.31 / MAX: 5.77
Etcpak OpenBenchmarking.org Mpx/s, More Is Better Etcpak 2.0 Benchmark: Multi-Threaded - Configuration: ETC2 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 160 320 480 640 800 SE +/- 1.25, N = 3 SE +/- 1.36, N = 3 SE +/- 0.82, N = 3 SE +/- 0.38, N = 3 741.78 734.53 731.23 739.74 1. (CXX) g++ options: -flto -pthread
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 76.89 76.12 75.99 77.08 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 400 800 1200 1600 2000 SE +/- 6.84, N = 3 SE +/- 0.31, N = 3 SE +/- 0.87, N = 3 SE +/- 0.54, N = 3 1922.13 1903.04 1899.80 1926.89 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Stress-NG OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.17.08 Test: Memory Copying 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 2K 4K 6K 8K 10K SE +/- 44.94, N = 3 SE +/- 39.56, N = 3 SE +/- 25.18, N = 3 SE +/- 72.54, N = 3 10752.81 10608.96 10608.82 10720.31 1. (CXX) g++ options: -lm -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lgmp -lgbm -lmpfr -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -U_FORTIFY_SOURCE
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 20 40 60 80 100 SE +/- 0.07, N = 3 SE +/- 0.12, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 76.90 77.45 77.81 77.18
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 11 22 33 44 55 SE +/- 0.30, N = 3 SE +/- 0.40, N = 3 SE +/- 0.30, N = 3 47.76 48.26 47.78
Build: defconfig
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: : internal compiler error: Segmentation fault
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 80 160 240 320 400 SE +/- 0.34, N = 3 SE +/- 0.24, N = 3 345.25 341.68
Time To Compile
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
simdjson OpenBenchmarking.org GB/s, More Is Better simdjson 3.10 Throughput Test: LargeRandom 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 0.477 0.954 1.431 1.908 2.385 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 2.12 2.10 2.11 2.11 1. (CXX) g++ options: -O3 -lrt
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 50 100 150 200 250 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 221.14 219.27
Time To Compile
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: Warning: Protocol buffer compiler (protoc) not found.
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Decompression Rating 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 30K 60K 90K 120K 150K SE +/- 14.15, N = 3 SE +/- 33.91, N = 3 SE +/- 69.90, N = 3 158133 157005 157512 1. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20
Test: Decompression Rating
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status.
Timed Linux Kernel Compilation This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 130 260 390 520 650 SE +/- 0.15, N = 3 SE +/- 0.52, N = 3 590.50 589.33
Build: allmodconfig
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38: The test quit with a non-zero exit status. E: gcc: internal compiler error: Segmentation fault signal terminated program as
Llamafile OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.8.6 Test: Meta-Llama-3-8B-Instruct.F16 - Acceleration: CPU 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 1.053 2.106 3.159 4.212 5.265 4.68
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 30 60 90 120 150 SE +/- 0.28, N = 3 SE +/- 0.28, N = 3 SE +/- 5.81, N = 15 SE +/- 0.35, N = 3 128.76 128.32 124.37 128.01
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M 2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G 2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 150 300 450 600 750 SE +/- 0.28, N = 3 SE +/- 49.36, N = 9 SE +/- 1.02, N = 3 705.55 573.33 668.16
2 x 16GB DDR5-6000 CL30 F5-6000J3038F16G Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aPython Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 14 August 2024 14:50 by user phoronix.
2 x 32GB DDR5-6400 CL32 CMK64GX5M2B6400C32 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 32GB DDR5-6400MT/s Corsair CMK64GX5M2B6400C32, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 August 2024 01:12 by user phoronix.
2 x 16GB DDR5-8000 CL36 CMH32GX5M2X8000C36 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DDR5-8000MT/s Corsair CMH32GX5M2X8000C36, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 20 August 2024 15:50 by user phoronix.
2 x 24GB DDR5-8000 CL38 CMP48GX5M2X8000C38 Processor: AMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (2204 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 24GB DDR5-8000MT/s Corsair CMP48GX5M2X8000C38, Disk: 2000GB Corsair MP700 PRO, Graphics: AMD Radeon RX 7900 GRE 16GB, Audio: AMD Navi 31 HDMI/DP, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6E
OS: Ubuntu 24.04, Kernel: 6.10.0-phx (x86_64), Desktop: GNOME Shell 46.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 24.2~git2406040600.8112d4~oibaf~n (git-8112d44 2024-06-04 noble-oibaf-ppa) (LLVM 17.0.6 DRM 3.57), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb40401aJava Notes: OpenJDK Runtime Environment (build 21.0.3+9-Ubuntu-1ubuntu1)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 21 August 2024 15:59 by user phoronix.