Granite Rapids MRDIMM vs. DDR5 Benchmarks

Benchmarks for a future article. 2 x Intel Xeon 6980P testing with a Intel AvenueCity v0.01 (BHSDCRB1.IPC.0035.D44.2408292336 BIOS) and ASPEED on Ubuntu 24.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2410037-NE-2410039NE61&sro&grr.

Granite Rapids MRDIMM vs. DDR5 BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen Resolution24 x DDR5-640024 x MRDIMM 888002 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads)Intel AvenueCity v0.01 (BHSDCRB1.IPC.0035.D44.2408292336 BIOS)Intel Ice Lake IEH1520GB960GB SAMSUNG MZ1L2960HCJR-00A07 + 2 x 3201GB KIOXIA KCMYXVUG3T20ASPEEDIntel I210 + 2 x Intel 10-Gigabit X540-AT2Ubuntu 24.046.8.0-22-generic (x86_64)GCC 13.2.0ext41920x12002 x 1920GB KIOXIA KCD8XPUG1T92 + 960GB SAMSUNG MZ1L2960HCJR-00A076.10.0-phx (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-OiuXZC/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Details- OpenJDK Runtime Environment (build 21.0.3-ea+7-Ubuntu-1build1)Python Details- Python 3.12.2Security Details- 24 x DDR5-6400: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 24 x MRDIMM 88800: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected

Granite Rapids MRDIMM vs. DDR5 Benchmarksgromacs: MPI CPU - water_GMX50_barepgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 800 - Read Onlycassandra: Writeslibxsmm: 128hpcg: 144 144 144 - 60stream: Copybuild-llvm: Unix Makefilestinymembench: Standard Memsettinymembench: Standard Memcpyhpcg: 104 104 104 - 60java-jmh: Throughputlulesh: build-nodejs: Time To Compilepgbench: 100 - 800 - Read Write - Average Latencypgbench: 100 - 800 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writebuild-linux-kernel: allmodconfigbuild-linux-kernel: defconfigopenradioss: Chrysler Neon 1Mpennant: leblancbigmbw: Memory Copy - 8192 MiBincompact3d: X3D-benchmarking input.i3dmbw: Memory Copy, Fixed Block Size - 8192 MiBopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timebuild-llvm: Ninjanpb: EP.Dspecfem3d: Water-layered Halfspacespecfem3d: Layered Halfspacespecfem3d: Homogeneous Halfspacespecfem3d: Tomographic Modelspecfem3d: Mount St. Helensamg: openfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timenpb: SP.Bnpb: BT.Cpennant: sedovbignpb: IS.Dnpb: LU.Cincompact3d: input.i3d 193 Cells Per Directionnpb: CG.Cnpb: MG.Clibxsmm: 64stream: Triadstream: Addstream: Scale24 x DDR5-640024 x MRDIMM 888001.5826374581.59550395082480128.043870313.730044.815023.6130.83656.3111420773.0781368689.1114219.82286.15282448683.00780.046631157.0874232963.429.5660905757.5765941415.73545181218.99996528.90647349232.51759415.2113871.96735889.762.89296635113647.23365424.17893252.7887756.1848431.933.3062.1214756211.596509160870637398.5169.491861560.8198.67730001.015008.9170.946790759705068.81124013.27138.98955.3721445272.69213757131.83923.43160.821.34276815316.41369.80645248766.21371.936334137.8487176.60032324.729.2166930667.1135535615.4177613214.3799271793.955489179860655300018.38920823.416256373559.46804396.266.47267115638.56769386.672.62023497118207.17449470.324501.4879963.6937751.9952124.6OpenBenchmarking.org

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bare24 x MRDIMM 88800816243240SE +/- 0.08, N = 233.311. (CXX) g++ options: -O3 -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency24 x DDR5-640024 x MRDIMM 888000.47720.95441.43161.90882.386SE +/- 0.042, N = 12SE +/- 0.060, N = 121.5822.1211. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Only24 x DDR5-640024 x MRDIMM 88800140K280K420K560K700KSE +/- 18280.50, N = 12SE +/- 13155.72, N = 126374584756211. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency24 x DDR5-640024 x MRDIMM 888000.35910.71821.07731.43641.7955SE +/- 0.037, N = 9SE +/- 0.058, N = 121.5951.5961. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Only24 x DDR5-640024 x MRDIMM 88800110K220K330K440K550KSE +/- 12217.99, N = 9SE +/- 19798.84, N = 125039505091601. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: Writes24 x DDR5-640024 x MRDIMM 8880020K40K60K80K100KSE +/- 1732.33, N = 9SE +/- 1099.82, N = 128248087063

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 12824 x MRDIMM 8880016003200480064008000SE +/- 660.65, N = 67398.51. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 6024 x DDR5-640024 x MRDIMM 888004080120160200SE +/- 0.22, N = 3SE +/- 0.12, N = 3128.04169.491. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copy24 x DDR5-640024 x MRDIMM 88800200K400K600K800K1000KSE +/- 5160.21, N = 25SE +/- 7612.22, N = 25870313.7861560.81. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix Makefiles24 x MRDIMM 888004080120160200SE +/- 1.12, N = 3198.68

Tinymembench

Standard Memset

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memset24 x DDR5-640024 x MRDIMM 888006K12K18K24K30KSE +/- 3.48, N = 3SE +/- 19.60, N = 330044.830001.01. (CC) gcc options: -O2 -lm

Tinymembench

Standard Memcpy

OpenBenchmarking.orgMB/s, More Is BetterTinymembench 2018-05-28Standard Memcpy24 x DDR5-640024 x MRDIMM 888003K6K9K12K15KSE +/- 0.23, N = 3SE +/- 10.01, N = 315023.615008.91. (CC) gcc options: -O2 -lm

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 6024 x DDR5-640024 x MRDIMM 888004080120160200SE +/- 0.48, N = 3SE +/- 0.19, N = 3130.84170.951. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Java JMH

Throughput

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughput24 x MRDIMM 88800200000M400000M600000M800000M1000000M790759705068.81

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.324 x MRDIMM 8880030K60K90K120K150KSE +/- 801.06, N = 15124013.271. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compile24 x MRDIMM 88800306090120150SE +/- 1.02, N = 3138.99

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency24 x DDR5-640024 x MRDIMM 888001326395265SE +/- 0.18, N = 3SE +/- 0.67, N = 356.3155.371. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Write24 x DDR5-640024 x MRDIMM 888003K6K9K12K15KSE +/- 44.90, N = 3SE +/- 174.94, N = 314207144521. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency24 x DDR5-640024 x MRDIMM 888001632486480SE +/- 0.55, N = 3SE +/- 0.11, N = 373.0872.691. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Write24 x DDR5-640024 x MRDIMM 888003K6K9K12K15KSE +/- 104.01, N = 3SE +/- 20.41, N = 313686137571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfig24 x MRDIMM 88800306090120150SE +/- 0.82, N = 3131.84

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfig24 x MRDIMM 88800612182430SE +/- 0.15, N = 1523.43

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1M24 x DDR5-640024 x MRDIMM 8880020406080100SE +/- 0.10, N = 3SE +/- 0.29, N = 389.1160.82

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig24 x MRDIMM 888000.30210.60420.90631.20841.5105SE +/- 0.011876, N = 151.3427681. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

MBW

Test: Memory Copy - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy - Array Size: 8192 MiB24 x DDR5-640024 x MRDIMM 888003K6K9K12K15KSE +/- 119.22, N = 8SE +/- 26.98, N = 314219.8215316.411. (CC) gcc options: -O3 -march=native

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d24 x DDR5-640024 x MRDIMM 8880020406080100SE +/- 0.38, N = 3SE +/- 0.04, N = 386.1569.811. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

MBW

Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB

OpenBenchmarking.orgMiB/s, More Is BetterMBW 2018-09-08Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB24 x DDR5-640024 x MRDIMM 888002K4K6K8K10KSE +/- 35.08, N = 3SE +/- 34.71, N = 38683.018766.211. (CC) gcc options: -O3 -march=native

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time24 x DDR5-640024 x MRDIMM 888002040608010080.0571.941. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time24 x DDR5-640024 x MRDIMM 88800306090120150157.09137.851. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja24 x MRDIMM 8880020406080100SE +/- 0.18, N = 376.60

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D24 x DDR5-640024 x MRDIMM 888007K14K21K28K35KSE +/- 90.50, N = 3SE +/- 1574.53, N = 1332963.4232324.721. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered Halfspace24 x DDR5-640024 x MRDIMM 888003691215SE +/- 0.071811125, N = 3SE +/- 0.111895654, N = 49.5660905759.2166930661. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered Halfspace24 x DDR5-640024 x MRDIMM 88800246810SE +/- 0.059546245, N = 3SE +/- 0.043473328, N = 37.5765941417.1135535611. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous Halfspace24 x DDR5-640024 x MRDIMM 888001.29052.5813.87155.1626.4525SE +/- 0.044695936, N = 3SE +/- 0.013586641, N = 35.7354518125.4177613211. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic Model24 x MRDIMM 888000.98551.9712.95653.9424.9275SE +/- 0.009660353, N = 34.3799271791. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. Helens24 x MRDIMM 888000.891.782.673.564.45SE +/- 0.025428571, N = 33.9554891791. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.224 x MRDIMM 888002000M4000M6000M8000M10000MSE +/- 17997194.71, N = 386065530001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time24 x DDR5-640024 x MRDIMM 8880051015202519.0018.391. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time24 x DDR5-640024 x MRDIMM 8880071421283528.9123.421. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B24 x DDR5-640024 x MRDIMM 8880080K160K240K320K400KSE +/- 2373.46, N = 3SE +/- 3986.08, N = 4349232.51373559.461. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C24 x DDR5-640024 x MRDIMM 88800200K400K600K800K1000KSE +/- 553.04, N = 3SE +/- 1537.19, N = 3759415.21804396.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig24 x MRDIMM 88800246810SE +/- 0.009571, N = 36.4726711. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D24 x DDR5-640024 x MRDIMM 888003K6K9K12K15KSE +/- 60.96, N = 3SE +/- 36.61, N = 313871.9615638.561. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C24 x DDR5-640024 x MRDIMM 88800160K320K480K640K800KSE +/- 786.48, N = 3SE +/- 2025.53, N = 3735889.76769386.671. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction24 x DDR5-640024 x MRDIMM 888000.65091.30181.95272.60363.2545SE +/- 0.00797243, N = 3SE +/- 0.02984810, N = 32.892966352.620234971. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C24 x DDR5-640024 x MRDIMM 8880030K60K90K120K150KSE +/- 1173.69, N = 3SE +/- 1589.02, N = 3113647.23118207.171. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C24 x DDR5-640024 x MRDIMM 88800100K200K300K400K500KSE +/- 966.97, N = 3SE +/- 1588.36, N = 3365424.17449470.321. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 6424 x MRDIMM 8880010002000300040005000SE +/- 244.26, N = 154501.41. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

System Power Consumption Monitor

Phoronix Test Suite System Monitoring

OpenBenchmarking.orgWattsSystem Power Consumption MonitorPhoronix Test Suite System Monitoring24 x MRDIMM 888002004006008001000Min: 183.7 / Avg: 465.09 / Max: 1108.2

Pennant

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800207455805OpenBenchmarking.orgWatts, Fewer Is BetterPennant 1.0.1System Power Consumption Monitor2004006008001000

Pennant

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800200394681OpenBenchmarking.orgWatts, Fewer Is BetterPennant 1.0.1System Power Consumption Monitor2004006008001000

Algebraic Multi-Grid Benchmark

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800194564954OpenBenchmarking.orgWatts, Fewer Is BetterAlgebraic Multi-Grid Benchmark 1.2System Power Consumption Monitor2004006008001000

libxsmm

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800188.8310.1387.9OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor100200300400500

libxsmm

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800203466879OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor2004006008001000

Java JMH

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800206766882OpenBenchmarking.orgWatts, Fewer Is BetterJava JMHSystem Power Consumption Monitor2004006008001000

GROMACS

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800184310809OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2024System Power Consumption Monitor2004006008001000

Timed Node.js Compilation

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800207445834OpenBenchmarking.orgWatts, Fewer Is BetterTimed Node.js Compilation 21.7.2System Power Consumption Monitor2004006008001000

Timed LLVM Compilation

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800192359827OpenBenchmarking.orgWatts, Fewer Is BetterTimed LLVM Compilation 16.0System Power Consumption Monitor2004006008001000

Timed LLVM Compilation

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800203470858OpenBenchmarking.orgWatts, Fewer Is BetterTimed LLVM Compilation 16.0System Power Consumption Monitor2004006008001000

Timed Linux Kernel Compilation

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800204555811OpenBenchmarking.orgWatts, Fewer Is BetterTimed Linux Kernel Compilation 6.8System Power Consumption Monitor2004006008001000

Timed Linux Kernel Compilation

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800198366848OpenBenchmarking.orgWatts, Fewer Is BetterTimed Linux Kernel Compilation 6.8System Power Consumption Monitor2004006008001000

LULESH

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800198496911OpenBenchmarking.orgWatts, Fewer Is BetterLULESH 2.0.3System Power Consumption Monitor2004006008001000

SPECFEM3D

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800198402790OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800205429801OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800201421756OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800204470805OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800204422804OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

OpenRadioss

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800209534864OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor2004006008001000

OpenFOAM

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800221652934OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption Monitor2004006008001000

OpenFOAM

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800282523764OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption Monitor2004006008001000

Xcompact3d Incompact3d

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 888002028071051OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11System Power Consumption Monitor2004006008001000

Xcompact3d Incompact3d

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800205424852OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11System Power Consumption Monitor2004006008001000

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800199.6357.7502.0OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor130260390520650

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800201.5368.8500.7OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor130260390520650

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800202.8367.3498.6OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor130260390520650

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800220.3386.6564.3OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor140280420560700

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800205399760OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor2004006008001000

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800204435782OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor2004006008001000

NAS Parallel Benchmarks

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800203476900OpenBenchmarking.orgWatts, Fewer Is BetterNAS Parallel Benchmarks 3.4System Power Consumption Monitor2004006008001000

PostgreSQL

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800201.1483.3564.2OpenBenchmarking.orgWatts, Fewer Is BetterPostgreSQL 17System Power Consumption Monitor140280420560700

PostgreSQL

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800204.9344.0378.0OpenBenchmarking.orgWatts, Fewer Is BetterPostgreSQL 17System Power Consumption Monitor100200300400500

PostgreSQL

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800197.7488.2566.6OpenBenchmarking.orgWatts, Fewer Is BetterPostgreSQL 17System Power Consumption Monitor140280420560700

PostgreSQL

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800199.5339.2373.5OpenBenchmarking.orgWatts, Fewer Is BetterPostgreSQL 17System Power Consumption Monitor100200300400500

Apache Cassandra

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800217.7372.0431.6OpenBenchmarking.orgWatts, Fewer Is BetterApache Cassandra 5.0System Power Consumption Monitor110220330440550

Tinymembench

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800213.8286.2394.6OpenBenchmarking.orgWatts, Fewer Is BetterTinymembench 2018-05-28System Power Consumption Monitor110220330440550

High Performance Conjugate Gradient

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 8880035410401107OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1System Power Consumption Monitor2004006008001000

High Performance Conjugate Gradient

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 888002239601108OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1System Power Consumption Monitor2004006008001000

MBW

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800200.8289.6295.1OpenBenchmarking.orgWatts, Fewer Is BetterMBW 2018-09-08System Power Consumption Monitor70140210280350

MBW

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800204.6291.3297.2OpenBenchmarking.orgWatts, Fewer Is BetterMBW 2018-09-08System Power Consumption Monitor70140210280350

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triad24 x DDR5-640024 x MRDIMM 88800200K400K600K800K1000KSE +/- 18029.90, N = 5SE +/- 16674.48, N = 5893252.7879963.61. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Add24 x DDR5-640024 x MRDIMM 88800200K400K600K800K1000KSE +/- 19594.57, N = 5SE +/- 13677.37, N = 5887756.1937751.91. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scale24 x DDR5-640024 x MRDIMM 88800200K400K600K800K1000KSE +/- 18666.81, N = 5SE +/- 15910.14, N = 5848431.9952124.61. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

System Power Consumption Monitor

MinAvgMax24 x MRDIMM 88800198585924OpenBenchmarking.orgWatts, Fewer Is BetterStream 2013-01-17System Power Consumption Monitor2004006008001000


Phoronix Test Suite v10.8.5