AMD EPYC 9575F HPC Tuning Recommendations

Benchmarks for a future article by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2411294-NE-AMDEPYC9526&grs.

AMD EPYC 9575F HPC Tuning RecommendationsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionDefaultHPC Tuning RecommendationsAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-linux-pm-next-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Python Details- Python 3.12.7Security Details- Default: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - HPC Tuning Recommendations: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 9575F HPC Tuning Recommendationsxnnpack: FP16MobileNetV3Smallxnnpack: FP32MobileNetV3Largexnnpack: FP16MobileNetV3Largexnnpack: FP32MobileNetV2libxsmm: 32libxsmm: 64libxsmm: 128quicksilver: CORAL2 P2quicksilver: CTS2quicksilver: CORAL2 P1openfoam: drivaerFastback, Large Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timelibxsmm: 256openradioss: Bumper Beamhpcg: 160 160 160 - 60openfoam: drivaerFastback, Small Mesh Size - Mesh Timeheffte: c2c - FFTW - float - 256openfoam: drivaerFastback, Large Mesh Size - Mesh Timenamd: STMV with 1,066,628 Atomsopenradioss: Chrysler Neon 1Mincompact3d: input.i3d 193 Cells Per Directionmt-dgemm: Sustained Floating-Point Ratespecfem3d: Layered Halfspacespecfem3d: Mount St. Helensheffte: r2c - FFTW - float - 256graph500: 26graph500: 26graph500: 26qmcpack: FeCO6_b3lyp_gmsspecfem3d: Tomographic Modelopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timespecfem3d: Homogeneous Halfspaceopenradioss: Bird Strike on Windshieldgraph500: 26qmcpack: simple-H2Oqmcpack: LiH_ae_MSDamg: specfem3d: Water-layered Halfspaceqmcpack: Li2_STO_aeopenradioss: INIVOL and Fluid Structure Interaction Drop Containernwchem: C240 Buckyballlammps: 20k Atomscp2k: Fayalite-FISTqmcpack: O_ae_pyscf_UHFlaghos: Sedov Blast Wave, ube_922_hex.mesheasywave: e2Asean Grid + BengkuluSept2007 Source - 1200gromacs: MPI CPU - water_GMX50_bareopenradioss: Cell Phone Drop Testeasywave: e2Asean Grid + BengkuluSept2007 Source - 2400cp2k: H20-256openfoam: drivaerFastback, Small Mesh Size - Execution Timecp2k: H20-64laghos: Triple Point Problemqmcpack: H4_aeopenradioss: Rubber O-Ring Seal Installationheffte: r2c - FFTW - float - 128gpaw: Carbon Nanotubexnnpack: QS8MobileNetV2xnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV1xnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV1heffte: c2c - FFTW - float - 128incompact3d: X3D-benchmarking input.i3dhpcg: 144 144 144 - 60DefaultHPC Tuning Recommendations5449725870464758948.71799.13358.02466666726510000366100008380.5323283.11872422.766.0940.348118.974323178.295603.694923.72199125.147.955934055183.43969016.5481654826.206598110374.9021335290000130556000060015300069.2567.31000888884.1193269.00769683082.7747037800019.29552.358319083825015.78184475172.84190.691249.153.20962.271142.31567.0920.81414.53317.7651.643137.80224.57934414.182295.588.70238.44332.01327.85750324681246753982429223.464325.05256841.09402842395938562651548.41060.62007.91502000017513333276533336426.0644218.127753098.952.7349.090815.967165210.419517.584033.24773110.257.012523174633.21771718.4870962496.901913424411.0721461890000142009000064694200073.9946.87288497479.4680328.51437893178.2749571100020.22754.872306456475016.4192799175.75394.311298.251.42064.352146.58551.1221.26514.22918.0752.469135.94524.25671414.308293.038.75438.33332.91327.83726172553148929411686235.079245.67947551.0171OpenBenchmarking.org

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3SmallDefaultHPC Tuning Recommendations12002400360048006000SE +/- 65.16, N = 3SE +/- 15.58, N = 10544928421. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3LargeDefaultHPC Tuning Recommendations16003200480064008000SE +/- 43.97, N = 3SE +/- 23.40, N = 10725839591. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3LargeDefaultHPC Tuning Recommendations15003000450060007500SE +/- 11.89, N = 3SE +/- 55.41, N = 10704638561. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV2DefaultHPC Tuning Recommendations10002000300040005000SE +/- 24.31, N = 3SE +/- 49.78, N = 10475826511. (CXX) g++ options: -O3 -lrt -lm

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32DefaultHPC Tuning Recommendations2004006008001000SE +/- 0.78, N = 6SE +/- 6.50, N = 15948.7548.41. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64DefaultHPC Tuning Recommendations400800120016002000SE +/- 2.83, N = 6SE +/- 8.66, N = 151799.11060.61. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128DefaultHPC Tuning Recommendations7001400210028003500SE +/- 10.68, N = 3SE +/- 24.99, N = 43358.02007.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

Quicksilver

Input: CORAL2 P2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P2DefaultHPC Tuning Recommendations5M10M15M20M25MSE +/- 29627.31, N = 3SE +/- 11547.01, N = 324666667150200001. (CXX) g++ options: -fopenmp -O3 -march=native

Quicksilver

Input: CTS2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CTS2DefaultHPC Tuning Recommendations6M12M18M24M30MSE +/- 60827.63, N = 3SE +/- 110503.90, N = 326510000175133331. (CXX) g++ options: -fopenmp -O3 -march=native

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P1DefaultHPC Tuning Recommendations8M16M24M32M40MSE +/- 80000.00, N = 3SE +/- 100388.14, N = 336610000276533331. (CXX) g++ options: -fopenmp -O3 -march=native

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Large Mesh Size - Execution TimeDefaultHPC Tuning Recommendations2K4K6K8K10K8380.536426.061. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeDefaultHPC Tuning Recommendations60120180240300283.12218.131. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256DefaultHPC Tuning Recommendations7001400210028003500SE +/- 4.19, N = 3SE +/- 23.61, N = 32422.73098.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamDefaultHPC Tuning Recommendations1530456075SE +/- 0.61, N = 15SE +/- 0.19, N = 366.0952.73

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60DefaultHPC Tuning Recommendations1122334455SE +/- 0.27, N = 3SE +/- 0.11, N = 340.3549.091. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeDefaultHPC Tuning Recommendations51015202518.9715.971. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256DefaultHPC Tuning Recommendations50100150200250SE +/- 2.31, N = 15SE +/- 0.34, N = 13178.30210.421. (CXX) g++ options: -O3

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Large Mesh Size - Mesh TimeDefaultHPC Tuning Recommendations130260390520650603.69517.581. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

NAMD

Input: STMV with 1,066,628 Atoms

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 AtomsDefaultHPC Tuning Recommendations0.83741.67482.51223.34964.187SE +/- 0.00564, N = 4SE +/- 0.00833, N = 33.721993.24773

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MDefaultHPC Tuning Recommendations306090120150SE +/- 1.75, N = 12SE +/- 0.10, N = 3125.14110.25

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionDefaultHPC Tuning Recommendations246810SE +/- 0.05610512, N = 5SE +/- 0.02216234, N = 67.955934057.012523171. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateDefaultHPC Tuning Recommendations11002200330044005500SE +/- 5.20, N = 5SE +/- 7.97, N = 55183.444633.221. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Layered HalfspaceDefaultHPC Tuning Recommendations510152025SE +/- 0.18, N = 3SE +/- 0.04, N = 316.5518.491. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Mount St. HelensDefaultHPC Tuning Recommendations246810SE +/- 0.051223682, N = 9SE +/- 0.013455303, N = 56.2065981106.9019134241. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256DefaultHPC Tuning Recommendations90180270360450SE +/- 2.51, N = 13SE +/- 2.78, N = 15374.90411.071. (CXX) g++ options: -O3

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations300M600M900M1200M1500M133529000014618900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations300M600M900M1200M1500M130556000014200900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations140M280M420M560M700M6001530006469420001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: FeCO6_b3lyp_gmsDefaultHPC Tuning Recommendations1632486480SE +/- 0.12, N = 3SE +/- 0.07, N = 369.2673.991. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic ModelDefaultHPC Tuning Recommendations246810SE +/- 0.069507248, N = 6SE +/- 0.017950948, N = 57.3100088886.8728849741. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeDefaultHPC Tuning Recommendations2040608010084.1279.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous HalfspaceDefaultHPC Tuning Recommendations3691215SE +/- 0.067842113, N = 5SE +/- 0.018784174, N = 59.0076968308.5143789311. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldDefaultHPC Tuning Recommendations20406080100SE +/- 0.12, N = 3SE +/- 0.26, N = 382.7778.27

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations110M220M330M440M550M4703780004957110001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: simple-H2ODefaultHPC Tuning Recommendations510152025SE +/- 0.14, N = 12SE +/- 0.01, N = 319.3020.231. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

QMCPACK

Input: LiH_ae_MSD

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: LiH_ae_MSDDefaultHPC Tuning Recommendations1224364860SE +/- 0.14, N = 3SE +/- 0.15, N = 352.3654.871. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2DefaultHPC Tuning Recommendations700M1400M2100M2800M3500MSE +/- 11683916.89, N = 4SE +/- 21054535.14, N = 431908382503064564750

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Water-layered HalfspaceDefaultHPC Tuning Recommendations48121620SE +/- 0.14, N = 3SE +/- 0.09, N = 315.7816.421. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: Li2_STO_aeDefaultHPC Tuning Recommendations20406080100SE +/- 0.21, N = 3SE +/- 0.16, N = 372.8475.751. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerDefaultHPC Tuning Recommendations20406080100SE +/- 0.35, N = 3SE +/- 0.16, N = 390.6994.31

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.2.3Input: C240 BuckyballDefaultHPC Tuning Recommendations300600900120015001249.11298.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lfcidump -lgwmol -lga -larmci -lpeigs -l64to32 -llapack -lopenblas -lpthread -lrt -lcomex -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -ffast-math -std=legacy -fdefault-integer-8 -O0

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsDefaultHPC Tuning Recommendations1224364860SE +/- 0.14, N = 3SE +/- 0.16, N = 353.2151.421. (CXX) g++ options: -O3 -lm -ldl

CP2K Molecular Dynamics

Input: Fayalite-FIST

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: Fayalite-FISTDefaultHPC Tuning Recommendations1428425670SE +/- 0.10, N = 3SE +/- 0.06, N = 362.2764.351. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

QMCPACK

Input: O_ae_pyscf_UHF

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: O_ae_pyscf_UHFDefaultHPC Tuning Recommendations306090120150SE +/- 1.70, N = 3SE +/- 0.88, N = 3142.31146.581. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshDefaultHPC Tuning Recommendations120240360480600SE +/- 3.42, N = 3SE +/- 5.64, N = 3567.09551.121. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

easyWave

Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200DefaultHPC Tuning Recommendations510152025SE +/- 0.23, N = 3SE +/- 0.16, N = 320.8121.271. (CXX) g++ options: -O3 -fopenmp

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareDefaultHPC Tuning Recommendations48121620SE +/- 0.00, N = 3SE +/- 0.01, N = 314.5314.231. (CXX) g++ options: -O3 -lm

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestDefaultHPC Tuning Recommendations48121620SE +/- 0.16, N = 3SE +/- 0.02, N = 317.7618.07

easyWave

Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400DefaultHPC Tuning Recommendations1224364860SE +/- 0.13, N = 3SE +/- 0.49, N = 1551.6452.471. (CXX) g++ options: -O3 -fopenmp

CP2K Molecular Dynamics

Input: H20-256

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: H20-256DefaultHPC Tuning Recommendations306090120150SE +/- 0.29, N = 3SE +/- 0.47, N = 3137.80135.951. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeDefaultHPC Tuning Recommendations61218243024.5824.261. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

CP2K Molecular Dynamics

Input: H20-64

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: H20-64DefaultHPC Tuning Recommendations48121620SE +/- 0.12, N = 4SE +/- 0.05, N = 414.1814.311. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point ProblemDefaultHPC Tuning Recommendations60120180240300SE +/- 1.56, N = 3SE +/- 2.26, N = 3295.58293.031. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

QMCPACK

Input: H4_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: H4_aeDefaultHPC Tuning Recommendations246810SE +/- 0.079, N = 15SE +/- 0.010, N = 58.7028.7541. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationDefaultHPC Tuning Recommendations918273645SE +/- 0.41, N = 4SE +/- 0.02, N = 338.4438.33

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128DefaultHPC Tuning Recommendations70140210280350SE +/- 1.37, N = 14SE +/- 3.79, N = 15332.01332.911. (CXX) g++ options: -O3

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeDefaultHPC Tuning Recommendations714212835SE +/- 0.17, N = 3SE +/- 0.07, N = 327.8627.841. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

NWChem

System Power Consumption Monitor

MinAvgMaxDefault100602654HPC Tuning Recommendations98530586OpenBenchmarking.orgWatts, Fewer Is BetterNWChem 7.2.3System Power Consumption Monitor2004006008001000

NWChem

CPU Power Consumption Monitor

MinAvgMaxDefault83.7391.1402.0HPC Tuning Recommendations2.0339.3355.6OpenBenchmarking.orgWatts, Fewer Is BetterNWChem 7.2.3CPU Power Consumption Monitor110220330440550

Laghos

System Power Consumption Monitor

MinAvgMaxDefault100.2529.5599.0HPC Tuning Recommendations97.8456.8515.3OpenBenchmarking.orgWatts, Fewer Is BetterLaghos 3.1System Power Consumption Monitor160320480640800

Laghos

CPU Power Consumption Monitor

MinAvgMaxDefault139.0359.9401.8HPC Tuning Recommendations0.5291.6342.2OpenBenchmarking.orgWatts, Fewer Is BetterLaghos 3.1CPU Power Consumption Monitor110220330440550

Laghos

System Power Consumption Monitor

MinAvgMaxDefault99.0545.1595.4HPC Tuning Recommendations97.3461.1504.9OpenBenchmarking.orgWatts, Fewer Is BetterLaghos 3.1System Power Consumption Monitor160320480640800

Laghos

CPU Power Consumption Monitor

MinAvgMaxDefault81.7371.9401.8HPC Tuning Recommendations0.8298.2336.5OpenBenchmarking.orgWatts, Fewer Is BetterLaghos 3.1CPU Power Consumption Monitor110220330440550

Quicksilver

System Power Consumption Monitor

MinAvgMaxDefault98.8459.3479.5HPC Tuning Recommendations97.3385.5396.7OpenBenchmarking.orgWatts, Fewer Is BetterQuicksilver 20230818System Power Consumption Monitor120240360480600

Quicksilver

CPU Power Consumption Monitor

MinAvgMaxDefault0.4303.6316.5HPC Tuning Recommendations109.1253.0259.3OpenBenchmarking.orgWatts, Fewer Is BetterQuicksilver 20230818CPU Power Consumption Monitor80160240320400

Quicksilver

System Power Consumption Monitor

MinAvgMaxDefault97.5436.2452.5HPC Tuning Recommendations96.3335.1341.2OpenBenchmarking.orgWatts, Fewer Is BetterQuicksilver 20230818System Power Consumption Monitor120240360480600

Quicksilver

CPU Power Consumption Monitor

MinAvgMaxDefault72.5293.5302.9HPC Tuning Recommendations2.2213.7219.3OpenBenchmarking.orgWatts, Fewer Is BetterQuicksilver 20230818CPU Power Consumption Monitor80160240320400

Quicksilver

System Power Consumption Monitor

MinAvgMaxDefault100.0419.6484.9HPC Tuning Recommendations98.3373.7417.6OpenBenchmarking.orgWatts, Fewer Is BetterQuicksilver 20230818System Power Consumption Monitor120240360480600

Quicksilver

CPU Power Consumption Monitor

MinAvgMaxDefault0.1278.0325.5HPC Tuning Recommendations0.7238.8274.3OpenBenchmarking.orgWatts, Fewer Is BetterQuicksilver 20230818CPU Power Consumption Monitor80160240320400

libxsmm

System Power Consumption Monitor

MinAvgMaxDefault103486644HPC Tuning Recommendations98387556OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor2004006008001000

libxsmm

CPU Power Consumption Monitor

MinAvgMaxDefault107.8296.6387.2HPC Tuning Recommendations1.9236.7327.5OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645CPU Power Consumption Monitor100200300400500

libxsmm

System Power Consumption Monitor

MinAvgMaxDefault99361702HPC Tuning Recommendations100317504OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor2004006008001000

libxsmm

CPU Power Consumption Monitor

MinAvgMaxDefault79.0206.8336.1HPC Tuning Recommendations89.2184.8285.0OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645CPU Power Consumption Monitor80160240320400

libxsmm

System Power Consumption Monitor

MinAvgMaxDefault99417697HPC Tuning Recommendations98421605OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor2004006008001000

libxsmm

CPU Power Consumption Monitor

MinAvgMaxDefault108.7264.5402.1HPC Tuning Recommendations0.6234.2337.2OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645CPU Power Consumption Monitor110220330440550

libxsmm

System Power Consumption Monitor

MinAvgMaxDefault96396685HPC Tuning Recommendations96401595OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor2004006008001000

libxsmm

CPU Power Consumption Monitor

MinAvgMaxDefault105.8250.2402.2HPC Tuning Recommendations1.5224.4328.6OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645CPU Power Consumption Monitor110220330440550

XNNPACK

System Power Consumption Monitor

MinAvgMaxDefault94.9438.7489.6HPC Tuning Recommendations97.7353.6403.1OpenBenchmarking.orgWatts, Fewer Is BetterXNNPACK b7b048System Power Consumption Monitor130260390520650

XNNPACK

CPU Power Consumption Monitor

MinAvgMaxDefault126.0294.5327.8HPC Tuning Recommendations1.9224.7253.8OpenBenchmarking.orgWatts, Fewer Is BetterXNNPACK b7b048CPU Power Consumption Monitor80160240320400

XNNPACK

Model: QS8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV2DefaultHPC Tuning Recommendations11002200330044005500SE +/- 15.76, N = 3SE +/- 75.96, N = 10503226171. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV2DefaultHPC Tuning Recommendations10002000300040005000SE +/- 14.43, N = 3SE +/- 73.59, N = 10468125531. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV1DefaultHPC Tuning Recommendations5001000150020002500SE +/- 1.45, N = 3SE +/- 75.71, N = 10246714891. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3SmallDefaultHPC Tuning Recommendations12002400360048006000SE +/- 13.20, N = 3SE +/- 99.39, N = 10539829411. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV1DefaultHPC Tuning Recommendations5001000150020002500SE +/- 5.13, N = 3SE +/- 170.15, N = 10242916861. (CXX) g++ options: -O3 -lrt -lm

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault96.5140.7338.4HPC Tuning Recommendations95.2130.9273.3OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor80160240320400

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault57.071.982.1HPC Tuning Recommendations1.759.582.0OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault95.4105.9168.3HPC Tuning Recommendations95.2128.0236.1OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor60120180240300

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault56.761.671.7HPC Tuning Recommendations1.454.672.9OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128DefaultHPC Tuning Recommendations50100150200250SE +/- 0.74, N = 14SE +/- 3.65, N = 15223.46235.081. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault97.1122.4223.2HPC Tuning Recommendations95.8110.6193.0OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor60120180240300

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault0.860.967.4HPC Tuning Recommendations1.257.877.6OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault98.6122.6264.0HPC Tuning Recommendations98.0128.7243.3OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor70140210280350

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault0.658.375.3HPC Tuning Recommendations1.054.873.7OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

GPAW

System Power Consumption Monitor

MinAvgMaxDefault105585660HPC Tuning Recommendations105526605OpenBenchmarking.orgWatts, Fewer Is BetterGPAW 23.6System Power Consumption Monitor2004006008001000

GPAW

CPU Power Consumption Monitor

MinAvgMaxDefault89.8350.3401.7HPC Tuning Recommendations2.7292.4361.6OpenBenchmarking.orgWatts, Fewer Is BetterGPAW 23.6CPU Power Consumption Monitor110220330440550

Xcompact3d Incompact3d

System Power Consumption Monitor

MinAvgMaxDefault106642685HPC Tuning Recommendations101581613OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11System Power Consumption Monitor2004006008001000

Xcompact3d Incompact3d

CPU Power Consumption Monitor

MinAvgMaxDefault0.1387.2401.6HPC Tuning Recommendations1.0333.4344.1OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11CPU Power Consumption Monitor110220330440550

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dDefaultHPC Tuning Recommendations70140210280350SE +/- 7.62, N = 9SE +/- 3.22, N = 9325.05245.681. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

System Power Consumption Monitor

MinAvgMaxDefault105436674HPC Tuning Recommendations104332604OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11System Power Consumption Monitor2004006008001000

Xcompact3d Incompact3d

CPU Power Consumption Monitor

MinAvgMaxDefault138.1269.9401.8HPC Tuning Recommendations0.0209.2345.9OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11CPU Power Consumption Monitor110220330440550

OpenFOAM

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption MonitorDefaultHPC Tuning Recommendations120240360480600Min: 109.8 / Avg: 642.84 / Max: 689.8Min: 104 / Avg: 599.41 / Max: 635.5

OpenFOAM

CPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10CPU Power Consumption MonitorDefaultHPC Tuning Recommendations70140210280350Min: 54.24 / Avg: 392.08 / Max: 401.87Min: 76.13 / Avg: 350.87 / Max: 357.15

OpenFOAM

System Power Consumption Monitor

MinAvgMaxDefault101638679HPC Tuning Recommendations97589629OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption Monitor2004006008001000

OpenFOAM

CPU Power Consumption Monitor

MinAvgMaxDefault53.0386.9401.6HPC Tuning Recommendations1.8339.6360.1OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10CPU Power Consumption Monitor110220330440550

OpenFOAM

System Power Consumption Monitor

MinAvgMaxDefault106555638HPC Tuning Recommendations104481577OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption Monitor2004006008001000

OpenFOAM

CPU Power Consumption Monitor

MinAvgMaxDefault0.6302.0401.3HPC Tuning Recommendations0.3277.4356.0OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault103603683HPC Tuning Recommendations99546616OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor2004006008001000

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault50.9356.6402.2HPC Tuning Recommendations1.4308.0354.1OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault98.8573.0621.3HPC Tuning Recommendations98.6507.4545.1OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault56.9369.5401.9HPC Tuning Recommendations4.8314.4347.8OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault98.6542.4592.6HPC Tuning Recommendations98.9475.2518.6OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault0.7356.9393.1HPC Tuning Recommendations56.2308.7341.2OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault100.3457.4611.9HPC Tuning Recommendations98.1405.0536.5OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault56.8286.3399.5HPC Tuning Recommendations4.0233.8346.0OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault98.9546.0607.6HPC Tuning Recommendations99.2467.1534.7OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault50.3347.4399.7HPC Tuning Recommendations56.5301.4345.7OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault107.5572.9612.9HPC Tuning Recommendations97.9494.3544.0OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault2.1363.4399.6HPC Tuning Recommendations5.2308.6349.2OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault97.8409.9612.7HPC Tuning Recommendations97.1345.2505.6OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor160320480640800

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault100.1257.1401.9HPC Tuning Recommendations3.6200.8332.3OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor110220330440550

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault99387629HPC Tuning Recommendations98325520OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault66.2242.1374.8HPC Tuning Recommendations2.0200.4334.9OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor100200300400500

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault98.2419.3614.1HPC Tuning Recommendations96.9356.2508.1OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor160320480640800

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault1.6245.0401.9HPC Tuning Recommendations0.0210.7333.2OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor110220330440550

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault99503643HPC Tuning Recommendations97427539OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault106.0316.5402.1HPC Tuning Recommendations5.6231.0339.9OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor110220330440550

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault99517655HPC Tuning Recommendations95442540OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor2004006008001000

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault85.0319.7402.1HPC Tuning Recommendations3.2254.3339.5OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor110220330440550

easyWave

System Power Consumption Monitor

MinAvgMaxDefault102.8309.3342.8HPC Tuning Recommendations95.4275.7353.2OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34System Power Consumption Monitor100200300400500

easyWave

CPU Power Consumption Monitor

MinAvgMaxDefault0.5167.2192.5HPC Tuning Recommendations3.6150.4185.3OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34CPU Power Consumption Monitor50100150200250

easyWave

System Power Consumption Monitor

MinAvgMaxDefault101.7282.5387.8HPC Tuning Recommendations101.6250.7312.6OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34System Power Consumption Monitor100200300400500

easyWave

CPU Power Consumption Monitor

MinAvgMaxDefault55.9166.9218.7HPC Tuning Recommendations2.0130.9167.2OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34CPU Power Consumption Monitor60120180240300

CP2K Molecular Dynamics

System Power Consumption Monitor

MinAvgMaxDefault102626653HPC Tuning Recommendations99552582OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3System Power Consumption Monitor2004006008001000

CP2K Molecular Dynamics

CPU Power Consumption Monitor

MinAvgMaxDefault2.9382.1400.9HPC Tuning Recommendations0.7328.5346.7OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3CPU Power Consumption Monitor110220330440550

CP2K Molecular Dynamics

System Power Consumption Monitor

MinAvgMaxDefault99504628HPC Tuning Recommendations101470551OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3System Power Consumption Monitor2004006008001000

CP2K Molecular Dynamics

CPU Power Consumption Monitor

MinAvgMaxDefault84.6316.9401.9HPC Tuning Recommendations7.1244.8346.1OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3CPU Power Consumption Monitor110220330440550

CP2K Molecular Dynamics

System Power Consumption Monitor

MinAvgMaxDefault102.4569.2609.0HPC Tuning Recommendations97.5509.7539.3OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3System Power Consumption Monitor160320480640800

CP2K Molecular Dynamics

CPU Power Consumption Monitor

MinAvgMaxDefault128.4373.7400.0HPC Tuning Recommendations7.0317.6347.9OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3CPU Power Consumption Monitor110220330440550

QMCPACK

System Power Consumption Monitor

MinAvgMaxDefault98.1422.7584.2HPC Tuning Recommendations99.0360.1507.2OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1System Power Consumption Monitor160320480640800

QMCPACK

CPU Power Consumption Monitor

MinAvgMaxDefault85.1277.2395.1HPC Tuning Recommendations4.5220.5335.3OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1CPU Power Consumption Monitor110220330440550

QMCPACK

System Power Consumption Monitor

MinAvgMaxDefault99.3549.5584.9HPC Tuning Recommendations99.9476.2516.8OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1System Power Consumption Monitor160320480640800

QMCPACK

CPU Power Consumption Monitor

MinAvgMaxDefault135.4358.6382.5HPC Tuning Recommendations6.1304.3337.1OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1CPU Power Consumption Monitor100200300400500

QMCPACK

System Power Consumption Monitor

MinAvgMaxDefault100.8562.8584.9HPC Tuning Recommendations98.6502.0517.6OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1System Power Consumption Monitor160320480640800

QMCPACK

CPU Power Consumption Monitor

MinAvgMaxDefault84.7380.5396.3HPC Tuning Recommendations2.8332.3347.9OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1CPU Power Consumption Monitor110220330440550

QMCPACK

System Power Consumption Monitor

MinAvgMaxDefault100.3536.4573.8HPC Tuning Recommendations98.7460.4489.9OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1System Power Consumption Monitor140280420560700

QMCPACK

CPU Power Consumption Monitor

MinAvgMaxDefault2.1347.0379.6HPC Tuning Recommendations1.7298.1322.8OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1CPU Power Consumption Monitor100200300400500

QMCPACK

System Power Consumption Monitor

MinAvgMaxDefault98.0547.7589.6HPC Tuning Recommendations98.2484.1517.6OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1System Power Consumption Monitor160320480640800

QMCPACK

CPU Power Consumption Monitor

MinAvgMaxDefault103.6373.4398.0HPC Tuning Recommendations89.6324.2346.2OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1CPU Power Consumption Monitor110220330440550

QMCPACK

System Power Consumption Monitor

MinAvgMaxDefault99.0506.2592.7HPC Tuning Recommendations100.9412.1484.5OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1System Power Consumption Monitor160320480640800

QMCPACK

CPU Power Consumption Monitor

MinAvgMaxDefault82.3333.2401.8HPC Tuning Recommendations7.7246.9327.1OpenBenchmarking.orgWatts, Fewer Is BetterQMCPACK 3.17.1CPU Power Consumption Monitor110220330440550

Algebraic Multi-Grid Benchmark

System Power Consumption Monitor

MinAvgMaxDefault103508679HPC Tuning Recommendations100422582OpenBenchmarking.orgWatts, Fewer Is BetterAlgebraic Multi-Grid Benchmark 1.2System Power Consumption Monitor2004006008001000

Algebraic Multi-Grid Benchmark

CPU Power Consumption Monitor

MinAvgMaxDefault0.7261.0399.4HPC Tuning Recommendations4.4220.6335.1OpenBenchmarking.orgWatts, Fewer Is BetterAlgebraic Multi-Grid Benchmark 1.2CPU Power Consumption Monitor110220330440550

ACES DGEMM

System Power Consumption Monitor

MinAvgMaxDefault105.2427.8566.3HPC Tuning Recommendations102.7339.1437.5OpenBenchmarking.orgWatts, Fewer Is BetterACES DGEMM 1.0System Power Consumption Monitor140280420560700

ACES DGEMM

CPU Power Consumption Monitor

MinAvgMaxDefault83.9245.6341.5HPC Tuning Recommendations1.5181.3266.5OpenBenchmarking.orgWatts, Fewer Is BetterACES DGEMM 1.0CPU Power Consumption Monitor80160240320400

Graph500

System Power Consumption Monitor

MinAvgMaxDefault109636653HPC Tuning Recommendations108564576OpenBenchmarking.orgWatts, Fewer Is BetterGraph500 3.0System Power Consumption Monitor2004006008001000

Graph500

CPU Power Consumption Monitor

MinAvgMaxDefault137.4396.7401.7HPC Tuning Recommendations8.0337.5349.4OpenBenchmarking.orgWatts, Fewer Is BetterGraph500 3.0CPU Power Consumption Monitor110220330440550

High Performance Conjugate Gradient

System Power Consumption Monitor

MinAvgMaxDefault106637679HPC Tuning Recommendations109580607OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1System Power Consumption Monitor2004006008001000

High Performance Conjugate Gradient

CPU Power Consumption Monitor

MinAvgMaxDefault84.2387.0402.1HPC Tuning Recommendations6.1337.3347.9OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1CPU Power Consumption Monitor110220330440550

High Performance Conjugate Gradient

System Power Consumption Monitor

MinAvgMaxDefault101639679HPC Tuning Recommendations100578611OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1System Power Consumption Monitor2004006008001000

High Performance Conjugate Gradient

CPU Power Consumption Monitor

MinAvgMaxDefault83.7387.3402.2HPC Tuning Recommendations25.4336.6347.9OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1CPU Power Consumption Monitor110220330440550

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60DefaultHPC Tuning Recommendations1224364860SE +/- 0.58, N = 9SE +/- 1.04, N = 1241.0951.021. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

LAMMPS Molecular Dynamics Simulator

System Power Consumption Monitor

MinAvgMaxDefault96.4588.0601.4HPC Tuning Recommendations97.3524.0539.4OpenBenchmarking.orgWatts, Fewer Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022System Power Consumption Monitor160320480640800

LAMMPS Molecular Dynamics Simulator

CPU Power Consumption Monitor

MinAvgMaxDefault84.2390.0401.9HPC Tuning Recommendations79.8342.7356.6OpenBenchmarking.orgWatts, Fewer Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022CPU Power Consumption Monitor110220330440550

NAMD

System Power Consumption Monitor

MinAvgMaxDefault99374648HPC Tuning Recommendations99379566OpenBenchmarking.orgWatts, Fewer Is BetterNAMD 3.0System Power Consumption Monitor2004006008001000

NAMD

CPU Power Consumption Monitor

MinAvgMaxDefault50.8256.9392.7HPC Tuning Recommendations56.8218.9347.3OpenBenchmarking.orgWatts, Fewer Is BetterNAMD 3.0CPU Power Consumption Monitor110220330440550

GROMACS

System Power Consumption Monitor

MinAvgMaxDefault96505637HPC Tuning Recommendations98443562OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2024System Power Consumption Monitor2004006008001000

GROMACS

CPU Power Consumption Monitor

MinAvgMaxDefault36.9298.3401.9HPC Tuning Recommendations36.9248.8346.5OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2024CPU Power Consumption Monitor110220330440550


Phoronix Test Suite v10.8.5