AMD EPYC 9575F HPC Tuning Guide

Benchmarks for a future article by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2411294-NE-AMDEPYC9542.

AMD EPYC 9575F HPC Tuning GuideProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionDefaultHPC Tuning RecommendationsAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.12.0-rc7-linux-pm-next-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Python Details- Python 3.12.7Security Details- Default: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - HPC Tuning Recommendations: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 9575F HPC Tuning Guidegromacs: MPI CPU - water_GMX50_barehpcg: 144 144 144 - 60hpcg: 160 160 160 - 60graph500: 26graph500: 26graph500: 26graph500: 26cp2k: H20-256easywave: e2Asean Grid + BengkuluSept2007 Source - 1200easywave: e2Asean Grid + BengkuluSept2007 Source - 2400specfem3d: Homogeneous Halfspacespecfem3d: Tomographic Modelopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: Cell Phone Drop Testopenradioss: Bumper Beamopenradioss: INIVOL and Fluid Structure Interaction Drop Containeropenradioss: Chrysler Neon 1Mopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Large Mesh Size - Mesh Timeopenfoam: drivaerFastback, Large Mesh Size - Execution Timeincompact3d: input.i3d 193 Cells Per Directionincompact3d: X3D-benchmarking input.i3dgpaw: Carbon Nanotubeheffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float - 256heffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float - 256xnnpack: FP32MobileNetV1xnnpack: FP32MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP16MobileNetV1xnnpack: FP16MobileNetV2xnnpack: FP16MobileNetV3Largexnnpack: FP16MobileNetV3Smallxnnpack: QS8MobileNetV2libxsmm: 256nwchem: C240 BuckyballDefaultHPC Tuning Recommendations14.53341.094040.348113055600001335290000470378000600153000137.80220.81451.6439.0076968307.31000888882.7738.4417.7666.0990.69125.1418.97432324.57934484.119326283.1187603.694928380.53237.95593405325.05256827.857332.013374.902223.464178.2952429475872585398246746817046544950322422.71249.114.22951.017149.090814200900001461890000495711000646942000135.94521.26552.4698.5143789316.87288497478.2738.3318.0752.7394.31110.2515.96716524.25671479.468032218.12775517.584036426.06447.01252317245.67947527.837332.913411.072235.079210.4191686265139592941148925533856284226173098.91298.2OpenBenchmarking.org

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareDefaultHPC Tuning Recommendations48121620SE +/- 0.00, N = 3SE +/- 0.01, N = 314.5314.231. (CXX) g++ options: -O3 -lm

GROMACS

CPU Power Consumption Monitor

MinAvgMaxDefault36.9298.3401.9HPC Tuning Recommendations36.9248.8346.5OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2024CPU Power Consumption Monitor110220330440550

GROMACS

System Power Consumption Monitor

MinAvgMaxDefault96505637HPC Tuning Recommendations98443562OpenBenchmarking.orgWatts, Fewer Is BetterGROMACS 2024System Power Consumption Monitor2004006008001000

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60DefaultHPC Tuning Recommendations1224364860SE +/- 0.58, N = 9SE +/- 1.04, N = 1241.0951.021. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

CPU Power Consumption Monitor

MinAvgMaxDefault83.7387.3402.2HPC Tuning Recommendations25.4336.6347.9OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1CPU Power Consumption Monitor110220330440550

High Performance Conjugate Gradient

System Power Consumption Monitor

MinAvgMaxDefault101639679HPC Tuning Recommendations100578611OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1System Power Consumption Monitor2004006008001000

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60DefaultHPC Tuning Recommendations1122334455SE +/- 0.27, N = 3SE +/- 0.11, N = 340.3549.091. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

CPU Power Consumption Monitor

MinAvgMaxDefault84.2387.0402.1HPC Tuning Recommendations6.1337.3347.9OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1CPU Power Consumption Monitor110220330440550

High Performance Conjugate Gradient

System Power Consumption Monitor

MinAvgMaxDefault106637679HPC Tuning Recommendations109580607OpenBenchmarking.orgWatts, Fewer Is BetterHigh Performance Conjugate Gradient 3.1System Power Consumption Monitor2004006008001000

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations300M600M900M1200M1500M130556000014200900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations300M600M900M1200M1500M133529000014618900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations110M220M330M440M550M4703780004957110001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26DefaultHPC Tuning Recommendations140M280M420M560M700M6001530006469420001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

CPU Power Consumption Monitor

MinAvgMaxDefault137.4396.7401.7HPC Tuning Recommendations8.0337.5349.4OpenBenchmarking.orgWatts, Fewer Is BetterGraph500 3.0CPU Power Consumption Monitor110220330440550

Graph500

System Power Consumption Monitor

MinAvgMaxDefault109636653HPC Tuning Recommendations108564576OpenBenchmarking.orgWatts, Fewer Is BetterGraph500 3.0System Power Consumption Monitor2004006008001000

CP2K Molecular Dynamics

Input: H20-256

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: H20-256DefaultHPC Tuning Recommendations306090120150SE +/- 0.29, N = 3SE +/- 0.47, N = 3137.80135.951. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

CP2K Molecular Dynamics

CPU Power Consumption Monitor

MinAvgMaxDefault2.9382.1400.9HPC Tuning Recommendations0.7328.5346.7OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3CPU Power Consumption Monitor110220330440550

CP2K Molecular Dynamics

System Power Consumption Monitor

MinAvgMaxDefault102626653HPC Tuning Recommendations99552582OpenBenchmarking.orgWatts, Fewer Is BetterCP2K Molecular Dynamics 2024.3System Power Consumption Monitor2004006008001000

easyWave

Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200DefaultHPC Tuning Recommendations510152025SE +/- 0.23, N = 3SE +/- 0.16, N = 320.8121.271. (CXX) g++ options: -O3 -fopenmp

easyWave

CPU Power Consumption Monitor

MinAvgMaxDefault55.9166.9218.7HPC Tuning Recommendations2.0130.9167.2OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34CPU Power Consumption Monitor60120180240300

easyWave

System Power Consumption Monitor

MinAvgMaxDefault101.7282.5387.8HPC Tuning Recommendations101.6250.7312.6OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34System Power Consumption Monitor100200300400500

easyWave

Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400DefaultHPC Tuning Recommendations1224364860SE +/- 0.13, N = 3SE +/- 0.49, N = 1551.6452.471. (CXX) g++ options: -O3 -fopenmp

easyWave

CPU Power Consumption Monitor

MinAvgMaxDefault0.5167.2192.5HPC Tuning Recommendations3.6150.4185.3OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34CPU Power Consumption Monitor50100150200250

easyWave

System Power Consumption Monitor

MinAvgMaxDefault102.8309.3342.8HPC Tuning Recommendations95.4275.7353.2OpenBenchmarking.orgWatts, Fewer Is BettereasyWave r34System Power Consumption Monitor100200300400500

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Homogeneous HalfspaceDefaultHPC Tuning Recommendations3691215SE +/- 0.067842113, N = 5SE +/- 0.018784174, N = 59.0076968308.5143789311. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault1.6245.0401.9HPC Tuning Recommendations0.0210.7333.2OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor110220330440550

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault98.2419.3614.1HPC Tuning Recommendations96.9356.2508.1OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor160320480640800

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.1.1Model: Tomographic ModelDefaultHPC Tuning Recommendations246810SE +/- 0.069507248, N = 6SE +/- 0.017950948, N = 57.3100088886.8728849741. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

CPU Power Consumption Monitor

MinAvgMaxDefault100.1257.1401.9HPC Tuning Recommendations3.6200.8332.3OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1CPU Power Consumption Monitor110220330440550

SPECFEM3D

System Power Consumption Monitor

MinAvgMaxDefault97.8409.9612.7HPC Tuning Recommendations97.1345.2505.6OpenBenchmarking.orgWatts, Fewer Is BetterSPECFEM3D 4.1.1System Power Consumption Monitor160320480640800

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldDefaultHPC Tuning Recommendations20406080100SE +/- 0.12, N = 3SE +/- 0.26, N = 382.7778.27

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault2.1363.4399.6HPC Tuning Recommendations5.2308.6349.2OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault107.5572.9612.9HPC Tuning Recommendations97.9494.3544.0OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationDefaultHPC Tuning Recommendations918273645SE +/- 0.41, N = 4SE +/- 0.02, N = 338.4438.33

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault50.3347.4399.7HPC Tuning Recommendations56.5301.4345.7OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault98.9546.0607.6HPC Tuning Recommendations99.2467.1534.7OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestDefaultHPC Tuning Recommendations48121620SE +/- 0.16, N = 3SE +/- 0.02, N = 317.7618.07

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault56.8286.3399.5HPC Tuning Recommendations4.0233.8346.0OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault100.3457.4611.9HPC Tuning Recommendations98.1405.0536.5OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamDefaultHPC Tuning Recommendations1530456075SE +/- 0.61, N = 15SE +/- 0.19, N = 366.0952.73

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault0.7356.9393.1HPC Tuning Recommendations56.2308.7341.2OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault98.6542.4592.6HPC Tuning Recommendations98.9475.2518.6OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerDefaultHPC Tuning Recommendations20406080100SE +/- 0.35, N = 3SE +/- 0.16, N = 390.6994.31

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault56.9369.5401.9HPC Tuning Recommendations4.8314.4347.8OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault98.8573.0621.3HPC Tuning Recommendations98.6507.4545.1OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor160320480640800

OpenRadioss

Model: Chrysler Neon 1M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MDefaultHPC Tuning Recommendations306090120150SE +/- 1.75, N = 12SE +/- 0.10, N = 3125.14110.25

OpenRadioss

CPU Power Consumption Monitor

MinAvgMaxDefault50.9356.6402.2HPC Tuning Recommendations1.4308.0354.1OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15CPU Power Consumption Monitor110220330440550

OpenRadioss

System Power Consumption Monitor

MinAvgMaxDefault103603683HPC Tuning Recommendations99546616OpenBenchmarking.orgWatts, Fewer Is BetterOpenRadioss 2023.09.15System Power Consumption Monitor2004006008001000

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeDefaultHPC Tuning Recommendations51015202518.9715.971. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeDefaultHPC Tuning Recommendations61218243024.5824.261. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

CPU Power Consumption Monitor

MinAvgMaxDefault0.6302.0401.3HPC Tuning Recommendations0.3277.4356.0OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10CPU Power Consumption Monitor110220330440550

OpenFOAM

System Power Consumption Monitor

MinAvgMaxDefault106555638HPC Tuning Recommendations104481577OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption Monitor2004006008001000

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeDefaultHPC Tuning Recommendations2040608010084.1279.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeDefaultHPC Tuning Recommendations60120180240300283.12218.131. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

CPU Power Consumption Monitor

MinAvgMaxDefault53.0386.9401.6HPC Tuning Recommendations1.8339.6360.1OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10CPU Power Consumption Monitor110220330440550

OpenFOAM

System Power Consumption Monitor

MinAvgMaxDefault101638679HPC Tuning Recommendations97589629OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption Monitor2004006008001000

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Large Mesh Size - Mesh TimeDefaultHPC Tuning Recommendations130260390520650603.69517.581. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Large Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Large Mesh Size - Execution TimeDefaultHPC Tuning Recommendations2K4K6K8K10K8380.536426.061. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

CPU Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10CPU Power Consumption MonitorDefaultHPC Tuning Recommendations70140210280350Min: 54.24 / Avg: 392.08 / Max: 401.87Min: 76.13 / Avg: 350.87 / Max: 357.15

OpenFOAM

System Power Consumption Monitor

OpenBenchmarking.orgWatts, Fewer Is BetterOpenFOAM 10System Power Consumption MonitorDefaultHPC Tuning Recommendations120240360480600Min: 109.8 / Avg: 642.84 / Max: 689.8Min: 104 / Avg: 599.41 / Max: 635.5

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionDefaultHPC Tuning Recommendations246810SE +/- 0.05610512, N = 5SE +/- 0.02216234, N = 67.955934057.012523171. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

CPU Power Consumption Monitor

MinAvgMaxDefault138.1269.9401.8HPC Tuning Recommendations0.0209.2345.9OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11CPU Power Consumption Monitor110220330440550

Xcompact3d Incompact3d

System Power Consumption Monitor

MinAvgMaxDefault105436674HPC Tuning Recommendations104332604OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11System Power Consumption Monitor2004006008001000

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dDefaultHPC Tuning Recommendations70140210280350SE +/- 7.62, N = 9SE +/- 3.22, N = 9325.05245.681. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

CPU Power Consumption Monitor

MinAvgMaxDefault0.1387.2401.6HPC Tuning Recommendations1.0333.4344.1OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11CPU Power Consumption Monitor110220330440550

Xcompact3d Incompact3d

System Power Consumption Monitor

MinAvgMaxDefault106642685HPC Tuning Recommendations101581613OpenBenchmarking.orgWatts, Fewer Is BetterXcompact3d Incompact3d 2021-03-11System Power Consumption Monitor2004006008001000

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeDefaultHPC Tuning Recommendations714212835SE +/- 0.17, N = 3SE +/- 0.07, N = 327.8627.841. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

GPAW

CPU Power Consumption Monitor

MinAvgMaxDefault89.8350.3401.7HPC Tuning Recommendations2.7292.4361.6OpenBenchmarking.orgWatts, Fewer Is BetterGPAW 23.6CPU Power Consumption Monitor110220330440550

GPAW

System Power Consumption Monitor

MinAvgMaxDefault105585660HPC Tuning Recommendations105526605OpenBenchmarking.orgWatts, Fewer Is BetterGPAW 23.6System Power Consumption Monitor2004006008001000

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128DefaultHPC Tuning Recommendations70140210280350SE +/- 1.37, N = 14SE +/- 3.79, N = 15332.01332.911. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault0.658.375.3HPC Tuning Recommendations1.054.873.7OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault98.6122.6264.0HPC Tuning Recommendations98.0128.7243.3OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor70140210280350

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256DefaultHPC Tuning Recommendations90180270360450SE +/- 2.51, N = 13SE +/- 2.78, N = 15374.90411.071. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault0.860.967.4HPC Tuning Recommendations1.257.877.6OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault97.1122.4223.2HPC Tuning Recommendations95.8110.6193.0OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor60120180240300

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128DefaultHPC Tuning Recommendations50100150200250SE +/- 0.74, N = 14SE +/- 3.65, N = 15223.46235.081. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault56.761.671.7HPC Tuning Recommendations1.454.672.9OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault95.4105.9168.3HPC Tuning Recommendations95.2128.0236.1OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor60120180240300

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256DefaultHPC Tuning Recommendations50100150200250SE +/- 2.31, N = 15SE +/- 0.34, N = 13178.30210.421. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

CPU Power Consumption Monitor

MinAvgMaxDefault57.071.982.1HPC Tuning Recommendations1.759.582.0OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4CPU Power Consumption Monitor20406080100

HeFFTe - Highly Efficient FFT for Exascale

System Power Consumption Monitor

MinAvgMaxDefault96.5140.7338.4HPC Tuning Recommendations95.2130.9273.3OpenBenchmarking.orgWatts, Fewer Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4System Power Consumption Monitor80160240320400

XNNPACK

Model: FP32MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV1DefaultHPC Tuning Recommendations5001000150020002500SE +/- 5.13, N = 3SE +/- 170.15, N = 10242916861. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV2DefaultHPC Tuning Recommendations10002000300040005000SE +/- 24.31, N = 3SE +/- 49.78, N = 10475826511. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3LargeDefaultHPC Tuning Recommendations16003200480064008000SE +/- 43.97, N = 3SE +/- 23.40, N = 10725839591. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3SmallDefaultHPC Tuning Recommendations12002400360048006000SE +/- 13.20, N = 3SE +/- 99.39, N = 10539829411. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV1

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV1DefaultHPC Tuning Recommendations5001000150020002500SE +/- 1.45, N = 3SE +/- 75.71, N = 10246714891. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV2DefaultHPC Tuning Recommendations10002000300040005000SE +/- 14.43, N = 3SE +/- 73.59, N = 10468125531. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3LargeDefaultHPC Tuning Recommendations15003000450060007500SE +/- 11.89, N = 3SE +/- 55.41, N = 10704638561. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3SmallDefaultHPC Tuning Recommendations12002400360048006000SE +/- 65.16, N = 3SE +/- 15.58, N = 10544928421. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QS8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV2DefaultHPC Tuning Recommendations11002200330044005500SE +/- 15.76, N = 3SE +/- 75.96, N = 10503226171. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

CPU Power Consumption Monitor

MinAvgMaxDefault126.0294.5327.8HPC Tuning Recommendations1.9224.7253.8OpenBenchmarking.orgWatts, Fewer Is BetterXNNPACK b7b048CPU Power Consumption Monitor80160240320400

XNNPACK

System Power Consumption Monitor

MinAvgMaxDefault94.9438.7489.6HPC Tuning Recommendations97.7353.6403.1OpenBenchmarking.orgWatts, Fewer Is BetterXNNPACK b7b048System Power Consumption Monitor130260390520650

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256DefaultHPC Tuning Recommendations7001400210028003500SE +/- 4.19, N = 3SE +/- 23.61, N = 32422.73098.91. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2

libxsmm

CPU Power Consumption Monitor

MinAvgMaxDefault107.8296.6387.2HPC Tuning Recommendations1.9236.7327.5OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645CPU Power Consumption Monitor100200300400500

libxsmm

System Power Consumption Monitor

MinAvgMaxDefault103486644HPC Tuning Recommendations98387556OpenBenchmarking.orgWatts, Fewer Is Betterlibxsmm 2-1.17-3645System Power Consumption Monitor2004006008001000

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.2.3Input: C240 BuckyballDefaultHPC Tuning Recommendations300600900120015001249.11298.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lfcidump -lgwmol -lga -larmci -lpeigs -l64to32 -llapack -lopenblas -lpthread -lrt -lcomex -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -ffast-math -std=legacy -fdefault-integer-8 -O0

NWChem

CPU Power Consumption Monitor

MinAvgMaxDefault83.7391.1402.0HPC Tuning Recommendations2.0339.3355.6OpenBenchmarking.orgWatts, Fewer Is BetterNWChem 7.2.3CPU Power Consumption Monitor110220330440550

NWChem

System Power Consumption Monitor

MinAvgMaxDefault100602654HPC Tuning Recommendations98530586OpenBenchmarking.orgWatts, Fewer Is BetterNWChem 7.2.3System Power Consumption Monitor2004006008001000


Phoronix Test Suite v10.8.5