mpitest

AMD Ryzen Threadripper PRO 3995WX 64-Cores testing with a GIGABYTE WRX80-SU8 N/A (WRX80SU8-F2 BIOS) and Gigabyte NVIDIA GeForce RTX 3080 Ti 12GB on Ubuntu 24.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2501206-NE-MPITEST7573
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
AMD Ryzen Threadripper PRO 3995WX 64-Cores
January 20
  8 Hours, 55 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


mpitestOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen Threadripper PRO 3995WX 64-Cores @ 2.70GHz (64 Cores / 128 Threads)GIGABYTE WRX80-SU8 N/A (WRX80SU8-F2 BIOS)AMD Starship/Matisse4 x 16GB DDR4-2133MT/s CM4X16GC3200C16K2E1000GB CT1000P3SSD8 + 1000GB KINGSTON SNV3S1000GGigabyte NVIDIA GeForce RTX 3080 Ti 12GBAMD Starship/Matisse2 x LG TV2 x Realtek RTL8111/8168/8211/8411 + 2 x Intel I210 + 2 x Intel X550 + Intel Wi-Fi 6 AX200Ubuntu 24.046.8.0-51-generic (x86_64)GNOME Shell 46.0X Server 1.21.1.11NVIDIAOpenCL 3.0 CUDA 12.4.131GCC 13.3.0 + CUDA 12.6ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenCLCompilerFile-SystemScreen ResolutionMpitest BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fG75Ri/gcc-13-13.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fG75Ri/gcc-13-13.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0x830107c- Python 3.12.4- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

mpitesthpcg: 104 104 104 - 60hpcg: 104 104 104 - 1800npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Chpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthminife: Smallpennant: sedovbigpennant: leblancbigmrbayes: Primate Phylogeny Analysisincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionmocassin: Gas HII40mocassin: Dust 2D tau100.0lammps: 20k Atomslammps: Rhodopsin Proteinaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingintel-mpi: IMB-P2P PingPongintel-mpi: IMB-MPI1 Exchangeintel-mpi: IMB-MPI1 Exchangeintel-mpi: IMB-MPI1 PingPongintel-mpi: IMB-MPI1 Sendrecvintel-mpi: IMB-MPI1 Sendrecvgromacs: MPI CPU - water_GMX50_baregromacs: NVIDIA CUDA GPU - water_GMX50_baregpaw: Carbon NanotubeAMD Ryzen Threadripper PRO 3995WX 64-Cores5.599166.3322649906.825819.635140.815322.8421478.66853.8849703.319608.9730075.2016955.27152.5333311.193477.253582.541220.568270.192711.141250.716539246.2666809.8417.351165.925810131.46113.486003259.395554819.249145.59724.59821.83811744.3910871.70253666682490.11557.432411.511859.20292.173.17123.801136.787OpenBenchmarking.org

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60AMD Ryzen Threadripper PRO 3995WX 64-Cores1.25982.51963.77945.03926.299SE +/- 0.17243, N = 95.599161. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

X Y Z: 144 144 144 - RT: 60

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

X Y Z: 160 160 160 - RT: 60

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

X Y Z: 192 192 192 - RT: 60

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 1800AMD Ryzen Threadripper PRO 3995WX 64-Cores246810SE +/- 0.00769, N = 36.332261. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

X Y Z: 144 144 144 - RT: 1800

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

X Y Z: 160 160 160 - RT: 1800

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

X Y Z: 192 192 192 - RT: 1800

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CAMD Ryzen Threadripper PRO 3995WX 64-Cores11K22K33K44K55KSE +/- 287.94, N = 349906.821. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CAMD Ryzen Threadripper PRO 3995WX 64-Cores12002400360048006000SE +/- 70.34, N = 45819.631. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CAMD Ryzen Threadripper PRO 3995WX 64-Cores11002200330044005500SE +/- 57.80, N = 35140.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DAMD Ryzen Threadripper PRO 3995WX 64-Cores11002200330044005500SE +/- 13.91, N = 35322.841. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CAMD Ryzen Threadripper PRO 3995WX 64-Cores5K10K15K20K25KSE +/- 53.71, N = 321478.661. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DAMD Ryzen Threadripper PRO 3995WX 64-Cores2004006008001000SE +/- 5.18, N = 3853.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CAMD Ryzen Threadripper PRO 3995WX 64-Cores11K22K33K44K55KSE +/- 171.34, N = 349703.31. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CAMD Ryzen Threadripper PRO 3995WX 64-Cores4K8K12K16K20KSE +/- 58.72, N = 319608.971. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BAMD Ryzen Threadripper PRO 3995WX 64-Cores6K12K18K24K30KSE +/- 331.56, N = 430075.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CAMD Ryzen Threadripper PRO 3995WX 64-Cores4K8K12K16K20KSE +/- 70.76, N = 316955.271. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLAMD Ryzen Threadripper PRO 3995WX 64-Cores306090120150SE +/- 0.48, N = 3152.531. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteAMD Ryzen Threadripper PRO 3995WX 64-Cores3691215SE +/- 0.01, N = 311.191. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMAMD Ryzen Threadripper PRO 3995WX 64-Cores246810SE +/- 0.08554, N = 37.253581. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransAMD Ryzen Threadripper PRO 3995WX 64-Cores0.57181.14361.71542.28722.859SE +/- 0.01461, N = 32.541221. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadAMD Ryzen Threadripper PRO 3995WX 64-Cores0.12790.25580.38370.51160.6395SE +/- 0.00036, N = 30.568271. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessAMD Ryzen Threadripper PRO 3995WX 64-Cores0.04340.08680.13020.17360.217SE +/- 0.00082, N = 30.192711. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyAMD Ryzen Threadripper PRO 3995WX 64-Cores0.25680.51360.77041.02721.284SE +/- 0.00848, N = 31.141251. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthAMD Ryzen Threadripper PRO 3995WX 64-Cores0.16120.32240.48360.64480.806SE +/- 0.01836, N = 30.716531. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthAMD Ryzen Threadripper PRO 3995WX 64-Cores2K4K6K8K10KSE +/- 9.69, N = 39246.271. (CC) gcc options: -lblas -lm -lmpi -fomit-frame-pointer -funroll-loops2. OpenBLAS + Open MPI 4.1.6

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallAMD Ryzen Threadripper PRO 3995WX 64-Cores15003000450060007500SE +/- 73.24, N = 56809.841. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigAMD Ryzen Threadripper PRO 3995WX 64-Cores48121620SE +/- 0.22, N = 417.351. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigAMD Ryzen Threadripper PRO 3995WX 64-Cores1.33332.66663.99995.33326.6665SE +/- 0.074192, N = 155.9258101. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisAMD Ryzen Threadripper PRO 3995WX 64-Cores306090120150SE +/- 0.17, N = 3131.461. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

Input: H4_ae

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Input: Li2_STO_ae

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Input: LiH_ae_MSD

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Input: simple-H2O

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Input: O_ae_pyscf_UHF

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Input: FeCO6_b3lyp_gms

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun was unable to launch the specified application as it could not access

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

Input: X3D-benchmarking input.i3d

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun noticed that process rank 63 with PID 0 on node WRX80-3995WX exited on signal 9 (Killed).

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionAMD Ryzen Threadripper PRO 3995WX 64-Cores3691215SE +/- 0.10, N = 313.491. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionAMD Ryzen Threadripper PRO 3995WX 64-Cores1326395265SE +/- 0.13, N = 359.401. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Monte Carlo Simulations of Ionised Nebulae

Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Gas HII40AMD Ryzen Threadripper PRO 3995WX 64-Cores510152025SE +/- 0.08, N = 319.251. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Dust 2D tau100.0AMD Ryzen Threadripper PRO 3995WX 64-Cores306090120150SE +/- 0.98, N = 3145.601. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsAMD Ryzen Threadripper PRO 3995WX 64-Cores612182430SE +/- 0.32, N = 324.601. (CXX) g++ options: -O3 -lm -ldl

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin ProteinAMD Ryzen Threadripper PRO 3995WX 64-Cores510152025SE +/- 0.24, N = 521.841. (CXX) g++ options: -O3 -lm -ldl

Open FMM Nero2D

This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples. Learn more via the OpenBenchmarking.org test page.

AMD Ryzen Threadripper PRO 3995WX 64-Cores: The test quit with a non-zero exit status. E: mpirun noticed that process rank 7 with PID 0 on node WRX80-3995WX exited on signal 11 (Segmentation fault).

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingAMD Ryzen Threadripper PRO 3995WX 64-Cores3K6K9K12K15KSE +/- 430.92, N = 1211744.391. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingAMD Ryzen Threadripper PRO 3995WX 64-Cores2K4K6K8K10KSE +/- 392.31, N = 1210871.701. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Intel MPI Benchmarks

Intel MPI Benchmarks for stressing MPI implementations. At this point the test profile aggregates results for some common MPI functionality. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Msg/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-P2P PingPongAMD Ryzen Threadripper PRO 3995WX 64-Cores5M10M15M20M25MSE +/- 223826.40, N = 325366668MIN: 2089 / MAX: 678551081. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeAMD Ryzen Threadripper PRO 3995WX 64-Cores5001000150020002500SE +/- 29.82, N = 32490.11MAX: 9884.841. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 ExchangeAMD Ryzen Threadripper PRO 3995WX 64-Cores120240360480600SE +/- 8.21, N = 3557.43MIN: 0.97 / MAX: 22328.951. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 PingPongAMD Ryzen Threadripper PRO 3995WX 64-Cores5001000150020002500SE +/- 10.55, N = 32411.51MIN: 4.38 / MAX: 7065.211. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage Mbytes/sec, More Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvAMD Ryzen Threadripper PRO 3995WX 64-Cores400800120016002000SE +/- 6.66, N = 31859.20MAX: 6882.761. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

OpenBenchmarking.orgAverage usec, Fewer Is BetterIntel MPI Benchmarks 2019.3Test: IMB-MPI1 SendrecvAMD Ryzen Threadripper PRO 3995WX 64-Cores60120180240300SE +/- 2.93, N = 3292.17MIN: 0.5 / MAX: 9957.711. (CXX) g++ options: -O0 -pedantic -fopenmp -lmpi_cxx -lmpi

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareAMD Ryzen Threadripper PRO 3995WX 64-Cores0.71351.4272.14052.8543.5675SE +/- 0.036, N = 33.1711. (CXX) g++ options: -O3 -lm

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: NVIDIA CUDA GPU - Input: water_GMX50_bareAMD Ryzen Threadripper PRO 3995WX 64-Cores612182430SE +/- 0.01, N = 323.801. (CXX) g++ options: -O3 -lm

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon NanotubeAMD Ryzen Threadripper PRO 3995WX 64-Cores306090120150SE +/- 0.41, N = 3136.791. (CC) gcc options: -pthread -shared -lxc -lblas -lmpi -fno-strict-overflow -O2 -fPIC -isystem -UNDEBUG -std=c99

42 Results Shown

High Performance Conjugate Gradient:
  104 104 104 - 60
  104 104 104 - 1800
NAS Parallel Benchmarks:
  BT.C
  CG.C
  EP.C
  EP.D
  FT.C
  IS.D
  LU.C
  MG.C
  SP.B
  SP.C
HPC Challenge:
  G-HPL
  G-Ffte
  EP-DGEMM
  G-Ptrans
  EP-STREAM Triad
  G-Rand Access
  Rand Ring Latency
  Rand Ring Bandwidth
  Max Ping Pong Bandwidth
miniFE
Pennant:
  sedovbig
  leblancbig
Timed MrBayes Analysis
Xcompact3d Incompact3d:
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
Monte Carlo Simulations of Ionised Nebulae:
  Gas HII40
  Dust 2D tau100.0
LAMMPS Molecular Dynamics Simulator:
  20k Atoms
  Rhodopsin Protein
ASKAP:
  tConvolve MPI - Degridding
  tConvolve MPI - Gridding
Intel MPI Benchmarks:
  IMB-P2P PingPong
  IMB-MPI1 Exchange
  IMB-MPI1 Exchange
  IMB-MPI1 PingPong
  IMB-MPI1 Sendrecv
  IMB-MPI1 Sendrecv
GROMACS:
  MPI CPU - water_GMX50_bare
  NVIDIA CUDA GPU - water_GMX50_bare
GPAW