2023-12-10-0133 AMD EPYC 7402P 24-Core testing with a Supermicro H11SSL-i v2.00 (2.1 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2312102-NE-20231210012&grr .
2023-12-10-0133 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenCL Vulkan Compiler File-System Screen Resolution AMD EPYC 7402P 24-Core AMD EPYC 7402P 24-Core @ 2.80GHz (24 Cores / 48 Threads) Supermicro H11SSL-i v2.00 (2.1 BIOS) AMD Starship/Matisse 128GB 1000GB Western Digital WDS100T2B0C-00PXH0 + 4 x 2048GB ADATA LEGEND 710 + 11756GB HUH721212AL5204 + 4GB USB DISK 2.0 + 3 x 3001GB Western Digital WD30EFRX-68E + 18000GB Seagate ST18000NM000J-2T + 2000GB CT2000BX500SSD1 AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 AMD Navi 10 HDMI Audio 4 x Intel X710 for 10GbE SFP+ + 2 x Intel I210 Ubuntu 22.04 5.15.0-89-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.4 OpenCL 2.1 AMD-APP (3590.0) 1.3.238 GCC 11.4.0 ext4 1920x1080 OpenBenchmarking.org - Transparent Huge Pages: madvise - NVM_CD_FLAGS= - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x830107a - Python 3.10.12 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2023-12-10-0133 clomp: Static OMP Speedup minibude: OpenMP - BM2 minibude: OpenMP - BM2 namd: ATPase Simulation - 327,506 Atoms minibude: OpenMP - BM1 minibude: OpenMP - BM1 pennant: sedovbig amg: pennant: leblancbig ffte: N=256, 3D Complex FFT Routine AMD EPYC 7402P 24-Core 37.4 26.407 660.182 0.99209 26.070 651.739 28.08422 628017667 17.81729 111909.39684401 OpenBenchmarking.org
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup AMD EPYC 7402P 24-Core 9 18 27 36 45 SE +/- 4.56, N = 15 37.4 1. (CC) gcc options: -fopenmp -O3 -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 AMD EPYC 7402P 24-Core 6 12 18 24 30 SE +/- 0.07, N = 3 26.41 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 AMD EPYC 7402P 24-Core 140 280 420 560 700 SE +/- 1.71, N = 3 660.18 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms AMD EPYC 7402P 24-Core 0.2232 0.4464 0.6696 0.8928 1.116 SE +/- 0.01109, N = 5 0.99209
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 AMD EPYC 7402P 24-Core 6 12 18 24 30 SE +/- 0.33, N = 3 26.07 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 AMD EPYC 7402P 24-Core 140 280 420 560 700 SE +/- 8.32, N = 3 651.74 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig AMD EPYC 7402P 24-Core 7 14 21 28 35 SE +/- 0.28, N = 3 28.08 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 AMD EPYC 7402P 24-Core 130M 260M 390M 520M 650M SE +/- 5200498.23, N = 3 628017667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig AMD EPYC 7402P 24-Core 4 8 12 16 20 SE +/- 0.01, N = 3 17.82 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine AMD EPYC 7402P 24-Core 20K 40K 60K 80K 100K SE +/- 1289.32, N = 3 111909.40 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Phoronix Test Suite v10.8.5