2023-12-08-0937 AMD EPYC 7402P 24-Core testing with a Supermicro H11SSL-i v2.00 (2.1 BIOS) and AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2312092-NE-20231208019&grw .
2023-12-08-0937 Processor Motherboard Chipset Memory Disk Graphics Audio Network OS Kernel Desktop Display Server OpenCL Vulkan Compiler File-System Screen Resolution AMD EPYC 7402P 24-Core AMD EPYC 7402P 24-Core @ 2.80GHz (24 Cores / 48 Threads) Supermicro H11SSL-i v2.00 (2.1 BIOS) AMD Starship/Matisse 128GB 1000GB Western Digital WDS100T2B0C-00PXH0 + 4 x 2048GB ADATA LEGEND 710 + 11756GB HUH721212AL5204 + 4GB USB DISK 2.0 + 3 x 3001GB Western Digital WD30EFRX-68E + 18000GB Seagate ST18000NM000J-2T + 2000GB CT2000BX500SSD1 AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 AMD Navi 10 HDMI Audio 4 x Intel X710 for 10GbE SFP+ + 2 x Intel I210 Ubuntu 22.04 5.15.0-89-generic (x86_64) GNOME Shell 42.9 X Server 1.21.1.4 OpenCL 2.1 AMD-APP (3590.0) 1.3.238 GCC 11.4.0 ext4 1024x768 OpenBenchmarking.org - Transparent Huge Pages: madvise - NVM_CD_FLAGS= - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0x830107a - Python 3.10.12 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT enabled with STIBP protection + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
2023-12-08-0937 clomp: Static OMP Speedup minibude: OpenMP - BM1 minibude: OpenMP - BM1 minibude: OpenMP - BM2 minibude: OpenMP - BM2 openradioss: Bumper Beam openradioss: Ford Taurus 10M openradioss: Chrysler Neon 1M openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container relion: Basic - CPU namd: ATPase Simulation - 327,506 Atoms pennant: sedovbig pennant: leblancbig amg: ffte: N=256, 3D Complex FFT Routine incompact3d: X3D-benchmarking input.i3d incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction AMD EPYC 7402P 24-Core 45.2 654.548 26.182 666.798 26.672 113.39 31918.04 521.28 58.45 182.86 98.45 294.81 901.772 1.33338 27.74788 17.75843 646332533 102768.870311836 974.654093 10.10870203 39.5189705 OpenBenchmarking.org
CLOMP Static OMP Speedup OpenBenchmarking.org Speedup, More Is Better CLOMP 1.2 Static OMP Speedup AMD EPYC 7402P 24-Core 10 20 30 40 50 SE +/- 3.67, N = 12 45.2 1. (CC) gcc options: -fopenmp -O3 -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 AMD EPYC 7402P 24-Core 140 280 420 560 700 SE +/- 0.68, N = 3 654.55 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 AMD EPYC 7402P 24-Core 6 12 18 24 30 SE +/- 0.03, N = 3 26.18 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 AMD EPYC 7402P 24-Core 140 280 420 560 700 SE +/- 1.42, N = 3 666.80 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 AMD EPYC 7402P 24-Core 6 12 18 24 30 SE +/- 0.06, N = 3 26.67 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam AMD EPYC 7402P 24-Core 30 60 90 120 150 SE +/- 0.71, N = 3 113.39
OpenRadioss Model: Ford Taurus 10M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Ford Taurus 10M AMD EPYC 7402P 24-Core 7K 14K 21K 28K 35K SE +/- 43.06, N = 3 31918.04
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M AMD EPYC 7402P 24-Core 110 220 330 440 550 SE +/- 2.05, N = 3 521.28
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test AMD EPYC 7402P 24-Core 13 26 39 52 65 SE +/- 0.49, N = 15 58.45
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield AMD EPYC 7402P 24-Core 40 80 120 160 200 SE +/- 1.26, N = 3 182.86
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation AMD EPYC 7402P 24-Core 20 40 60 80 100 SE +/- 0.74, N = 15 98.45
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container AMD EPYC 7402P 24-Core 60 120 180 240 300 SE +/- 2.22, N = 3 294.81
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 4.0.1 Test: Basic - Device: CPU AMD EPYC 7402P 24-Core 200 400 600 800 1000 SE +/- 6.65, N = 3 901.77 1. (CXX) g++ options: -fopenmp -std=c++11 -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms AMD EPYC 7402P 24-Core 0.3 0.6 0.9 1.2 1.5 SE +/- 0.24961, N = 15 1.33338
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig AMD EPYC 7402P 24-Core 7 14 21 28 35 SE +/- 0.33, N = 3 27.75 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig AMD EPYC 7402P 24-Core 4 8 12 16 20 SE +/- 0.06, N = 3 17.76 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 AMD EPYC 7402P 24-Core 140M 280M 420M 560M 700M SE +/- 1357994.34, N = 3 646332533 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
FFTE N=256, 3D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 N=256, 3D Complex FFT Routine AMD EPYC 7402P 24-Core 20K 40K 60K 80K 100K SE +/- 6855.65, N = 12 102768.87 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d AMD EPYC 7402P 24-Core 200 400 600 800 1000 SE +/- 2.42, N = 3 974.65 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction AMD EPYC 7402P 24-Core 3 6 9 12 15 SE +/- 0.12, N = 15 10.11 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction AMD EPYC 7402P 24-Core 9 18 27 36 45 SE +/- 0.16, N = 3 39.52 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Phoronix Test Suite v10.8.5