kernel-scientific AMD Ryzen 7 1700 Eight-Core testing with a Gigabyte AB350M-Gaming 3-CF (F51g BIOS) and Sapphire AMD Radeon HD 4550 on Ubuntu 20.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2301315-HYDR-KERNELS26&grs .
kernel-scientific Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Vulkan Compiler File-System Screen Resolution default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 AMD Ryzen 7 1700 Eight-Core @ 3.50GHz (8 Cores / 16 Threads) Gigabyte AB350M-Gaming 3-CF (F51g BIOS) AMD 17h 8GB 1000GB Western Digital WD10EFRX-68J + 1000GB Hitachi HDT72101 + 32GB TS32GMTS800 + 1000GB SAMSUNG HD103UJ Sapphire AMD Radeon HD 4550 512MB AMD RV710/730 PLG2488H Realtek RTL8111/8168/8411 Ubuntu 20.04 5.15.0-58-generic (x86_64) 1.1.182 GCC 9.4.0 zfs 1920x1080 1000GB Western Digital WD10EFRX-68J + 1000GB Hitachi HDT72101 + 32GB TS32GMTS800 Sapphire AMD Radeon HD 4550 5.15.30-hdr (x86_64) 1024x768 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-Av3uEd/gcc-9-9.4.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8001138 Python Details - Python 3.8.10 Security Details - default: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Mitigation of untrained return thunk; SMT vulnerable + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - default mitigations=off: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Vulnerable + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 5.15.30-hdr: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled + srbds: Not affected + tsx_async_abort: Not affected - 5.15.30-hdr vol preempt: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled + srbds: Not affected + tsx_async_abort: Not affected - 5.15.30-hdr 250Hz: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled + srbds: Not affected + tsx_async_abort: Not affected - 5.15.30-hdr vol 250Hz: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled + srbds: Not affected + tsx_async_abort: Not affected - vol 250Hz x86-64v2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled + srbds: Not affected + tsx_async_abort: Not affected
kernel-scientific hpcc: G-HPL hmmer: Pfam Database Search fftw: Stock - 2D FFT Size 32 ffte: N=256, 1D Complex FFT Routine lammps: 20k Atoms mocassin: Dust 2D tau100.0 lammps: Rhodopsin Protein fftw: Float + SSE - 2D FFT Size 32 incompact3d: input.i3d 129 Cells Per Direction kripke: mt-dgemm: Sustained Floating-Point Rate himeno: Poisson Pressure Solver mrbayes: Primate Phylogeny Analysis cp2k: Fayalite-FIST incompact3d: input.i3d 193 Cells Per Direction gromacs: MPI CPU - water_GMX50_bare arrayfire: BLAS CPU fftw: Stock - 1D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 4096 pennant: leblancbig pennant: sedovbig fftw: Float + SSE - 1D FFT Size 32 fftw: Stock - 2D FFT Size 4096 cloverleaf: Lagrangian-Eulerian Hydrodynamics fftw: Float + SSE - 2D FFT Size 4096 minife: Small dolfyn: Computational Fluid Dynamics amg: namd: ATPase Simulation - 327,506 Atoms fftw: Stock - 1D FFT Size 32 minife: Large default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 64.00567 159.230 8690.4 36873.342192894 4.695 280 4.593 28344 38.0383034 29927647 1.353413 3826.508282 154.227 682.008 138.638310 0.596 196.104 6756.9 27921 66.78679 91.99171 10719 4997.3 139.59 13450 5869.06 21.446 269121233 2.73229 6958.1 63.57920 157.855 8193.3 37510.483470889 4.698 282 4.608 28364 37.8776042 30653590 1.377947 3867.533008 152.742 683.831 136.512207 0.608 195.339 6735.2 28028 66.44555 92.25545 10816 4974.8 139.18 13490 5876.58 21.406 269542133 2.72922 6942.3 52.69553 173.577 8100.1 39372.658435520 4.415 266 4.361 27134 39.3337796 30606070 1.403760 3893.117655 149.756 697.237 139.082275 0.601 199.895 6767.4 27822 65.79164 91.75040 10714.7 4990.6 138.53 13465 5897.06 21.507 270086767 2.72996 5979.5 53.97165 172.179 8191.0 38625.218859172 4.515 267 4.401 27069 38.6797460 31049327 1.362534 3943.783194 149.733 689.832 140.136485 0.606 199.763 6762.0 27703 66.68654 92.10770 10809 5011.7 139.02 13464 5892.33 21.469 270080267 2.72960 6246.8 63.13570 162.980 8575.1 37639.900103740 4.606 269 4.535 28351 37.9741058 30569070 1.363374 3890.163040 152.943 678.514 136.358022 0.609 197.638 6633.7 27901 66.28735 93.08412 10776 4977.3 139.09 13425 5881.06 21.434 269554167 2.73050 6923.4 62.02053 162.895 8689.1 37252.652189444 4.567 269 4.574 28369 38.3258909 30253350 1.392794 3897.190434 150.491 677.524 138.483266 0.610 198.713 6751.6 27866 66.61243 92.29351 10725 4991.5 139.03 13489 5880.69 21.430 269489033 2.73172 6817.2 62.16950 163.035 8635.2 36828.226400456 4.547 269 4.485 28322 38.3521869 30340393 1.361155 3872.402634 150.821 678.203 136.655497 0.606 199.432 6759.4 27546 66.29798 92.04811 10737 4967.0 139.25 13436 5878.30 21.425 268936133 2.73133 7101.6 OpenBenchmarking.org
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 14 28 42 56 70 SE +/- 0.34, N = 3 SE +/- 0.48, N = 3 SE +/- 0.49, N = 3 SE +/- 0.59, N = 4 SE +/- 0.22, N = 3 SE +/- 0.75, N = 4 SE +/- 0.67, N = 3 64.01 63.58 52.70 53.97 63.14 62.02 62.17 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops 2. ATLAS + Open MPI 4.0.3
Timed HMMer Search Pfam Database Search OpenBenchmarking.org Seconds, Fewer Is Better Timed HMMer Search 3.3.2 Pfam Database Search default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 40 80 120 160 200 SE +/- 1.65, N = 12 SE +/- 0.45, N = 3 SE +/- 0.46, N = 3 SE +/- 0.35, N = 3 SE +/- 0.54, N = 3 SE +/- 0.25, N = 3 SE +/- 0.40, N = 3 159.23 157.86 173.58 172.18 162.98 162.90 163.04 1. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm -lmpi
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 2K 4K 6K 8K 10K SE +/- 109.62, N = 15 SE +/- 3.16, N = 3 SE +/- 96.85, N = 4 SE +/- 16.36, N = 3 SE +/- 111.85, N = 15 SE +/- 111.55, N = 15 SE +/- 110.45, N = 15 8690.4 8193.3 8100.1 8191.0 8575.1 8689.1 8635.2 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTE Test: N=256, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 Test: N=256, 1D Complex FFT Routine default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 8K 16K 24K 32K 40K SE +/- 298.77, N = 3 SE +/- 304.60, N = 9 SE +/- 1.90, N = 3 SE +/- 374.35, N = 3 SE +/- 436.81, N = 4 SE +/- 287.08, N = 3 SE +/- 240.02, N = 3 36873.34 37510.48 39372.66 38625.22 37639.90 37252.65 36828.23 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 1.0571 2.1142 3.1713 4.2284 5.2855 SE +/- 0.043, N = 7 SE +/- 0.006, N = 3 SE +/- 0.070, N = 9 SE +/- 0.011, N = 3 SE +/- 0.008, N = 3 SE +/- 0.043, N = 7 SE +/- 0.009, N = 3 4.695 4.698 4.415 4.515 4.606 4.567 4.547 1. (CXX) g++ options: -O3 -pthread -lm -ldl
Monte Carlo Simulations of Ionised Nebulae Input: Dust 2D tau100.0 OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2019-03-24 Input: Dust 2D tau100.0 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 60 120 180 240 300 SE +/- 1.15, N = 3 SE +/- 1.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 280 282 266 267 269 269 269 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O3 -O2 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: Rhodopsin Protein default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 1.0368 2.0736 3.1104 4.1472 5.184 SE +/- 0.036, N = 12 SE +/- 0.040, N = 3 SE +/- 0.053, N = 4 SE +/- 0.042, N = 3 SE +/- 0.047, N = 3 SE +/- 0.050, N = 3 SE +/- 0.036, N = 9 4.593 4.608 4.361 4.401 4.535 4.574 4.485 1. (CXX) g++ options: -O3 -pthread -lm -ldl
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 6K 12K 18K 24K 30K SE +/- 37.33, N = 3 SE +/- 18.48, N = 3 SE +/- 494.14, N = 15 SE +/- 498.90, N = 15 SE +/- 13.30, N = 3 SE +/- 15.77, N = 3 SE +/- 24.91, N = 3 28344 28364 27134 27069 28351 28369 28322 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 9 18 27 36 45 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.25, N = 3 SE +/- 0.42, N = 3 SE +/- 0.09, N = 3 SE +/- 0.33, N = 3 SE +/- 0.39, N = 3 38.04 37.88 39.33 38.68 37.97 38.33 38.35 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 7M 14M 21M 28M 35M SE +/- 311858.03, N = 3 SE +/- 286459.37, N = 3 SE +/- 212260.85, N = 12 SE +/- 271455.19, N = 3 SE +/- 327750.54, N = 3 SE +/- 253208.05, N = 3 SE +/- 80864.85, N = 3 29927647 30653590 30606070 31049327 30569070 30253350 30340393 1. (CXX) g++ options: -O3 -fopenmp
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 0.3158 0.6316 0.9474 1.2632 1.579 SE +/- 0.012458, N = 3 SE +/- 0.019193, N = 3 SE +/- 0.006679, N = 3 SE +/- 0.003795, N = 3 SE +/- 0.009891, N = 3 SE +/- 0.010683, N = 12 SE +/- 0.010264, N = 10 1.353413 1.377947 1.403760 1.362534 1.363374 1.392794 1.361155 1. (CC) gcc options: -O3 -march=native -fopenmp
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 800 1600 2400 3200 4000 SE +/- 32.49, N = 3 SE +/- 26.40, N = 15 SE +/- 55.25, N = 3 SE +/- 14.00, N = 3 SE +/- 35.66, N = 7 SE +/- 4.29, N = 3 SE +/- 47.61, N = 3 3826.51 3867.53 3893.12 3943.78 3890.16 3897.19 3872.40 1. (CC) gcc options: -O3 -mavx2
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 30 60 90 120 150 SE +/- 0.74, N = 3 SE +/- 0.80, N = 3 SE +/- 0.27, N = 3 SE +/- 0.56, N = 3 SE +/- 1.93, N = 3 SE +/- 0.73, N = 3 SE +/- 1.47, N = 3 154.23 152.74 149.76 149.73 152.94 150.49 150.82 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -lm
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 150 300 450 600 750 682.01 683.83 697.24 689.83 678.51 677.52 678.20
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 30 60 90 120 150 SE +/- 1.71, N = 3 SE +/- 1.17, N = 3 SE +/- 0.06, N = 3 SE +/- 0.52, N = 3 SE +/- 1.09, N = 3 SE +/- 1.61, N = 3 SE +/- 1.42, N = 3 138.64 136.51 139.08 140.14 136.36 138.48 136.66 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 0.1373 0.2746 0.4119 0.5492 0.6865 SE +/- 0.007, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 0.596 0.608 0.601 0.606 0.609 0.610 0.606 1. (CXX) g++ options: -O3 -pthread
ArrayFire Test: BLAS CPU OpenBenchmarking.org GFLOPS, More Is Better ArrayFire 3.7 Test: BLAS CPU default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 40 80 120 160 200 SE +/- 1.09, N = 3 SE +/- 0.26, N = 3 SE +/- 1.91, N = 3 SE +/- 2.31, N = 3 SE +/- 1.83, N = 3 SE +/- 1.95, N = 3 SE +/- 0.52, N = 3 196.10 195.34 199.90 199.76 197.64 198.71 199.43 1. (CXX) g++ options: -rdynamic
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 1500 3000 4500 6000 7500 SE +/- 8.27, N = 3 SE +/- 11.52, N = 3 SE +/- 4.37, N = 3 SE +/- 8.50, N = 3 SE +/- 75.82, N = 15 SE +/- 1.27, N = 3 SE +/- 8.71, N = 3 6756.9 6735.2 6767.4 6762.0 6633.7 6751.6 6759.4 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 6K 12K 18K 24K 30K SE +/- 70.30, N = 3 SE +/- 129.82, N = 3 SE +/- 43.59, N = 3 SE +/- 305.62, N = 3 SE +/- 49.21, N = 3 SE +/- 67.49, N = 3 SE +/- 138.87, N = 3 27921 28028 27822 27703 27901 27866 27546 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 15 30 45 60 75 SE +/- 0.36, N = 3 SE +/- 0.25, N = 3 SE +/- 0.12, N = 3 SE +/- 0.37, N = 3 SE +/- 0.15, N = 3 SE +/- 0.14, N = 3 SE +/- 0.21, N = 3 66.79 66.45 65.79 66.69 66.29 66.61 66.30 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 20 40 60 80 100 SE +/- 0.40, N = 3 SE +/- 0.47, N = 3 SE +/- 0.36, N = 3 SE +/- 0.37, N = 3 SE +/- 0.88, N = 3 SE +/- 0.08, N = 3 SE +/- 0.73, N = 3 91.99 92.26 91.75 92.11 93.08 92.29 92.05 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 2K 4K 6K 8K 10K SE +/- 4.58, N = 3 SE +/- 14.17, N = 3 SE +/- 66.80, N = 13 SE +/- 23.03, N = 3 SE +/- 25.85, N = 3 SE +/- 12.98, N = 3 SE +/- 47.64, N = 3 10719.0 10816.0 10714.7 10809.0 10776.0 10725.0 10737.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 1100 2200 3300 4400 5500 SE +/- 16.35, N = 3 SE +/- 24.04, N = 3 SE +/- 21.71, N = 3 SE +/- 16.01, N = 3 SE +/- 18.78, N = 3 SE +/- 14.03, N = 3 SE +/- 2.97, N = 3 4997.3 4974.8 4990.6 5011.7 4977.3 4991.5 4967.0 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 30 60 90 120 150 SE +/- 0.08, N = 3 SE +/- 0.19, N = 3 SE +/- 0.10, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.14, N = 3 139.59 139.18 138.53 139.02 139.09 139.03 139.25 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 3K 6K 9K 12K 15K SE +/- 10.93, N = 3 SE +/- 42.79, N = 3 SE +/- 28.02, N = 3 SE +/- 126.98, N = 3 SE +/- 26.19, N = 3 SE +/- 29.45, N = 3 SE +/- 26.35, N = 3 13450 13490 13465 13464 13425 13489 13436 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 1300 2600 3900 5200 6500 SE +/- 7.34, N = 3 SE +/- 5.81, N = 3 SE +/- 6.03, N = 3 SE +/- 5.13, N = 3 SE +/- 5.92, N = 3 SE +/- 3.64, N = 3 SE +/- 4.63, N = 3 5869.06 5876.58 5897.06 5892.33 5881.06 5880.69 5878.30 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.15, N = 13 SE +/- 0.25, N = 4 SE +/- 0.21, N = 6 SE +/- 0.22, N = 6 SE +/- 0.20, N = 6 SE +/- 0.22, N = 5 21.45 21.41 21.51 21.47 21.43 21.43 21.43
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 60M 120M 180M 240M 300M SE +/- 334412.09, N = 3 SE +/- 655205.65, N = 3 SE +/- 200238.99, N = 3 SE +/- 497247.06, N = 3 SE +/- 402172.93, N = 3 SE +/- 79377.94, N = 3 SE +/- 356240.65, N = 3 269121233 269542133 270086767 270080267 269554167 269489033 268936133 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 0.6148 1.2296 1.8444 2.4592 3.074 SE +/- 0.00051, N = 3 SE +/- 0.00012, N = 3 SE +/- 0.00060, N = 3 SE +/- 0.00204, N = 3 SE +/- 0.00112, N = 3 SE +/- 0.00079, N = 3 SE +/- 0.00078, N = 3 2.73229 2.72922 2.72996 2.72960 2.73050 2.73172 2.73133
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 default default mitigations=off 5.15.30-hdr 5.15.30-hdr vol preempt 5.15.30-hdr 250Hz 5.15.30-hdr vol 250Hz vol 250Hz x86-64v2 1500 3000 4500 6000 7500 SE +/- 103.41, N = 15 SE +/- 102.34, N = 15 SE +/- 322.28, N = 12 SE +/- 303.53, N = 15 SE +/- 121.64, N = 15 SE +/- 150.54, N = 15 SE +/- 98.37, N = 15 6958.1 6942.3 5979.5 6246.8 6923.4 6817.2 7101.6 1. (CC) gcc options: -pthread -O3 -fomit-frame-pointer -mtune=native -malign-double -fstrict-aliasing -fno-schedule-insns -ffast-math -lm
Phoronix Test Suite v10.8.4