Benchmarks by Michael Larabel for a future article..
HEX Mode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Notes: OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
SNC3 - Default Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel BIRCHSTREAM (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Intel Xeon 6900P - SNC vs. HEX Clustering Mode OpenBenchmarking.org Phoronix Test Suite 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads) Intel BIRCHSTREAM (BHSDCRB1.IPC.0035.D44.2408292336 BIOS) Intel Ice Lake IEH 1520GB 960GB SAMSUNG MZ1L2960HCJR-00A07 ASPEED Intel I210 + 2 x Intel 10-Gigabit X540-AT2 Ubuntu 24.04 6.8.0-45-generic (x86_64) GCC 13.2.0 ext4 1920x1200 Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Compiler File-System Screen Resolution Intel Xeon 6900P - SNC Vs. HEX Clustering Mode Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0 - OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04) - Python 3.12.3 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
HEX Mode vs. SNC3 - Default Comparison Phoronix Test Suite Baseline +13.9% +13.9% +27.8% +27.8% +41.7% +41.7% +55.6% +55.6% 55.7% 52.9% 47.2% 34.8% 31.2% 29.5% 22.6% 15.1% 14.7% 14.3% 12.5% 11.6% 10.8% 10.3% 10% 9.9% 9.1% 8.8% 8.6% 8.3% 8.1% 7.2% 6.4% 5.7% 5.2% 5.1% 5% 4.9% 4.7% 4.3% 4.1% 4% 3.8% 3.1% 3% 2.8% 2.7% 2.3% 2% 256 H2 Database Engine allmodconfig Writes 46% tConvolve MPI - Gridding Streams 32.3% tConvolve MPI - Degridding 26 S.w.1.0.6.A 29.4% CPU - 512 - ResNet-50 23.4% Ninja A.w.3.5.A 18.8% 100 - 1000 - Read Only 17.7% defconfig 26 26 e.G.B.S - 1200 100 - 1000 - Read Only - Average Latency 12% B.S.o.W CPU - Numpy - 4194304 - Equation of State OpenMP - Euclidean Cluster 10.4% i.i.1.C.P.D Basic - CPU 32 S.F.P.R 9% Unix Makefiles i.i.1.C.P.D 26 Chrysler Neon 1M C240 Buckyball 160 160 160 - 60 144 144 144 - 60 OpenMP - BM2 5.7% OpenMP - BM2 5.7% B.B.D.F R.O.R.S.I OpenMP - NDT Mapping 64 Preset 8 - Bosphorus 4K 4.7% X.b.i.i 104 104 104 - 60 128 C.P.D.T Apache Xalan XSLT 3.8% Chess Benchmark Preset 13 - Beauty 4K 10-bit 3.3% Apache Tomcat 3.2% OpenMP - Points2Image Layered Halfspace I.a.F.S.I.D.C BMW27 - CPU-Only Preset 8 - Beauty 4K 10-bit 2.5% Carbon Nanotube Tomographic Model libxsmm DaCapo Benchmark Timed Linux Kernel Compilation Apache Cassandra ASKAP PETSc ASKAP Graph500 NAMD TensorFlow Timed LLVM Compilation NAMD PostgreSQL Timed Linux Kernel Compilation Graph500 Graph500 easyWave PostgreSQL OpenRadioss PyHPC Benchmarks Darmstadt Automotive Parallel Heterogeneous Suite Algebraic Multi-Grid Benchmark Xcompact3d Incompact3d RELION libxsmm ACES DGEMM Timed LLVM Compilation Xcompact3d Incompact3d Graph500 OpenRadioss NWChem High Performance Conjugate Gradient High Performance Conjugate Gradient miniBUDE miniBUDE DaCapo Benchmark OpenRadioss Darmstadt Automotive Parallel Heterogeneous Suite libxsmm SVT-AV1 Xcompact3d Incompact3d High Performance Conjugate Gradient libxsmm OpenRadioss DaCapo Benchmark Stockfish SVT-AV1 DaCapo Benchmark Darmstadt Automotive Parallel Heterogeneous Suite SPECFEM3D OpenRadioss Blender SVT-AV1 GPAW SPECFEM3D HEX Mode SNC3 - Default
Intel Xeon 6900P - SNC vs. HEX Clustering Mode hpcg: 104 104 104 - 60 hpcg: 144 144 144 - 60 hpcg: 160 160 160 - 60 minibude: OpenMP - BM2 minibude: OpenMP - BM2 namd: ATPase with 327,506 Atoms namd: STMV with 1,066,628 Atoms amg: libxsmm: 128 libxsmm: 256 libxsmm: 32 libxsmm: 64 nwchem: C240 Buckyball incompact3d: X3D-benchmarking input.i3d incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction openradioss: Bumper Beam openradioss: Chrysler Neon 1M openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container relion: Basic - CPU specfem3d: Mount St. Helens specfem3d: Layered Halfspace specfem3d: Tomographic Model specfem3d: Homogeneous Halfspace specfem3d: Water-layered Halfspace lammps: 20k Atoms lammps: Rhodopsin Protein dacapobench: Jython dacapobench: Apache Kafka dacapobench: Apache Tomcat dacapobench: Apache Xalan XSLT dacapobench: H2 Database Engine dacapobench: BioJava Biological Data Framework byte: Dhrystone 2 byte: System Call byte: Whetstone Double easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 easywave: e2Asean Grid + BengkuluSept2007 Source - 2400 svt-av1: Preset 3 - Bosphorus 4K svt-av1: Preset 5 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 5 - Beauty 4K 10-bit svt-av1: Preset 8 - Beauty 4K 10-bit svt-av1: Preset 13 - Beauty 4K 10-bit mt-dgemm: Sustained Floating-Point Rate compress-7zip: Compression Rating compress-7zip: Decompression Rating stockfish: Chess Benchmark build-linux-kernel: defconfig build-linux-kernel: allmodconfig build-llvm: Ninja build-llvm: Unix Makefiles askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding graph500: 26 graph500: 26 graph500: 26 graph500: 26 gromacs: MPI CPU - water_GMX50_bare daphne: OpenMP - NDT Mapping daphne: OpenMP - Points2Image daphne: OpenMP - Euclidean Cluster pgbench: 100 - 1000 - Read Only pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Write pgbench: 100 - 1000 - Read Write - Average Latency tensorflow: CPU - 512 - ResNet-50 gpaw: Carbon Nanotube blender: BMW27 - CPU-Only blender: Junkshop - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only petsc: Streams cassandra: Writes pyhpc: CPU - Numpy - 4194304 - Equation of State pyhpc: CPU - Numpy - 4194304 - Isoneutral Mixing HEX Mode SNC3 - Default 168.784 161.916 159.835 7093.931 283.757 4.49970 2.55596 7708395000 7862.2 2706.0 3214.1 5554.0 1779.7 72.1088104 0.903198322 2.82190266 110.68 66.46 40.73 172.45 231.10 93.13 114.397 4.018974773 7.422688518 4.448644552 5.513972044 9.321282291 92.752 70.474 3568 5995 8933 2500 17178 5791 18804694499.0 910572120.1 3728535.6 59.596 140.438 9.106 31.193 66.992 195.989 5.684 8.191 13.410 3865.658800 911774 1377979 566141415 26.983 193.114 94.061 214.439 82064.2 92985.2 1530330000 1954670000 673326000 964107000 32.137 547.43 4243.69 676.38 638580 1.645 13656 73.226 214.73 88.866 7.69 10.89 17.51 10.49 68.40 21.54 595124.2316 147125 1.713 1.931 176.009 171.205 170.002 6713.241 268.530 3.78755 1.97540 8504201333 8182.9 4213.9 3507.4 5826.5 1660.6 68.8912379 0.831954996 2.56438231 110.55 61.48 39.18 154.56 219.85 90.63 104.051 3.995710366 7.207569259 4.361234856 5.433174239 9.243850981 94.052 71.402 3562 6098 9223 2595 11238 5503 18658833934.3 897496230.9 3722311.4 52.965 141.905 9.155 30.856 63.967 193.486 5.773 7.991 12.979 3545.814232 897725 1394947 587474317 23.443 131.149 76.730 197.129 107661 125342 1748790000 2116240000 871769000 1105950000 32.592 574.84 4375.31 612.85 542652 1.843 13506 74.043 173.99 86.877 7.49 10.85 17.55 10.43 68.00 21.28 449771.2850 100742 1.546 1.907 OpenBenchmarking.org
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 SNC3 - Default HEX Mode 1500 3000 4500 6000 7500 SE +/- 151.76, N = 12 SE +/- 66.01, N = 6 6713.24 7093.93 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 SNC3 - Default HEX Mode 60 120 180 240 300 SE +/- 6.07, N = 12 SE +/- 2.64, N = 6 268.53 283.76 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms SNC3 - Default HEX Mode 1.0124 2.0248 3.0372 4.0496 5.062 SE +/- 0.01928, N = 3 SE +/- 0.05577, N = 15 3.78755 4.49970
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: STMV with 1,066,628 Atoms SNC3 - Default HEX Mode 0.5751 1.1502 1.7253 2.3004 2.8755 SE +/- 0.01818, N = 13 SE +/- 0.06201, N = 15 1.97540 2.55596
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 SNC3 - Default HEX Mode 2000M 4000M 6000M 8000M 10000M SE +/- 11711915.09, N = 3 SE +/- 20523024.36, N = 3 8504201333 7708395000 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 SNC3 - Default HEX Mode 2K 4K 6K 8K 10K SE +/- 103.51, N = 3 SE +/- 151.59, N = 9 8182.9 7862.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 256 SNC3 - Default HEX Mode 900 1800 2700 3600 4500 SE +/- 124.41, N = 15 SE +/- 36.38, N = 3 4213.9 2706.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 SNC3 - Default HEX Mode 800 1600 2400 3200 4000 SE +/- 58.49, N = 12 SE +/- 68.51, N = 12 3507.4 3214.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 SNC3 - Default HEX Mode 1200 2400 3600 4800 6000 SE +/- 63.47, N = 15 SE +/- 68.57, N = 15 5826.5 5554.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball SNC3 - Default HEX Mode 400 800 1200 1600 2000 1660.6 1779.7 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d SNC3 - Default HEX Mode 16 32 48 64 80 SE +/- 0.42, N = 3 SE +/- 0.09, N = 3 68.89 72.11 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction SNC3 - Default HEX Mode 0.2032 0.4064 0.6096 0.8128 1.016 SE +/- 0.005913486, N = 15 SE +/- 0.011942737, N = 3 0.831954996 0.903198322 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction SNC3 - Default HEX Mode 0.6349 1.2698 1.9047 2.5396 3.1745 SE +/- 0.01327750, N = 3 SE +/- 0.02990610, N = 15 2.56438231 2.82190266 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam SNC3 - Default HEX Mode 20 40 60 80 100 SE +/- 0.43, N = 3 SE +/- 0.36, N = 3 110.55 110.68
RELION RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RELION 4.0.1 Test: Basic - Device: CPU SNC3 - Default HEX Mode 30 60 90 120 150 SE +/- 0.54, N = 3 SE +/- 0.83, N = 3 104.05 114.40 1. (CXX) g++ options: -fopenmp -std=c++11 -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens SNC3 - Default HEX Mode 0.9043 1.8086 2.7129 3.6172 4.5215 SE +/- 0.037097693, N = 7 SE +/- 0.026444752, N = 3 3.995710366 4.018974773 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace SNC3 - Default HEX Mode 2 4 6 8 10 SE +/- 0.060537089, N = 3 SE +/- 0.036865236, N = 3 7.207569259 7.422688518 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model SNC3 - Default HEX Mode 1.0009 2.0018 3.0027 4.0036 5.0045 SE +/- 0.014525946, N = 3 SE +/- 0.014802330, N = 3 4.361234856 4.448644552 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace SNC3 - Default HEX Mode 1.2406 2.4812 3.7218 4.9624 6.203 SE +/- 0.012290725, N = 3 SE +/- 0.007527831, N = 3 5.433174239 5.513972044 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace SNC3 - Default HEX Mode 3 6 9 12 15 SE +/- 0.054952663, N = 3 SE +/- 0.016038732, N = 3 9.243850981 9.321282291 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
BYTE Unix Benchmark OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Dhrystone 2 SNC3 - Default HEX Mode 4000M 8000M 12000M 16000M 20000M SE +/- 22131931.33, N = 3 SE +/- 9501831.17, N = 3 18658833934.3 18804694499.0 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: System Call SNC3 - Default HEX Mode 200M 400M 600M 800M 1000M SE +/- 219845.26, N = 3 SE +/- 144616.99, N = 3 897496230.9 910572120.1 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
OpenBenchmarking.org MWIPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Whetstone Double SNC3 - Default HEX Mode 800K 1600K 2400K 3200K 4000K SE +/- 382.28, N = 3 SE +/- 113.17, N = 3 3722311.4 3728535.6 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
easyWave The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 SNC3 - Default HEX Mode 13 26 39 52 65 SE +/- 0.59, N = 5 SE +/- 0.84, N = 12 52.97 59.60 1. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 SNC3 - Default HEX Mode 30 60 90 120 150 SE +/- 1.82, N = 12 SE +/- 1.63, N = 12 141.91 140.44 1. (CXX) g++ options: -O3 -fopenmp
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 3 - Input: Bosphorus 4K SNC3 - Default HEX Mode 3 6 9 12 15 SE +/- 0.011, N = 3 SE +/- 0.031, N = 3 9.155 9.106 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Bosphorus 4K SNC3 - Default HEX Mode 7 14 21 28 35 SE +/- 0.35, N = 3 SE +/- 0.19, N = 3 30.86 31.19 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Bosphorus 4K SNC3 - Default HEX Mode 15 30 45 60 75 SE +/- 0.65, N = 3 SE +/- 0.74, N = 3 63.97 66.99 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Bosphorus 4K SNC3 - Default HEX Mode 40 80 120 160 200 SE +/- 0.39, N = 3 SE +/- 4.03, N = 12 193.49 195.99 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit SNC3 - Default HEX Mode 1.2989 2.5978 3.8967 5.1956 6.4945 SE +/- 0.006, N = 3 SE +/- 0.018, N = 3 5.773 5.684 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit SNC3 - Default HEX Mode 2 4 6 8 10 SE +/- 0.055, N = 3 SE +/- 0.009, N = 3 7.991 8.191 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit SNC3 - Default HEX Mode 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.00, N = 3 12.98 13.41 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
ACES DGEMM OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate SNC3 - Default HEX Mode 800 1600 2400 3200 4000 SE +/- 6.73, N = 3 SE +/- 5.73, N = 3 3545.81 3865.66 1. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Compression Rating SNC3 - Default HEX Mode 200K 400K 600K 800K 1000K SE +/- 8001.79, N = 3 SE +/- 5282.93, N = 3 897725 911774 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Decompression Rating SNC3 - Default HEX Mode 300K 600K 900K 1200K 1500K SE +/- 10392.40, N = 3 SE +/- 5512.25, N = 3 1394947 1377979 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Stockfish OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 17 Chess Benchmark SNC3 - Default HEX Mode 130M 260M 390M 520M 650M SE +/- 16310733.41, N = 6 SE +/- 10082654.90, N = 9 587474317 566141415 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding SNC3 - Default HEX Mode 20K 40K 60K 80K 100K SE +/- 797.06, N = 3 SE +/- 704.03, N = 3 107661.0 82064.2 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding SNC3 - Default HEX Mode 30K 60K 90K 120K 150K SE +/- 1080.26, N = 3 SE +/- 1249.70, N = 3 125342.0 92985.2 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 SNC3 - Default HEX Mode 400M 800M 1200M 1600M 2000M 1748790000 1530330000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 SNC3 - Default HEX Mode 500M 1000M 1500M 2000M 2500M 2116240000 1954670000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 SNC3 - Default HEX Mode 200M 400M 600M 800M 1000M 871769000 673326000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 SNC3 - Default HEX Mode 200M 400M 600M 800M 1000M 1105950000 964107000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare SNC3 - Default HEX Mode 8 16 24 32 40 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 32.59 32.14 1. (CXX) g++ options: -O3 -lm
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only SNC3 - Default HEX Mode 140K 280K 420K 560K 700K SE +/- 6739.41, N = 3 SE +/- 42784.99, N = 12 542652 638580 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency SNC3 - Default HEX Mode 0.4147 0.8294 1.2441 1.6588 2.0735 SE +/- 0.023, N = 3 SE +/- 0.108, N = 12 1.843 1.645 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write SNC3 - Default HEX Mode 3K 6K 9K 12K 15K SE +/- 37.41, N = 3 SE +/- 2.00, N = 3 13506 13656 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency SNC3 - Default HEX Mode 16 32 48 64 80 SE +/- 0.20, N = 3 SE +/- 0.01, N = 3 74.04 73.23 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 SNC3 - Default HEX Mode 50 100 150 200 250 SE +/- 1.54, N = 3 SE +/- 2.28, N = 4 173.99 214.73
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube SNC3 - Default HEX Mode 20 40 60 80 100 SE +/- 0.72, N = 3 SE +/- 0.28, N = 3 86.88 88.87 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only SNC3 - Default HEX Mode 2 4 6 8 10 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 7.49 7.69
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only SNC3 - Default HEX Mode 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 10.85 10.89
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Classroom - Compute: CPU-Only SNC3 - Default HEX Mode 4 8 12 16 20 SE +/- 0.14, N = 3 SE +/- 0.04, N = 3 17.55 17.51
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Fishy Cat - Compute: CPU-Only SNC3 - Default HEX Mode 3 6 9 12 15 SE +/- 0.12, N = 3 SE +/- 0.10, N = 3 10.43 10.49
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only SNC3 - Default HEX Mode 15 30 45 60 75 SE +/- 0.33, N = 3 SE +/- 0.18, N = 3 68.00 68.40
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Pabellon Barcelona - Compute: CPU-Only SNC3 - Default HEX Mode 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 21.28 21.54
PETSc PETSc, the Portable, Extensible Toolkit for Scientific Computation, is for the scalable (parallel) solution of scientific applications modeled by partial differential equations. This test profile runs the PETSc "make streams" benchmark and records the throughput rate when all available cores are utilized for the MPI Streams build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better PETSc 3.19 Test: Streams SNC3 - Default HEX Mode 130K 260K 390K 520K 650K SE +/- 5879.52, N = 4 SE +/- 1960.36, N = 3 449771.29 595124.23 1. (CC) gcc options: -fPIC -O3 -O2 -lpthread -lm
PyHPC Benchmarks PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State SNC3 - Default HEX Mode 0.3854 0.7708 1.1562 1.5416 1.927 SE +/- 0.008, N = 3 SE +/- 0.012, N = 3 1.546 1.713
OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing SNC3 - Default HEX Mode 0.4345 0.869 1.3035 1.738 2.1725 SE +/- 0.020, N = 3 SE +/- 0.011, N = 3 1.907 1.931
HEX Mode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Notes: OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 24 September 2024 11:15 by user intel.
SNC3 - Default Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel BIRCHSTREAM (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Notes: OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 24 September 2024 23:52 by user intel.