Benchmarks by Michael Larabel for a future article..
HEX Mode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Notes: OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
SNC3 - Default Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel BIRCHSTREAM (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Intel Xeon 6900P - SNC vs. HEX Clustering Mode OpenBenchmarking.org Phoronix Test Suite 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads) Intel BIRCHSTREAM (BHSDCRB1.IPC.0035.D44.2408292336 BIOS) Intel Ice Lake IEH 1520GB 960GB SAMSUNG MZ1L2960HCJR-00A07 ASPEED Intel I210 + 2 x Intel 10-Gigabit X540-AT2 Ubuntu 24.04 6.8.0-45-generic (x86_64) GCC 13.2.0 ext4 1920x1200 Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Compiler File-System Screen Resolution Intel Xeon 6900P - SNC Vs. HEX Clustering Mode Benchmarks System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0 - OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04) - Python 3.12.3 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
HEX Mode vs. SNC3 - Default Comparison Phoronix Test Suite Baseline +13.9% +13.9% +27.8% +27.8% +41.7% +41.7% +55.6% +55.6% 55.7% 52.9% 47.2% 34.8% 31.2% 29.5% 22.6% 15.1% 14.7% 14.3% 12.5% 11.6% 10.8% 10.3% 10% 9.9% 9.1% 8.8% 8.6% 8.3% 8.1% 7.2% 6.4% 5.7% 5.2% 5.1% 5% 4.9% 4.7% 4.3% 4.1% 4% 3.8% 3.1% 3% 2.8% 2.7% 2.3% 2% 256 H2 Database Engine allmodconfig Writes 46% tConvolve MPI - Gridding Streams 32.3% tConvolve MPI - Degridding 26 S.w.1.0.6.A 29.4% CPU - 512 - ResNet-50 23.4% Ninja A.w.3.5.A 18.8% 100 - 1000 - Read Only 17.7% defconfig 26 26 e.G.B.S - 1200 100 - 1000 - Read Only - Average Latency 12% B.S.o.W CPU - Numpy - 4194304 - Equation of State OpenMP - Euclidean Cluster 10.4% i.i.1.C.P.D Basic - CPU 32 S.F.P.R 9% Unix Makefiles i.i.1.C.P.D 26 Chrysler Neon 1M C240 Buckyball 160 160 160 - 60 144 144 144 - 60 OpenMP - BM2 5.7% OpenMP - BM2 5.7% B.B.D.F R.O.R.S.I OpenMP - NDT Mapping 64 Preset 8 - Bosphorus 4K 4.7% X.b.i.i 104 104 104 - 60 128 C.P.D.T Apache Xalan XSLT 3.8% Chess Benchmark Preset 13 - Beauty 4K 10-bit 3.3% Apache Tomcat 3.2% OpenMP - Points2Image Layered Halfspace I.a.F.S.I.D.C BMW27 - CPU-Only Preset 8 - Beauty 4K 10-bit 2.5% Carbon Nanotube Tomographic Model libxsmm DaCapo Benchmark Timed Linux Kernel Compilation Apache Cassandra ASKAP PETSc ASKAP Graph500 NAMD TensorFlow Timed LLVM Compilation NAMD PostgreSQL Timed Linux Kernel Compilation Graph500 Graph500 easyWave PostgreSQL OpenRadioss PyHPC Benchmarks Darmstadt Automotive Parallel Heterogeneous Suite Algebraic Multi-Grid Benchmark Xcompact3d Incompact3d RELION libxsmm ACES DGEMM Timed LLVM Compilation Xcompact3d Incompact3d Graph500 OpenRadioss NWChem High Performance Conjugate Gradient High Performance Conjugate Gradient miniBUDE miniBUDE DaCapo Benchmark OpenRadioss Darmstadt Automotive Parallel Heterogeneous Suite libxsmm SVT-AV1 Xcompact3d Incompact3d High Performance Conjugate Gradient libxsmm OpenRadioss DaCapo Benchmark Stockfish SVT-AV1 DaCapo Benchmark Darmstadt Automotive Parallel Heterogeneous Suite SPECFEM3D OpenRadioss Blender SVT-AV1 GPAW SPECFEM3D HEX Mode SNC3 - Default
Intel Xeon 6900P - SNC vs. HEX Clustering Mode libxsmm: 32 dacapobench: Jython dacapobench: Apache Kafka dacapobench: Apache Tomcat dacapobench: Apache Xalan XSLT libxsmm: 64 dacapobench: H2 Database Engine dacapobench: BioJava Biological Data Framework daphne: OpenMP - NDT Mapping daphne: OpenMP - Points2Image daphne: OpenMP - Euclidean Cluster graph500: 26 graph500: 26 graph500: 26 libxsmm: 256 libxsmm: 128 graph500: 26 minibude: OpenMP - BM2 minibude: OpenMP - BM2 openradioss: Bumper Beam openradioss: Chrysler Neon 1M specfem3d: Mount St. Helens openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield specfem3d: Homogeneous Halfspace specfem3d: Water-layered Halfspace specfem3d: Tomographic Model specfem3d: Layered Halfspace openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container relion: Basic - CPU tensorflow: CPU - 512 - ResNet-50 gromacs: MPI CPU - water_GMX50_bare lammps: 20k Atoms lammps: Rhodopsin Protein hpcg: 104 104 104 - 60 hpcg: 144 144 144 - 60 hpcg: 160 160 160 - 60 namd: ATPase with 327,506 Atoms namd: STMV with 1,066,628 Atoms askap: tConvolve MPI - Degridding askap: tConvolve MPI - Gridding easywave: e2Asean Grid + BengkuluSept2007 Source - 2400 easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 mt-dgemm: Sustained Floating-Point Rate amg: pyhpc: CPU - Numpy - 4194304 - Equation of State pyhpc: CPU - Numpy - 4194304 - Isoneutral Mixing nwchem: C240 Buckyball incompact3d: X3D-benchmarking input.i3d incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction gpaw: Carbon Nanotube stockfish: Chess Benchmark compress-7zip: Compression Rating compress-7zip: Decompression Rating build-llvm: Ninja build-llvm: Unix Makefiles build-linux-kernel: defconfig build-linux-kernel: allmodconfig svt-av1: Preset 3 - Bosphorus 4K svt-av1: Preset 5 - Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 5 - Beauty 4K 10-bit svt-av1: Preset 8 - Beauty 4K 10-bit svt-av1: Preset 13 - Beauty 4K 10-bit petsc: Streams blender: BMW27 - CPU-Only blender: Junkshop - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only cassandra: Writes pgbench: 100 - 1000 - Read Only pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Write pgbench: 100 - 1000 - Read Write - Average Latency byte: Dhrystone 2 byte: System Call byte: Whetstone Double HEX Mode SNC3 - Default 3214.1 3568 5995 8933 2500 5554.0 17178 5791 547.43 4243.69 676.38 1530330000 1954670000 673326000 2706.0 7862.2 964107000 7093.931 283.757 110.68 66.46 4.018974773 40.73 172.45 5.513972044 9.321282291 4.448644552 7.422688518 231.10 93.13 114.397 214.73 32.137 92.752 70.474 168.784 161.916 159.835 4.49970 2.55596 82064.2 92985.2 140.438 59.596 3865.658800 7708395000 1.713 1.931 1779.7 72.1088104 0.903198322 2.82190266 88.866 566141415 911774 1377979 94.061 214.439 26.983 193.114 9.106 31.193 66.992 195.989 5.684 8.191 13.410 595124.2316 7.69 10.89 17.51 10.49 68.40 21.54 147125 638580 1.645 13656 73.226 18804694499.0 910572120.1 3728535.6 3507.4 3562 6098 9223 2595 5826.5 11238 5503 574.84 4375.31 612.85 1748790000 2116240000 871769000 4213.9 8182.9 1105950000 6713.241 268.530 110.55 61.48 3.995710366 39.18 154.56 5.433174239 9.243850981 4.361234856 7.207569259 219.85 90.63 104.051 173.99 32.592 94.052 71.402 176.009 171.205 170.002 3.78755 1.97540 107661 125342 141.905 52.965 3545.814232 8504201333 1.546 1.907 1660.6 68.8912379 0.831954996 2.56438231 86.877 587474317 897725 1394947 76.730 197.129 23.443 131.149 9.155 30.856 63.967 193.486 5.773 7.991 12.979 449771.2850 7.49 10.85 17.55 10.43 68.00 21.28 100742 542652 1.843 13506 74.043 18658833934.3 897496230.9 3722311.4 OpenBenchmarking.org
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 HEX Mode SNC3 - Default 800 1600 2400 3200 4000 SE +/- 68.51, N = 12 SE +/- 58.49, N = 12 3214.1 3507.4 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 HEX Mode SNC3 - Default 1200 2400 3600 4800 6000 SE +/- 68.57, N = 15 SE +/- 63.47, N = 15 5554.0 5826.5 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 HEX Mode SNC3 - Default 400M 800M 1200M 1600M 2000M 1530330000 1748790000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 HEX Mode SNC3 - Default 500M 1000M 1500M 2000M 2500M 1954670000 2116240000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 HEX Mode SNC3 - Default 200M 400M 600M 800M 1000M 673326000 871769000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
libxsmm Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 256 HEX Mode SNC3 - Default 900 1800 2700 3600 4500 SE +/- 36.38, N = 3 SE +/- 124.41, N = 15 2706.0 4213.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 HEX Mode SNC3 - Default 2K 4K 6K 8K 10K SE +/- 151.59, N = 9 SE +/- 103.51, N = 3 7862.2 8182.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 HEX Mode SNC3 - Default 200M 400M 600M 800M 1000M 964107000 1105950000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
miniBUDE MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 HEX Mode SNC3 - Default 1500 3000 4500 6000 7500 SE +/- 66.01, N = 6 SE +/- 151.76, N = 12 7093.93 6713.24 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 HEX Mode SNC3 - Default 60 120 180 240 300 SE +/- 2.64, N = 6 SE +/- 6.07, N = 12 283.76 268.53 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam HEX Mode SNC3 - Default 20 40 60 80 100 SE +/- 0.36, N = 3 SE +/- 0.43, N = 3 110.68 110.55
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Mount St. Helens HEX Mode SNC3 - Default 0.9043 1.8086 2.7129 3.6172 4.5215 SE +/- 0.026444752, N = 3 SE +/- 0.037097693, N = 7 4.018974773 3.995710366 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test HEX Mode SNC3 - Default 9 18 27 36 45 SE +/- 0.29, N = 3 SE +/- 0.06, N = 3 40.73 39.18
SPECFEM3D simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Homogeneous Halfspace HEX Mode SNC3 - Default 1.2406 2.4812 3.7218 4.9624 6.203 SE +/- 0.007527831, N = 3 SE +/- 0.012290725, N = 3 5.513972044 5.433174239 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Water-layered Halfspace HEX Mode SNC3 - Default 3 6 9 12 15 SE +/- 0.016038732, N = 3 SE +/- 0.054952663, N = 3 9.321282291 9.243850981 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Tomographic Model HEX Mode SNC3 - Default 1.0009 2.0018 3.0027 4.0036 5.0045 SE +/- 0.014802330, N = 3 SE +/- 0.014525946, N = 3 4.448644552 4.361234856 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.1.1 Model: Layered Halfspace HEX Mode SNC3 - Default 2 4 6 8 10 SE +/- 0.036865236, N = 3 SE +/- 0.060537089, N = 3 7.422688518 7.207569259 1. (F9X) gfortran options: -O2 -fopenmp -std=f2008 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation HEX Mode SNC3 - Default 50 100 150 200 250 SE +/- 6.59, N = 9 SE +/- 3.87, N = 12 231.10 219.85
RELION RELION - REgularised LIkelihood OptimisatioN - is a stand-alone computer program for Maximum A Posteriori refinement of (multiple) 3D reconstructions or 2D class averages in cryo-electron microscopy (cryo-EM). It is developed in the research group of Sjors Scheres at the MRC Laboratory of Molecular Biology. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better RELION 4.0.1 Test: Basic - Device: CPU HEX Mode SNC3 - Default 30 60 90 120 150 SE +/- 0.83, N = 3 SE +/- 0.54, N = 3 114.40 104.05 1. (CXX) g++ options: -fopenmp -std=c++11 -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 HEX Mode SNC3 - Default 50 100 150 200 250 SE +/- 2.28, N = 4 SE +/- 1.54, N = 3 214.73 173.99
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare HEX Mode SNC3 - Default 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 32.14 32.59 1. (CXX) g++ options: -O3 -lm
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms HEX Mode SNC3 - Default 1.0124 2.0248 3.0372 4.0496 5.062 SE +/- 0.05577, N = 15 SE +/- 0.01928, N = 3 4.49970 3.78755
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: STMV with 1,066,628 Atoms HEX Mode SNC3 - Default 0.5751 1.1502 1.7253 2.3004 2.8755 SE +/- 0.06201, N = 15 SE +/- 0.01818, N = 13 2.55596 1.97540
ASKAP ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Degridding HEX Mode SNC3 - Default 20K 40K 60K 80K 100K SE +/- 704.03, N = 3 SE +/- 797.06, N = 3 82064.2 107661.0 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
OpenBenchmarking.org Mpix/sec, More Is Better ASKAP 1.0 Test: tConvolve MPI - Gridding HEX Mode SNC3 - Default 30K 60K 90K 120K 150K SE +/- 1249.70, N = 3 SE +/- 1080.26, N = 3 92985.2 125342.0 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
easyWave The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 HEX Mode SNC3 - Default 30 60 90 120 150 SE +/- 1.63, N = 12 SE +/- 1.82, N = 12 140.44 141.91 1. (CXX) g++ options: -O3 -fopenmp
OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 HEX Mode SNC3 - Default 13 26 39 52 65 SE +/- 0.84, N = 12 SE +/- 0.59, N = 5 59.60 52.97 1. (CXX) g++ options: -O3 -fopenmp
ACES DGEMM OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate HEX Mode SNC3 - Default 800 1600 2400 3200 4000 SE +/- 5.73, N = 3 SE +/- 6.73, N = 3 3865.66 3545.81 1. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 HEX Mode SNC3 - Default 2000M 4000M 6000M 8000M 10000M SE +/- 20523024.36, N = 3 SE +/- 11711915.09, N = 3 7708395000 8504201333 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
PyHPC Benchmarks PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State HEX Mode SNC3 - Default 0.3854 0.7708 1.1562 1.5416 1.927 SE +/- 0.012, N = 3 SE +/- 0.008, N = 3 1.713 1.546
OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing HEX Mode SNC3 - Default 0.4345 0.869 1.3035 1.738 2.1725 SE +/- 0.011, N = 3 SE +/- 0.020, N = 3 1.931 1.907
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball HEX Mode SNC3 - Default 400 800 1200 1600 2000 1779.7 1660.6 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d HEX Mode SNC3 - Default 16 32 48 64 80 SE +/- 0.09, N = 3 SE +/- 0.42, N = 3 72.11 68.89 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction HEX Mode SNC3 - Default 0.2032 0.4064 0.6096 0.8128 1.016 SE +/- 0.011942737, N = 3 SE +/- 0.005913486, N = 15 0.903198322 0.831954996 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction HEX Mode SNC3 - Default 0.6349 1.2698 1.9047 2.5396 3.1745 SE +/- 0.02990610, N = 15 SE +/- 0.01327750, N = 3 2.82190266 2.56438231 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube HEX Mode SNC3 - Default 20 40 60 80 100 SE +/- 0.28, N = 3 SE +/- 0.72, N = 3 88.87 86.88 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Stockfish OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 17 Chess Benchmark HEX Mode SNC3 - Default 130M 260M 390M 520M 650M SE +/- 10082654.90, N = 9 SE +/- 16310733.41, N = 6 566141415 587474317 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
7-Zip Compression OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Compression Rating HEX Mode SNC3 - Default 200K 400K 600K 800K 1000K SE +/- 5282.93, N = 3 SE +/- 8001.79, N = 3 911774 897725 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Decompression Rating HEX Mode SNC3 - Default 300K 600K 900K 1200K 1500K SE +/- 5512.25, N = 3 SE +/- 10392.40, N = 3 1377979 1394947 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 3 - Input: Bosphorus 4K HEX Mode SNC3 - Default 3 6 9 12 15 SE +/- 0.031, N = 3 SE +/- 0.011, N = 3 9.106 9.155 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Bosphorus 4K HEX Mode SNC3 - Default 7 14 21 28 35 SE +/- 0.19, N = 3 SE +/- 0.35, N = 3 31.19 30.86 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Bosphorus 4K HEX Mode SNC3 - Default 15 30 45 60 75 SE +/- 0.74, N = 3 SE +/- 0.65, N = 3 66.99 63.97 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Bosphorus 4K HEX Mode SNC3 - Default 40 80 120 160 200 SE +/- 4.03, N = 12 SE +/- 0.39, N = 3 195.99 193.49 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 5 - Input: Beauty 4K 10-bit HEX Mode SNC3 - Default 1.2989 2.5978 3.8967 5.1956 6.4945 SE +/- 0.018, N = 3 SE +/- 0.006, N = 3 5.684 5.773 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 8 - Input: Beauty 4K 10-bit HEX Mode SNC3 - Default 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.055, N = 3 8.191 7.991 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.2 Encoder Mode: Preset 13 - Input: Beauty 4K 10-bit HEX Mode SNC3 - Default 3 6 9 12 15 SE +/- 0.00, N = 3 SE +/- 0.02, N = 3 13.41 12.98 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
PETSc PETSc, the Portable, Extensible Toolkit for Scientific Computation, is for the scalable (parallel) solution of scientific applications modeled by partial differential equations. This test profile runs the PETSc "make streams" benchmark and records the throughput rate when all available cores are utilized for the MPI Streams build. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better PETSc 3.19 Test: Streams HEX Mode SNC3 - Default 130K 260K 390K 520K 650K SE +/- 1960.36, N = 3 SE +/- 5879.52, N = 4 595124.23 449771.29 1. (CC) gcc options: -fPIC -O3 -O2 -lpthread -lm
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: BMW27 - Compute: CPU-Only HEX Mode SNC3 - Default 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 7.69 7.49
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Junkshop - Compute: CPU-Only HEX Mode SNC3 - Default 3 6 9 12 15 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 10.89 10.85
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Classroom - Compute: CPU-Only HEX Mode SNC3 - Default 4 8 12 16 20 SE +/- 0.04, N = 3 SE +/- 0.14, N = 3 17.51 17.55
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Fishy Cat - Compute: CPU-Only HEX Mode SNC3 - Default 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.12, N = 3 10.49 10.43
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Barbershop - Compute: CPU-Only HEX Mode SNC3 - Default 15 30 45 60 75 SE +/- 0.18, N = 3 SE +/- 0.33, N = 3 68.40 68.00
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.2 Blend File: Pabellon Barcelona - Compute: CPU-Only HEX Mode SNC3 - Default 5 10 15 20 25 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 21.54 21.28
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only HEX Mode SNC3 - Default 140K 280K 420K 560K 700K SE +/- 42784.99, N = 12 SE +/- 6739.41, N = 3 638580 542652 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency HEX Mode SNC3 - Default 0.4147 0.8294 1.2441 1.6588 2.0735 SE +/- 0.108, N = 12 SE +/- 0.023, N = 3 1.645 1.843 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write HEX Mode SNC3 - Default 3K 6K 9K 12K 15K SE +/- 2.00, N = 3 SE +/- 37.41, N = 3 13656 13506 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency HEX Mode SNC3 - Default 16 32 48 64 80 SE +/- 0.01, N = 3 SE +/- 0.20, N = 3 73.23 74.04 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
BYTE Unix Benchmark OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Dhrystone 2 HEX Mode SNC3 - Default 4000M 8000M 12000M 16000M 20000M SE +/- 9501831.17, N = 3 SE +/- 22131931.33, N = 3 18804694499.0 18658833934.3 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: System Call HEX Mode SNC3 - Default 200M 400M 600M 800M 1000M SE +/- 144616.99, N = 3 SE +/- 219845.26, N = 3 910572120.1 897496230.9 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
OpenBenchmarking.org MWIPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Whetstone Double HEX Mode SNC3 - Default 800K 1600K 2400K 3200K 4000K SE +/- 113.17, N = 3 SE +/- 382.28, N = 3 3728535.6 3722311.4 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
HEX Mode Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Notes: OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 24 September 2024 11:15 by user intel.
SNC3 - Default Processor: 2 x Intel Xeon 6980P @ 3.90GHz (256 Cores / 512 Threads), Motherboard: Intel BIRCHSTREAM (BHSDCRB1.IPC.0035.D44.2408292336 BIOS), Chipset: Intel Ice Lake IEH, Memory: 1520GB, Disk: 960GB SAMSUNG MZ1L2960HCJR-00A07, Graphics: ASPEED, Network: Intel I210 + 2 x Intel 10-Gigabit X540-AT2
OS: Ubuntu 24.04, Kernel: 6.8.0-45-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x10002f0Java Notes: OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)Python Notes: Python 3.12.3Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 24 September 2024 23:52 by user intel.