12700k-hpc-avx512-performance-profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt

Kernel Notes: Transparent Huge Pages: madvise
Environment Notes: CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3
Python Notes: Python 2.7.18 + Python 3.9.5
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Intel Core i7-12700K

Processor: Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads), Motherboard: MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS), Chipset: Intel Device 7aa7, Memory: 32GB, Disk: 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro, Graphics: Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz), Audio: Realtek ALC897, Monitor: LG HDR WQHD, Network: Intel I225-V

OS: Pop 21.04, Kernel: 5.15.5-76051505-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.0), OpenCL: OpenCL 2.2 AMD-APP (3361.0), Vulkan: 1.2.185, Compiler: GCC 11.1.0, File-System: ext4, Screen Resolution: 3440x1440

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

NAS Parallel Benchmarks

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

NAS Parallel Benchmarks

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

HPC Challenge

NAS Parallel Benchmarks

HPC Challenge

NAS Parallel Benchmarks

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

HPC Challenge

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently uses the SSMP (OpenMP) version of cp2k. Learn more via the OpenBenchmarking.org test page.

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

HPC Challenge

Parboil

NAS Parallel Benchmarks

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

HPC Challenge

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

Parboil

NAS Parallel Benchmarks

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

HPC Challenge

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Problem Size: Medium

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: The test quit with a non-zero exit status.

Intel Core i7-12700K: The test quit with a non-zero exit status.

Problem Size: Large

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: The test quit with a non-zero exit status.

Intel Core i7-12700K: The test quit with a non-zero exit status.

Parboil

Test: OpenMP MRI-Q

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: main.c:(.text.startup+0x20c): undefined reference to `ComputeQCPU'

Intel Core i7-12700K: main.c:(.text.startup+0x20c): undefined reference to `ComputeQCPU'

12700k-hpc-avx512-performance-profiling

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt

Intel Core i7-12700K

HPC Challenge

FFTW

NAS Parallel Benchmarks

Pennant

miniFE

NAS Parallel Benchmarks

Pennant

NAS Parallel Benchmarks

NAMD

HPC Challenge

NAS Parallel Benchmarks

HPC Challenge

NAS Parallel Benchmarks

FFTW

Parboil

CloverLeaf

HPC Challenge

CP2K Molecular Dynamics

Algebraic Multi-Grid Benchmark

HPC Challenge

Parboil

NAS Parallel Benchmarks

Nebular Empirical Analysis Tool

FFTW

HPC Challenge

High Performance Conjugate Gradient

FFTW

Parboil

NAS Parallel Benchmarks

FFTW

Timed MrBayes Analysis

Dolfyn

LeelaChessZero

HPC Challenge

FFTE

FFTW

miniFE

Parboil

HPC Challenge

44 Results Shown

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt

Intel Core i7-12700K