12700k-hpc-avx512-performance-profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2112119-TJ-12700KHPC25
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C/C++ Compiler Tests 2 Tests
CPU Massive 10 Tests
Fortran Tests 7 Tests
HPC - High Performance Computing 16 Tests
Linear Algebra 2 Tests
Molecular Dynamics 6 Tests
MPI Benchmarks 6 Tests
Multi-Core 5 Tests
OpenMPI Tests 10 Tests
Programmer / Developer System Benchmarks 2 Tests
Scientific Computing 12 Tests
Server CPU Tests 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt
December 10 2021
  3 Hours, 31 Minutes
Intel Core i7-12700K
December 10 2021
  3 Hours, 54 Minutes
Invert Hiding All Results Option
  3 Hours, 42 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


12700k-hpc-avx512-performance-profilingOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads)MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS)Intel Device 7aa732GB500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 ProGigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz)Realtek ALC897LG HDR WQHDIntel I225-VPop 21.045.15.5-76051505-generic (x86_64)GNOME Shell 3.38.4X Server 1.20.114.6 Mesa 21.2.2 (LLVM 12.0.0)OpenCL 2.2 AMD-APP (3361.0)1.2.185GCC 11.1.0ext43440x1440ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution12700k-hpc-avx512-performance-profiling BenchmarksSystem Logs- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3- Python 2.7.18 + Python 3.9.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt vs. Intel Core i7-12700K ComparisonPhoronix Test SuiteBaseline+3.6%+3.6%+7.2%+7.2%+10.8%+10.8%9.9%4.5%3.5%2.6%2.1%R.R.B14.2%EP-DGEMM11.4%O.M.G10.7%Float + SSE - 2D FFT Size 4096EP.C6.7%leblancbig6.2%Small6%R.R.L5.5%SP.B5.5%sedovbig5.4%MG.C5.3%ATPase Simulation - 327,506 Atoms5%G-Ptrans5%SP.C5%G-Ffte4.5%Float + SSE - 2D FFT Size 32FT.C4.4%Stock - 2D FFT Size 4096OpenMP LBM3.1%L.E.H2.8%EP-STREAM Triad2.8%Fayalite-FIST2.4%G-Rand Access2.1%OpenMP StencilLU.C2.1%HPC ChallengeHPC ChallengeParboilFFTWNAS Parallel BenchmarksPennantminiFEHPC ChallengeNAS Parallel BenchmarksPennantNAS Parallel BenchmarksNAMDHPC ChallengeNAS Parallel BenchmarksHPC ChallengeFFTWNAS Parallel BenchmarksFFTWParboilCloverLeafHPC ChallengeCP2K Molecular DynamicsAlgebraic Multi-Grid BenchmarkHPC ChallengeParboilNAS Parallel Benchmarks12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K

12700k-hpc-avx512-performance-profilinghpcc: G-HPLnpb: EP.Dlczero: BLASnpb: SP.Cfftw: Float + SSE - 2D FFT Size 4096hpcg: parboil: OpenMP MRI Griddingminife: Smallfftw: Stock - 2D FFT Size 4096cloverleaf: Lagrangian-Eulerian Hydrodynamicsnpb: BT.Ccp2k: Fayalite-FISTparboil: OpenMP LBMmrbayes: Primate Phylogeny Analysispennant: sedovbignpb: LU.Cnamd: ATPase Simulation - 327,506 Atomsnpb: IS.Dnpb: SP.Bpennant: leblancbigneat: npb: EP.Cnpb: FT.Cnpb: CG.Cfftw: Float + SSE - 1D FFT Size 4096parboil: OpenMP Stencilamg: fftw: Float + SSE - 2D FFT Size 32npb: MG.Cdolfyn: Computational Fluid Dynamicsfftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32parboil: OpenMP CUTCPfftw: Float + SSE - 1D FFT Size 32ffte: N=256, 1D Complex FFT Routinehpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Ffte12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K100.893671799.619306894.32397615.9341143.9842316404.2713307127.2322646.01358.719113.19777772.30766.1036834915.851.16838993.717514.2745.7914226.6001953.1917792.967783.6610257315.3684173048557007902616859.6310.9481852522769231473.2057513218131619.9313.659570.240530.052055.150313.8997422.766238.3292599.831231826.049316568.74437135.8743848.6988616040.1913768130.8422496.45349.608116.68098972.47669.7047434212.341.22675975.627125.6648.6104226.9811831.2017038.597661.1810356015.0560692976449008256216013.4910.9601834322464230313.1778513209155882.65161618231619.6033.204220.253870.050975.010523.7155420.437077.96832OpenBenchmarking.org

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt20406080100SE +/- 0.82, N = 3SE +/- 0.26, N = 399.83100.891. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K400800120016002000SE +/- 27.45, N = 15SE +/- 26.34, N = 151799.611826.041. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLAS12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K2004006008001000SE +/- 9.13, N = 3SE +/- 8.74, N = 39309311. (CXX) g++ options: -flto -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt15003000450060007500SE +/- 13.30, N = 3SE +/- 5.69, N = 36568.746894.321. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 409612700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K9K18K27K36K45KSE +/- 204.18, N = 3SE +/- 229.21, N = 339761437131. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1.33522.67044.00565.34086.676SE +/- 0.01365, N = 3SE +/- 0.00086, N = 35.874385.934111. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1122334455SE +/- 0.84, N = 15SE +/- 0.50, N = 348.7043.981. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt14002800420056007000SE +/- 49.96, N = 15SE +/- 12.96, N = 36040.196404.271. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 409612700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K3K6K9K12K15KSE +/- 18.37, N = 3SE +/- 170.25, N = 413307137681. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt306090120150SE +/- 0.16, N = 3SE +/- 0.02, N = 3130.84127.231. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt5K10K15K20K25KSE +/- 44.89, N = 3SE +/- 19.61, N = 322496.4522646.011. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently uses the SSMP (OpenMP) version of cp2k. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.2Input: Fayalite-FIST12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K80160240320400358.72349.61

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt306090120150SE +/- 0.78, N = 3SE +/- 0.01, N = 3116.68113.201. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1632486480SE +/- 0.16, N = 3SE +/- 0.44, N = 372.4872.311. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm -lreadline

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1632486480SE +/- 0.51, N = 3SE +/- 0.17, N = 369.7066.101. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt7K14K21K28K35KSE +/- 86.62, N = 3SE +/- 78.61, N = 334212.3434915.851. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.2760.5520.8281.1041.38SE +/- 0.00077, N = 3SE +/- 0.00439, N = 31.226751.16838

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt2004006008001000SE +/- 4.70, N = 3SE +/- 0.34, N = 3975.62993.711. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt16003200480064008000SE +/- 27.67, N = 3SE +/- 4.56, N = 37125.667514.271. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1122334455SE +/- 0.22, N = 3SE +/- 0.06, N = 348.6145.791. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2.3Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt612182430SE +/- 0.01, N = 3SE +/- 0.09, N = 326.9826.601. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt400800120016002000SE +/- 28.20, N = 15SE +/- 18.12, N = 151831.201953.191. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt4K8K12K16K20KSE +/- 117.45, N = 3SE +/- 28.43, N = 317038.5917792.961. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt2K4K6K8K10KSE +/- 5.60, N = 3SE +/- 5.12, N = 37661.187783.661. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 409612700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K20K40K60K80K100KSE +/- 738.30, N = 12SE +/- 1383.85, N = 31025731035601. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Problem Size: Medium

Intel Core i7-12700K: The test quit with a non-zero exit status.

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: The test quit with a non-zero exit status.

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP Stencil12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K48121620SE +/- 0.16, N = 3SE +/- 0.01, N = 315.3715.061. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt70M140M210M280M350MSE +/- 402307.41, N = 3SE +/- 27694.10, N = 32976449003048557001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 3212700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K20K40K60K80K100KSE +/- 2332.92, N = 15SE +/- 619.05, N = 1179026825621. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt4K8K12K16K20KSE +/- 74.83, N = 3SE +/- 10.22, N = 316013.4916859.631. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt3691215SE +/- 0.04, N = 3SE +/- 0.02, N = 310.9610.95

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt4K8K12K16K20KSE +/- 171.90, N = 3SE +/- 112.58, N = 318343185251. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt5K10K15K20K25KSE +/- 294.36, N = 3SE +/- 9.02, N = 322464227691. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt5K10K15K20K25KSE +/- 150.35, N = 3SE +/- 120.03, N = 323031231471. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCP12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K0.72131.44262.16392.88523.6065SE +/- 0.042404, N = 3SE +/- 0.028089, N = 33.2057513.1778511. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt7K14K21K28K35KSE +/- 67.42, N = 3SE +/- 8.29, N = 332091321811. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0Test: N=256, 1D Complex FFT RoutineIntel Core i7-12700K12K24K36K48K60KSE +/- 339.88, N = 355882.651. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Problem Size: Large

Intel Core i7-12700K: The test quit with a non-zero exit status.

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: The test quit with a non-zero exit status.

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

Test: OpenMP MRI-Q

Intel Core i7-12700K: main.c:(.text.startup+0x20c): undefined reference to `ComputeQCPU'

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: main.c:(.text.startup+0x20c): undefined reference to `ComputeQCPU'

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt7K14K21K28K35KSE +/- 106.25, N = 3SE +/- 44.84, N = 331619.6031619.931. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.82341.64682.47023.29364.117SE +/- 0.13946, N = 3SE +/- 0.01551, N = 33.204223.659571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.05710.11420.17130.22840.2855SE +/- 0.01103, N = 3SE +/- 0.00071, N = 30.253870.240531. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.01170.02340.03510.04680.0585SE +/- 0.00034, N = 3SE +/- 0.00039, N = 30.050970.052051. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1.15882.31763.47644.63525.794SE +/- 0.01798, N = 3SE +/- 0.00183, N = 35.010525.150311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.87741.75482.63223.50964.387SE +/- 0.04763, N = 3SE +/- 0.00899, N = 33.715543.899741. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt510152025SE +/- 0.41, N = 3SE +/- 0.07, N = 320.4422.771. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt246810SE +/- 0.05761, N = 3SE +/- 0.01478, N = 37.968328.329251. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0