12700k-hpc-avx512-performance-profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2112119-TJ-12700KHPC25
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt
December 10 2021
  3 Hours, 31 Minutes
Intel Core i7-12700K
December 10 2021
  3 Hours, 54 Minutes
Invert Behavior (Only Show Selected Data)
  3 Hours, 42 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


12700k-hpc-avx512-performance-profilingOpenBenchmarking.orgPhoronix Test SuiteIntel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads)MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS)Intel Device 7aa732GB500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 ProGigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz)Realtek ALC897LG HDR WQHDIntel I225-VPop 21.045.15.5-76051505-generic (x86_64)GNOME Shell 3.38.4X Server 1.20.114.6 Mesa 21.2.2 (LLVM 12.0.0)OpenCL 2.2 AMD-APP (3361.0)1.2.185GCC 11.1.0ext43440x1440ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution12700k-hpc-avx512-performance-profiling BenchmarksSystem Logs- Transparent Huge Pages: madvise- CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3- Python 2.7.18 + Python 3.9.5- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt vs. Intel Core i7-12700K ComparisonPhoronix Test SuiteBaseline+3.6%+3.6%+7.2%+7.2%+10.8%+10.8%9.9%4.5%3.5%2.6%2.1%R.R.B14.2%EP-DGEMM11.4%O.M.G10.7%Float + SSE - 2D FFT Size 4096EP.C6.7%leblancbig6.2%Small6%R.R.L5.5%SP.B5.5%sedovbig5.4%MG.C5.3%ATPase Simulation - 327,506 Atoms5%G-Ptrans5%SP.C5%G-Ffte4.5%Float + SSE - 2D FFT Size 32FT.C4.4%Stock - 2D FFT Size 4096OpenMP LBM3.1%L.E.H2.8%EP-STREAM Triad2.8%Fayalite-FIST2.4%G-Rand Access2.1%OpenMP StencilLU.C2.1%HPC ChallengeHPC ChallengeParboilFFTWNAS Parallel BenchmarksPennantminiFEHPC ChallengeNAS Parallel BenchmarksPennantNAS Parallel BenchmarksNAMDHPC ChallengeNAS Parallel BenchmarksHPC ChallengeFFTWNAS Parallel BenchmarksFFTWParboilCloverLeafHPC ChallengeCP2K Molecular DynamicsAlgebraic Multi-Grid BenchmarkHPC ChallengeParboilNAS Parallel Benchmarks12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K

12700k-hpc-avx512-performance-profilinghpcc: EP-DGEMMfftw: Float + SSE - 2D FFT Size 4096npb: EP.Cpennant: leblancbigminife: Smallnpb: SP.Bpennant: sedovbignpb: MG.Cnamd: ATPase Simulation - 327,506 Atomshpcc: G-Ptransnpb: SP.Chpcc: G-Fftenpb: FT.Cfftw: Stock - 2D FFT Size 4096parboil: OpenMP LBMcloverleaf: Lagrangian-Eulerian Hydrodynamicshpcc: EP-STREAM Triadcp2k: Fayalite-FISTamg: hpcc: G-Rand Accessparboil: OpenMP Stencilnpb: LU.Cnpb: IS.Dnpb: CG.Cnpb: EP.Dneat: fftw: Stock - 1D FFT Size 32hpcc: G-HPLhpcg: fftw: Stock - 1D FFT Size 4096fftw: Float + SSE - 1D FFT Size 4096parboil: OpenMP CUTCPnpb: BT.Cfftw: Stock - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 32mrbayes: Primate Phylogeny Analysisdolfyn: Computational Fluid Dynamicslczero: BLAShpcc: Max Ping Pong Bandwidthffte: N=256, 1D Complex FFT Routinefftw: Float + SSE - 2D FFT Size 32parboil: OpenMP MRI Griddinghpcc: Rand Ring Bandwidthhpcc: Rand Ring Latency12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K22.76623397611953.1945.791426404.277514.2766.1036816859.631.168383.899746894.328.3292517792.9613307113.197777127.235.15031358.7193048557000.0520515.36841734915.85993.717783.661799.6126.60022769100.893675.93411185251025733.20575122646.01231473218172.30710.94893031619.9317902643.9842313.659570.2405320.43707437131831.2048.610426040.197125.6669.7047416013.491.226753.715546568.747.9683217038.5913768116.680989130.845.01052349.6082976449000.0509715.05606934212.34975.627661.181826.0426.9812246499.831235.87438183431035603.17785122496.45230313209172.47610.96093131619.60355882.6516161828256248.6988613.204220.25387OpenBenchmarking.org

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt510152025SE +/- 0.41, N = 3SE +/- 0.07, N = 320.4422.771. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt9K18K27K36K45KSE +/- 229.21, N = 3SE +/- 204.18, N = 343713397611. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt400800120016002000SE +/- 28.20, N = 15SE +/- 18.12, N = 151831.201953.191. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1122334455SE +/- 0.22, N = 3SE +/- 0.06, N = 348.6145.791. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt14002800420056007000SE +/- 49.96, N = 15SE +/- 12.96, N = 36040.196404.271. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt16003200480064008000SE +/- 27.67, N = 3SE +/- 4.56, N = 37125.667514.271. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1632486480SE +/- 0.51, N = 3SE +/- 0.17, N = 369.7066.101. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt4K8K12K16K20KSE +/- 74.83, N = 3SE +/- 10.22, N = 316013.4916859.631. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.2760.5520.8281.1041.38SE +/- 0.00077, N = 3SE +/- 0.00439, N = 31.226751.16838

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.87741.75482.63223.50964.387SE +/- 0.04763, N = 3SE +/- 0.00899, N = 33.715543.899741. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt15003000450060007500SE +/- 13.30, N = 3SE +/- 5.69, N = 36568.746894.321. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt246810SE +/- 0.05761, N = 3SE +/- 0.01478, N = 37.968328.329251. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt4K8K12K16K20KSE +/- 117.45, N = 3SE +/- 28.43, N = 317038.5917792.961. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt3K6K9K12K15KSE +/- 170.25, N = 4SE +/- 18.37, N = 313768133071. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt306090120150SE +/- 0.78, N = 3SE +/- 0.01, N = 3116.68113.201. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt306090120150SE +/- 0.16, N = 3SE +/- 0.02, N = 3130.84127.231. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1.15882.31763.47644.63525.794SE +/- 0.01798, N = 3SE +/- 0.00183, N = 35.010525.150311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently uses the SSMP (OpenMP) version of cp2k. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.2Input: Fayalite-FISTIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt80160240320400349.61358.72

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt70M140M210M280M350MSE +/- 402307.41, N = 3SE +/- 27694.10, N = 32976449003048557001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.01170.02340.03510.04680.0585SE +/- 0.00034, N = 3SE +/- 0.00039, N = 30.050970.052051. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt48121620SE +/- 0.01, N = 3SE +/- 0.16, N = 315.0615.371. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt7K14K21K28K35KSE +/- 86.62, N = 3SE +/- 78.61, N = 334212.3434915.851. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt2004006008001000SE +/- 4.70, N = 3SE +/- 0.34, N = 3975.62993.711. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt2K4K6K8K10KSE +/- 5.60, N = 3SE +/- 5.12, N = 37661.187783.661. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt400800120016002000SE +/- 26.34, N = 15SE +/- 27.45, N = 151826.041799.611. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2.3Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt612182430SE +/- 0.01, N = 3SE +/- 0.09, N = 326.9826.601. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt5K10K15K20K25KSE +/- 294.36, N = 3SE +/- 9.02, N = 322464227691. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt20406080100SE +/- 0.82, N = 3SE +/- 0.26, N = 399.83100.891. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1.33522.67044.00565.34086.676SE +/- 0.01365, N = 3SE +/- 0.00086, N = 35.874385.934111. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt4K8K12K16K20KSE +/- 171.90, N = 3SE +/- 112.58, N = 318343185251. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt20K40K60K80K100KSE +/- 1383.85, N = 3SE +/- 738.30, N = 121035601025731. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.72131.44262.16392.88523.6065SE +/- 0.028089, N = 3SE +/- 0.042404, N = 33.1778513.2057511. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt5K10K15K20K25KSE +/- 44.89, N = 3SE +/- 19.61, N = 322496.4522646.011. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt5K10K15K20K25KSE +/- 150.35, N = 3SE +/- 120.03, N = 323031231471. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt7K14K21K28K35KSE +/- 67.42, N = 3SE +/- 8.29, N = 332091321811. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1632486480SE +/- 0.16, N = 3SE +/- 0.44, N = 372.4872.311. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm -lreadline

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt3691215SE +/- 0.04, N = 3SE +/- 0.02, N = 310.9610.95

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt2004006008001000SE +/- 8.74, N = 3SE +/- 9.13, N = 39319301. (CXX) g++ options: -flto -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -pthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt7K14K21K28K35KSE +/- 106.25, N = 3SE +/- 44.84, N = 331619.6031619.931. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0Test: N=256, 1D Complex FFT RoutineIntel Core i7-12700K12K24K36K48K60KSE +/- 339.88, N = 355882.651. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt20K40K60K80K100KSE +/- 619.05, N = 11SE +/- 2332.92, N = 1582562790261. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

Problem Size: Medium

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: The test quit with a non-zero exit status.

Intel Core i7-12700K: The test quit with a non-zero exit status.

Problem Size: Large

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: The test quit with a non-zero exit status.

Intel Core i7-12700K: The test quit with a non-zero exit status.

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt1122334455SE +/- 0.84, N = 15SE +/- 0.50, N = 348.7043.981. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Test: OpenMP MRI-Q

12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: main.c:(.text.startup+0x20c): undefined reference to `ComputeQCPU'

Intel Core i7-12700K: main.c:(.text.startup+0x20c): undefined reference to `ComputeQCPU'

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.82341.64682.47023.29364.117SE +/- 0.13946, N = 3SE +/- 0.01551, N = 33.204223.659571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.05710.11420.17130.22840.2855SE +/- 0.01103, N = 3SE +/- 0.00071, N = 30.253870.240531. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0