12700k-hpc-avx512-performance-profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2112119-TJ-12700KHPC25&sor.

12700k-hpc-avx512-performance-profilingProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen Resolution12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700KIntel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads)MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS)Intel Device 7aa732GB500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 ProGigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz)Realtek ALC897LG HDR WQHDIntel I225-VPop 21.045.15.5-76051505-generic (x86_64)GNOME Shell 3.38.4X Server 1.20.114.6 Mesa 21.2.2 (LLVM 12.0.0)OpenCL 2.2 AMD-APP (3361.0)1.2.185GCC 11.1.0ext43440x1440OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3Python Details- Python 2.7.18 + Python 3.9.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

12700k-hpc-avx512-performance-profilinghpcg: npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Chpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthlczero: BLASparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingminife: Smallcloverleaf: Lagrangian-Eulerian Hydrodynamicscp2k: Fayalite-FISTnamd: ATPase Simulation - 327,506 Atomsdolfyn: Computational Fluid Dynamicsneat: amg: ffte: N=256, 1D Complex FFT Routinefftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 2D FFT Size 32fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 4096pennant: sedovbigpennant: leblancbigmrbayes: Primate Phylogeny Analysis12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K5.9341122646.017783.661953.191799.6117792.96993.7134915.8516859.637514.276894.32100.893678.3292522.766233.899745.150310.052050.240533.6595731619.931930113.1977773.20575115.36841743.9842316404.27127.23358.7191.1683810.94826.6003048557002276923147185251330732181790261025733976166.1036845.7914272.3075.8743822496.457661.181831.201826.0417038.59975.6234212.3416013.497125.666568.7499.831237.9683220.437073.715545.010520.050970.253873.2042231619.603931116.6809893.17785115.05606948.6988616040.19130.84349.6081.2267510.96026.98129764490055882.6516161822246423031183431376832091825621035604371369.7047448.6104272.476OpenBenchmarking.org

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.112700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K1.33522.67044.00565.34086.676SE +/- 0.00086, N = 3SE +/- 0.01365, N = 35.934115.874381. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K5K10K15K20K25KSE +/- 19.61, N = 3SE +/- 44.89, N = 322646.0122496.451. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K2K4K6K8K10KSE +/- 5.12, N = 3SE +/- 5.60, N = 37783.667661.181. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K400800120016002000SE +/- 18.12, N = 15SE +/- 28.20, N = 151953.191831.201. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt400800120016002000SE +/- 26.34, N = 15SE +/- 27.45, N = 151826.041799.611. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K4K8K12K16K20KSE +/- 28.43, N = 3SE +/- 117.45, N = 317792.9617038.591. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K2004006008001000SE +/- 0.34, N = 3SE +/- 4.70, N = 3993.71975.621. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K7K14K21K28K35KSE +/- 78.61, N = 3SE +/- 86.62, N = 334915.8534212.341. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K4K8K12K16K20KSE +/- 10.22, N = 3SE +/- 74.83, N = 316859.6316013.491. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: SP.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K16003200480064008000SE +/- 4.56, N = 3SE +/- 27.67, N = 37514.277125.661. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K15003000450060007500SE +/- 5.69, N = 3SE +/- 13.30, N = 36894.326568.741. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K20406080100SE +/- 0.26, N = 3SE +/- 0.82, N = 3100.8999.831. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K246810SE +/- 0.01478, N = 3SE +/- 0.05761, N = 38.329257.968321. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K510152025SE +/- 0.07, N = 3SE +/- 0.41, N = 322.7720.441. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K0.87741.75482.63223.50964.387SE +/- 0.00899, N = 3SE +/- 0.04763, N = 33.899743.715541. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K1.15882.31763.47644.63525.794SE +/- 0.00183, N = 3SE +/- 0.01798, N = 35.150315.010521. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K0.01170.02340.03510.04680.0585SE +/- 0.00039, N = 3SE +/- 0.00034, N = 30.052050.050971. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K0.05710.11420.17130.22840.2855SE +/- 0.00071, N = 3SE +/- 0.01103, N = 30.240530.253871. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K0.82341.64682.47023.29364.117SE +/- 0.01551, N = 3SE +/- 0.13946, N = 33.659573.204221. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K7K14K21K28K35KSE +/- 44.84, N = 3SE +/- 106.25, N = 331619.9331619.601. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt2004006008001000SE +/- 8.74, N = 3SE +/- 9.13, N = 39319301. (CXX) g++ options: -flto -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -pthread

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBM12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K306090120150SE +/- 0.01, N = 3SE +/- 0.78, N = 3113.20116.681. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt0.72131.44262.16392.88523.6065SE +/- 0.028089, N = 3SE +/- 0.042404, N = 33.1778513.2057511. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt48121620SE +/- 0.01, N = 3SE +/- 0.16, N = 315.0615.371. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI Gridding12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K1122334455SE +/- 0.50, N = 3SE +/- 0.84, N = 1543.9848.701. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Small12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K14002800420056007000SE +/- 12.96, N = 3SE +/- 49.96, N = 156404.276040.191. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K306090120150SE +/- 0.02, N = 3SE +/- 0.16, N = 3127.23130.841. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

CP2K Molecular Dynamics

Input: Fayalite-FIST

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.2Input: Fayalite-FISTIntel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt80160240320400349.61358.72

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K0.2760.5520.8281.1041.38SE +/- 0.00439, N = 3SE +/- 0.00077, N = 31.168381.22675

Dolfyn

Computational Fluid Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid Dynamics12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K3691215SE +/- 0.02, N = 3SE +/- 0.04, N = 310.9510.96

Nebular Empirical Analysis Tool

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2.312700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K612182430SE +/- 0.09, N = 3SE +/- 0.01, N = 326.6026.981. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.212700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K70M140M210M280M350MSE +/- 27694.10, N = 3SE +/- 402307.41, N = 33048557002976449001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

FFTE

Test: N=256, 1D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0Test: N=256, 1D Complex FFT RoutineIntel Core i7-12700K12K24K36K48K60KSE +/- 339.88, N = 355882.651. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 3212700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K5K10K15K20K25KSE +/- 9.02, N = 3SE +/- 294.36, N = 322769224641. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Stock - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 3212700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K5K10K15K20K25KSE +/- 120.03, N = 3SE +/- 150.35, N = 323147230311. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 409612700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K4K8K12K16K20KSE +/- 112.58, N = 3SE +/- 171.90, N = 318525183431. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt3K6K9K12K15KSE +/- 170.25, N = 4SE +/- 18.37, N = 313768133071. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 3212700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K7K14K21K28K35KSE +/- 8.29, N = 3SE +/- 67.42, N = 332181320911. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt20K40K60K80K100KSE +/- 619.05, N = 11SE +/- 2332.92, N = 1582562790261. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt20K40K60K80K100KSE +/- 1383.85, N = 3SE +/- 738.30, N = 121035601025731. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Intel Core i7-12700K12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt9K18K27K36K45KSE +/- 229.21, N = 3SE +/- 204.18, N = 343713397611. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K1632486480SE +/- 0.17, N = 3SE +/- 0.51, N = 366.1069.701. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K1122334455SE +/- 0.06, N = 3SE +/- 0.22, N = 345.7948.611. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xtIntel Core i7-12700K1632486480SE +/- 0.44, N = 3SE +/- 0.16, N = 372.3172.481. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm -lreadline


Phoronix Test Suite v10.8.5