12700k-hpc-avx512-performance-profiling Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2112119-TJ-12700KHPC25&gru&rdt .
12700k-hpc-avx512-performance-profiling Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Intel Device 7aa7 32GB 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) Realtek ALC897 LG HDR WQHD Intel I225-V Pop 21.04 5.15.5-76051505-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.11 4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL 2.2 AMD-APP (3361.0) 1.2.185 GCC 11.1.0 ext4 3440x1440 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3 Python Details - Python 2.7.18 + Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
12700k-hpc-avx512-performance-profiling minife: Small amg: hpcc: G-Ptrans hpcc: EP-STREAM Triad hpcc: Rand Ring Bandwidth hpcg: hpcc: G-HPL hpcc: G-Ffte hpcc: EP-DGEMM hpcc: G-Rand Access hpcc: Max Ping Pong Bandwidth ffte: N=256, 1D Complex FFT Routine fftw: Stock - 1D FFT Size 32 fftw: Stock - 2D FFT Size 32 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 32 fftw: Float + SSE - 2D FFT Size 32 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 lczero: BLAS npb: BT.C npb: CG.C npb: EP.C npb: EP.D npb: FT.C npb: IS.D npb: LU.C npb: MG.C npb: SP.B npb: SP.C namd: ATPase Simulation - 327,506 Atoms pennant: sedovbig pennant: leblancbig parboil: OpenMP LBM parboil: OpenMP CUTCP parboil: OpenMP Stencil parboil: OpenMP MRI Gridding cloverleaf: Lagrangian-Eulerian Hydrodynamics cp2k: Fayalite-FIST dolfyn: Computational Fluid Dynamics neat: mrbayes: Primate Phylogeny Analysis hpcc: Rand Ring Latency 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 6404.27 304855700 3.89974 5.15031 3.65957 5.93411 100.89367 8.32925 22.76623 0.05205 31619.931 22769 23147 18525 13307 32181 79026 102573 39761 930 22646.01 7783.66 1953.19 1799.61 17792.96 993.71 34915.85 16859.63 7514.27 6894.32 1.16838 66.10368 45.79142 113.197777 3.205751 15.368417 43.984231 127.23 358.719 10.948 26.600 72.307 0.24053 6040.19 297644900 3.71554 5.01052 3.20422 5.87438 99.83123 7.96832 20.43707 0.05097 31619.603 55882.651616182 22464 23031 18343 13768 32091 82562 103560 43713 931 22496.45 7661.18 1831.20 1826.04 17038.59 975.62 34212.34 16013.49 7125.66 6568.74 1.22675 69.70474 48.61042 116.680989 3.177851 15.056069 48.698861 130.84 349.608 10.960 26.981 72.476 0.25387 OpenBenchmarking.org
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1400 2800 4200 5600 7000 SE +/- 12.96, N = 3 SE +/- 49.96, N = 15 6404.27 6040.19 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 70M 140M 210M 280M 350M SE +/- 27694.10, N = 3 SE +/- 402307.41, N = 3 304855700 297644900 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.8774 1.7548 2.6322 3.5096 4.387 SE +/- 0.00899, N = 3 SE +/- 0.04763, N = 3 3.89974 3.71554 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.00183, N = 3 SE +/- 0.01798, N = 3 5.15031 5.01052 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.8234 1.6468 2.4702 3.2936 4.117 SE +/- 0.01551, N = 3 SE +/- 0.13946, N = 3 3.65957 3.20422 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1.3352 2.6704 4.0056 5.3408 6.676 SE +/- 0.00086, N = 3 SE +/- 0.01365, N = 3 5.93411 5.87438 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.82, N = 3 100.89 99.83 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 2 4 6 8 10 SE +/- 0.01478, N = 3 SE +/- 0.05761, N = 3 8.32925 7.96832 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.41, N = 3 22.77 20.44 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.0117 0.0234 0.0351 0.0468 0.0585 SE +/- 0.00039, N = 3 SE +/- 0.00034, N = 3 0.05205 0.05097 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 44.84, N = 3 SE +/- 106.25, N = 3 31619.93 31619.60 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
FFTE Test: N=256, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 Test: N=256, 1D Complex FFT Routine Intel Core i7-12700K 12K 24K 36K 48K 60K SE +/- 339.88, N = 3 55882.65 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 9.02, N = 3 SE +/- 294.36, N = 3 22769 22464 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 120.03, N = 3 SE +/- 150.35, N = 3 23147 23031 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 112.58, N = 3 SE +/- 171.90, N = 3 18525 18343 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 3K 6K 9K 12K 15K SE +/- 18.37, N = 3 SE +/- 170.25, N = 4 13307 13768 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 8.29, N = 3 SE +/- 67.42, N = 3 32181 32091 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20K 40K 60K 80K 100K SE +/- 2332.92, N = 15 SE +/- 619.05, N = 11 79026 82562 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20K 40K 60K 80K 100K SE +/- 738.30, N = 12 SE +/- 1383.85, N = 3 102573 103560 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 9K 18K 27K 36K 45K SE +/- 204.18, N = 3 SE +/- 229.21, N = 3 39761 43713 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 200 400 600 800 1000 SE +/- 9.13, N = 3 SE +/- 8.74, N = 3 930 931 1. (CXX) g++ options: -flto -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -pthread
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 19.61, N = 3 SE +/- 44.89, N = 3 22646.01 22496.45 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 2K 4K 6K 8K 10K SE +/- 5.12, N = 3 SE +/- 5.60, N = 3 7783.66 7661.18 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 400 800 1200 1600 2000 SE +/- 18.12, N = 15 SE +/- 28.20, N = 15 1953.19 1831.20 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 400 800 1200 1600 2000 SE +/- 27.45, N = 15 SE +/- 26.34, N = 15 1799.61 1826.04 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 28.43, N = 3 SE +/- 117.45, N = 3 17792.96 17038.59 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 200 400 600 800 1000 SE +/- 0.34, N = 3 SE +/- 4.70, N = 3 993.71 975.62 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 78.61, N = 3 SE +/- 86.62, N = 3 34915.85 34212.34 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 10.22, N = 3 SE +/- 74.83, N = 3 16859.63 16013.49 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1600 3200 4800 6400 8000 SE +/- 4.56, N = 3 SE +/- 27.67, N = 3 7514.27 7125.66 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1500 3000 4500 6000 7500 SE +/- 5.69, N = 3 SE +/- 13.30, N = 3 6894.32 6568.74 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.276 0.552 0.828 1.104 1.38 SE +/- 0.00439, N = 3 SE +/- 0.00077, N = 3 1.16838 1.22675
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 16 32 48 64 80 SE +/- 0.17, N = 3 SE +/- 0.51, N = 3 66.10 69.70 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.22, N = 3 45.79 48.61 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.78, N = 3 113.20 116.68 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.7213 1.4426 2.1639 2.8852 3.6065 SE +/- 0.042404, N = 3 SE +/- 0.028089, N = 3 3.205751 3.177851 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.01, N = 3 15.37 15.06 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 11 22 33 44 55 SE +/- 0.50, N = 3 SE +/- 0.84, N = 15 43.98 48.70 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 127.23 130.84 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 80 160 240 320 400 358.72 349.61
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 10.95 10.96
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 26.60 26.98 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 16 32 48 64 80 SE +/- 0.44, N = 3 SE +/- 0.16, N = 3 72.31 72.48 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm -lreadline
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.0571 0.1142 0.1713 0.2284 0.2855 SE +/- 0.00071, N = 3 SE +/- 0.01103, N = 3 0.24053 0.25387 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
Phoronix Test Suite v10.8.5