12700k-hpc-avx512-performance-profiling Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2112119-TJ-12700KHPC25&grs&rdt .
12700k-hpc-avx512-performance-profiling Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Intel Device 7aa7 32GB 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) Realtek ALC897 LG HDR WQHD Intel I225-V Pop 21.04 5.15.5-76051505-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.11 4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL 2.2 AMD-APP (3361.0) 1.2.185 GCC 11.1.0 ext4 3440x1440 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3 Python Details - Python 2.7.18 + Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
12700k-hpc-avx512-performance-profiling hpcc: EP-DGEMM fftw: Float + SSE - 2D FFT Size 4096 npb: EP.C pennant: leblancbig minife: Small npb: SP.B pennant: sedovbig npb: MG.C namd: ATPase Simulation - 327,506 Atoms hpcc: G-Ptrans npb: SP.C hpcc: G-Ffte npb: FT.C fftw: Stock - 2D FFT Size 4096 parboil: OpenMP LBM cloverleaf: Lagrangian-Eulerian Hydrodynamics hpcc: EP-STREAM Triad cp2k: Fayalite-FIST amg: hpcc: G-Rand Access parboil: OpenMP Stencil npb: LU.C npb: IS.D npb: CG.C npb: EP.D neat: fftw: Stock - 1D FFT Size 32 hpcc: G-HPL hpcg: fftw: Stock - 1D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 4096 parboil: OpenMP CUTCP npb: BT.C fftw: Stock - 2D FFT Size 32 fftw: Float + SSE - 1D FFT Size 32 mrbayes: Primate Phylogeny Analysis dolfyn: Computational Fluid Dynamics lczero: BLAS hpcc: Max Ping Pong Bandwidth ffte: N=256, 1D Complex FFT Routine fftw: Float + SSE - 2D FFT Size 32 parboil: OpenMP MRI Gridding hpcc: Rand Ring Bandwidth hpcc: Rand Ring Latency 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 22.76623 39761 1953.19 45.79142 6404.27 7514.27 66.10368 16859.63 1.16838 3.89974 6894.32 8.32925 17792.96 13307 113.197777 127.23 5.15031 358.719 304855700 0.05205 15.368417 34915.85 993.71 7783.66 1799.61 26.600 22769 100.89367 5.93411 18525 102573 3.205751 22646.01 23147 32181 72.307 10.948 930 31619.931 79026 43.984231 3.65957 0.24053 20.43707 43713 1831.20 48.61042 6040.19 7125.66 69.70474 16013.49 1.22675 3.71554 6568.74 7.96832 17038.59 13768 116.680989 130.84 5.01052 349.608 297644900 0.05097 15.056069 34212.34 975.62 7661.18 1826.04 26.981 22464 99.83123 5.87438 18343 103560 3.177851 22496.45 23031 32091 72.476 10.960 931 31619.603 55882.651616182 82562 48.698861 3.20422 0.25387 OpenBenchmarking.org
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.41, N = 3 22.77 20.44 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 9K 18K 27K 36K 45K SE +/- 204.18, N = 3 SE +/- 229.21, N = 3 39761 43713 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 400 800 1200 1600 2000 SE +/- 18.12, N = 15 SE +/- 28.20, N = 15 1953.19 1831.20 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.22, N = 3 45.79 48.61 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1400 2800 4200 5600 7000 SE +/- 12.96, N = 3 SE +/- 49.96, N = 15 6404.27 6040.19 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1600 3200 4800 6400 8000 SE +/- 4.56, N = 3 SE +/- 27.67, N = 3 7514.27 7125.66 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 16 32 48 64 80 SE +/- 0.17, N = 3 SE +/- 0.51, N = 3 66.10 69.70 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 10.22, N = 3 SE +/- 74.83, N = 3 16859.63 16013.49 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.276 0.552 0.828 1.104 1.38 SE +/- 0.00439, N = 3 SE +/- 0.00077, N = 3 1.16838 1.22675
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.8774 1.7548 2.6322 3.5096 4.387 SE +/- 0.00899, N = 3 SE +/- 0.04763, N = 3 3.89974 3.71554 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1500 3000 4500 6000 7500 SE +/- 5.69, N = 3 SE +/- 13.30, N = 3 6894.32 6568.74 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 2 4 6 8 10 SE +/- 0.01478, N = 3 SE +/- 0.05761, N = 3 8.32925 7.96832 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 28.43, N = 3 SE +/- 117.45, N = 3 17792.96 17038.59 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 3K 6K 9K 12K 15K SE +/- 18.37, N = 3 SE +/- 170.25, N = 4 13307 13768 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.78, N = 3 113.20 116.68 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 127.23 130.84 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.00183, N = 3 SE +/- 0.01798, N = 3 5.15031 5.01052 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 80 160 240 320 400 358.72 349.61
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 70M 140M 210M 280M 350M SE +/- 27694.10, N = 3 SE +/- 402307.41, N = 3 304855700 297644900 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.0117 0.0234 0.0351 0.0468 0.0585 SE +/- 0.00039, N = 3 SE +/- 0.00034, N = 3 0.05205 0.05097 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.01, N = 3 15.37 15.06 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 78.61, N = 3 SE +/- 86.62, N = 3 34915.85 34212.34 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 200 400 600 800 1000 SE +/- 0.34, N = 3 SE +/- 4.70, N = 3 993.71 975.62 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 2K 4K 6K 8K 10K SE +/- 5.12, N = 3 SE +/- 5.60, N = 3 7783.66 7661.18 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 400 800 1200 1600 2000 SE +/- 27.45, N = 15 SE +/- 26.34, N = 15 1799.61 1826.04 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 26.60 26.98 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 9.02, N = 3 SE +/- 294.36, N = 3 22769 22464 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.82, N = 3 100.89 99.83 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1.3352 2.6704 4.0056 5.3408 6.676 SE +/- 0.00086, N = 3 SE +/- 0.01365, N = 3 5.93411 5.87438 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 112.58, N = 3 SE +/- 171.90, N = 3 18525 18343 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20K 40K 60K 80K 100K SE +/- 738.30, N = 12 SE +/- 1383.85, N = 3 102573 103560 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.7213 1.4426 2.1639 2.8852 3.6065 SE +/- 0.042404, N = 3 SE +/- 0.028089, N = 3 3.205751 3.177851 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 19.61, N = 3 SE +/- 44.89, N = 3 22646.01 22496.45 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 120.03, N = 3 SE +/- 150.35, N = 3 23147 23031 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 8.29, N = 3 SE +/- 67.42, N = 3 32181 32091 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 16 32 48 64 80 SE +/- 0.44, N = 3 SE +/- 0.16, N = 3 72.31 72.48 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm -lreadline
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 10.95 10.96
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 200 400 600 800 1000 SE +/- 9.13, N = 3 SE +/- 8.74, N = 3 930 931 1. (CXX) g++ options: -flto -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -pthread
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 44.84, N = 3 SE +/- 106.25, N = 3 31619.93 31619.60 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
FFTE Test: N=256, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 Test: N=256, 1D Complex FFT Routine Intel Core i7-12700K 12K 24K 36K 48K 60K SE +/- 339.88, N = 3 55882.65 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20K 40K 60K 80K 100K SE +/- 2332.92, N = 15 SE +/- 619.05, N = 11 79026 82562 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 11 22 33 44 55 SE +/- 0.50, N = 3 SE +/- 0.84, N = 15 43.98 48.70 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.8234 1.6468 2.4702 3.2936 4.117 SE +/- 0.01551, N = 3 SE +/- 0.13946, N = 3 3.65957 3.20422 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.0571 0.1142 0.1713 0.2284 0.2855 SE +/- 0.00071, N = 3 SE +/- 0.01103, N = 3 0.24053 0.25387 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
Phoronix Test Suite v10.8.5