12700k-hpc-avx512-performance-profiling Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2112119-TJ-12700KHPC25&grr&sro .
12700k-hpc-avx512-performance-profiling Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Intel Device 7aa7 32GB 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) Realtek ALC897 LG HDR WQHD Intel I225-V Pop 21.04 5.15.5-76051505-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.11 4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL 2.2 AMD-APP (3361.0) 1.2.185 GCC 11.1.0 ext4 3440x1440 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3 Python Details - Python 2.7.18 + Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
12700k-hpc-avx512-performance-profiling hpcc: G-HPL npb: EP.D lczero: BLAS npb: SP.C fftw: Float + SSE - 2D FFT Size 4096 hpcg: parboil: OpenMP MRI Gridding minife: Small fftw: Stock - 2D FFT Size 4096 cloverleaf: Lagrangian-Eulerian Hydrodynamics npb: BT.C cp2k: Fayalite-FIST parboil: OpenMP LBM mrbayes: Primate Phylogeny Analysis pennant: sedovbig npb: LU.C namd: ATPase Simulation - 327,506 Atoms npb: IS.D npb: SP.B pennant: leblancbig neat: npb: EP.C npb: FT.C npb: CG.C fftw: Float + SSE - 1D FFT Size 4096 parboil: OpenMP Stencil amg: fftw: Float + SSE - 2D FFT Size 32 npb: MG.C dolfyn: Computational Fluid Dynamics fftw: Stock - 1D FFT Size 4096 fftw: Stock - 1D FFT Size 32 fftw: Stock - 2D FFT Size 32 parboil: OpenMP CUTCP fftw: Float + SSE - 1D FFT Size 32 ffte: N=256, 1D Complex FFT Routine hpcc: Max Ping Pong Bandwidth hpcc: Rand Ring Bandwidth hpcc: Rand Ring Latency hpcc: G-Rand Access hpcc: EP-STREAM Triad hpcc: G-Ptrans hpcc: EP-DGEMM hpcc: G-Ffte 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 100.89367 1799.61 930 6894.32 39761 5.93411 43.984231 6404.27 13307 127.23 22646.01 358.719 113.197777 72.307 66.10368 34915.85 1.16838 993.71 7514.27 45.79142 26.600 1953.19 17792.96 7783.66 102573 15.368417 304855700 79026 16859.63 10.948 18525 22769 23147 3.205751 32181 31619.931 3.65957 0.24053 0.05205 5.15031 3.89974 22.76623 8.32925 99.83123 1826.04 931 6568.74 43713 5.87438 48.698861 6040.19 13768 130.84 22496.45 349.608 116.680989 72.476 69.70474 34212.34 1.22675 975.62 7125.66 48.61042 26.981 1831.20 17038.59 7661.18 103560 15.056069 297644900 82562 16013.49 10.960 18343 22464 23031 3.177851 32091 55882.651616182 31619.603 3.20422 0.25387 0.05097 5.01052 3.71554 20.43707 7.96832 OpenBenchmarking.org
HPC Challenge Test / Class: G-HPL OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-HPL 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20 40 60 80 100 SE +/- 0.26, N = 3 SE +/- 0.82, N = 3 100.89 99.83 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: EP.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 400 800 1200 1600 2000 SE +/- 27.45, N = 15 SE +/- 26.34, N = 15 1799.61 1826.04 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 200 400 600 800 1000 SE +/- 9.13, N = 3 SE +/- 8.74, N = 3 930 931 1. (CXX) g++ options: -flto -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -pthread
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1500 3000 4500 6000 7500 SE +/- 5.69, N = 3 SE +/- 13.30, N = 3 6894.32 6568.74 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 9K 18K 27K 36K 45K SE +/- 204.18, N = 3 SE +/- 229.21, N = 3 39761 43713 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
High Performance Conjugate Gradient OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1.3352 2.6704 4.0056 5.3408 6.676 SE +/- 0.00086, N = 3 SE +/- 0.01365, N = 3 5.93411 5.87438 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
Parboil Test: OpenMP MRI Gridding OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP MRI Gridding 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 11 22 33 44 55 SE +/- 0.50, N = 3 SE +/- 0.84, N = 15 43.98 48.70 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1400 2800 4200 5600 7000 SE +/- 12.96, N = 3 SE +/- 49.96, N = 15 6404.27 6040.19 1. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 3K 6K 9K 12K 15K SE +/- 18.37, N = 3 SE +/- 170.25, N = 4 13307 13768 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 30 60 90 120 150 SE +/- 0.02, N = 3 SE +/- 0.16, N = 3 127.23 130.84 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 19.61, N = 3 SE +/- 44.89, N = 3 22646.01 22496.45 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 80 160 240 320 400 358.72 349.61
Parboil Test: OpenMP LBM OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP LBM 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 30 60 90 120 150 SE +/- 0.01, N = 3 SE +/- 0.78, N = 3 113.20 116.68 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Timed MrBayes Analysis Primate Phylogeny Analysis OpenBenchmarking.org Seconds, Fewer Is Better Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 16 32 48 64 80 SE +/- 0.44, N = 3 SE +/- 0.16, N = 3 72.31 72.48 1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm -O3 -std=c99 -pedantic -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm -lreadline
Pennant Test: sedovbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 16 32 48 64 80 SE +/- 0.17, N = 3 SE +/- 0.51, N = 3 66.10 69.70 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 78.61, N = 3 SE +/- 86.62, N = 3 34915.85 34212.34 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.276 0.552 0.828 1.104 1.38 SE +/- 0.00439, N = 3 SE +/- 0.00077, N = 3 1.16838 1.22675
NAS Parallel Benchmarks Test / Class: IS.D OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: IS.D 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 200 400 600 800 1000 SE +/- 0.34, N = 3 SE +/- 4.70, N = 3 993.71 975.62 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1600 3200 4800 6400 8000 SE +/- 4.56, N = 3 SE +/- 27.67, N = 3 7514.27 7125.66 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Pennant Test: leblancbig OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 11 22 33 44 55 SE +/- 0.06, N = 3 SE +/- 0.22, N = 3 45.79 48.61 1. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 6 12 18 24 30 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 26.60 26.98 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 400 800 1200 1600 2000 SE +/- 18.12, N = 15 SE +/- 28.20, N = 15 1953.19 1831.20 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 28.43, N = 3 SE +/- 117.45, N = 3 17792.96 17038.59 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 2K 4K 6K 8K 10K SE +/- 5.12, N = 3 SE +/- 5.60, N = 3 7783.66 7661.18 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20K 40K 60K 80K 100K SE +/- 738.30, N = 12 SE +/- 1383.85, N = 3 102573 103560 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
Parboil Test: OpenMP Stencil OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP Stencil 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.01, N = 3 15.37 15.06 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
Algebraic Multi-Grid Benchmark OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 70M 140M 210M 280M 350M SE +/- 27694.10, N = 3 SE +/- 402307.41, N = 3 304855700 297644900 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 20K 40K 60K 80K 100K SE +/- 2332.92, N = 15 SE +/- 619.05, N = 11 79026 82562 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 10.22, N = 3 SE +/- 74.83, N = 3 16859.63 16013.49 1. (F9X) gfortran options: -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent_core -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.1.0
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 10.95 10.96
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 4K 8K 12K 16K 20K SE +/- 112.58, N = 3 SE +/- 171.90, N = 3 18525 18343 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 9.02, N = 3 SE +/- 294.36, N = 3 22769 22464 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5K 10K 15K 20K 25K SE +/- 120.03, N = 3 SE +/- 150.35, N = 3 23147 23031 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
Parboil Test: OpenMP CUTCP OpenBenchmarking.org Seconds, Fewer Is Better Parboil 2.5 Test: OpenMP CUTCP 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.7213 1.4426 2.1639 2.8852 3.6065 SE +/- 0.042404, N = 3 SE +/- 0.028089, N = 3 3.205751 3.177851 1. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 8.29, N = 3 SE +/- 67.42, N = 3 32181 32091 1. (CC) gcc options: -pthread -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -lm
FFTE Test: N=256, 1D Complex FFT Routine OpenBenchmarking.org MFLOPS, More Is Better FFTE 7.0 Test: N=256, 1D Complex FFT Routine Intel Core i7-12700K 12K 24K 36K 48K 60K SE +/- 339.88, N = 3 55882.65 1. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
HPC Challenge Test / Class: Max Ping Pong Bandwidth OpenBenchmarking.org MB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Max Ping Pong Bandwidth 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 7K 14K 21K 28K 35K SE +/- 44.84, N = 3 SE +/- 106.25, N = 3 31619.93 31619.60 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: Random Ring Bandwidth OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Bandwidth 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.8234 1.6468 2.4702 3.2936 4.117 SE +/- 0.01551, N = 3 SE +/- 0.13946, N = 3 3.65957 3.20422 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: Random Ring Latency OpenBenchmarking.org usecs, Fewer Is Better HPC Challenge 1.5.0 Test / Class: Random Ring Latency 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.0571 0.1142 0.1713 0.2284 0.2855 SE +/- 0.00071, N = 3 SE +/- 0.01103, N = 3 0.24053 0.25387 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: G-Random Access OpenBenchmarking.org GUP/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Random Access 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.0117 0.0234 0.0351 0.0468 0.0585 SE +/- 0.00039, N = 3 SE +/- 0.00034, N = 3 0.05205 0.05097 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: EP-STREAM Triad OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: EP-STREAM Triad 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 1.1588 2.3176 3.4764 4.6352 5.794 SE +/- 0.00183, N = 3 SE +/- 0.01798, N = 3 5.15031 5.01052 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: G-Ptrans OpenBenchmarking.org GB/s, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ptrans 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 0.8774 1.7548 2.6322 3.5096 4.387 SE +/- 0.00899, N = 3 SE +/- 0.04763, N = 3 3.89974 3.71554 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: EP-DGEMM OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: EP-DGEMM 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.41, N = 3 22.77 20.44 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
HPC Challenge Test / Class: G-Ffte OpenBenchmarking.org GFLOPS, More Is Better HPC Challenge 1.5.0 Test / Class: G-Ffte 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt Intel Core i7-12700K 2 4 6 8 10 SE +/- 0.01478, N = 3 SE +/- 0.05761, N = 3 8.32925 7.96832 1. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -O3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -funroll-loops 2. OpenBLAS + Open MPI 4.1.0
Phoronix Test Suite v10.8.5