12700k HPC+OpenCL AVX512 performance profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2112125-TJ-12700KHPC62
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Bioinformatics 5 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 5 Tests
C++ Boost Tests 4 Tests
C/C++ Compiler Tests 6 Tests
CPU Massive 13 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 32 Tests
LAPACK (Linear Algebra Pack) Tests 3 Tests
Linear Algebra 3 Tests
Machine Learning 9 Tests
Molecular Dynamics 7 Tests
MPI Benchmarks 7 Tests
Multi-Core 9 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 4 Tests
OpenMPI Tests 15 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 3 Tests
Scientific Computing 17 Tests
Server CPU Tests 5 Tests
Single-Threaded 3 Tests
Speech 2 Tests
Telephony 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt
December 09 2021
  10 Hours, 32 Minutes
12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt
December 11 2021
  8 Hours, 28 Minutes
Invert Hiding All Results Option
  9 Hours, 30 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


12700k HPC+OpenCL AVX512 performance profiling Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite. ,,"12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt","12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt" Processor,,Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads),Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) Motherboard,,MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS),MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Chipset,,Intel Device 7aa7,Intel Device 7aa7 Memory,,32GB,32GB Disk,,500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro,500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 300GB Western Digital WD3000GLFS-0 + 128GB HP SSD S700 Pro Graphics,,Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz),Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) Audio,,Realtek ALC897,Realtek ALC897 Monitor,,LG HDR WQHD,LG HDR WQHD Network,,Intel I225-V,Intel I225-V OS,,Pop 21.04,Pop 21.04 Kernel,,5.15.5-76051505-generic (x86_64),5.15.5-76051505-generic (x86_64) Desktop,,GNOME Shell 3.38.4,GNOME Shell 3.38.4 Display Server,,X Server 1.20.11,X Server 1.20.11 OpenGL,,4.6 Mesa 21.2.2 (LLVM 12.0.0),4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL,,OpenCL 2.2 AMD-APP (3361.0),OpenCL 2.2 AMD-APP (3361.0) Vulkan,,1.2.185,1.2.185 Compiler,,GCC 11.1.0,GCC 11.1.0 File-System,,ext4,ext4 Screen Resolution,,3440x1440,3440x1440 ,,"12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt","12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt" "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: S3D (GFLOPS)",HIB,125.078,125.070 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Triad (GB/s)",HIB,12.5997,12.3131 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: FFT SP (GFLOPS)",HIB,680.883,682.017 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: MD5 Hash (GHash/s)",HIB,9.3041,9.3179 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Reduction (GB/s)",HIB,254.126,254.407 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: GEMM SGEMM_N (GFLOPS)",HIB,1841.75,1859.40 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Max SP Flops (GFLOPS)",HIB,8376637,9031599 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Bus Speed Download (GB/s)",HIB,20.1487,19.9782 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Bus Speed Readback (GB/s)",HIB,20.3871,21.0509 "SHOC Scalable HeterOgeneous Computing - Target: OpenCL - Benchmark: Texture Read Bandwidth (GB/s)",HIB,349.340,349.012 "cl-mem - Benchmark: Copy (GB/s)",HIB,198.4,195.2 "cl-mem - Benchmark: Read (GB/s)",HIB,263.6,261.2 "cl-mem - Benchmark: Write (GB/s)",HIB,255.3,248.4 "HPL Linpack - (GFLOPS)",HIB,97.554,100.593 "LeelaChessZero - Backend: BLAS (Nodes/s)",HIB,906,959 "Parboil - Test: OpenMP LBM (sec)",LIB,114.068817,114.096662 "Parboil - Test: OpenMP CUTCP (sec)",LIB,3.177141,3.183109 "Parboil - Test: OpenMP Stencil (sec)",LIB,15.011457,14.980375 "Parboil - Test: OpenMP MRI Gridding (sec)",LIB,49.150711,48.567082 "miniFE - Problem Size: Small (CG Mflops)",HIB,6411.43,6407.12 "CP2K Molecular Dynamics - Input: Fayalite-FIST (sec)",LIB,398.345,372.744 "NAMD - ATPase Simulation - 327,506 Atoms (days/ns)",LIB,1.16249,1.17871 "Algebraic Multi-Grid Benchmark - (Figure Of Merit)",HIB,303975100,302617633 "FFTW - Build: Stock - Size: 1D FFT Size 32 (Mflops)",HIB,22774,22777 "FFTW - Build: Stock - Size: 2D FFT Size 32 (Mflops)",HIB,22948,22827 "FFTW - Build: Stock - Size: 1D FFT Size 4096 (Mflops)",HIB,18293,18502 "FFTW - Build: Stock - Size: 2D FFT Size 4096 (Mflops)",HIB,13770,14020 "FFTW - Build: Float + SSE - Size: 1D FFT Size 32 (Mflops)",HIB,32180,31560 "FFTW - Build: Float + SSE - Size: 2D FFT Size 32 (Mflops)",HIB,80496,82515 "FFTW - Build: Float + SSE - Size: 1D FFT Size 4096 (Mflops)",HIB,103960,104417 "FFTW - Build: Float + SSE - Size: 2D FFT Size 4096 (Mflops)",HIB,43900,42935 "Pennant - Test: sedovbig (Hydro Cycle Time - sec)",LIB,69.89434,67.77358 "Pennant - Test: leblancbig (Hydro Cycle Time - sec)",LIB,49.93021,47.38740 "Timed MrBayes Analysis - Primate Phylogeny Analysis (sec)",LIB,73.794,74.339 "QMCPACK - Input: simple-H2O (Execution Time - sec)",LIB,17.950,18.126 "Timed HMMer Search - Pfam Database Search (sec)",LIB,82.482,82.610 "Timed MAFFT Alignment - Multiple Sequence Alignment - LSU RNA (sec)",LIB,7.703,7.523 "OpenFOAM - Input: Motorbike 30M (sec)",LIB,137.71,135.71 "OpenFOAM - Input: Motorbike 60M (sec)",LIB,867.20,864.00 "RELION - Test: Basic - Device: CPU (sec)",LIB,1684.702,1656.713 "LULESH - (z/s)",HIB,6872.8297,6878.6229 "ArrayFire - Test: BLAS CPU (GFLOPS)",HIB,1205.09,1207.87 "ACES DGEMM - Sustained Floating-Point Rate (GFLOP/s)",HIB,4.875598,5.016124 "Himeno Benchmark - Poisson Pressure Solver (MFLOPS)",HIB,9471.738369,9554.056801 "oneDNN - Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU (ms)",LIB,2.58637,2.57885 "oneDNN - Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU (ms)",LIB,8.78268,8.89450 "oneDNN - Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,0.685132,0.690873 "oneDNN - Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,1.93347,1.92539 "oneDNN - Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,2.44545,2.46474 "oneDNN - Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,3.56837,3.52788 "oneDNN - Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU (ms)",LIB,13.3957,13.4115 "oneDNN - Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU (ms)",LIB,6.27787,6.60245 "oneDNN - Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU (ms)",LIB,4.13169,4.23668 "oneDNN - Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,13.2836,13.4080 "oneDNN - Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,0.872683,0.878588 "oneDNN - Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,1.06856,1.07696 "oneDNN - Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU (ms)",LIB,2540.28,2532.45 "oneDNN - Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU (ms)",LIB,1388.34,1341.61 "oneDNN - Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,2596.32,2560.48 "oneDNN - Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,6.77334,6.34910 "oneDNN - Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,7.43204,6.72455 "oneDNN - Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,4.67789,4.67293 "oneDNN - Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,1334.31,1328.42 "oneDNN - Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU (ms)",LIB,2.33613,2.33206 "oneDNN - Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,2585.15,2578.54 "oneDNN - Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,1337.47,1339.83 "oneDNN - Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU (ms)",LIB,0.565699,0.550588 "oneDNN - Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU (ms)",LIB,1.18583,1.18705 "Numpy Benchmark - (Score)",HIB,618.63,612.18 "DeepSpeech - Acceleration: CPU (sec)",LIB,48.85145,48.96052 "R Benchmark - (sec)",LIB,0.1044,0.1050 "RNNoise - (sec)",LIB,16.509,17.119 "ASKAP - Test: tConvolve MT - Gridding (Million Grid Points/sec)",HIB,1245.64,1248.93 "ASKAP - Test: tConvolve MT - Degridding (Million Grid Points/sec)",HIB,2054.71,2054.38 "ASKAP - Test: tConvolve MPI - Degridding (Mpix/sec)",HIB,4859.18,4889.74 "ASKAP - Test: tConvolve MPI - Gridding (Mpix/sec)",HIB,5046.07,5046.07 "ASKAP - Test: tConvolve OpenMP - Gridding (Million Grid Points/sec)",HIB,1866.30,1906.52 "ASKAP - Test: tConvolve OpenMP - Degridding (Million Grid Points/sec)",HIB,3614.48,3599.35 "ASKAP - Test: Hogbom Clean OpenMP (Iterations/sec)",HIB,267.389,269.303 "Intel MPI Benchmarks - Test: IMB-P2P PingPong (Msg/sec)",HIB,8628982,8513496 "Intel MPI Benchmarks - Test: IMB-MPI1 Exchange (Mbytes/sec)",HIB,15189.76,15023.98 "Intel MPI Benchmarks - Test: IMB-MPI1 Exchange (usec)",LIB,109.01,108.21 "Intel MPI Benchmarks - Test: IMB-MPI1 PingPong (Mbytes/sec)",HIB,10004.46,10294.25 "Intel MPI Benchmarks - Test: IMB-MPI1 Sendrecv (Mbytes/sec)",HIB,12273.54,12471.89 "Intel MPI Benchmarks - Test: IMB-MPI1 Sendrecv (usec)",LIB,53.50,52.66 "GROMACS - Implementation: MPI CPU - Input: water_GMX50_bare (Ns/Day)",HIB,1.180,1.186 "Darmstadt Automotive Parallel Heterogeneous Suite - Backend: OpenMP - Kernel: NDT Mapping (Test Cases/min)",HIB,1033.44,1033.74 "Darmstadt Automotive Parallel Heterogeneous Suite - Backend: OpenMP - Kernel: Points2Image (Test Cases/min)",HIB,35022.145431097,36114.808332258 "Darmstadt Automotive Parallel Heterogeneous Suite - Backend: OpenMP - Kernel: Euclidean Cluster (Test Cases/min)",HIB,1671.51,1667.43 "TensorFlow Lite - Model: SqueezeNet (us)",LIB,145830,145241 "TensorFlow Lite - Model: Inception V4 (us)",LIB,2080110,2082823 "TensorFlow Lite - Model: NASNet Mobile (us)",LIB,124859,124941 "TensorFlow Lite - Model: Mobilenet Float (us)",LIB,97253.9,97559.5 "TensorFlow Lite - Model: Mobilenet Quant (us)",LIB,98345.3,98287.0 "TensorFlow Lite - Model: Inception ResNet V2 (us)",LIB,1878730,1881663 "Darktable - Test: Boat - Acceleration: OpenCL (sec)",LIB,3.971,4.151 "Darktable - Test: Masskrug - Acceleration: OpenCL (sec)",LIB,3.827,3.818 "Darktable - Test: Server Rack - Acceleration: OpenCL (sec)",LIB,0.133,0.131 "Darktable - Test: Server Room - Acceleration: OpenCL (sec)",LIB,3.012,2.999 "GNU Octave Benchmark - (sec)",LIB,5.080,5.064 "Caffe - Model: AlexNet - Acceleration: CPU - Iterations: 100 (ms)",LIB,23122,23659 "Caffe - Model: AlexNet - Acceleration: CPU - Iterations: 200 (ms)",LIB,46624,47045 "Caffe - Model: AlexNet - Acceleration: CPU - Iterations: 1000 (ms)",LIB,247624,238435 "Caffe - Model: GoogleNet - Acceleration: CPU - Iterations: 100 (ms)",LIB,79674,67852 "Caffe - Model: GoogleNet - Acceleration: CPU - Iterations: 200 (ms)",LIB,157272,136187 "Caffe - Model: GoogleNet - Acceleration: CPU - Iterations: 1000 (ms)",LIB,726541,680177