12700k HPC+OpenCL AVX512 performance profiling Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite. 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: Processor: Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads), Motherboard: MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS), Chipset: Intel Device 7aa7, Memory: 32GB, Disk: 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro, Graphics: Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz), Audio: Realtek ALC897, Monitor: LG HDR WQHD, Network: Intel I225-V OS: Pop 21.04, Kernel: 5.15.5-76051505-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.0), OpenCL: OpenCL 2.2 AMD-APP (3361.0), Vulkan: 1.2.185, Compiler: GCC 11.1.0, File-System: ext4, Screen Resolution: 3440x1440 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt: Processor: Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads), Motherboard: MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS), Chipset: Intel Device 7aa7, Memory: 32GB, Disk: 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 300GB Western Digital WD3000GLFS-0 + 128GB HP SSD S700 Pro, Graphics: Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz), Audio: Realtek ALC897, Monitor: LG HDR WQHD, Network: Intel I225-V OS: Pop 21.04, Kernel: 5.15.5-76051505-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.0), OpenCL: OpenCL 2.2 AMD-APP (3361.0), Vulkan: 1.2.185, Compiler: GCC 11.1.0, File-System: ext4, Screen Resolution: 3440x1440 SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 125.08 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 125.07 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 12.60 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 12.31 |=============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 680.88 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 682.02 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 9.3041 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 9.3179 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 254.13 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 254.41 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1841.75 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1859.40 |============= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 8376637 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 9031599 |============= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 20.15 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 19.98 |=============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 20.39 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 21.05 |=============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 349.34 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 349.01 |============== cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 198.4 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 195.2 |=============== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 263.6 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 261.2 |=============== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 255.3 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 248.4 |=============== HPL Linpack 2.3 GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 97.55 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 100.59 |============== LeelaChessZero 0.28 Backend: BLAS Nodes Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 906 |================ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 959 |================= Parboil 2.5 Test: OpenMP LBM Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 114.07 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 114.10 |============== Parboil 2.5 Test: OpenMP CUTCP Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.177141 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3.183109 |============ Parboil 2.5 Test: OpenMP Stencil Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 15.01 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 14.98 |=============== Parboil 2.5 Test: OpenMP MRI Gridding Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 49.15 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 48.57 |=============== miniFE 2.2 Problem Size: Small CG Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6411.43 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6407.12 |============= CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 398.35 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 372.74 |============= NAMD 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.16249 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.17871 |============= Algebraic Multi-Grid Benchmark 1.2 Figure Of Merit > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 303975100 |=========== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 302617633 |=========== FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 22774 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 22777 |=============== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 22948 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 22827 |=============== FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 18293 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 18502 |=============== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 13770 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 14020 |=============== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 32180 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 31560 |=============== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 80496 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 82515 |=============== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 103960 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 104417 |============== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 43900 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 42935 |=============== Pennant 1.0.1 Test: sedovbig Hydro Cycle Time - Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 69.89 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 67.77 |=============== Pennant 1.0.1 Test: leblancbig Hydro Cycle Time - Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 49.93 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 47.39 |============== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 73.79 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 74.34 |=============== QMCPACK 3.11 Input: simple-H2O Total Execution Time - Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 17.95 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 18.13 |=============== Timed HMMer Search 3.3.2 Pfam Database Search Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 82.48 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 82.61 |=============== Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 7.703 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 7.523 |=============== OpenFOAM 8 Input: Motorbike 30M Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 137.71 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 135.71 |============== OpenFOAM 8 Input: Motorbike 60M Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 867.20 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 864.00 |============== RELION 3.1.1 Test: Basic - Device: CPU Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1684.70 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1656.71 |============= LULESH 2.0.3 z/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6872.83 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6878.62 |============= ArrayFire 3.7 Test: BLAS CPU GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1205.09 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1207.87 |============= ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4.875598 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 5.016124 |============ Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 9471.74 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 9554.06 |============= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2.58637 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.57885 |============= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 8.78268 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 8.89450 |============= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.685132 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.690873 |============ oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.93347 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.92539 |============= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2.44545 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.46474 |============= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.56837 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3.52788 |============= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 13.40 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 13.41 |=============== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6.27787 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6.60245 |============= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4.13169 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4.23668 |============= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 13.28 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 13.41 |=============== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.872683 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.878588 |============ oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.06856 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.07696 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2540.28 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2532.45 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1388.34 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1341.61 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2596.32 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2560.48 |============= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6.77334 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6.34910 |============ oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 7.43204 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6.72455 |============ oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4.67789 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4.67293 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1334.31 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1328.42 |============= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2.33613 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.33206 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2585.15 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2578.54 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1337.47 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1339.83 |============= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.565699 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.550588 |============ oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.18583 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.18705 |============= Numpy Benchmark Score > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 618.63 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 612.18 |============== DeepSpeech 0.6 Acceleration: CPU Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 48.85 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 48.96 |=============== R Benchmark Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.1044 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.1050 |============== RNNoise 2020-06-28 Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 16.51 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 17.12 |=============== ASKAP 1.0 Test: tConvolve MT - Gridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1245.64 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1248.93 |============= ASKAP 1.0 Test: tConvolve MT - Degridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2054.71 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2054.38 |============= ASKAP 1.0 Test: tConvolve MPI - Degridding Mpix/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4859.18 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4889.74 |============= ASKAP 1.0 Test: tConvolve MPI - Gridding Mpix/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 5046.07 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 5046.07 |============= ASKAP 1.0 Test: tConvolve OpenMP - Gridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1866.30 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1906.52 |============= ASKAP 1.0 Test: tConvolve OpenMP - Degridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3614.48 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3599.35 |============= ASKAP 1.0 Test: Hogbom Clean OpenMP Iterations Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 267.39 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 269.30 |============== Intel MPI Benchmarks 2019.3 Test: IMB-P2P PingPong Average Msg/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 8628982 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 8513496 |============= Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Exchange Average Mbytes/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 15189.76 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 15023.98 |============ Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Exchange Average usec < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 109.01 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 108.21 |============== Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 PingPong Average Mbytes/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 10004.46 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 10294.25 |============ Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Sendrecv Average Mbytes/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 12273.54 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 12471.89 |============ Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Sendrecv Average usec < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 53.50 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 52.66 |=============== GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.180 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.186 |=============== Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping Test Cases Per Minute > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1033.44 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1033.74 |============= Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image Test Cases Per Minute > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 35022.15 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 36114.81 |============ Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster Test Cases Per Minute > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1671.51 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1667.43 |============= TensorFlow Lite 2020-08-23 Model: SqueezeNet Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 145830 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 145241 |============== TensorFlow Lite 2020-08-23 Model: Inception V4 Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2080110 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2082823 |============= TensorFlow Lite 2020-08-23 Model: NASNet Mobile Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 124859 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 124941 |============== TensorFlow Lite 2020-08-23 Model: Mobilenet Float Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 97253.9 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 97559.5 |============= TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 98345.3 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 98287.0 |============= TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1878730 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1881663 |============= Darktable 3.4.1 Test: Boat - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.971 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4.151 |=============== Darktable 3.4.1 Test: Masskrug - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.827 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3.818 |=============== Darktable 3.4.1 Test: Server Rack - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.133 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.131 |=============== Darktable 3.4.1 Test: Server Room - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.012 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.999 |=============== GNU Octave Benchmark 6.1.1~hg.2021.01.26 Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 5.080 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 5.064 |=============== Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 23122 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 23659 |=============== Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 46624 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 47045 |=============== Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 247624 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 238435 |============= Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 79674 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 67852 |============= Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 157272 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 136187 |============ Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 726541 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 680177 |=============