12700k HPC+OpenCL AVX512 performance profiling

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2112125-TJ-12700KHPC62
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Bioinformatics 5 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 5 Tests
C++ Boost Tests 4 Tests
C/C++ Compiler Tests 6 Tests
CPU Massive 13 Tests
Creator Workloads 4 Tests
HPC - High Performance Computing 32 Tests
LAPACK (Linear Algebra Pack) Tests 3 Tests
Linear Algebra 3 Tests
Machine Learning 9 Tests
Molecular Dynamics 7 Tests
MPI Benchmarks 7 Tests
Multi-Core 9 Tests
NVIDIA GPU Compute 6 Tests
OpenCL 4 Tests
OpenMPI Tests 15 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 3 Tests
Scientific Computing 17 Tests
Server CPU Tests 5 Tests
Single-Threaded 3 Tests
Speech 2 Tests
Telephony 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt
December 09 2021
  10 Hours, 32 Minutes
12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt
December 11 2021
  8 Hours, 28 Minutes
Invert Hiding All Results Option
  9 Hours, 30 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


12700k HPC+OpenCL AVX512 performance profiling Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite. 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt: Processor: Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads), Motherboard: MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS), Chipset: Intel Device 7aa7, Memory: 32GB, Disk: 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro, Graphics: Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz), Audio: Realtek ALC897, Monitor: LG HDR WQHD, Network: Intel I225-V OS: Pop 21.04, Kernel: 5.15.5-76051505-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.0), OpenCL: OpenCL 2.2 AMD-APP (3361.0), Vulkan: 1.2.185, Compiler: GCC 11.1.0, File-System: ext4, Screen Resolution: 3440x1440 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt: Processor: Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads), Motherboard: MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS), Chipset: Intel Device 7aa7, Memory: 32GB, Disk: 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 300GB Western Digital WD3000GLFS-0 + 128GB HP SSD S700 Pro, Graphics: Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz), Audio: Realtek ALC897, Monitor: LG HDR WQHD, Network: Intel I225-V OS: Pop 21.04, Kernel: 5.15.5-76051505-generic (x86_64), Desktop: GNOME Shell 3.38.4, Display Server: X Server 1.20.11, OpenGL: 4.6 Mesa 21.2.2 (LLVM 12.0.0), OpenCL: OpenCL 2.2 AMD-APP (3361.0), Vulkan: 1.2.185, Compiler: GCC 11.1.0, File-System: ext4, Screen Resolution: 3440x1440 Darktable 3.4.1 Test: Boat - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.971 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4.151 |=============== Darktable 3.4.1 Test: Masskrug - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.827 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3.818 |=============== Darktable 3.4.1 Test: Server Rack - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.133 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.131 |=============== Darktable 3.4.1 Test: Server Room - Acceleration: OpenCL Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.012 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.999 |=============== Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping Test Cases Per Minute > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1033.44 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1033.74 |============= Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image Test Cases Per Minute > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 35022.15 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 36114.81 |============ Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster Test Cases Per Minute > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1671.51 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1667.43 |============= HPL Linpack 2.3 GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 97.55 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 100.59 |============== RELION 3.1.1 Test: Basic - Device: CPU Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1684.70 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1656.71 |============= FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 22774 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 22777 |=============== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 22948 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 22827 |=============== FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 18293 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 18502 |=============== FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 13770 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 14020 |=============== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 32180 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 31560 |=============== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 80496 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 82515 |=============== FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 103960 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 104417 |============== FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 43900 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 42935 |=============== Timed HMMer Search 3.3.2 Pfam Database Search Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 82.48 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 82.61 |=============== Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 7.703 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 7.523 |=============== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 73.79 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 74.34 |=============== Himeno Benchmark 3.0 Poisson Pressure Solver MFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 9471.74 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 9554.06 |============= LeelaChessZero 0.28 Backend: BLAS Nodes Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 906 |================ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 959 |================= R Benchmark Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.1044 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.1050 |============== Numpy Benchmark Score > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 618.63 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 612.18 |============== GNU Octave Benchmark 6.1.1~hg.2021.01.26 Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 5.080 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 5.064 |=============== DeepSpeech 0.6 Acceleration: CPU Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 48.85 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 48.96 |=============== RNNoise 2020-06-28 Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 16.51 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 17.12 |=============== TensorFlow Lite 2020-08-23 Model: SqueezeNet Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 145830 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 145241 |============== TensorFlow Lite 2020-08-23 Model: Inception V4 Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2080110 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2082823 |============= TensorFlow Lite 2020-08-23 Model: NASNet Mobile Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 124859 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 124941 |============== TensorFlow Lite 2020-08-23 Model: Mobilenet Float Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 97253.9 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 97559.5 |============= TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 98345.3 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 98287.0 |============= TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Microseconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1878730 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1881663 |============= Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 100 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 23122 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 23659 |=============== Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 200 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 46624 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 47045 |=============== Caffe 2020-02-13 Model: AlexNet - Acceleration: CPU - Iterations: 1000 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 247624 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 238435 |============= Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 100 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 79674 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 67852 |============= Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 200 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 157272 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 136187 |============ Caffe 2020-02-13 Model: GoogleNet - Acceleration: CPU - Iterations: 1000 Milli-Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 726541 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 680177 |============= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 125.08 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 125.07 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 12.60 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 12.31 |=============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 680.88 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 682.02 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: MD5 Hash GHash/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 9.3041 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 9.3179 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 254.13 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 254.41 |============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1841.75 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1859.40 |============= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 8376637 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 9031599 |============= SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 20.15 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 19.98 |=============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 20.39 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 21.05 |=============== SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Texture Read Bandwidth GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 349.34 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 349.01 |============== GROMACS 2021.2 Implementation: MPI CPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.180 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.186 |=============== Parboil 2.5 Test: OpenMP LBM Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 114.07 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 114.10 |============== Parboil 2.5 Test: OpenMP CUTCP Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.177141 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3.183109 |============ Parboil 2.5 Test: OpenMP Stencil Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 15.01 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 14.98 |=============== Parboil 2.5 Test: OpenMP MRI Gridding Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 49.15 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 48.57 |=============== NAMD 2.14 ATPase Simulation - 327,506 Atoms days/ns < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.16249 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.17871 |============= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2.58637 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.57885 |============= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 8.78268 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 8.89450 |============= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.685132 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.690873 |============ oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.93347 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.92539 |============= oneDNN 2.1.2 Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2.44545 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.46474 |============= oneDNN 2.1.2 Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3.56837 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3.52788 |============= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 13.40 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 13.41 |=============== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6.27787 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6.60245 |============= oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4.13169 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4.23668 |============= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 13.28 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 13.41 |=============== oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.872683 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.878588 |============ oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.06856 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.07696 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2540.28 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2532.45 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1388.34 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1341.61 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2596.32 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2560.48 |============= oneDNN 2.1.2 Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6.77334 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6.34910 |============ oneDNN 2.1.2 Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 7.43204 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6.72455 |============ oneDNN 2.1.2 Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4.67789 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4.67293 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1334.31 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1328.42 |============= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2.33613 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2.33206 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2585.15 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2578.54 |============= oneDNN 2.1.2 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1337.47 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1339.83 |============= oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 0.565699 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 0.550588 |============ oneDNN 2.1.2 Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU ms < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1.18583 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1.18705 |============= ASKAP 1.0 Test: tConvolve MT - Gridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1245.64 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1248.93 |============= ASKAP 1.0 Test: tConvolve MT - Degridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 2054.71 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 2054.38 |============= ASKAP 1.0 Test: tConvolve MPI - Degridding Mpix/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4859.18 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 4889.74 |============= ASKAP 1.0 Test: tConvolve MPI - Gridding Mpix/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 5046.07 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 5046.07 |============= ASKAP 1.0 Test: tConvolve OpenMP - Gridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1866.30 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1906.52 |============= ASKAP 1.0 Test: tConvolve OpenMP - Degridding Million Grid Points Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 3614.48 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 3599.35 |============= ASKAP 1.0 Test: Hogbom Clean OpenMP Iterations Per Second > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 267.39 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 269.30 |============== Intel MPI Benchmarks 2019.3 Test: IMB-P2P PingPong Average Msg/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 8628982 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 8513496 |============= Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Exchange Average Mbytes/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 15189.76 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 15023.98 |============ Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Exchange Average usec < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 109.01 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 108.21 |============== Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 PingPong Average Mbytes/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 10004.46 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 10294.25 |============ Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Sendrecv Average Mbytes/sec > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 12273.54 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 12471.89 |============ Intel MPI Benchmarks 2019.3 Test: IMB-MPI1 Sendrecv Average usec < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 53.50 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 52.66 |=============== ArrayFire 3.7 Test: BLAS CPU GFLOPS > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 1205.09 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 1207.87 |============= ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 4.875598 |============ 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 5.016124 |============ Pennant 1.0.1 Test: sedovbig Hydro Cycle Time - Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 69.89 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 67.77 |=============== Pennant 1.0.1 Test: leblancbig Hydro Cycle Time - Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 49.93 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 47.39 |============== Algebraic Multi-Grid Benchmark 1.2 Figure Of Merit > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 303975100 |=========== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 302617633 |=========== LULESH 2.0.3 z/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6872.83 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6878.62 |============= OpenFOAM 8 Input: Motorbike 30M Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 137.71 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 135.71 |============== OpenFOAM 8 Input: Motorbike 60M Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 867.20 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 864.00 |============== QMCPACK 3.11 Input: simple-H2O Total Execution Time - Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 17.95 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 18.13 |=============== miniFE 2.2 Problem Size: Small CG Mflops > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 6411.43 |============= 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 6407.12 |============= CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST Seconds < Lower Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 398.35 |============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 372.74 |============= cl-mem 2017-01-13 Benchmark: Copy GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 198.4 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 195.2 |=============== cl-mem 2017-01-13 Benchmark: Read GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 263.6 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 261.2 |=============== cl-mem 2017-01-13 Benchmark: Write GB/s > Higher Is Better 12700k AVX512 march=sapphirerapids gcc 11.1 rx 5600xt .. 255.3 |=============== 12700k AVX512 march=native + AVX512 gcc 11.1 rx 5600xt . 248.4 |===============