HPC benchmark- POSSIBLE BAD DATA Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2112081-TJ-2112076TJ12&sor&grs .
HPC benchmark- POSSIBLE BAD DATA Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Intel Device 7aa7 32GB 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 (1650/750MHz) Realtek ALC897 LG HDR WQHD Intel I225-V Pop 21.04 5.15.5-76051505-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.11 4.6 Mesa 21.3.0-devel (LLVM 12.0.1) OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX + OpenCL 2.2 AMD-APP (3361.0) 1.2.182 GCC 10.3.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 + CUDA 10.2 ext4 3440x1440 Intel Core i7-12700K @ 6.30GHz (12 Cores / 20 Threads) Intel Core i7-12700K @ 6.50GHz (8 Cores / 16 Threads) Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) 4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX GCC 11.1.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : CXXFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" CFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : NONE / errors=remount-ro,noatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3 Graphics Details - GLAMOR - BAR1 / Visible vRAM Size: 6128 MB Python Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : Python 3.7.11 :: Intel - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
HPC benchmark- POSSIBLE BAD DATA pyhpc: CPU - PyTorch - 65536 - Equation of State pyhpc: CPU - PyTorch - 1048576 - Equation of State pyhpc: CPU - PyTorch - 262144 - Equation of State pyhpc: CPU - PyTorch - 16384 - Isoneutral Mixing pyhpc: CPU - PyTorch - 65536 - Isoneutral Mixing mlpack: scikit_qda tnn: CPU - MobileNet v2 pyhpc: CPU - PyTorch - 262144 - Isoneutral Mixing mlpack: scikit_linearridgeregression pyhpc: CPU - PyTorch - 1048576 - Isoneutral Mixing scikit-learn: pyhpc: CPU - PyTorch - 4194304 - Isoneutral Mixing numpy: tensorflow-lite: SqueezeNet tensorflow-lite: Inception V4 deepspeech: CPU pyhpc: CPU - Aesara - 16384 - Isoneutral Mixing namd: ATPase Simulation - 327,506 Atoms pyhpc: CPU - JAX - 4194304 - Equation of State fftw: Float + SSE - 1D FFT Size 4096 shoc: OpenCL - Reduction tensorflow-lite: Inception ResNet V2 pyhpc: CPU - JAX - 1048576 - Equation of State tensorflow-lite: Mobilenet Float askap: tConvolve OpenMP - Degridding tensorflow-lite: Mobilenet Quant askap: tConvolve OpenMP - Gridding pyhpc: CPU - JAX - 4194304 - Isoneutral Mixing ecp-candle: P1B2 plaidml: No - Inference - VGG16 - CPU ncnn: Vulkan GPU - mobilenet onnx: fcn-resnet101-11 - CPU ecp-candle: P3B1 minife: Small ecp-candle: P3B2 shoc: OpenCL - FFT SP pyhpc: CPU - Numpy - 65536 - Isoneutral Mixing shoc: OpenCL - S3D pyhpc: CPU - Numpy - 262144 - Isoneutral Mixing onnx: yolov4 - CPU onnx: shufflenet-v2-10 - CPU pyhpc: CPU - Numba - 262144 - Equation of State pyhpc: CPU - Numpy - 1048576 - Isoneutral Mixing himeno: Poisson Pressure Solver mt-dgemm: Sustained Floating-Point Rate pyhpc: CPU - Aesara - 262144 - Equation of State cp2k: Fayalite-FIST pyhpc: CPU - Numba - 65536 - Isoneutral Mixing rnnoise: fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 32 fftw: Float + SSE - 2D FFT Size 4096 askap: tConvolve MT - Gridding kripke: mafft: Multiple Sequence Alignment - LSU RNA pyhpc: CPU - Aesara - 4194304 - Equation of State pyhpc: CPU - Aesara - 65536 - Isoneutral Mixing pyhpc: CPU - Numba - 1048576 - Equation of State onnx: super-resolution-10 - CPU tnn: CPU - SqueezeNet v1.1 pyhpc: CPU - Numba - 4194304 - Equation of State pyhpc: CPU - Aesara - 1048576 - Equation of State octave-benchmark: dolfyn: Computational Fluid Dynamics pyhpc: CPU - Numba - 1048576 - Isoneutral Mixing tnn: CPU - SqueezeNet v2 pyhpc: CPU - Numba - 4194304 - Isoneutral Mixing pyhpc: CPU - Numpy - 262144 - Equation of State pyhpc: CPU - Numpy - 1048576 - Equation of State fftw: Stock - 2D FFT Size 32 pyhpc: CPU - Aesara - 4194304 - Isoneutral Mixing askap: tConvolve MT - Degridding fftw: Stock - 1D FFT Size 4096 pyhpc: CPU - Aesara - 1048576 - Isoneutral Mixing pyhpc: CPU - Numba - 262144 - Isoneutral Mixing pyhpc: CPU - Numpy - 4194304 - Equation of State fftw: Stock - 1D FFT Size 32 pyhpc: CPU - TensorFlow - 4194304 - Equation of State pyhpc: CPU - Aesara - 262144 - Isoneutral Mixing pyhpc: CPU - Aesara - 65536 - Equation of State pyhpc: CPU - Aesara - 16384 - Equation of State pyhpc: CPU - Numpy - 16384 - Equation of State pyhpc: CPU - Numba - 65536 - Equation of State pyhpc: CPU - Numba - 16384 - Equation of State pyhpc: CPU - JAX - 16384 - Isoneutral Mixing opencv: DNN - Deep Neural Network pyhpc: CPU - TensorFlow - 1048576 - Equation of State pyhpc: CPU - TensorFlow - 262144 - Equation of State pyhpc: CPU - TensorFlow - 65536 - Equation of State pyhpc: CPU - PyTorch - 4194304 - Equation of State pyhpc: CPU - Numpy - 4194304 - Isoneutral Mixing pyhpc: CPU - Numpy - 65536 - Equation of State pyhpc: CPU - Numpy - 16384 - Isoneutral Mixing pyhpc: CPU - Numba - 16384 - Isoneutral Mixing pyhpc: CPU - JAX - 1048576 - Isoneutral Mixing pyhpc: CPU - JAX - 262144 - Isoneutral Mixing pyhpc: CPU - JAX - 262144 - Equation of State pyhpc: CPU - JAX - 65536 - Isoneutral Mixing mlpack: scikit_svm mlpack: scikit_ica plaidml: No - Inference - ResNet 50 - CPU tnn: CPU - DenseNet ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: squeezenetv1.1 mnn: mobilenetV3 tensorflow-lite: NASNet Mobile daphne: OpenMP - Euclidean Cluster daphne: OpenMP - Points2Image daphne: OpenCL - Points2Image daphne: OpenMP - NDT Mapping askap: Hogbom Clean OpenMP rbenchmark: fftw: Float + SSE - 2D FFT Size 32 neat: cloverleaf: Lagrangian-Eulerian Hydrodynamics shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download shoc: OpenCL - Max SP Flops shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Triad Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.021 0.005 0.003 0.013 95.97 224.710 0.052 1.53 0.285 4.883 1.332 565.28 141138 2024680 49.07625 0.004 1.26329 0.030 100155 21.2083 1837950 0.008 95569.3 2812.63 99378.6 1484.83 0.637 31.787 22.29 6.86 74 794.258 5458.44 403.484 87.6173 0.021 12.2422 0.090 469 40688 0.009 0.416 10224.564685 4.723792 0.011 382.83 0.012 17.513 12945 30059 40789 1255.80 76868693 7.915 0.175 0.017 0.035 4662 235.526 0.143 0.044 5.166 11.158 0.202 45.898 0.847 0.053 0.222 22796 1.299 2004.44 18379 0.293 0.043 1.327 22739 0.095 0.061 0.003 0.001 0.002 0.002 0.001 0.002 66637 0.020 0.005 0.002 0.090 1.861 0.013 0.004 0.003 0.134 0.028 0.001 0.011 6.95 12.39 8.52 2719.820 4.28 4.63 8.16 6.64 6.69 3.04 17.58 5.41 1.69 7.72 3.40 2.68 4.02 3.19 5.55 13.41 15.34 16.38 9.29 10.37 37.73 9.00 1.02 3.37 2.12 2.39 2.17 2.38 9.07 23.277 1.749 1.892 3.590 16.694 2.264 1.113 126721 1615.65 32570.157710805 15525.664613677 1008.65 253.548 0.1070 79668 26.980 186.74 31.3126 30.5438 1667232 284.989 15.6282 0.001 0.015 0.004 0.003 0.013 96.46 262.514 0.054 1.64 0.280 5.069 1.336 558.72 159253 2213513 54.54775 0.003 1.14364 0.028 80667 17.9097 1801520 0.007 93959.4 3423.09 97487.4 1826.95 0.644 28.405 19.23 7.40 68 918.197 6321.61 401.56 98.1365 0.024 13.6588 0.097 460 42897 0.009 0.454 9116.455509 4.909597 0.011 403.729 0.011 16.792 13568 30047 42516 1231.73 72234187 8.013 0.178 0.016 0.036 4417 237.601 0.144 0.044 5.108 11.042 0.201 46.588 0.845 0.052 0.222 23477 1.301 1966.15 18279 0.290 0.043 1.325 23117 0.094 0.060 0.003 0.001 0.002 0.002 0.001 0.002 91035 0.020 0.004 0.001 0.069 2.021 0.013 0.005 0.002 0.134 0.027 0.001 0.010 7.19 12.02 8.32 2854.854 5.83 4.73 10.03 7.46 6.84 3.96 19.80 7.09 2.71 9.34 4.27 3.65 4.76 4.05 7.44 15.89 17.33 19.41 10.64 13.24 43.97 10.43 1.21 4.35 2.72 3.16 2.74 2.98 10.74 25.231 2.054 2.217 4.060 18.461 2.559 1.237 154285 1355.37 25089.993604089 15404.277700861 993.75 243.796 0.1096 75886 26.677 191.33 38.4318 38.5258 1542874 254.597 18.3795 0.004 0.055 0.013 0.009 0.031 109.65 486.558 0.111 2.29 0.499 8.193 2.212 422.78 109893 1564193 65.95657 0.004 0.94957 0.037 78573 16.8136 1413363 0.009 74706.9 2873.34 78810.5 1584.97 0.779 28.693 19.57 6.72 80 887.14 5932.11 409.434 101.122 0.024 13.9616 0.101 520 45034 0.009 0.459 9209.748838 5.205018 0.011 398.301 0.011 16.524 13494 29780 43336 1171.04 76195000 8.183 0.179 0.016 0.036 4579 230.459 0.144 0.044 5.069 11.057 0.201 45.500 0.837 0.053 0.227 23622 1.299 1970.57 17876 0.290 0.043 1.343 23240 0.093 0.061 0.003 0.001 0.002 0.002 0.001 0.002 52247 0.02 0.004 0.001 0.223 2.131 0.014 0.005 0.002 0.170 0.034 0.001 0.013 13.06 14.58 8.41 3987.715 3.71 4.30 8.14 6.11 6.41 2.34 16.96 5.10 1.43 7.65 2.72 2.07 3.64 2.61 6.80 14.85 17.62 20.01 11.23 11.74 40.61 10.34 1.21 4.47 2.88 2.67 2.75 3.10 11.06 26.687 2.613 2.565 4.640 22.622 2.865 1.261 89096.6 1439.63 26580.090189963 15539.073739801 821.38 255.322 0.1097 75032 25.676 138.33 36.0449 35.1906 1581543 228.412 18.4348 0.002 0.021 0.005 0.003 0.013 46.60 216.871 0.052 3.11 0.281 6.167 1.325 615.24 143667 2022477 48.78499 0.004 1.18910 0.029 102333 21.8802 1783857 0.008 93377.4 3550.08 98208.8 1853.31 0.635 26.149 23.08 6.17 74 790.264 6218.41 353.81 99.8771 0.021 13.7011 0.089 468 39909 0.008 0.409 9558.655773 4.965331 0.01 368.132 0.011 16.103 14069 32202 43777 1256.30 77469363 7.667 0.168 0.016 0.034 4593 225.758 0.137 0.042 4.950 10.693 0.194 44.744 0.815 0.051 0.219 22861 1.256 2023.16 18303 0.285 0.042 1.313 22747 0.093 0.060 0.003 0.001 0.002 0.002 0.001 0.002 7537 0.021 0.005 0.002 0.088 1.847 0.012 0.005 0.002 0.133 0.028 0.001 0.011 10.83 36.43 9.39 2623.161 3.46 3.74 7.63 5.63 5.75 1.96 16.43 4.69 1.03 7.20 2.48 1.80 3.33 2.41 4.71 12.06 14.05 15.22 8.38 9.23 37.91 7.63 0.91 2.90 1.85 2.18 1.89 2.15 8.63 21.017 1.532 1.616 3.165 14.786 2.045 0.963 130527 1649.84 33183.121427314 16397.283085518 1099.43 264.784 0.1038 83426 26.165 141.53 37.1909 38.2339 1723333 263.157 17.5019 OpenBenchmarking.org
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0009 0.0018 0.0027 0.0036 0.0045 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.002 0.002 0.004
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0124 0.0248 0.0372 0.0496 0.062 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 0.015 0.021 0.021 0.055
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.004 0.005 0.005 0.013
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.003 0.003 0.003 0.009
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.007 0.014 0.021 0.028 0.035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.013 0.013 0.013 0.031
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.12, N = 3 SE +/- 0.33, N = 3 SE +/- 0.15, N = 3 SE +/- 0.88, N = 3 46.60 95.97 96.46 109.65
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 110 220 330 440 550 SE +/- 0.22, N = 3 SE +/- 1.06, N = 3 SE +/- 2.18, N = 15 SE +/- 0.45, N = 3 216.87 224.71 262.51 486.56 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 214.9 / MAX: 226.18 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 219.21 / MAX: 248.29 -march=native - MIN: 221.83 / MAX: 377.5 -march=native - MIN: 479.58 / MAX: 559.42 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.025 0.05 0.075 0.1 0.125 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.052 0.052 0.054 0.111
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6998 1.3996 2.0994 2.7992 3.499 SE +/- 0.02, N = 15 SE +/- 0.02, N = 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 1.53 1.64 2.29 3.11
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1123 0.2246 0.3369 0.4492 0.5615 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.280 0.281 0.285 0.499
Scikit-Learn OpenBenchmarking.org Seconds, Fewer Is Better Scikit-Learn 0.22.1 Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.045, N = 3 SE +/- 0.019, N = 3 SE +/- 0.002, N = 3 4.883 5.069 6.167 8.193
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.4977 0.9954 1.4931 1.9908 2.4885 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.016, N = 3 SE +/- 0.018, N = 9 1.325 1.332 1.336 2.212
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 130 260 390 520 650 SE +/- 1.67, N = 3 SE +/- 1.21, N = 3 SE +/- 2.34, N = 3 SE +/- 2.53, N = 3 615.24 565.28 558.72 422.78
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 30K 60K 90K 120K 150K SE +/- 195.44, N = 3 SE +/- 86.06, N = 3 SE +/- 1142.56, N = 15 SE +/- 701.15, N = 3 109893 141138 143667 159253
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 500K 1000K 1500K 2000K 2500K SE +/- 2696.20, N = 3 SE +/- 10245.31, N = 3 SE +/- 1345.60, N = 3 SE +/- 8085.58, N = 3 1564193 2022477 2024680 2213513
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 15 30 45 60 75 SE +/- 0.28, N = 3 SE +/- 0.29, N = 3 SE +/- 0.25, N = 3 SE +/- 0.20, N = 3 48.78 49.08 54.55 65.96
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0009 0.0018 0.0027 0.0036 0.0045 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.003 0.004 0.004 0.004
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2842 0.5684 0.8526 1.1368 1.421 SE +/- 0.00925, N = 3 SE +/- 0.00123, N = 3 SE +/- 0.00934, N = 10 SE +/- 0.01507, N = 3 0.94957 1.14364 1.18910 1.26329
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0083 0.0166 0.0249 0.0332 0.0415 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.028 0.029 0.030 0.037
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 950.48, N = 3 SE +/- 688.36, N = 15 SE +/- 367.77, N = 3 SE +/- 827.63, N = 4 102333 100155 80667 78573 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.10, N = 3 SE +/- 0.29, N = 3 SE +/- 0.08, N = 3 SE +/- 0.15, N = 3 21.88 21.21 17.91 16.81 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -march=native 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 400K 800K 1200K 1600K 2000K SE +/- 972.87, N = 3 SE +/- 2059.01, N = 3 SE +/- 380.18, N = 3 SE +/- 2113.91, N = 3 1413363 1783857 1801520 1837950
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.007 0.008 0.008 0.009
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 1007.74, N = 3 SE +/- 1032.89, N = 5 SE +/- 153.26, N = 3 SE +/- 70.47, N = 3 74706.9 93377.4 93959.4 95569.3
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 800 1600 2400 3200 4000 SE +/- 0.00, N = 3 SE +/- 25.80, N = 5 SE +/- 10.37, N = 3 SE +/- 9.94, N = 3 3550.08 3423.09 2873.34 2812.63 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 254.55, N = 3 SE +/- 147.61, N = 3 SE +/- 995.15, N = 3 SE +/- 477.30, N = 3 78810.5 97487.4 98208.8 99378.6
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 4.31, N = 3 SE +/- 18.90, N = 5 SE +/- 9.49, N = 3 SE +/- 10.00, N = 3 1853.31 1826.95 1584.97 1484.83 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1753 0.3506 0.5259 0.7012 0.8765 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.635 0.637 0.644 0.779
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P1B2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 7 14 21 28 35 26.15 28.41 28.69 31.79
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 SE +/- 0.24, N = 4 SE +/- 0.22, N = 4 23.08 22.29 19.57 19.23
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mobilenet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.05, N = 3 SE +/- 0.06, N = 9 SE +/- 0.07, N = 15 SE +/- 0.10, N = 3 6.17 6.72 6.86 7.40 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.85 / MAX: 8.17 -march=native - MIN: 6.36 / MAX: 35.89 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 6.23 / MAX: 45.95 -march=native - MIN: 6.41 / MAX: 52.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
ONNX Runtime Model: fcn-resnet101-11 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: fcn-resnet101-11 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.17, N = 3 SE +/- 0.00, N = 3 SE +/- 0.75, N = 5 80 74 74 68 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B1 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 200 400 600 800 1000 790.26 794.26 887.14 918.20
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1400 2800 4200 5600 7000 SE +/- 9.91, N = 3 SE +/- 26.31, N = 3 SE +/- 4.26, N = 3 SE +/- 73.38, N = 3 6321.61 6218.41 5932.11 5458.44 1. (CXX) g++ options: -O3 -fopenmp -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lrt -lpthread -ldl
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 90 180 270 360 450 353.81 401.56 403.48 409.43
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.58, N = 3 SE +/- 0.45, N = 3 SE +/- 1.24, N = 3 101.12 99.88 98.14 87.62 -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0054 0.0108 0.0162 0.0216 0.027 SE +/- 0.000, N = 3 SE +/- 0.000, N = 13 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.021 0.021 0.024 0.024
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.12, N = 15 SE +/- 0.11, N = 10 SE +/- 0.12, N = 15 SE +/- 0.18, N = 15 13.96 13.70 13.66 12.24 -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0227 0.0454 0.0681 0.0908 0.1135 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.089 0.090 0.097 0.101
ONNX Runtime Model: yolov4 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: yolov4 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 110 220 330 440 550 SE +/- 4.64, N = 12 SE +/- 0.44, N = 3 SE +/- 5.97, N = 12 SE +/- 1.48, N = 3 520 469 468 460 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ONNX Runtime Model: shufflenet-v2-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: shufflenet-v2-10 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 10K 20K 30K 40K 50K SE +/- 46.94, N = 3 SE +/- 105.96, N = 3 SE +/- 181.26, N = 3 SE +/- 620.08, N = 11 45034 42897 40688 39909 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.008 0.009 0.009 0.009
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1033 0.2066 0.3099 0.4132 0.5165 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.409 0.416 0.454 0.459
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2K 4K 6K 8K 10K SE +/- 6.21, N = 3 SE +/- 5.76, N = 3 SE +/- 67.32, N = 3 SE +/- 32.30, N = 3 10224.56 9558.66 9209.75 9116.46 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native 1. (CC) gcc options: -O3 -mavx2
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1.1711 2.3422 3.5133 4.6844 5.8555 SE +/- 0.005750, N = 3 SE +/- 0.019734, N = 3 SE +/- 0.027431, N = 3 SE +/- 0.007819, N = 3 5.205018 4.965331 4.909597 4.723792 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CC) gcc options: -O3 -march=native -fopenmp
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0025 0.005 0.0075 0.01 0.0125 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.010 0.011 0.011 0.011
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 90 180 270 360 450 368.13 382.83 398.30 403.73
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0027 0.0054 0.0081 0.0108 0.0135 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.011 0.011 0.011 0.012
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.01, N = 3 16.10 16.52 16.79 17.51 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3K 6K 9K 12K 15K SE +/- 54.22, N = 3 SE +/- 25.36, N = 3 SE +/- 24.06, N = 3 SE +/- 108.98, N = 8 14069 13568 13494 12945 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 7K 14K 21K 28K 35K SE +/- 25.36, N = 3 SE +/- 235.28, N = 3 SE +/- 192.25, N = 3 SE +/- 6.89, N = 3 32202 30059 30047 29780 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 9K 18K 27K 36K 45K SE +/- 301.86, N = 3 SE +/- 522.29, N = 9 SE +/- 932.44, N = 7 SE +/- 293.32, N = 3 43777 43336 42516 40789 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CC) gcc options: -pthread -O3 -lm
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 300 600 900 1200 1500 SE +/- 0.74, N = 3 SE +/- 0.45, N = 3 SE +/- 2.99, N = 3 SE +/- 0.43, N = 3 1256.30 1255.80 1231.73 1171.04 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 17M 34M 51M 68M 85M SE +/- 37071.05, N = 3 SE +/- 132255.44, N = 3 SE +/- 396295.91, N = 3 SE +/- 688051.71, N = 3 77469363 76868693 76195000 72234187 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -march=native 1. (CXX) g++ options: -O3 -fopenmp
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.031, N = 3 SE +/- 0.051, N = 3 SE +/- 0.009, N = 3 SE +/- 0.069, N = 3 7.667 7.915 8.013 8.183 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0403 0.0806 0.1209 0.1612 0.2015 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.168 0.175 0.178 0.179
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0038 0.0076 0.0114 0.0152 0.019 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.016 0.016 0.016 0.017
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0081 0.0162 0.0243 0.0324 0.0405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.034 0.035 0.036 0.036
ONNX Runtime Model: super-resolution-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: super-resolution-10 - Device: CPU Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1000 2000 3000 4000 5000 SE +/- 24.67, N = 3 SE +/- 10.54, N = 3 SE +/- 7.52, N = 3 SE +/- 5.07, N = 3 4662 4593 4579 4417 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 50 100 150 200 250 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 1.05, N = 3 225.76 230.46 235.53 237.60 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 225.47 / MAX: 226.77 -march=native - MIN: 230.05 / MAX: 230.91 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 235.23 / MAX: 237 -march=native - MIN: 235.29 / MAX: 265.95 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0324 0.0648 0.0972 0.1296 0.162 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.137 0.143 0.144 0.144
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0099 0.0198 0.0297 0.0396 0.0495 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.042 0.044 0.044 0.044
GNU Octave Benchmark OpenBenchmarking.org Seconds, Fewer Is Better GNU Octave Benchmark 6.1.1~hg.2021.01.26 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1.1624 2.3248 3.4872 4.6496 5.812 SE +/- 0.017, N = 5 SE +/- 0.025, N = 5 SE +/- 0.039, N = 5 SE +/- 0.024, N = 5 4.950 5.069 5.108 5.166
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 10.69 11.04 11.06 11.16
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0455 0.091 0.1365 0.182 0.2275 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.194 0.201 0.201 0.202
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.32, N = 3 SE +/- 0.22, N = 3 44.74 45.50 45.90 46.59 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 44.37 / MAX: 45.8 -march=native - MIN: 45.08 / MAX: 45.93 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 45.05 / MAX: 46.7 -march=native - MIN: 45.39 / MAX: 53.21 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1906 0.3812 0.5718 0.7624 0.953 SE +/- 0.003, N = 3 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.815 0.837 0.845 0.847
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0119 0.0238 0.0357 0.0476 0.0595 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.051 0.052 0.053 0.053
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0511 0.1022 0.1533 0.2044 0.2555 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.002, N = 3 0.219 0.222 0.222 0.227
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 5K 10K 15K 20K 25K SE +/- 102.46, N = 3 SE +/- 178.70, N = 3 SE +/- 54.44, N = 3 SE +/- 214.39, N = 3 23622 23477 22861 22796 -march=native -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CC) gcc options: -pthread -O3 -lm
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2927 0.5854 0.8781 1.1708 1.4635 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 1.256 1.299 1.299 1.301
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 0.85, N = 3 SE +/- 2.06, N = 3 SE +/- 1.70, N = 3 SE +/- 15.85, N = 3 2023.16 2004.44 1970.57 1966.15 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 4K 8K 12K 16K 20K SE +/- 141.46, N = 14 SE +/- 33.69, N = 3 SE +/- 117.70, N = 3 SE +/- 29.78, N = 3 18379 18303 18279 17876 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0659 0.1318 0.1977 0.2636 0.3295 SE +/- 0.000, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 0.285 0.290 0.290 0.293
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0097 0.0194 0.0291 0.0388 0.0485 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.042 0.043 0.043 0.043
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.3022 0.6044 0.9066 1.2088 1.511 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.003, N = 3 1.313 1.325 1.327 1.343
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 5K 10K 15K 20K 25K SE +/- 13.78, N = 3 SE +/- 122.88, N = 3 SE +/- 20.34, N = 3 SE +/- 1.86, N = 3 23240 23117 22747 22739 -march=native -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CC) gcc options: -pthread -O3 -lm
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0214 0.0428 0.0642 0.0856 0.107 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.093 0.093 0.094 0.095
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.060 0.060 0.061 0.061
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0007 0.0014 0.0021 0.0028 0.0035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.003 0.003 0.003 0.003
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: DNN - Deep Neural Network Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 245.00, N = 15 SE +/- 713.19, N = 15 SE +/- 1166.72, N = 15 SE +/- 3031.05, N = 12 7537 52247 66637 91035 -march=native -ldl -lm -lpthread -lrt -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -ldl -lm -lpthread -lrt -march=native -ldl -lm -lpthread -lrt 1. (CXX) g++ options: -O3 -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -fvisibility=hidden
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0047 0.0094 0.0141 0.0188 0.0235 SE +/- 0.000, N = 4 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.020 0.020 0.020 0.021
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0011 0.0022 0.0033 0.0044 0.0055 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.004 0.004 0.005 0.005
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0502 0.1004 0.1506 0.2008 0.251 SE +/- 0.002, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.069 0.088 0.090 0.223
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.4795 0.959 1.4385 1.918 2.3975 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.100, N = 15 1.847 1.861 2.021 2.131
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0032 0.0064 0.0096 0.0128 0.016 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 15 0.012 0.013 0.013 0.014
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0011 0.0022 0.0033 0.0044 0.0055 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.004 0.005 0.005 0.005
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0007 0.0014 0.0021 0.0028 0.0035 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.002 0.002 0.002 0.003
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0383 0.0766 0.1149 0.1532 0.1915 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.004, N = 12 0.133 0.134 0.134 0.170
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0077 0.0154 0.0231 0.0308 0.0385 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.001, N = 12 0.027 0.028 0.028 0.034
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.001, N = 12 0.010 0.011 0.011 0.013
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.05, N = 15 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 SE +/- 0.54, N = 12 6.95 7.19 10.83 13.06
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 8 16 24 32 40 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.26, N = 12 SE +/- 0.29, N = 9 12.02 12.39 14.58 36.43
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.25, N = 9 SE +/- 0.04, N = 3 9.39 8.52 8.41 8.32
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 900 1800 2700 3600 4500 SE +/- 3.06, N = 3 SE +/- 2.71, N = 3 SE +/- 40.36, N = 3 SE +/- 299.40, N = 9 2623.16 2719.82 2854.85 3987.72 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2564.18 / MAX: 2757.86 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2625.5 / MAX: 3184.86 -march=native - MIN: 2660.43 / MAX: 3772.17 -march=native - MIN: 2619.64 / MAX: 5569.92 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: regnety_400m Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1.3118 2.6236 3.9354 5.2472 6.559 SE +/- 0.02, N = 3 SE +/- 0.05, N = 9 SE +/- 0.09, N = 15 SE +/- 1.34, N = 3 3.46 3.71 4.28 5.83 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.39 / MAX: 9.61 -march=native - MIN: 3.44 / MAX: 27.73 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.49 / MAX: 28.51 -march=native - MIN: 3.47 / MAX: 39.09 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: squeezenet_ssd Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1.0643 2.1286 3.1929 4.2572 5.3215 SE +/- 0.01, N = 3 SE +/- 0.06, N = 9 SE +/- 0.04, N = 15 SE +/- 0.35, N = 3 3.74 4.30 4.63 4.73 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.67 / MAX: 6.02 -march=native - MIN: 3.76 / MAX: 18.02 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.77 / MAX: 53.14 -march=native - MIN: 3.77 / MAX: 44.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: yolov4-tiny Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.02, N = 9 SE +/- 0.03, N = 15 SE +/- 1.16, N = 3 7.63 8.14 8.16 10.03 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.42 / MAX: 9.29 -march=native - MIN: 7.82 / MAX: 19.98 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.45 / MAX: 57.64 -march=native - MIN: 7.85 / MAX: 80.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet50 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.04, N = 9 SE +/- 0.08, N = 15 SE +/- 1.23, N = 3 5.63 6.11 6.64 7.46 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.55 / MAX: 11.35 -march=native - MIN: 5.6 / MAX: 31.19 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.66 / MAX: 49.12 -march=native - MIN: 5.65 / MAX: 33.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: alexnet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.14, N = 9 SE +/- 0.06, N = 15 SE +/- 0.55, N = 3 5.75 6.41 6.69 6.84 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.49 / MAX: 12.08 -march=native - MIN: 5.54 / MAX: 27.79 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.55 / MAX: 48.95 -march=native - MIN: 5.56 / MAX: 36.9 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet18 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.891 1.782 2.673 3.564 4.455 SE +/- 0.04, N = 3 SE +/- 0.09, N = 7 SE +/- 0.05, N = 15 SE +/- 1.09, N = 3 1.96 2.34 3.04 3.96 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.89 / MAX: 9.21 -march=native - MIN: 1.95 / MAX: 34.31 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.03 / MAX: 39.18 -march=native - MIN: 2.04 / MAX: 34.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: vgg16 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.08, N = 9 SE +/- 0.12, N = 15 SE +/- 2.77, N = 3 16.43 16.96 17.58 19.80 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 15.67 / MAX: 43.35 -march=native - MIN: 15.75 / MAX: 62.69 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 15.73 / MAX: 64.58 -march=native - MIN: 15.66 / MAX: 57.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: googlenet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.06, N = 9 SE +/- 0.07, N = 15 SE +/- 1.19, N = 3 4.69 5.10 5.41 7.09 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.63 / MAX: 7.03 -march=native - MIN: 4.68 / MAX: 29.19 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.74 / MAX: 36.71 -march=native - MIN: 4.73 / MAX: 34.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: blazeface Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6098 1.2196 1.8294 2.4392 3.049 SE +/- 0.04, N = 3 SE +/- 0.02, N = 9 SE +/- 0.03, N = 15 SE +/- 0.30, N = 3 1.03 1.43 1.69 2.71 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.85 / MAX: 22.57 -march=native - MIN: 1.17 / MAX: 22.73 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.29 / MAX: 28.85 -march=native - MIN: 1.3 / MAX: 55.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: efficientnet-b0 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 9 SE +/- 0.04, N = 15 SE +/- 1.47, N = 3 7.20 7.65 7.72 9.34 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.08 / MAX: 11.54 -march=native - MIN: 7.2 / MAX: 28.56 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.18 / MAX: 34.33 -march=native - MIN: 7.17 / MAX: 38.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mnasnet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.9608 1.9216 2.8824 3.8432 4.804 SE +/- 0.01, N = 3 SE +/- 0.06, N = 9 SE +/- 0.09, N = 15 SE +/- 1.10, N = 3 2.48 2.72 3.40 4.27 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.4 / MAX: 6.71 -march=native - MIN: 2.46 / MAX: 25.53 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.5 / MAX: 33.07 -march=native - MIN: 2.5 / MAX: 32.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: shufflenet-v2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.8213 1.6426 2.4639 3.2852 4.1065 SE +/- 0.00, N = 3 SE +/- 0.02, N = 9 SE +/- 0.04, N = 15 SE +/- 0.97, N = 3 1.80 2.07 2.68 3.65 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.77 / MAX: 3.92 -march=native - MIN: 1.82 / MAX: 24.55 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 25.24 -march=native - MIN: 1.88 / MAX: 35.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1.071 2.142 3.213 4.284 5.355 SE +/- 0.05, N = 3 SE +/- 0.05, N = 9 SE +/- 0.05, N = 15 SE +/- 0.44, N = 3 3.33 3.64 4.02 4.76 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.22 / MAX: 15.12 -march=native - MIN: 3.28 / MAX: 24.44 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.28 / MAX: 27.17 -march=native - MIN: 3.33 / MAX: 28.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.01, N = 3 SE +/- 0.04, N = 9 SE +/- 0.04, N = 15 SE +/- 0.50, N = 3 2.41 2.61 3.19 4.05 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.33 / MAX: 6.88 -march=native - MIN: 2.38 / MAX: 19.16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.43 / MAX: 25.75 -march=native - MIN: 2.44 / MAX: 30.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 15 SE +/- 0.20, N = 15 SE +/- 0.55, N = 15 4.71 5.55 6.80 7.44 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.58 / MAX: 6.32 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.92 / MAX: 31.26 -march=native - MIN: 6.23 / MAX: 15.86 -march=native - MIN: 5.34 / MAX: 398.2 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.02, N = 15 SE +/- 0.52, N = 15 SE +/- 0.72, N = 15 12.06 13.41 14.85 15.89 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 11.82 / MAX: 17.26 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 12.41 / MAX: 45.1 -march=native - MIN: 13.57 / MAX: 63.1 -march=native - MIN: 12.62 / MAX: 489.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.20, N = 3 SE +/- 0.13, N = 15 SE +/- 0.43, N = 15 SE +/- 0.44, N = 15 14.05 15.34 17.33 17.62 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 13.36 / MAX: 28.81 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 13.6 / MAX: 60.85 -march=native - MIN: 14.49 / MAX: 220.15 -march=native - MIN: 15.29 / MAX: 370.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.23, N = 3 SE +/- 0.11, N = 15 SE +/- 0.58, N = 15 SE +/- 0.32, N = 15 15.22 16.38 19.41 20.01 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.63 / MAX: 21.4 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.75 / MAX: 169.63 -march=native - MIN: 15.46 / MAX: 404.54 -march=native - MIN: 18.29 / MAX: 72.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 15 SE +/- 0.31, N = 15 SE +/- 0.06, N = 15 8.38 9.29 10.64 11.23 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.27 / MAX: 14.31 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 8.58 / MAX: 188.45 -march=native - MIN: 8.84 / MAX: 348.67 -march=native - MIN: 10.78 / MAX: 35.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.05, N = 15 SE +/- 0.21, N = 15 SE +/- 0.65, N = 15 9.23 10.37 11.74 13.24 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 9.03 / MAX: 29.68 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 9.43 / MAX: 107.77 -march=native - MIN: 10.66 / MAX: 43.48 -march=native - MIN: 9.67 / MAX: 377.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 10 20 30 40 50 SE +/- 0.15, N = 15 SE +/- 0.31, N = 3 SE +/- 0.40, N = 15 SE +/- 1.09, N = 15 37.73 37.91 40.61 43.97 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 35.14 / MAX: 288.51 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 37.18 / MAX: 80.66 -march=native - MIN: 38.83 / MAX: 98.35 -march=native - MIN: 36.31 / MAX: 485.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.15, N = 15 SE +/- 0.27, N = 15 SE +/- 0.39, N = 15 7.63 9.00 10.34 10.43 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.41 / MAX: 17.97 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.77 / MAX: 212.02 -march=native - MIN: 9.1 / MAX: 35.14 -march=native - MIN: 8.03 / MAX: 223.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.01, N = 3 SE +/- 0.00, N = 15 SE +/- 0.04, N = 15 SE +/- 0.04, N = 15 0.91 1.02 1.21 1.21 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.88 / MAX: 4.07 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 3.99 -march=native - MIN: 0.93 / MAX: 35.68 -march=native - MIN: 1.05 / MAX: 6.94 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 1.0058 2.0116 3.0174 4.0232 5.029 SE +/- 0.01, N = 3 SE +/- 0.02, N = 15 SE +/- 0.20, N = 15 SE +/- 0.12, N = 15 2.90 3.37 4.35 4.47 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.83 / MAX: 3.97 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.95 / MAX: 16.3 -march=native - MIN: 3.03 / MAX: 261.53 -march=native - MIN: 3.96 / MAX: 34.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.648 1.296 1.944 2.592 3.24 SE +/- 0.01, N = 3 SE +/- 0.01, N = 14 SE +/- 0.12, N = 14 SE +/- 0.12, N = 15 1.85 2.12 2.72 2.88 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.79 / MAX: 2.88 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 5.63 -march=native - MIN: 1.95 / MAX: 218.37 -march=native - MIN: 2.43 / MAX: 186.85 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.711 1.422 2.133 2.844 3.555 SE +/- 0.00, N = 3 SE +/- 0.01, N = 14 SE +/- 0.07, N = 14 SE +/- 0.31, N = 15 2.18 2.39 2.67 3.16 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.1 / MAX: 6.91 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.14 / MAX: 6.73 -march=native - MIN: 2.45 / MAX: 10.63 -march=native - MIN: 2.17 / MAX: 324.42 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6188 1.2376 1.8564 2.4752 3.094 SE +/- 0.01, N = 3 SE +/- 0.01, N = 15 SE +/- 0.12, N = 15 SE +/- 0.09, N = 15 1.89 2.17 2.74 2.75 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.83 / MAX: 2.92 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.87 / MAX: 11.1 -march=native - MIN: 1.97 / MAX: 173.37 -march=native - MIN: 2.43 / MAX: 270.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6975 1.395 2.0925 2.79 3.4875 SE +/- 0.01, N = 3 SE +/- 0.01, N = 15 SE +/- 0.12, N = 15 SE +/- 0.15, N = 15 2.15 2.38 2.98 3.10 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.08 / MAX: 5.56 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.09 / MAX: 19.16 -march=native - MIN: 2.18 / MAX: 280.99 -march=native - MIN: 2.73 / MAX: 410 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.10, N = 3 SE +/- 0.12, N = 15 SE +/- 0.34, N = 15 SE +/- 0.31, N = 15 8.63 9.07 10.74 11.06 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.28 / MAX: 13.76 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.88 / MAX: 149.11 -march=native - MIN: 8.38 / MAX: 272.88 -march=native - MIN: 9.8 / MAX: 493.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.04, N = 3 SE +/- 0.05, N = 15 SE +/- 0.55, N = 15 SE +/- 0.37, N = 12 21.02 23.28 25.23 26.69 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 19.1 / MAX: 37.33 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 20.16 / MAX: 153.62 -march=native - MIN: 20.75 / MAX: 242.75 -march=native - MIN: 22.92 / MAX: 195.79 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.5879 1.1758 1.7637 2.3516 2.9395 SE +/- 0.007, N = 3 SE +/- 0.016, N = 15 SE +/- 0.066, N = 15 SE +/- 0.016, N = 12 1.532 1.749 2.054 2.613 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.46 / MAX: 12.02 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.51 / MAX: 60.38 -march=native - MIN: 1.55 / MAX: 55.56 -march=native - MIN: 2.48 / MAX: 16.01 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.5771 1.1542 1.7313 2.3084 2.8855 SE +/- 0.004, N = 3 SE +/- 0.021, N = 15 SE +/- 0.076, N = 15 SE +/- 0.050, N = 12 1.616 1.892 2.217 2.565 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.54 / MAX: 7.39 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.59 / MAX: 22.25 -march=native - MIN: 1.65 / MAX: 59.17 -march=native - MIN: 2.04 / MAX: 63.68 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 1.044 2.088 3.132 4.176 5.22 SE +/- 0.251, N = 3 SE +/- 0.051, N = 15 SE +/- 0.128, N = 15 SE +/- 0.105, N = 12 3.165 3.590 4.060 4.640 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.84 / MAX: 8.57 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.83 / MAX: 78.46 -march=native - MIN: 2.94 / MAX: 82.76 -march=native - MIN: 3.45 / MAX: 47.3 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.06, N = 15 SE +/- 0.50, N = 15 SE +/- 0.57, N = 12 14.79 16.69 18.46 22.62 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.1 / MAX: 42.25 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.63 / MAX: 104.64 -march=native - MIN: 14.88 / MAX: 173.91 -march=native - MIN: 19.41 / MAX: 83.45 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6446 1.2892 1.9338 2.5784 3.223 SE +/- 0.010, N = 3 SE +/- 0.022, N = 15 SE +/- 0.078, N = 15 SE +/- 0.110, N = 12 2.045 2.264 2.559 2.865 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.99 / MAX: 8.27 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.95 / MAX: 45.84 -march=native - MIN: 1.99 / MAX: 49.19 -march=native - MIN: 2.23 / MAX: 52.16 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2837 0.5674 0.8511 1.1348 1.4185 SE +/- 0.001, N = 3 SE +/- 0.016, N = 15 SE +/- 0.037, N = 15 SE +/- 0.017, N = 12 0.963 1.113 1.237 1.261 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.93 / MAX: 4.84 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 16.19 -march=native - MIN: 0.95 / MAX: 40.28 -march=native - MIN: 1.08 / MAX: 48.36 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 30K 60K 90K 120K 150K SE +/- 609.47, N = 3 SE +/- 95.70, N = 3 SE +/- 2986.13, N = 15 SE +/- 6529.38, N = 12 89096.6 126721.0 130527.0 154285.0
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 12.05, N = 3 SE +/- 3.29, N = 3 SE +/- 27.64, N = 15 SE +/- 12.00, N = 15 1649.84 1615.65 1439.63 1355.37 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 7K 14K 21K 28K 35K SE +/- 638.53, N = 12 SE +/- 274.39, N = 3 SE +/- 100.69, N = 3 SE +/- 1308.12, N = 13 33183.12 32570.16 26580.09 25089.99 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenCL - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenCL - Kernel: Points2Image Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 4K 8K 12K 16K 20K SE +/- 145.81, N = 3 SE +/- 375.06, N = 12 SE +/- 177.47, N = 3 SE +/- 137.15, N = 3 16397.28 15539.07 15525.66 15404.28 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 200 400 600 800 1000 SE +/- 4.87, N = 3 SE +/- 4.83, N = 3 SE +/- 10.71, N = 5 SE +/- 26.57, N = 15 1099.43 1008.65 993.75 821.38 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 60 120 180 240 300 SE +/- 0.23, N = 3 SE +/- 0.58, N = 3 SE +/- 1.70, N = 15 SE +/- 4.49, N = 15 264.78 255.32 253.55 243.80 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
R Benchmark OpenBenchmarking.org Seconds, Fewer Is Better R Benchmark Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0247 0.0494 0.0741 0.0988 0.1235 SE +/- 0.0004, N = 3 SE +/- 0.0004, N = 3 SE +/- 0.0013, N = 3 SE +/- 0.0057, N = 12 0.1038 0.1070 0.1096 0.1097 1. R scripting front-end version 4.0.4 (2021-02-15)
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 650.31, N = 3 SE +/- 1461.14, N = 15 SE +/- 618.46, N = 15 SE +/- 820.18, N = 3 83426 79668 75886 75032 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -march=native 1. (CC) gcc options: -pthread -O3 -lm
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.66, N = 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 25.68 26.17 26.68 26.98 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 40 80 120 160 200 SE +/- 0.42, N = 3 SE +/- 1.45, N = 3 SE +/- 2.66, N = 12 SE +/- 18.74, N = 9 138.33 141.53 186.74 191.33 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 9 18 27 36 45 SE +/- 1.55, N = 15 SE +/- 1.52, N = 12 SE +/- 1.03, N = 14 SE +/- 1.21, N = 15 38.43 37.19 36.04 31.31 -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 9 18 27 36 45 SE +/- 1.64, N = 15 SE +/- 1.47, N = 15 SE +/- 1.31, N = 15 SE +/- 1.01, N = 12 38.53 38.23 35.19 30.54 -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 400K 800K 1200K 1600K 2000K SE +/- 44470.87, N = 12 SE +/- 76015.19, N = 12 SE +/- 38770.78, N = 15 SE +/- 87626.15, N = 15 1723333 1667232 1581543 1542874 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -march=native 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt 60 120 180 240 300 SE +/- 1.08, N = 3 SE +/- 0.87, N = 3 SE +/- 4.42, N = 12 SE +/- 3.17, N = 15 284.99 263.16 254.60 228.41 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -march=native 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.20, N = 4 SE +/- 0.16, N = 15 SE +/- 0.20, N = 15 SE +/- 0.29, N = 15 18.43 18.38 17.50 15.63 -march=native -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
Phoronix Test Suite v10.8.5