HPC benchmark- POSSIBLE BAD DATA Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2112081-TJ-2112076TJ12&grr&sro .
HPC benchmark- POSSIBLE BAD DATA Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Intel Device 7aa7 32GB 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 (1650/750MHz) Realtek ALC897 LG HDR WQHD Intel I225-V Pop 21.04 5.15.5-76051505-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.11 4.6 Mesa 21.3.0-devel (LLVM 12.0.1) OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX + OpenCL 2.2 AMD-APP (3361.0) 1.2.182 GCC 10.3.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 + CUDA 10.2 ext4 3440x1440 Intel Core i7-12700K @ 6.30GHz (12 Cores / 20 Threads) Intel Core i7-12700K @ 6.50GHz (8 Cores / 16 Threads) Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) 4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX GCC 11.1.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : CXXFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" CFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : NONE / errors=remount-ro,noatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3 Graphics Details - GLAMOR - BAR1 / Visible vRAM Size: 6128 MB Python Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : Python 3.7.11 :: Intel - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
HPC benchmark- POSSIBLE BAD DATA fftw: Float + SSE - 2D FFT Size 4096 cloverleaf: Lagrangian-Eulerian Hydrodynamics plaidml: No - Inference - ResNet 50 - CPU tnn: CPU - DenseNet onnx: yolov4 - CPU ecp-candle: P3B1 mnn: inception-v3 mnn: mobilenet-v1-1.0 mnn: MobileNetV2_224 mnn: SqueezeNetV1.0 mnn: resnet-v2-50 mnn: squeezenetv1.1 mnn: mobilenetV3 opencv: DNN - Deep Neural Network ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet onnx: shufflenet-v2-10 - CPU pyhpc: CPU - Numpy - 4194304 - Isoneutral Mixing tensorflow-lite: NASNet Mobile fftw: Stock - 2D FFT Size 4096 shoc: OpenCL - Max SP Flops onnx: fcn-resnet101-11 - CPU daphne: OpenMP - Points2Image numpy: ecp-candle: P3B2 pyhpc: CPU - PyTorch - 4194304 - Isoneutral Mixing cp2k: Fayalite-FIST plaidml: No - Inference - VGG16 - CPU pyhpc: CPU - JAX - 65536 - Isoneutral Mixing mlpack: scikit_qda tensorflow-lite: SqueezeNet onnx: super-resolution-10 - CPU mlpack: scikit_linearridgeregression pyhpc: CPU - PyTorch - 1048576 - Equation of State daphne: OpenCL - Points2Image tensorflow-lite: Inception V4 namd: ATPase Simulation - 327,506 Atoms ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet tensorflow-lite: Inception ResNet V2 pyhpc: CPU - TensorFlow - 1048576 - Equation of State shoc: OpenCL - S3D askap: tConvolve MT - Degridding askap: tConvolve MT - Gridding tensorflow-lite: Mobilenet Float pyhpc: CPU - Aesara - 4194304 - Isoneutral Mixing tensorflow-lite: Mobilenet Quant pyhpc: CPU - JAX - 4194304 - Isoneutral Mixing pyhpc: CPU - JAX - 262144 - Isoneutral Mixing himeno: Poisson Pressure Solver neat: mlpack: scikit_ica pyhpc: CPU - PyTorch - 4194304 - Equation of State pyhpc: CPU - Numba - 4194304 - Isoneutral Mixing pyhpc: CPU - TensorFlow - 65536 - Equation of State mt-dgemm: Sustained Floating-Point Rate minife: Small daphne: OpenMP - Euclidean Cluster pyhpc: CPU - Aesara - 65536 - Isoneutral Mixing kripke: pyhpc: CPU - Numpy - 4194304 - Equation of State daphne: OpenMP - NDT Mapping pyhpc: CPU - PyTorch - 65536 - Isoneutral Mixing deepspeech: CPU shoc: OpenCL - GEMM SGEMM_N askap: Hogbom Clean OpenMP tnn: CPU - MobileNet v2 mlpack: scikit_svm pyhpc: CPU - TensorFlow - 262144 - Equation of State pyhpc: CPU - Numba - 65536 - Equation of State pyhpc: CPU - Aesara - 4194304 - Equation of State pyhpc: CPU - Numpy - 1048576 - Equation of State pyhpc: CPU - JAX - 1048576 - Isoneutral Mixing pyhpc: CPU - Numpy - 65536 - Equation of State rbenchmark: pyhpc: CPU - Numba - 262144 - Equation of State pyhpc: CPU - Numba - 4194304 - Equation of State pyhpc: CPU - PyTorch - 65536 - Equation of State pyhpc: CPU - Numpy - 16384 - Isoneutral Mixing pyhpc: CPU - Numba - 65536 - Isoneutral Mixing pyhpc: CPU - JAX - 1048576 - Equation of State pyhpc: CPU - TensorFlow - 4194304 - Equation of State pyhpc: CPU - Numpy - 1048576 - Isoneutral Mixing pyhpc: CPU - PyTorch - 1048576 - Isoneutral Mixing pyhpc: CPU - Aesara - 1048576 - Isoneutral Mixing pyhpc: CPU - Aesara - 16384 - Isoneutral Mixing pyhpc: CPU - Numba - 16384 - Isoneutral Mixing rnnoise: tnn: CPU - SqueezeNet v1.1 pyhpc: CPU - Numpy - 262144 - Isoneutral Mixing pyhpc: CPU - Aesara - 262144 - Isoneutral Mixing pyhpc: CPU - Aesara - 262144 - Equation of State pyhpc: CPU - PyTorch - 262144 - Isoneutral Mixing pyhpc: CPU - PyTorch - 262144 - Equation of State pyhpc: CPU - JAX - 4194304 - Equation of State fftw: Float + SSE - 1D FFT Size 4096 pyhpc: CPU - Numba - 1048576 - Isoneutral Mixing pyhpc: CPU - Aesara - 16384 - Equation of State pyhpc: CPU - JAX - 262144 - Equation of State dolfyn: Computational Fluid Dynamics askap: tConvolve OpenMP - Degridding askap: tConvolve OpenMP - Gridding pyhpc: CPU - Numba - 262144 - Isoneutral Mixing shoc: OpenCL - Reduction ecp-candle: P1B2 pyhpc: CPU - PyTorch - 16384 - Isoneutral Mixing fftw: Stock - 1D FFT Size 4096 octave-benchmark: pyhpc: CPU - Aesara - 1048576 - Equation of State pyhpc: CPU - JAX - 16384 - Isoneutral Mixing pyhpc: CPU - Numba - 16384 - Equation of State mafft: Multiple Sequence Alignment - LSU RNA shoc: OpenCL - Triad pyhpc: CPU - Numpy - 65536 - Isoneutral Mixing fftw: Float + SSE - 2D FFT Size 32 pyhpc: CPU - Numpy - 262144 - Equation of State shoc: OpenCL - Bus Speed Readback shoc: OpenCL - Bus Speed Download pyhpc: CPU - Numba - 1048576 - Equation of State scikit-learn: pyhpc: CPU - Aesara - 65536 - Equation of State fftw: Stock - 1D FFT Size 32 fftw: Stock - 2D FFT Size 32 tnn: CPU - SqueezeNet v2 fftw: Float + SSE - 1D FFT Size 32 pyhpc: CPU - Numpy - 16384 - Equation of State shoc: OpenCL - FFT SP Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 40789 186.74 8.52 2719.820 469 794.258 23.277 1.749 1.892 3.590 16.694 2.264 1.113 66637 5.55 13.41 15.34 16.38 9.29 10.37 37.73 9.00 1.02 3.37 2.12 2.39 2.17 2.38 9.07 40688 1.861 126721 12945 1667232 74 32570.157710805 565.28 403.484 1.332 382.83 22.29 0.011 95.97 141138 4662 1.53 0.021 15525.664613677 2024680 1.26329 4.28 4.63 8.16 6.64 6.69 3.04 17.58 5.41 1.69 7.72 3.40 2.68 4.02 3.19 6.86 1837950 0.020 12.2422 2004.44 1255.80 95569.3 1.299 99378.6 0.637 0.028 10224.564685 26.980 12.39 0.090 0.847 0.002 4.723792 5458.44 1615.65 0.017 76868693 1.327 1008.65 0.013 49.07625 284.989 253.548 224.710 6.95 0.005 0.002 0.175 0.222 0.134 0.013 0.1070 0.009 0.143 0.002 0.004 0.012 0.008 0.095 0.416 0.285 0.293 0.004 0.003 17.513 235.526 0.090 0.061 0.011 0.052 0.005 0.030 100155 0.202 0.001 0.001 11.158 2812.63 1484.83 0.043 21.2083 31.787 0.003 18379 5.166 0.044 0.002 0.001 7.915 15.6282 0.021 79668 0.053 31.3126 30.5438 0.035 4.883 0.003 22739 22796 45.898 30059 0.002 87.6173 42516 191.33 8.32 2854.854 460 918.197 25.231 2.054 2.217 4.060 18.461 2.559 1.237 91035 7.44 15.89 17.33 19.41 10.64 13.24 43.97 10.43 1.21 4.35 2.72 3.16 2.74 2.98 10.74 42897 2.021 154285 13568 1542874 68 25089.993604089 558.72 401.56 1.336 403.729 19.23 0.010 96.46 159253 4417 1.64 0.015 15404.277700861 2213513 1.14364 5.83 4.73 10.03 7.46 6.84 3.96 19.80 7.09 2.71 9.34 4.27 3.65 4.76 4.05 7.40 1801520 0.020 13.6588 1966.15 1231.73 93959.4 1.301 97487.4 0.644 0.027 9116.455509 26.677 12.02 0.069 0.845 0.001 4.909597 6321.61 1355.37 0.016 72234187 1.325 993.75 0.013 54.54775 254.597 243.796 262.514 7.19 0.004 0.002 0.178 0.222 0.134 0.013 0.1096 0.009 0.144 0.001 0.005 0.011 0.007 0.094 0.454 0.280 0.290 0.003 0.002 16.792 237.601 0.097 0.060 0.011 0.054 0.004 0.028 80667 0.201 0.001 0.001 11.042 3423.09 1826.95 0.043 17.9097 28.405 0.003 18279 5.108 0.044 0.002 0.001 8.013 18.3795 0.024 75886 0.052 38.4318 38.5258 0.036 5.069 0.003 23117 23477 46.588 30047 0.002 98.1365 43336 138.33 8.41 3987.715 520 887.14 26.687 2.613 2.565 4.640 22.622 2.865 1.261 52247 6.80 14.85 17.62 20.01 11.23 11.74 40.61 10.34 1.21 4.47 2.88 2.67 2.75 3.10 11.06 45034 2.131 89096.6 13494 1581543 80 26580.090189963 422.78 409.434 2.212 398.301 19.57 0.013 109.65 109893 4579 2.29 0.055 15539.073739801 1564193 0.94957 3.71 4.30 8.14 6.11 6.41 2.34 16.96 5.10 1.43 7.65 2.72 2.07 3.64 2.61 6.72 1413363 0.02 13.9616 1970.57 1171.04 74706.9 1.299 78810.5 0.779 0.034 9209.748838 25.676 14.58 0.223 0.837 0.001 5.205018 5932.11 1439.63 0.016 76195000 1.343 821.38 0.031 65.95657 228.412 255.322 486.558 13.06 0.004 0.002 0.179 0.227 0.170 0.014 0.1097 0.009 0.144 0.004 0.005 0.011 0.009 0.093 0.459 0.499 0.290 0.004 0.002 16.524 230.459 0.101 0.061 0.011 0.111 0.013 0.037 78573 0.201 0.001 0.001 11.057 2873.34 1584.97 0.043 16.8136 28.693 0.009 17876 5.069 0.044 0.002 0.001 8.183 18.4348 0.024 75032 0.053 36.0449 35.1906 0.036 8.193 0.003 23240 23622 45.500 29780 0.002 101.122 43777 141.53 9.39 2623.161 468 790.264 21.017 1.532 1.616 3.165 14.786 2.045 0.963 7537 4.71 12.06 14.05 15.22 8.38 9.23 37.91 7.63 0.91 2.90 1.85 2.18 1.89 2.15 8.63 39909 1.847 130527 14069 1723333 74 33183.121427314 615.24 353.81 1.325 368.132 23.08 0.011 46.60 143667 4593 3.11 0.021 16397.283085518 2022477 1.18910 3.46 3.74 7.63 5.63 5.75 1.96 16.43 4.69 1.03 7.20 2.48 1.80 3.33 2.41 6.17 1783857 0.021 13.7011 2023.16 1256.30 93377.4 1.256 98208.8 0.635 0.028 9558.655773 26.165 36.43 0.088 0.815 0.002 4.965331 6218.41 1649.84 0.016 77469363 1.313 1099.43 0.013 48.78499 263.157 264.784 216.871 10.83 0.005 0.002 0.168 0.219 0.133 0.012 0.1038 0.008 0.137 0.002 0.005 0.011 0.008 0.093 0.409 0.281 0.285 0.004 0.002 16.103 225.758 0.089 0.060 0.01 0.052 0.005 0.029 102333 0.194 0.001 0.001 10.693 3550.08 1853.31 0.042 21.8802 26.149 0.003 18303 4.950 0.042 0.002 0.001 7.667 17.5019 0.021 83426 0.051 37.1909 38.2339 0.034 6.167 0.003 22747 22861 44.744 32202 0.002 99.8771 OpenBenchmarking.org
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 9K 18K 27K 36K 45K SE +/- 522.29, N = 9 SE +/- 293.32, N = 3 SE +/- 932.44, N = 7 SE +/- 301.86, N = 3 43336 40789 42516 43777 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 40 80 120 160 200 SE +/- 0.42, N = 3 SE +/- 2.66, N = 12 SE +/- 18.74, N = 9 SE +/- 1.45, N = 3 138.33 186.74 191.33 141.53 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.25, N = 9 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 8.41 8.52 8.32 9.39
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 900 1800 2700 3600 4500 SE +/- 299.40, N = 9 SE +/- 2.71, N = 3 SE +/- 40.36, N = 3 SE +/- 3.06, N = 3 3987.72 2719.82 2854.85 2623.16 -march=native - MIN: 2619.64 / MAX: 5569.92 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2625.5 / MAX: 3184.86 -march=native - MIN: 2660.43 / MAX: 3772.17 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2564.18 / MAX: 2757.86 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
ONNX Runtime Model: yolov4 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: yolov4 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 110 220 330 440 550 SE +/- 4.64, N = 12 SE +/- 0.44, N = 3 SE +/- 1.48, N = 3 SE +/- 5.97, N = 12 520 469 460 468 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 200 400 600 800 1000 887.14 794.26 918.20 790.26
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.37, N = 12 SE +/- 0.05, N = 15 SE +/- 0.55, N = 15 SE +/- 0.04, N = 3 26.69 23.28 25.23 21.02 -march=native - MIN: 22.92 / MAX: 195.79 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 20.16 / MAX: 153.62 -march=native - MIN: 20.75 / MAX: 242.75 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 19.1 / MAX: 37.33 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.5879 1.1758 1.7637 2.3516 2.9395 SE +/- 0.016, N = 12 SE +/- 0.016, N = 15 SE +/- 0.066, N = 15 SE +/- 0.007, N = 3 2.613 1.749 2.054 1.532 -march=native - MIN: 2.48 / MAX: 16.01 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.51 / MAX: 60.38 -march=native - MIN: 1.55 / MAX: 55.56 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.46 / MAX: 12.02 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.5771 1.1542 1.7313 2.3084 2.8855 SE +/- 0.050, N = 12 SE +/- 0.021, N = 15 SE +/- 0.076, N = 15 SE +/- 0.004, N = 3 2.565 1.892 2.217 1.616 -march=native - MIN: 2.04 / MAX: 63.68 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.59 / MAX: 22.25 -march=native - MIN: 1.65 / MAX: 59.17 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.54 / MAX: 7.39 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.044 2.088 3.132 4.176 5.22 SE +/- 0.105, N = 12 SE +/- 0.051, N = 15 SE +/- 0.128, N = 15 SE +/- 0.251, N = 3 4.640 3.590 4.060 3.165 -march=native - MIN: 3.45 / MAX: 47.3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.83 / MAX: 78.46 -march=native - MIN: 2.94 / MAX: 82.76 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.84 / MAX: 8.57 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.57, N = 12 SE +/- 0.06, N = 15 SE +/- 0.50, N = 15 SE +/- 0.05, N = 3 22.62 16.69 18.46 14.79 -march=native - MIN: 19.41 / MAX: 83.45 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.63 / MAX: 104.64 -march=native - MIN: 14.88 / MAX: 173.91 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.1 / MAX: 42.25 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6446 1.2892 1.9338 2.5784 3.223 SE +/- 0.110, N = 12 SE +/- 0.022, N = 15 SE +/- 0.078, N = 15 SE +/- 0.010, N = 3 2.865 2.264 2.559 2.045 -march=native - MIN: 2.23 / MAX: 52.16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.95 / MAX: 45.84 -march=native - MIN: 1.99 / MAX: 49.19 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.99 / MAX: 8.27 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2837 0.5674 0.8511 1.1348 1.4185 SE +/- 0.017, N = 12 SE +/- 0.016, N = 15 SE +/- 0.037, N = 15 SE +/- 0.001, N = 3 1.261 1.113 1.237 0.963 -march=native - MIN: 1.08 / MAX: 48.36 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 16.19 -march=native - MIN: 0.95 / MAX: 40.28 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.93 / MAX: 4.84 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: DNN - Deep Neural Network Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 713.19, N = 15 SE +/- 1166.72, N = 15 SE +/- 3031.05, N = 12 SE +/- 245.00, N = 15 52247 66637 91035 7537 -march=native -ldl -lm -lpthread -lrt -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -ldl -lm -lpthread -lrt -march=native -ldl -lm -lpthread -lrt 1. (CXX) g++ options: -O3 -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -fvisibility=hidden
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.20, N = 15 SE +/- 0.01, N = 15 SE +/- 0.55, N = 15 SE +/- 0.01, N = 3 6.80 5.55 7.44 4.71 -march=native - MIN: 6.23 / MAX: 15.86 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.92 / MAX: 31.26 -march=native - MIN: 5.34 / MAX: 398.2 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.58 / MAX: 6.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.52, N = 15 SE +/- 0.02, N = 15 SE +/- 0.72, N = 15 SE +/- 0.02, N = 3 14.85 13.41 15.89 12.06 -march=native - MIN: 13.57 / MAX: 63.1 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 12.41 / MAX: 45.1 -march=native - MIN: 12.62 / MAX: 489.07 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 11.82 / MAX: 17.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.44, N = 15 SE +/- 0.13, N = 15 SE +/- 0.43, N = 15 SE +/- 0.20, N = 3 17.62 15.34 17.33 14.05 -march=native - MIN: 15.29 / MAX: 370.58 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 13.6 / MAX: 60.85 -march=native - MIN: 14.49 / MAX: 220.15 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 13.36 / MAX: 28.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.32, N = 15 SE +/- 0.11, N = 15 SE +/- 0.58, N = 15 SE +/- 0.23, N = 3 20.01 16.38 19.41 15.22 -march=native - MIN: 18.29 / MAX: 72.6 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.75 / MAX: 169.63 -march=native - MIN: 15.46 / MAX: 404.54 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.63 / MAX: 21.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.06, N = 15 SE +/- 0.05, N = 15 SE +/- 0.31, N = 15 SE +/- 0.01, N = 3 11.23 9.29 10.64 8.38 -march=native - MIN: 10.78 / MAX: 35.58 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 8.58 / MAX: 188.45 -march=native - MIN: 8.84 / MAX: 348.67 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.27 / MAX: 14.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.21, N = 15 SE +/- 0.05, N = 15 SE +/- 0.65, N = 15 SE +/- 0.03, N = 3 11.74 10.37 13.24 9.23 -march=native - MIN: 10.66 / MAX: 43.48 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 9.43 / MAX: 107.77 -march=native - MIN: 9.67 / MAX: 377.98 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 9.03 / MAX: 29.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 10 20 30 40 50 SE +/- 0.40, N = 15 SE +/- 0.15, N = 15 SE +/- 1.09, N = 15 SE +/- 0.31, N = 3 40.61 37.73 43.97 37.91 -march=native - MIN: 38.83 / MAX: 98.35 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 35.14 / MAX: 288.51 -march=native - MIN: 36.31 / MAX: 485.44 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 37.18 / MAX: 80.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.27, N = 15 SE +/- 0.15, N = 15 SE +/- 0.39, N = 15 SE +/- 0.04, N = 3 10.34 9.00 10.43 7.63 -march=native - MIN: 9.1 / MAX: 35.14 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.77 / MAX: 212.02 -march=native - MIN: 8.03 / MAX: 223.75 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.41 / MAX: 17.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.04, N = 15 SE +/- 0.00, N = 15 SE +/- 0.04, N = 15 SE +/- 0.01, N = 3 1.21 1.02 1.21 0.91 -march=native - MIN: 1.05 / MAX: 6.94 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 3.99 -march=native - MIN: 0.93 / MAX: 35.68 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.88 / MAX: 4.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.0058 2.0116 3.0174 4.0232 5.029 SE +/- 0.12, N = 15 SE +/- 0.02, N = 15 SE +/- 0.20, N = 15 SE +/- 0.01, N = 3 4.47 3.37 4.35 2.90 -march=native - MIN: 3.96 / MAX: 34.65 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.95 / MAX: 16.3 -march=native - MIN: 3.03 / MAX: 261.53 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.83 / MAX: 3.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.648 1.296 1.944 2.592 3.24 SE +/- 0.12, N = 15 SE +/- 0.01, N = 14 SE +/- 0.12, N = 14 SE +/- 0.01, N = 3 2.88 2.12 2.72 1.85 -march=native - MIN: 2.43 / MAX: 186.85 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 5.63 -march=native - MIN: 1.95 / MAX: 218.37 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.79 / MAX: 2.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.711 1.422 2.133 2.844 3.555 SE +/- 0.07, N = 14 SE +/- 0.01, N = 14 SE +/- 0.31, N = 15 SE +/- 0.00, N = 3 2.67 2.39 3.16 2.18 -march=native - MIN: 2.45 / MAX: 10.63 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.14 / MAX: 6.73 -march=native - MIN: 2.17 / MAX: 324.42 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.1 / MAX: 6.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6188 1.2376 1.8564 2.4752 3.094 SE +/- 0.09, N = 15 SE +/- 0.01, N = 15 SE +/- 0.12, N = 15 SE +/- 0.01, N = 3 2.75 2.17 2.74 1.89 -march=native - MIN: 2.43 / MAX: 270.92 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.87 / MAX: 11.1 -march=native - MIN: 1.97 / MAX: 173.37 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.83 / MAX: 2.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6975 1.395 2.0925 2.79 3.4875 SE +/- 0.15, N = 15 SE +/- 0.01, N = 15 SE +/- 0.12, N = 15 SE +/- 0.01, N = 3 3.10 2.38 2.98 2.15 -march=native - MIN: 2.73 / MAX: 410 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.09 / MAX: 19.16 -march=native - MIN: 2.18 / MAX: 280.99 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.08 / MAX: 5.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.31, N = 15 SE +/- 0.12, N = 15 SE +/- 0.34, N = 15 SE +/- 0.10, N = 3 11.06 9.07 10.74 8.63 -march=native - MIN: 9.8 / MAX: 493.36 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.88 / MAX: 149.11 -march=native - MIN: 8.38 / MAX: 272.88 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.28 / MAX: 13.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
ONNX Runtime Model: shufflenet-v2-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: shufflenet-v2-10 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 10K 20K 30K 40K 50K SE +/- 46.94, N = 3 SE +/- 181.26, N = 3 SE +/- 105.96, N = 3 SE +/- 620.08, N = 11 45034 40688 42897 39909 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.4795 0.959 1.4385 1.918 2.3975 SE +/- 0.100, N = 15 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 2.131 1.861 2.021 1.847
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 30K 60K 90K 120K 150K SE +/- 609.47, N = 3 SE +/- 95.70, N = 3 SE +/- 6529.38, N = 12 SE +/- 2986.13, N = 15 89096.6 126721.0 154285.0 130527.0
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3K 6K 9K 12K 15K SE +/- 24.06, N = 3 SE +/- 108.98, N = 8 SE +/- 25.36, N = 3 SE +/- 54.22, N = 3 13494 12945 13568 14069 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400K 800K 1200K 1600K 2000K SE +/- 38770.78, N = 15 SE +/- 76015.19, N = 12 SE +/- 87626.15, N = 15 SE +/- 44470.87, N = 12 1581543 1667232 1542874 1723333 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
ONNX Runtime Model: fcn-resnet101-11 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: fcn-resnet101-11 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 SE +/- 0.75, N = 5 SE +/- 0.17, N = 3 80 74 68 74 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 7K 14K 21K 28K 35K SE +/- 100.69, N = 3 SE +/- 274.39, N = 3 SE +/- 1308.12, N = 13 SE +/- 638.53, N = 12 26580.09 32570.16 25089.99 33183.12 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 130 260 390 520 650 SE +/- 2.53, N = 3 SE +/- 1.21, N = 3 SE +/- 2.34, N = 3 SE +/- 1.67, N = 3 422.78 565.28 558.72 615.24
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 90 180 270 360 450 409.43 403.48 401.56 353.81
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.4977 0.9954 1.4931 1.9908 2.4885 SE +/- 0.018, N = 9 SE +/- 0.001, N = 3 SE +/- 0.016, N = 3 SE +/- 0.005, N = 3 2.212 1.332 1.336 1.325
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 90 180 270 360 450 398.30 382.83 403.73 368.13
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.24, N = 4 SE +/- 0.07, N = 3 SE +/- 0.22, N = 4 SE +/- 0.08, N = 3 19.57 22.29 19.23 23.08
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.001, N = 12 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.013 0.011 0.010 0.011
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 109.65 95.97 96.46 46.60
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 30K 60K 90K 120K 150K SE +/- 195.44, N = 3 SE +/- 86.06, N = 3 SE +/- 701.15, N = 3 SE +/- 1142.56, N = 15 109893 141138 159253 143667
ONNX Runtime Model: super-resolution-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: super-resolution-10 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1000 2000 3000 4000 5000 SE +/- 7.52, N = 3 SE +/- 24.67, N = 3 SE +/- 5.07, N = 3 SE +/- 10.54, N = 3 4579 4662 4417 4593 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6998 1.3996 2.0994 2.7992 3.499 SE +/- 0.02, N = 3 SE +/- 0.02, N = 15 SE +/- 0.02, N = 15 SE +/- 0.03, N = 3 2.29 1.53 1.64 3.11
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0124 0.0248 0.0372 0.0496 0.062 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.055 0.021 0.015 0.021
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenCL - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenCL - Kernel: Points2Image Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4K 8K 12K 16K 20K SE +/- 375.06, N = 12 SE +/- 177.47, N = 3 SE +/- 137.15, N = 3 SE +/- 145.81, N = 3 15539.07 15525.66 15404.28 16397.28 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 500K 1000K 1500K 2000K 2500K SE +/- 2696.20, N = 3 SE +/- 1345.60, N = 3 SE +/- 8085.58, N = 3 SE +/- 10245.31, N = 3 1564193 2024680 2213513 2022477
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2842 0.5684 0.8526 1.1368 1.421 SE +/- 0.00925, N = 3 SE +/- 0.01507, N = 3 SE +/- 0.00123, N = 3 SE +/- 0.00934, N = 10 0.94957 1.26329 1.14364 1.18910
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: regnety_400m Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.3118 2.6236 3.9354 5.2472 6.559 SE +/- 0.05, N = 9 SE +/- 0.09, N = 15 SE +/- 1.34, N = 3 SE +/- 0.02, N = 3 3.71 4.28 5.83 3.46 -march=native - MIN: 3.44 / MAX: 27.73 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.49 / MAX: 28.51 -march=native - MIN: 3.47 / MAX: 39.09 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.39 / MAX: 9.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: squeezenet_ssd Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.0643 2.1286 3.1929 4.2572 5.3215 SE +/- 0.06, N = 9 SE +/- 0.04, N = 15 SE +/- 0.35, N = 3 SE +/- 0.01, N = 3 4.30 4.63 4.73 3.74 -march=native - MIN: 3.76 / MAX: 18.02 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.77 / MAX: 53.14 -march=native - MIN: 3.77 / MAX: 44.23 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.67 / MAX: 6.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: yolov4-tiny Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.02, N = 9 SE +/- 0.03, N = 15 SE +/- 1.16, N = 3 SE +/- 0.03, N = 3 8.14 8.16 10.03 7.63 -march=native - MIN: 7.82 / MAX: 19.98 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.45 / MAX: 57.64 -march=native - MIN: 7.85 / MAX: 80.22 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.42 / MAX: 9.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet50 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.04, N = 9 SE +/- 0.08, N = 15 SE +/- 1.23, N = 3 SE +/- 0.01, N = 3 6.11 6.64 7.46 5.63 -march=native - MIN: 5.6 / MAX: 31.19 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.66 / MAX: 49.12 -march=native - MIN: 5.65 / MAX: 33.99 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.55 / MAX: 11.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: alexnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.14, N = 9 SE +/- 0.06, N = 15 SE +/- 0.55, N = 3 SE +/- 0.01, N = 3 6.41 6.69 6.84 5.75 -march=native - MIN: 5.54 / MAX: 27.79 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.55 / MAX: 48.95 -march=native - MIN: 5.56 / MAX: 36.9 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.49 / MAX: 12.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet18 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.891 1.782 2.673 3.564 4.455 SE +/- 0.09, N = 7 SE +/- 0.05, N = 15 SE +/- 1.09, N = 3 SE +/- 0.04, N = 3 2.34 3.04 3.96 1.96 -march=native - MIN: 1.95 / MAX: 34.31 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.03 / MAX: 39.18 -march=native - MIN: 2.04 / MAX: 34.37 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.89 / MAX: 9.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: vgg16 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.08, N = 9 SE +/- 0.12, N = 15 SE +/- 2.77, N = 3 SE +/- 0.07, N = 3 16.96 17.58 19.80 16.43 -march=native - MIN: 15.75 / MAX: 62.69 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 15.73 / MAX: 64.58 -march=native - MIN: 15.66 / MAX: 57.67 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 15.67 / MAX: 43.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: googlenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.06, N = 9 SE +/- 0.07, N = 15 SE +/- 1.19, N = 3 SE +/- 0.01, N = 3 5.10 5.41 7.09 4.69 -march=native - MIN: 4.68 / MAX: 29.19 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.74 / MAX: 36.71 -march=native - MIN: 4.73 / MAX: 34.54 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.63 / MAX: 7.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: blazeface Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6098 1.2196 1.8294 2.4392 3.049 SE +/- 0.02, N = 9 SE +/- 0.03, N = 15 SE +/- 0.30, N = 3 SE +/- 0.04, N = 3 1.43 1.69 2.71 1.03 -march=native - MIN: 1.17 / MAX: 22.73 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.29 / MAX: 28.85 -march=native - MIN: 1.3 / MAX: 55.08 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.85 / MAX: 22.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: efficientnet-b0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.05, N = 9 SE +/- 0.04, N = 15 SE +/- 1.47, N = 3 SE +/- 0.01, N = 3 7.65 7.72 9.34 7.20 -march=native - MIN: 7.2 / MAX: 28.56 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.18 / MAX: 34.33 -march=native - MIN: 7.17 / MAX: 38.24 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.08 / MAX: 11.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mnasnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.9608 1.9216 2.8824 3.8432 4.804 SE +/- 0.06, N = 9 SE +/- 0.09, N = 15 SE +/- 1.10, N = 3 SE +/- 0.01, N = 3 2.72 3.40 4.27 2.48 -march=native - MIN: 2.46 / MAX: 25.53 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.5 / MAX: 33.07 -march=native - MIN: 2.5 / MAX: 32.54 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.4 / MAX: 6.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: shufflenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.8213 1.6426 2.4639 3.2852 4.1065 SE +/- 0.02, N = 9 SE +/- 0.04, N = 15 SE +/- 0.97, N = 3 SE +/- 0.00, N = 3 2.07 2.68 3.65 1.80 -march=native - MIN: 1.82 / MAX: 24.55 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 25.24 -march=native - MIN: 1.88 / MAX: 35.3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.77 / MAX: 3.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.071 2.142 3.213 4.284 5.355 SE +/- 0.05, N = 9 SE +/- 0.05, N = 15 SE +/- 0.44, N = 3 SE +/- 0.05, N = 3 3.64 4.02 4.76 3.33 -march=native - MIN: 3.28 / MAX: 24.44 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.28 / MAX: 27.17 -march=native - MIN: 3.33 / MAX: 28.56 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.22 / MAX: 15.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.04, N = 9 SE +/- 0.04, N = 15 SE +/- 0.50, N = 3 SE +/- 0.01, N = 3 2.61 3.19 4.05 2.41 -march=native - MIN: 2.38 / MAX: 19.16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.43 / MAX: 25.75 -march=native - MIN: 2.44 / MAX: 30.19 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.33 / MAX: 6.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mobilenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.06, N = 9 SE +/- 0.07, N = 15 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 6.72 6.86 7.40 6.17 -march=native - MIN: 6.36 / MAX: 35.89 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 6.23 / MAX: 45.95 -march=native - MIN: 6.41 / MAX: 52.12 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.85 / MAX: 8.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400K 800K 1200K 1600K 2000K SE +/- 972.87, N = 3 SE +/- 2113.91, N = 3 SE +/- 380.18, N = 3 SE +/- 2059.01, N = 3 1413363 1837950 1801520 1783857
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0047 0.0094 0.0141 0.0188 0.0235 SE +/- 0.000, N = 3 SE +/- 0.000, N = 4 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.020 0.020 0.020 0.021
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.12, N = 15 SE +/- 0.18, N = 15 SE +/- 0.12, N = 15 SE +/- 0.11, N = 10 13.96 12.24 13.66 13.70 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 1.70, N = 3 SE +/- 2.06, N = 3 SE +/- 15.85, N = 3 SE +/- 0.85, N = 3 1970.57 2004.44 1966.15 2023.16 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 300 600 900 1200 1500 SE +/- 0.43, N = 3 SE +/- 0.45, N = 3 SE +/- 2.99, N = 3 SE +/- 0.74, N = 3 1171.04 1255.80 1231.73 1256.30 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 1007.74, N = 3 SE +/- 70.47, N = 3 SE +/- 153.26, N = 3 SE +/- 1032.89, N = 5 74706.9 95569.3 93959.4 93377.4
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2927 0.5854 0.8781 1.1708 1.4635 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 1.299 1.299 1.301 1.256
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 254.55, N = 3 SE +/- 477.30, N = 3 SE +/- 147.61, N = 3 SE +/- 995.15, N = 3 78810.5 99378.6 97487.4 98208.8
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1753 0.3506 0.5259 0.7012 0.8765 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.779 0.637 0.644 0.635
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0077 0.0154 0.0231 0.0308 0.0385 SE +/- 0.001, N = 12 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.034 0.028 0.027 0.028
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2K 4K 6K 8K 10K SE +/- 67.32, N = 3 SE +/- 6.21, N = 3 SE +/- 32.30, N = 3 SE +/- 5.76, N = 3 9209.75 10224.56 9116.46 9558.66 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -O3 -mavx2
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.66, N = 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 25.68 26.98 26.68 26.17 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 8 16 24 32 40 SE +/- 0.26, N = 12 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.29, N = 9 14.58 12.39 12.02 36.43
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0502 0.1004 0.1506 0.2008 0.251 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.002, N = 15 SE +/- 0.000, N = 3 0.223 0.090 0.069 0.088
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1906 0.3812 0.5718 0.7624 0.953 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 0.837 0.847 0.845 0.815
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.002 0.001 0.002
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.1711 2.3422 3.5133 4.6844 5.8555 SE +/- 0.005750, N = 3 SE +/- 0.007819, N = 3 SE +/- 0.027431, N = 3 SE +/- 0.019734, N = 3 5.205018 4.723792 4.909597 4.965331 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -O3 -march=native -fopenmp
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1400 2800 4200 5600 7000 SE +/- 4.26, N = 3 SE +/- 73.38, N = 3 SE +/- 9.91, N = 3 SE +/- 26.31, N = 3 5932.11 5458.44 6321.61 6218.41 1. (CXX) g++ options: -O3 -fopenmp -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lrt -lpthread -ldl
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 27.64, N = 15 SE +/- 3.29, N = 3 SE +/- 12.00, N = 15 SE +/- 12.05, N = 3 1439.63 1615.65 1355.37 1649.84 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0038 0.0076 0.0114 0.0152 0.019 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.016 0.017 0.016 0.016
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 17M 34M 51M 68M 85M SE +/- 396295.91, N = 3 SE +/- 132255.44, N = 3 SE +/- 688051.71, N = 3 SE +/- 37071.05, N = 3 76195000 76868693 72234187 77469363 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -fopenmp
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.3022 0.6044 0.9066 1.2088 1.511 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 1.343 1.327 1.325 1.313
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 200 400 600 800 1000 SE +/- 26.57, N = 15 SE +/- 4.83, N = 3 SE +/- 10.71, N = 5 SE +/- 4.87, N = 3 821.38 1008.65 993.75 1099.43 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.007 0.014 0.021 0.028 0.035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 0.031 0.013 0.013 0.013
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 15 30 45 60 75 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 SE +/- 0.25, N = 3 SE +/- 0.28, N = 3 65.96 49.08 54.55 48.78
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 60 120 180 240 300 SE +/- 3.17, N = 15 SE +/- 1.08, N = 3 SE +/- 4.42, N = 12 SE +/- 0.87, N = 3 228.41 284.99 254.60 263.16 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 60 120 180 240 300 SE +/- 0.58, N = 3 SE +/- 1.70, N = 15 SE +/- 4.49, N = 15 SE +/- 0.23, N = 3 255.32 253.55 243.80 264.78 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 110 220 330 440 550 SE +/- 0.45, N = 3 SE +/- 1.06, N = 3 SE +/- 2.18, N = 15 SE +/- 0.22, N = 3 486.56 224.71 262.51 216.87 -march=native - MIN: 479.58 / MAX: 559.42 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 219.21 / MAX: 248.29 -march=native - MIN: 221.83 / MAX: 377.5 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 214.9 / MAX: 226.18 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.54, N = 12 SE +/- 0.05, N = 15 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 13.06 6.95 7.19 10.83
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0011 0.0022 0.0033 0.0044 0.0055 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.004 0.005 0.004 0.005
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0403 0.0806 0.1209 0.1612 0.2015 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.179 0.175 0.178 0.168
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0511 0.1022 0.1533 0.2044 0.2555 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.227 0.222 0.222 0.219
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0383 0.0766 0.1149 0.1532 0.1915 SE +/- 0.004, N = 12 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.170 0.134 0.134 0.133
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0032 0.0064 0.0096 0.0128 0.016 SE +/- 0.001, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.014 0.013 0.013 0.012
R Benchmark OpenBenchmarking.org Seconds, Fewer Is Better R Benchmark Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0247 0.0494 0.0741 0.0988 0.1235 SE +/- 0.0057, N = 12 SE +/- 0.0004, N = 3 SE +/- 0.0013, N = 3 SE +/- 0.0004, N = 3 0.1097 0.1070 0.1096 0.1038 1. R scripting front-end version 4.0.4 (2021-02-15)
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.009 0.009 0.009 0.008
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0324 0.0648 0.0972 0.1296 0.162 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.144 0.143 0.144 0.137
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0009 0.0018 0.0027 0.0036 0.0045 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.004 0.002 0.001 0.002
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0011 0.0022 0.0033 0.0044 0.0055 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.005 0.004 0.005 0.005
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0027 0.0054 0.0081 0.0108 0.0135 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.011 0.012 0.011 0.011
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.009 0.008 0.007 0.008
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0214 0.0428 0.0642 0.0856 0.107 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.093 0.095 0.094 0.093
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1033 0.2066 0.3099 0.4132 0.5165 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.459 0.416 0.454 0.409
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1123 0.2246 0.3369 0.4492 0.5615 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 0.499 0.285 0.280 0.281
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0659 0.1318 0.1977 0.2636 0.3295 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.000, N = 3 0.290 0.293 0.290 0.285
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0009 0.0018 0.0027 0.0036 0.0045 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.004 0.004 0.003 0.004
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0007 0.0014 0.0021 0.0028 0.0035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 0.002 0.003 0.002 0.002
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 16.52 17.51 16.79 16.10 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 50 100 150 200 250 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 1.05, N = 3 SE +/- 0.04, N = 3 230.46 235.53 237.60 225.76 -march=native - MIN: 230.05 / MAX: 230.91 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 235.23 / MAX: 237 -march=native - MIN: 235.29 / MAX: 265.95 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 225.47 / MAX: 226.77 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0227 0.0454 0.0681 0.0908 0.1135 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.101 0.090 0.097 0.089
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.061 0.061 0.060 0.060
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0025 0.005 0.0075 0.01 0.0125 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.011 0.011 0.011 0.010
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.025 0.05 0.075 0.1 0.125 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.111 0.052 0.054 0.052
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.013 0.005 0.004 0.005
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0083 0.0166 0.0249 0.0332 0.0415 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.037 0.030 0.028 0.029
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 827.63, N = 4 SE +/- 688.36, N = 15 SE +/- 367.77, N = 3 SE +/- 950.48, N = 3 78573 100155 80667 102333 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0455 0.091 0.1365 0.182 0.2275 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.201 0.202 0.201 0.194
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 11.06 11.16 11.04 10.69
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 800 1600 2400 3200 4000 SE +/- 10.37, N = 3 SE +/- 9.94, N = 3 SE +/- 25.80, N = 5 SE +/- 0.00, N = 3 2873.34 2812.63 3423.09 3550.08 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 9.49, N = 3 SE +/- 10.00, N = 3 SE +/- 18.90, N = 5 SE +/- 4.31, N = 3 1584.97 1484.83 1826.95 1853.31 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0097 0.0194 0.0291 0.0388 0.0485 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.043 0.043 0.043 0.042
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 16.81 21.21 17.91 21.88 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P1B2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 7 14 21 28 35 28.69 31.79 28.41 26.15
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.009 0.003 0.003 0.003
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4K 8K 12K 16K 20K SE +/- 29.78, N = 3 SE +/- 141.46, N = 14 SE +/- 117.70, N = 3 SE +/- 33.69, N = 3 17876 18379 18279 18303 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
GNU Octave Benchmark OpenBenchmarking.org Seconds, Fewer Is Better GNU Octave Benchmark 6.1.1~hg.2021.01.26 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.1624 2.3248 3.4872 4.6496 5.812 SE +/- 0.025, N = 5 SE +/- 0.024, N = 5 SE +/- 0.039, N = 5 SE +/- 0.017, N = 5 5.069 5.166 5.108 4.950
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0099 0.0198 0.0297 0.0396 0.0495 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.044 0.044 0.044 0.042
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.069, N = 3 SE +/- 0.051, N = 3 SE +/- 0.009, N = 3 SE +/- 0.031, N = 3 8.183 7.915 8.013 7.667 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.20, N = 4 SE +/- 0.29, N = 15 SE +/- 0.16, N = 15 SE +/- 0.20, N = 15 18.43 15.63 18.38 17.50 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0054 0.0108 0.0162 0.0216 0.027 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 13 0.024 0.021 0.024 0.021
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 820.18, N = 3 SE +/- 1461.14, N = 15 SE +/- 618.46, N = 15 SE +/- 650.31, N = 3 75032 79668 75886 83426 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0119 0.0238 0.0357 0.0476 0.0595 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.053 0.053 0.052 0.051
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 9 18 27 36 45 SE +/- 1.03, N = 14 SE +/- 1.21, N = 15 SE +/- 1.55, N = 15 SE +/- 1.52, N = 12 36.04 31.31 38.43 37.19 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 9 18 27 36 45 SE +/- 1.31, N = 15 SE +/- 1.01, N = 12 SE +/- 1.64, N = 15 SE +/- 1.47, N = 15 35.19 30.54 38.53 38.23 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0081 0.0162 0.0243 0.0324 0.0405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.036 0.035 0.036 0.034
Scikit-Learn OpenBenchmarking.org Seconds, Fewer Is Better Scikit-Learn 0.22.1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.009, N = 3 SE +/- 0.045, N = 3 SE +/- 0.019, N = 3 8.193 4.883 5.069 6.167
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0007 0.0014 0.0021 0.0028 0.0035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.003 0.003 0.003 0.003
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5K 10K 15K 20K 25K SE +/- 13.78, N = 3 SE +/- 1.86, N = 3 SE +/- 122.88, N = 3 SE +/- 20.34, N = 3 23240 22739 23117 22747 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5K 10K 15K 20K 25K SE +/- 102.46, N = 3 SE +/- 214.39, N = 3 SE +/- 178.70, N = 3 SE +/- 54.44, N = 3 23622 22796 23477 22861 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.32, N = 3 SE +/- 0.22, N = 3 SE +/- 0.05, N = 3 45.50 45.90 46.59 44.74 -march=native - MIN: 45.08 / MAX: 45.93 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 45.05 / MAX: 46.7 -march=native - MIN: 45.39 / MAX: 53.21 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 44.37 / MAX: 45.8 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 7K 14K 21K 28K 35K SE +/- 6.89, N = 3 SE +/- 235.28, N = 3 SE +/- 192.25, N = 3 SE +/- 25.36, N = 3 29780 30059 30047 32202 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 1.24, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 101.12 87.62 98.14 99.88 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
Phoronix Test Suite v10.8.5