HPC benchmark- POSSIBLE BAD DATA Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2112081-TJ-2112076TJ12&sro&grt .
HPC benchmark- POSSIBLE BAD DATA Processor Motherboard Chipset Memory Disk Graphics Audio Monitor Network OS Kernel Desktop Display Server OpenGL OpenCL Vulkan Compiler File-System Screen Resolution Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads) MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) Intel Device 7aa7 32GB 500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 Pro Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 (1650/750MHz) Realtek ALC897 LG HDR WQHD Intel I225-V Pop 21.04 5.15.5-76051505-generic (x86_64) GNOME Shell 3.38.4 X Server 1.20.11 4.6 Mesa 21.3.0-devel (LLVM 12.0.1) OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX + OpenCL 2.2 AMD-APP (3361.0) 1.2.182 GCC 10.3.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 + CUDA 10.2 ext4 3440x1440 Intel Core i7-12700K @ 6.30GHz (12 Cores / 20 Threads) Intel Core i7-12700K @ 6.50GHz (8 Cores / 16 Threads) Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz) 4.6 Mesa 21.2.2 (LLVM 12.0.0) OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX GCC 11.1.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Environment Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : CXXFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" CFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native" - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" Compiler Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : NONE / errors=remount-ro,noatime,rw / Block Size: 4096 Processor Details - Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3 Graphics Details - GLAMOR - BAR1 / Visible vRAM Size: 6128 MB Python Details - Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : Python 3.7.11 :: Intel - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.9.5 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected
HPC benchmark- POSSIBLE BAD DATA mt-dgemm: Sustained Floating-Point Rate askap: tConvolve MT - Gridding askap: tConvolve MT - Degridding askap: tConvolve OpenMP - Gridding askap: tConvolve OpenMP - Degridding askap: Hogbom Clean OpenMP cloverleaf: Lagrangian-Eulerian Hydrodynamics cp2k: Fayalite-FIST daphne: OpenMP - NDT Mapping daphne: OpenCL - Points2Image daphne: OpenMP - Points2Image daphne: OpenMP - Euclidean Cluster deepspeech: CPU dolfyn: Computational Fluid Dynamics ecp-candle: P1B2 ecp-candle: P3B1 ecp-candle: P3B2 fftw: Stock - 1D FFT Size 32 fftw: Stock - 2D FFT Size 32 fftw: Stock - 1D FFT Size 4096 fftw: Stock - 2D FFT Size 4096 fftw: Float + SSE - 1D FFT Size 32 fftw: Float + SSE - 2D FFT Size 32 fftw: Float + SSE - 1D FFT Size 4096 fftw: Float + SSE - 2D FFT Size 4096 octave-benchmark: himeno: Poisson Pressure Solver kripke: minife: Small mlpack: scikit_ica mlpack: scikit_qda mlpack: scikit_svm mlpack: scikit_linearridgeregression mnn: mobilenetV3 mnn: squeezenetv1.1 mnn: resnet-v2-50 mnn: SqueezeNetV1.0 mnn: MobileNetV2_224 mnn: mobilenet-v1-1.0 mnn: inception-v3 namd: ATPase Simulation - 327,506 Atoms ncnn: CPU - mobilenet ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU - shufflenet-v2 ncnn: CPU - mnasnet ncnn: CPU - efficientnet-b0 ncnn: CPU - blazeface ncnn: CPU - googlenet ncnn: CPU - vgg16 ncnn: CPU - resnet18 ncnn: CPU - alexnet ncnn: CPU - resnet50 ncnn: CPU - yolov4-tiny ncnn: CPU - squeezenet_ssd ncnn: CPU - regnety_400m ncnn: Vulkan GPU - mobilenet ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - regnety_400m neat: numpy: onnx: yolov4 - CPU onnx: fcn-resnet101-11 - CPU onnx: shufflenet-v2-10 - CPU onnx: super-resolution-10 - CPU opencv: DNN - Deep Neural Network plaidml: No - Inference - VGG16 - CPU plaidml: No - Inference - ResNet 50 - CPU pyhpc: CPU - JAX - 16384 - Isoneutral Mixing pyhpc: CPU - JAX - 65536 - Isoneutral Mixing pyhpc: CPU - JAX - 262144 - Equation of State pyhpc: CPU - JAX - 262144 - Isoneutral Mixing pyhpc: CPU - JAX - 1048576 - Equation of State pyhpc: CPU - JAX - 1048576 - Isoneutral Mixing pyhpc: CPU - JAX - 4194304 - Equation of State pyhpc: CPU - JAX - 4194304 - Isoneutral Mixing pyhpc: CPU - Numba - 16384 - Equation of State pyhpc: CPU - Numba - 16384 - Isoneutral Mixing pyhpc: CPU - Numba - 65536 - Equation of State pyhpc: CPU - Numba - 65536 - Isoneutral Mixing pyhpc: CPU - Numpy - 16384 - Equation of State pyhpc: CPU - Numpy - 16384 - Isoneutral Mixing pyhpc: CPU - Numpy - 65536 - Equation of State pyhpc: CPU - Numpy - 65536 - Isoneutral Mixing pyhpc: CPU - Aesara - 16384 - Equation of State pyhpc: CPU - Aesara - 16384 - Isoneutral Mixing pyhpc: CPU - Aesara - 65536 - Equation of State pyhpc: CPU - Aesara - 65536 - Isoneutral Mixing pyhpc: CPU - Numba - 262144 - Equation of State pyhpc: CPU - Numba - 262144 - Isoneutral Mixing pyhpc: CPU - Numpy - 262144 - Equation of State pyhpc: CPU - Numpy - 262144 - Isoneutral Mixing pyhpc: CPU - Aesara - 262144 - Equation of State pyhpc: CPU - Aesara - 262144 - Isoneutral Mixing pyhpc: CPU - Numba - 1048576 - Equation of State pyhpc: CPU - Numba - 1048576 - Isoneutral Mixing pyhpc: CPU - Numba - 4194304 - Equation of State pyhpc: CPU - Numba - 4194304 - Isoneutral Mixing pyhpc: CPU - Numpy - 1048576 - Equation of State pyhpc: CPU - Numpy - 1048576 - Isoneutral Mixing pyhpc: CPU - Numpy - 4194304 - Equation of State pyhpc: CPU - Numpy - 4194304 - Isoneutral Mixing pyhpc: CPU - PyTorch - 16384 - Isoneutral Mixing pyhpc: CPU - PyTorch - 65536 - Equation of State pyhpc: CPU - PyTorch - 65536 - Isoneutral Mixing pyhpc: CPU - Aesara - 1048576 - Equation of State pyhpc: CPU - Aesara - 1048576 - Isoneutral Mixing pyhpc: CPU - Aesara - 4194304 - Equation of State pyhpc: CPU - Aesara - 4194304 - Isoneutral Mixing pyhpc: CPU - PyTorch - 262144 - Equation of State pyhpc: CPU - PyTorch - 262144 - Isoneutral Mixing pyhpc: CPU - PyTorch - 1048576 - Equation of State pyhpc: CPU - PyTorch - 1048576 - Isoneutral Mixing pyhpc: CPU - PyTorch - 4194304 - Equation of State pyhpc: CPU - PyTorch - 4194304 - Isoneutral Mixing pyhpc: CPU - TensorFlow - 65536 - Equation of State pyhpc: CPU - TensorFlow - 262144 - Equation of State pyhpc: CPU - TensorFlow - 1048576 - Equation of State pyhpc: CPU - TensorFlow - 4194304 - Equation of State rbenchmark: rnnoise: scikit-learn: shoc: OpenCL - S3D shoc: OpenCL - Triad shoc: OpenCL - FFT SP shoc: OpenCL - Reduction shoc: OpenCL - GEMM SGEMM_N shoc: OpenCL - Max SP Flops shoc: OpenCL - Bus Speed Download shoc: OpenCL - Bus Speed Readback tensorflow-lite: SqueezeNet tensorflow-lite: Inception V4 tensorflow-lite: NASNet Mobile tensorflow-lite: Mobilenet Float tensorflow-lite: Mobilenet Quant tensorflow-lite: Inception ResNet V2 mafft: Multiple Sequence Alignment - LSU RNA tnn: CPU - DenseNet tnn: CPU - MobileNet v2 tnn: CPU - SqueezeNet v2 tnn: CPU - SqueezeNet v1.1 Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4.723792 1255.80 2004.44 1484.83 2812.63 253.548 186.74 382.83 1008.65 15525.664613677 32570.157710805 1615.65 49.07625 11.158 31.787 794.258 403.484 22739 22796 18379 12945 30059 79668 100155 40789 5.166 10224.564685 76868693 5458.44 12.39 95.97 6.95 1.53 1.113 2.264 16.694 3.590 1.892 1.749 23.277 1.26329 9.07 2.38 2.17 2.39 2.12 3.37 1.02 9.00 37.73 10.37 9.29 16.38 15.34 13.41 5.55 6.86 3.19 4.02 2.68 3.40 7.72 1.69 5.41 17.58 3.04 6.69 6.64 8.16 4.63 4.28 26.980 565.28 469 74 40688 4662 66637 22.29 8.52 0.002 0.011 0.001 0.028 0.008 0.134 0.030 0.637 0.001 0.003 0.002 0.012 0.002 0.004 0.013 0.021 0.001 0.004 0.003 0.017 0.009 0.043 0.053 0.090 0.011 0.061 0.035 0.202 0.143 0.847 0.222 0.416 1.327 1.861 0.003 0.002 0.013 0.044 0.293 0.175 1.299 0.005 0.052 0.021 0.285 0.090 1.332 0.002 0.005 0.020 0.095 0.1070 17.513 4.883 12.2422 15.6282 87.6173 21.2083 284.989 1667232 30.5438 31.3126 141138 2024680 126721 95569.3 99378.6 1837950 7.915 2719.820 224.710 45.898 235.526 4.909597 1231.73 1966.15 1826.95 3423.09 243.796 191.33 403.729 993.75 15404.277700861 25089.993604089 1355.37 54.54775 11.042 28.405 918.197 401.56 23117 23477 18279 13568 30047 75886 80667 42516 5.108 9116.455509 72234187 6321.61 12.02 96.46 7.19 1.64 1.237 2.559 18.461 4.060 2.217 2.054 25.231 1.14364 10.74 2.98 2.74 3.16 2.72 4.35 1.21 10.43 43.97 13.24 10.64 19.41 17.33 15.89 7.44 7.40 4.05 4.76 3.65 4.27 9.34 2.71 7.09 19.80 3.96 6.84 7.46 10.03 4.73 5.83 26.677 558.72 460 68 42897 4417 91035 19.23 8.32 0.002 0.010 0.001 0.027 0.007 0.134 0.028 0.644 0.001 0.002 0.002 0.011 0.002 0.005 0.013 0.024 0.001 0.003 0.003 0.016 0.009 0.043 0.052 0.097 0.011 0.060 0.036 0.201 0.144 0.845 0.222 0.454 1.325 2.021 0.003 0.001 0.013 0.044 0.290 0.178 1.301 0.004 0.054 0.015 0.280 0.069 1.336 0.001 0.004 0.020 0.094 0.1096 16.792 5.069 13.6588 18.3795 98.1365 17.9097 254.597 1542874 38.5258 38.4318 159253 2213513 154285 93959.4 97487.4 1801520 8.013 2854.854 262.514 46.588 237.601 5.205018 1171.04 1970.57 1584.97 2873.34 255.322 138.33 398.301 821.38 15539.073739801 26580.090189963 1439.63 65.95657 11.057 28.693 887.14 409.434 23240 23622 17876 13494 29780 75032 78573 43336 5.069 9209.748838 76195000 5932.11 14.58 109.65 13.06 2.29 1.261 2.865 22.622 4.640 2.565 2.613 26.687 0.94957 11.06 3.10 2.75 2.67 2.88 4.47 1.21 10.34 40.61 11.74 11.23 20.01 17.62 14.85 6.80 6.72 2.61 3.64 2.07 2.72 7.65 1.43 5.10 16.96 2.34 6.41 6.11 8.14 4.30 3.71 25.676 422.78 520 80 45034 4579 52247 19.57 8.41 0.002 0.013 0.001 0.034 0.009 0.170 0.037 0.779 0.001 0.002 0.002 0.011 0.002 0.005 0.014 0.024 0.001 0.004 0.003 0.016 0.009 0.043 0.053 0.101 0.011 0.061 0.036 0.201 0.144 0.837 0.227 0.459 1.343 2.131 0.009 0.004 0.031 0.044 0.290 0.179 1.299 0.013 0.111 0.055 0.499 0.223 2.212 0.001 0.004 0.02 0.093 0.1097 16.524 8.193 13.9616 18.4348 101.122 16.8136 228.412 1581543 35.1906 36.0449 109893 1564193 89096.6 74706.9 78810.5 1413363 8.183 3987.715 486.558 45.500 230.459 4.965331 1256.30 2023.16 1853.31 3550.08 264.784 141.53 368.132 1099.43 16397.283085518 33183.121427314 1649.84 48.78499 10.693 26.149 790.264 353.81 22747 22861 18303 14069 32202 83426 102333 43777 4.950 9558.655773 77469363 6218.41 36.43 46.60 10.83 3.11 0.963 2.045 14.786 3.165 1.616 1.532 21.017 1.18910 8.63 2.15 1.89 2.18 1.85 2.90 0.91 7.63 37.91 9.23 8.38 15.22 14.05 12.06 4.71 6.17 2.41 3.33 1.80 2.48 7.20 1.03 4.69 16.43 1.96 5.75 5.63 7.63 3.74 3.46 26.165 615.24 468 74 39909 4593 7537 23.08 9.39 0.002 0.011 0.001 0.028 0.008 0.133 0.029 0.635 0.001 0.002 0.002 0.011 0.002 0.005 0.012 0.021 0.001 0.004 0.003 0.016 0.008 0.042 0.051 0.089 0.01 0.060 0.034 0.194 0.137 0.815 0.219 0.409 1.313 1.847 0.003 0.002 0.013 0.042 0.285 0.168 1.256 0.005 0.052 0.021 0.281 0.088 1.325 0.002 0.005 0.021 0.093 0.1038 16.103 6.167 13.7011 17.5019 99.8771 21.8802 263.157 1723333 38.2339 37.1909 143667 2022477 130527 93377.4 98208.8 1783857 7.667 2623.161 216.871 44.744 225.758 OpenBenchmarking.org
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.1711 2.3422 3.5133 4.6844 5.8555 SE +/- 0.005750, N = 3 SE +/- 0.007819, N = 3 SE +/- 0.027431, N = 3 SE +/- 0.019734, N = 3 5.205018 4.723792 4.909597 4.965331 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -O3 -march=native -fopenmp
ASKAP Test: tConvolve MT - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Gridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 300 600 900 1200 1500 SE +/- 0.43, N = 3 SE +/- 0.45, N = 3 SE +/- 2.99, N = 3 SE +/- 0.74, N = 3 1171.04 1255.80 1231.73 1256.30 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve MT - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve MT - Degridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 1.70, N = 3 SE +/- 2.06, N = 3 SE +/- 15.85, N = 3 SE +/- 0.85, N = 3 1970.57 2004.44 1966.15 2023.16 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Gridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Gridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 9.49, N = 3 SE +/- 10.00, N = 3 SE +/- 18.90, N = 5 SE +/- 4.31, N = 3 1584.97 1484.83 1826.95 1853.31 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: tConvolve OpenMP - Degridding OpenBenchmarking.org Million Grid Points Per Second, More Is Better ASKAP 1.0 Test: tConvolve OpenMP - Degridding Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 800 1600 2400 3200 4000 SE +/- 10.37, N = 3 SE +/- 9.94, N = 3 SE +/- 25.80, N = 5 SE +/- 0.00, N = 3 2873.34 2812.63 3423.09 3550.08 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
ASKAP Test: Hogbom Clean OpenMP OpenBenchmarking.org Iterations Per Second, More Is Better ASKAP 1.0 Test: Hogbom Clean OpenMP Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 60 120 180 240 300 SE +/- 0.58, N = 3 SE +/- 1.70, N = 15 SE +/- 4.49, N = 15 SE +/- 0.23, N = 3 255.32 253.55 243.80 264.78 1. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp
CloverLeaf Lagrangian-Eulerian Hydrodynamics OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf Lagrangian-Eulerian Hydrodynamics Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 40 80 120 160 200 SE +/- 0.42, N = 3 SE +/- 2.66, N = 12 SE +/- 18.74, N = 9 SE +/- 1.45, N = 3 138.33 186.74 191.33 141.53 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
CP2K Molecular Dynamics Input: Fayalite-FIST OpenBenchmarking.org Seconds, Fewer Is Better CP2K Molecular Dynamics 8.2 Input: Fayalite-FIST Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 90 180 270 360 450 398.30 382.83 403.73 368.13
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: NDT Mapping Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 200 400 600 800 1000 SE +/- 26.57, N = 15 SE +/- 4.83, N = 3 SE +/- 10.71, N = 5 SE +/- 4.87, N = 3 821.38 1008.65 993.75 1099.43 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenCL - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenCL - Kernel: Points2Image Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4K 8K 12K 16K 20K SE +/- 375.06, N = 12 SE +/- 177.47, N = 3 SE +/- 137.15, N = 3 SE +/- 145.81, N = 3 15539.07 15525.66 15404.28 16397.28 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Points2Image Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 7K 14K 21K 28K 35K SE +/- 100.69, N = 3 SE +/- 274.39, N = 3 SE +/- 1308.12, N = 13 SE +/- 638.53, N = 12 26580.09 32570.16 25089.99 33183.12 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster OpenBenchmarking.org Test Cases Per Minute, More Is Better Darmstadt Automotive Parallel Heterogeneous Suite Backend: OpenMP - Kernel: Euclidean Cluster Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400 800 1200 1600 2000 SE +/- 27.64, N = 15 SE +/- 3.29, N = 3 SE +/- 12.00, N = 15 SE +/- 12.05, N = 3 1439.63 1615.65 1355.37 1649.84 1. (CXX) g++ options: -O3 -std=c++11 -fopenmp
DeepSpeech Acceleration: CPU OpenBenchmarking.org Seconds, Fewer Is Better DeepSpeech 0.6 Acceleration: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 15 30 45 60 75 SE +/- 0.20, N = 3 SE +/- 0.29, N = 3 SE +/- 0.25, N = 3 SE +/- 0.28, N = 3 65.96 49.08 54.55 48.78
Dolfyn Computational Fluid Dynamics OpenBenchmarking.org Seconds, Fewer Is Better Dolfyn 0.527 Computational Fluid Dynamics Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 11.06 11.16 11.04 10.69
ECP-CANDLE Benchmark: P1B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P1B2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 7 14 21 28 35 28.69 31.79 28.41 26.15
ECP-CANDLE Benchmark: P3B1 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 200 400 600 800 1000 887.14 794.26 918.20 790.26
ECP-CANDLE Benchmark: P3B2 OpenBenchmarking.org Seconds, Fewer Is Better ECP-CANDLE 0.4 Benchmark: P3B2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 90 180 270 360 450 409.43 403.48 401.56 353.81
FFTW Build: Stock - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5K 10K 15K 20K 25K SE +/- 13.78, N = 3 SE +/- 1.86, N = 3 SE +/- 122.88, N = 3 SE +/- 20.34, N = 3 23240 22739 23117 22747 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5K 10K 15K 20K 25K SE +/- 102.46, N = 3 SE +/- 214.39, N = 3 SE +/- 178.70, N = 3 SE +/- 54.44, N = 3 23622 22796 23477 22861 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 1D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4K 8K 12K 16K 20K SE +/- 29.78, N = 3 SE +/- 141.46, N = 14 SE +/- 117.70, N = 3 SE +/- 33.69, N = 3 17876 18379 18279 18303 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Stock - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Stock - Size: 2D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3K 6K 9K 12K 15K SE +/- 24.06, N = 3 SE +/- 108.98, N = 8 SE +/- 25.36, N = 3 SE +/- 54.22, N = 3 13494 12945 13568 14069 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 7K 14K 21K 28K 35K SE +/- 6.89, N = 3 SE +/- 235.28, N = 3 SE +/- 192.25, N = 3 SE +/- 25.36, N = 3 29780 30059 30047 32202 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 32 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 32 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 820.18, N = 3 SE +/- 1461.14, N = 15 SE +/- 618.46, N = 15 SE +/- 650.31, N = 3 75032 79668 75886 83426 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 1D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 1D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 827.63, N = 4 SE +/- 688.36, N = 15 SE +/- 367.77, N = 3 SE +/- 950.48, N = 3 78573 100155 80667 102333 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
FFTW Build: Float + SSE - Size: 2D FFT Size 4096 OpenBenchmarking.org Mflops, More Is Better FFTW 3.3.6 Build: Float + SSE - Size: 2D FFT Size 4096 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 9K 18K 27K 36K 45K SE +/- 522.29, N = 9 SE +/- 293.32, N = 3 SE +/- 932.44, N = 7 SE +/- 301.86, N = 3 43336 40789 42516 43777 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -pthread -O3 -lm
GNU Octave Benchmark OpenBenchmarking.org Seconds, Fewer Is Better GNU Octave Benchmark 6.1.1~hg.2021.01.26 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.1624 2.3248 3.4872 4.6496 5.812 SE +/- 0.025, N = 5 SE +/- 0.024, N = 5 SE +/- 0.039, N = 5 SE +/- 0.017, N = 5 5.069 5.166 5.108 4.950
Himeno Benchmark Poisson Pressure Solver OpenBenchmarking.org MFLOPS, More Is Better Himeno Benchmark 3.0 Poisson Pressure Solver Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2K 4K 6K 8K 10K SE +/- 67.32, N = 3 SE +/- 6.21, N = 3 SE +/- 32.30, N = 3 SE +/- 5.76, N = 3 9209.75 10224.56 9116.46 9558.66 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -O3 -mavx2
Kripke OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.4 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 17M 34M 51M 68M 85M SE +/- 396295.91, N = 3 SE +/- 132255.44, N = 3 SE +/- 688051.71, N = 3 SE +/- 37071.05, N = 3 76195000 76868693 72234187 77469363 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -fopenmp
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1400 2800 4200 5600 7000 SE +/- 4.26, N = 3 SE +/- 73.38, N = 3 SE +/- 9.91, N = 3 SE +/- 26.31, N = 3 5932.11 5458.44 6321.61 6218.41 1. (CXX) g++ options: -O3 -fopenmp -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lrt -lpthread -ldl
Mlpack Benchmark Benchmark: scikit_ica OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_ica Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 8 16 24 32 40 SE +/- 0.26, N = 12 SE +/- 0.02, N = 3 SE +/- 0.07, N = 3 SE +/- 0.29, N = 9 14.58 12.39 12.02 36.43
Mlpack Benchmark Benchmark: scikit_qda OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_qda Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 SE +/- 0.15, N = 3 SE +/- 0.12, N = 3 109.65 95.97 96.46 46.60
Mlpack Benchmark Benchmark: scikit_svm OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_svm Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.54, N = 12 SE +/- 0.05, N = 15 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 13.06 6.95 7.19 10.83
Mlpack Benchmark Benchmark: scikit_linearridgeregression OpenBenchmarking.org Seconds, Fewer Is Better Mlpack Benchmark Benchmark: scikit_linearridgeregression Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6998 1.3996 2.0994 2.7992 3.499 SE +/- 0.02, N = 3 SE +/- 0.02, N = 15 SE +/- 0.02, N = 15 SE +/- 0.03, N = 3 2.29 1.53 1.64 3.11
Mobile Neural Network Model: mobilenetV3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenetV3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2837 0.5674 0.8511 1.1348 1.4185 SE +/- 0.017, N = 12 SE +/- 0.016, N = 15 SE +/- 0.037, N = 15 SE +/- 0.001, N = 3 1.261 1.113 1.237 0.963 -march=native - MIN: 1.08 / MAX: 48.36 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 16.19 -march=native - MIN: 0.95 / MAX: 40.28 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.93 / MAX: 4.84 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: squeezenetv1.1 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: squeezenetv1.1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6446 1.2892 1.9338 2.5784 3.223 SE +/- 0.110, N = 12 SE +/- 0.022, N = 15 SE +/- 0.078, N = 15 SE +/- 0.010, N = 3 2.865 2.264 2.559 2.045 -march=native - MIN: 2.23 / MAX: 52.16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.95 / MAX: 45.84 -march=native - MIN: 1.99 / MAX: 49.19 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.99 / MAX: 8.27 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: resnet-v2-50 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: resnet-v2-50 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.57, N = 12 SE +/- 0.06, N = 15 SE +/- 0.50, N = 15 SE +/- 0.05, N = 3 22.62 16.69 18.46 14.79 -march=native - MIN: 19.41 / MAX: 83.45 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.63 / MAX: 104.64 -march=native - MIN: 14.88 / MAX: 173.91 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.1 / MAX: 42.25 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: SqueezeNetV1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: SqueezeNetV1.0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.044 2.088 3.132 4.176 5.22 SE +/- 0.105, N = 12 SE +/- 0.051, N = 15 SE +/- 0.128, N = 15 SE +/- 0.251, N = 3 4.640 3.590 4.060 3.165 -march=native - MIN: 3.45 / MAX: 47.3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.83 / MAX: 78.46 -march=native - MIN: 2.94 / MAX: 82.76 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.84 / MAX: 8.57 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: MobileNetV2_224 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: MobileNetV2_224 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.5771 1.1542 1.7313 2.3084 2.8855 SE +/- 0.050, N = 12 SE +/- 0.021, N = 15 SE +/- 0.076, N = 15 SE +/- 0.004, N = 3 2.565 1.892 2.217 1.616 -march=native - MIN: 2.04 / MAX: 63.68 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.59 / MAX: 22.25 -march=native - MIN: 1.65 / MAX: 59.17 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.54 / MAX: 7.39 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: mobilenet-v1-1.0 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: mobilenet-v1-1.0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.5879 1.1758 1.7637 2.3516 2.9395 SE +/- 0.016, N = 12 SE +/- 0.016, N = 15 SE +/- 0.066, N = 15 SE +/- 0.007, N = 3 2.613 1.749 2.054 1.532 -march=native - MIN: 2.48 / MAX: 16.01 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.51 / MAX: 60.38 -march=native - MIN: 1.55 / MAX: 55.56 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.46 / MAX: 12.02 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
Mobile Neural Network Model: inception-v3 OpenBenchmarking.org ms, Fewer Is Better Mobile Neural Network 1.2 Model: inception-v3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.37, N = 12 SE +/- 0.05, N = 15 SE +/- 0.55, N = 15 SE +/- 0.04, N = 3 26.69 23.28 25.23 21.02 -march=native - MIN: 22.92 / MAX: 195.79 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 20.16 / MAX: 153.62 -march=native - MIN: 20.75 / MAX: 242.75 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 19.1 / MAX: 37.33 1. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2842 0.5684 0.8526 1.1368 1.421 SE +/- 0.00925, N = 3 SE +/- 0.01507, N = 3 SE +/- 0.00123, N = 3 SE +/- 0.00934, N = 10 0.94957 1.26329 1.14364 1.18910
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mobilenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.31, N = 15 SE +/- 0.12, N = 15 SE +/- 0.34, N = 15 SE +/- 0.10, N = 3 11.06 9.07 10.74 8.63 -march=native - MIN: 9.8 / MAX: 493.36 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.88 / MAX: 149.11 -march=native - MIN: 8.38 / MAX: 272.88 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.28 / MAX: 13.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v2-v2 - Model: mobilenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6975 1.395 2.0925 2.79 3.4875 SE +/- 0.15, N = 15 SE +/- 0.01, N = 15 SE +/- 0.12, N = 15 SE +/- 0.01, N = 3 3.10 2.38 2.98 2.15 -march=native - MIN: 2.73 / MAX: 410 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.09 / MAX: 19.16 -march=native - MIN: 2.18 / MAX: 280.99 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.08 / MAX: 5.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU-v3-v3 - Model: mobilenet-v3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6188 1.2376 1.8564 2.4752 3.094 SE +/- 0.09, N = 15 SE +/- 0.01, N = 15 SE +/- 0.12, N = 15 SE +/- 0.01, N = 3 2.75 2.17 2.74 1.89 -march=native - MIN: 2.43 / MAX: 270.92 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.87 / MAX: 11.1 -march=native - MIN: 1.97 / MAX: 173.37 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.83 / MAX: 2.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: shufflenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.711 1.422 2.133 2.844 3.555 SE +/- 0.07, N = 14 SE +/- 0.01, N = 14 SE +/- 0.31, N = 15 SE +/- 0.00, N = 3 2.67 2.39 3.16 2.18 -march=native - MIN: 2.45 / MAX: 10.63 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.14 / MAX: 6.73 -march=native - MIN: 2.17 / MAX: 324.42 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.1 / MAX: 6.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: mnasnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.648 1.296 1.944 2.592 3.24 SE +/- 0.12, N = 15 SE +/- 0.01, N = 14 SE +/- 0.12, N = 14 SE +/- 0.01, N = 3 2.88 2.12 2.72 1.85 -march=native - MIN: 2.43 / MAX: 186.85 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 5.63 -march=native - MIN: 1.95 / MAX: 218.37 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.79 / MAX: 2.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: efficientnet-b0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.0058 2.0116 3.0174 4.0232 5.029 SE +/- 0.12, N = 15 SE +/- 0.02, N = 15 SE +/- 0.20, N = 15 SE +/- 0.01, N = 3 4.47 3.37 4.35 2.90 -march=native - MIN: 3.96 / MAX: 34.65 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.95 / MAX: 16.3 -march=native - MIN: 3.03 / MAX: 261.53 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.83 / MAX: 3.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: blazeface Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2723 0.5446 0.8169 1.0892 1.3615 SE +/- 0.04, N = 15 SE +/- 0.00, N = 15 SE +/- 0.04, N = 15 SE +/- 0.01, N = 3 1.21 1.02 1.21 0.91 -march=native - MIN: 1.05 / MAX: 6.94 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 3.99 -march=native - MIN: 0.93 / MAX: 35.68 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.88 / MAX: 4.07 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: googlenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.27, N = 15 SE +/- 0.15, N = 15 SE +/- 0.39, N = 15 SE +/- 0.04, N = 3 10.34 9.00 10.43 7.63 -march=native - MIN: 9.1 / MAX: 35.14 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.77 / MAX: 212.02 -march=native - MIN: 8.03 / MAX: 223.75 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.41 / MAX: 17.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: vgg16 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 10 20 30 40 50 SE +/- 0.40, N = 15 SE +/- 0.15, N = 15 SE +/- 1.09, N = 15 SE +/- 0.31, N = 3 40.61 37.73 43.97 37.91 -march=native - MIN: 38.83 / MAX: 98.35 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 35.14 / MAX: 288.51 -march=native - MIN: 36.31 / MAX: 485.44 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 37.18 / MAX: 80.66 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet18 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.21, N = 15 SE +/- 0.05, N = 15 SE +/- 0.65, N = 15 SE +/- 0.03, N = 3 11.74 10.37 13.24 9.23 -march=native - MIN: 10.66 / MAX: 43.48 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 9.43 / MAX: 107.77 -march=native - MIN: 9.67 / MAX: 377.98 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 9.03 / MAX: 29.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: alexnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.06, N = 15 SE +/- 0.05, N = 15 SE +/- 0.31, N = 15 SE +/- 0.01, N = 3 11.23 9.29 10.64 8.38 -march=native - MIN: 10.78 / MAX: 35.58 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 8.58 / MAX: 188.45 -march=native - MIN: 8.84 / MAX: 348.67 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.27 / MAX: 14.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: resnet50 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.32, N = 15 SE +/- 0.11, N = 15 SE +/- 0.58, N = 15 SE +/- 0.23, N = 3 20.01 16.38 19.41 15.22 -march=native - MIN: 18.29 / MAX: 72.6 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.75 / MAX: 169.63 -march=native - MIN: 15.46 / MAX: 404.54 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.63 / MAX: 21.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: yolov4-tiny Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.44, N = 15 SE +/- 0.13, N = 15 SE +/- 0.43, N = 15 SE +/- 0.20, N = 3 17.62 15.34 17.33 14.05 -march=native - MIN: 15.29 / MAX: 370.58 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 13.6 / MAX: 60.85 -march=native - MIN: 14.49 / MAX: 220.15 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 13.36 / MAX: 28.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: squeezenet_ssd Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.52, N = 15 SE +/- 0.02, N = 15 SE +/- 0.72, N = 15 SE +/- 0.02, N = 3 14.85 13.41 15.89 12.06 -march=native - MIN: 13.57 / MAX: 63.1 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 12.41 / MAX: 45.1 -march=native - MIN: 12.62 / MAX: 489.07 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 11.82 / MAX: 17.26 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: CPU - Model: regnety_400m Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.20, N = 15 SE +/- 0.01, N = 15 SE +/- 0.55, N = 15 SE +/- 0.01, N = 3 6.80 5.55 7.44 4.71 -march=native - MIN: 6.23 / MAX: 15.86 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.92 / MAX: 31.26 -march=native - MIN: 5.34 / MAX: 398.2 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.58 / MAX: 6.32 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mobilenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.06, N = 9 SE +/- 0.07, N = 15 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 6.72 6.86 7.40 6.17 -march=native - MIN: 6.36 / MAX: 35.89 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 6.23 / MAX: 45.95 -march=native - MIN: 6.41 / MAX: 52.12 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.85 / MAX: 8.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.9113 1.8226 2.7339 3.6452 4.5565 SE +/- 0.04, N = 9 SE +/- 0.04, N = 15 SE +/- 0.50, N = 3 SE +/- 0.01, N = 3 2.61 3.19 4.05 2.41 -march=native - MIN: 2.38 / MAX: 19.16 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.43 / MAX: 25.75 -march=native - MIN: 2.44 / MAX: 30.19 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.33 / MAX: 6.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.071 2.142 3.213 4.284 5.355 SE +/- 0.05, N = 9 SE +/- 0.05, N = 15 SE +/- 0.44, N = 3 SE +/- 0.05, N = 3 3.64 4.02 4.76 3.33 -march=native - MIN: 3.28 / MAX: 24.44 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.28 / MAX: 27.17 -march=native - MIN: 3.33 / MAX: 28.56 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.22 / MAX: 15.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: shufflenet-v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.8213 1.6426 2.4639 3.2852 4.1065 SE +/- 0.02, N = 9 SE +/- 0.04, N = 15 SE +/- 0.97, N = 3 SE +/- 0.00, N = 3 2.07 2.68 3.65 1.80 -march=native - MIN: 1.82 / MAX: 24.55 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 25.24 -march=native - MIN: 1.88 / MAX: 35.3 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.77 / MAX: 3.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: mnasnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.9608 1.9216 2.8824 3.8432 4.804 SE +/- 0.06, N = 9 SE +/- 0.09, N = 15 SE +/- 1.10, N = 3 SE +/- 0.01, N = 3 2.72 3.40 4.27 2.48 -march=native - MIN: 2.46 / MAX: 25.53 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.5 / MAX: 33.07 -march=native - MIN: 2.5 / MAX: 32.54 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.4 / MAX: 6.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: efficientnet-b0 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.05, N = 9 SE +/- 0.04, N = 15 SE +/- 1.47, N = 3 SE +/- 0.01, N = 3 7.65 7.72 9.34 7.20 -march=native - MIN: 7.2 / MAX: 28.56 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.18 / MAX: 34.33 -march=native - MIN: 7.17 / MAX: 38.24 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.08 / MAX: 11.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: blazeface Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.6098 1.2196 1.8294 2.4392 3.049 SE +/- 0.02, N = 9 SE +/- 0.03, N = 15 SE +/- 0.30, N = 3 SE +/- 0.04, N = 3 1.43 1.69 2.71 1.03 -march=native - MIN: 1.17 / MAX: 22.73 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.29 / MAX: 28.85 -march=native - MIN: 1.3 / MAX: 55.08 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.85 / MAX: 22.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: googlenet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.06, N = 9 SE +/- 0.07, N = 15 SE +/- 1.19, N = 3 SE +/- 0.01, N = 3 5.10 5.41 7.09 4.69 -march=native - MIN: 4.68 / MAX: 29.19 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.74 / MAX: 36.71 -march=native - MIN: 4.73 / MAX: 34.54 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.63 / MAX: 7.03 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: vgg16 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.08, N = 9 SE +/- 0.12, N = 15 SE +/- 2.77, N = 3 SE +/- 0.07, N = 3 16.96 17.58 19.80 16.43 -march=native - MIN: 15.75 / MAX: 62.69 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 15.73 / MAX: 64.58 -march=native - MIN: 15.66 / MAX: 57.67 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 15.67 / MAX: 43.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet18 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.891 1.782 2.673 3.564 4.455 SE +/- 0.09, N = 7 SE +/- 0.05, N = 15 SE +/- 1.09, N = 3 SE +/- 0.04, N = 3 2.34 3.04 3.96 1.96 -march=native - MIN: 1.95 / MAX: 34.31 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.03 / MAX: 39.18 -march=native - MIN: 2.04 / MAX: 34.37 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.89 / MAX: 9.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: alexnet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.14, N = 9 SE +/- 0.06, N = 15 SE +/- 0.55, N = 3 SE +/- 0.01, N = 3 6.41 6.69 6.84 5.75 -march=native - MIN: 5.54 / MAX: 27.79 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.55 / MAX: 48.95 -march=native - MIN: 5.56 / MAX: 36.9 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.49 / MAX: 12.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: resnet50 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.04, N = 9 SE +/- 0.08, N = 15 SE +/- 1.23, N = 3 SE +/- 0.01, N = 3 6.11 6.64 7.46 5.63 -march=native - MIN: 5.6 / MAX: 31.19 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.66 / MAX: 49.12 -march=native - MIN: 5.65 / MAX: 33.99 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.55 / MAX: 11.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: yolov4-tiny Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.02, N = 9 SE +/- 0.03, N = 15 SE +/- 1.16, N = 3 SE +/- 0.03, N = 3 8.14 8.16 10.03 7.63 -march=native - MIN: 7.82 / MAX: 19.98 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.45 / MAX: 57.64 -march=native - MIN: 7.85 / MAX: 80.22 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.42 / MAX: 9.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: squeezenet_ssd Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.0643 2.1286 3.1929 4.2572 5.3215 SE +/- 0.06, N = 9 SE +/- 0.04, N = 15 SE +/- 0.35, N = 3 SE +/- 0.01, N = 3 4.30 4.63 4.73 3.74 -march=native - MIN: 3.76 / MAX: 18.02 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.77 / MAX: 53.14 -march=native - MIN: 3.77 / MAX: 44.23 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.67 / MAX: 6.02 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20210720 Target: Vulkan GPU - Model: regnety_400m Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1.3118 2.6236 3.9354 5.2472 6.559 SE +/- 0.05, N = 9 SE +/- 0.09, N = 15 SE +/- 1.34, N = 3 SE +/- 0.02, N = 3 3.71 4.28 5.83 3.46 -march=native - MIN: 3.44 / MAX: 27.73 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.49 / MAX: 28.51 -march=native - MIN: 3.47 / MAX: 39.09 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.39 / MAX: 9.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread
Nebular Empirical Analysis Tool OpenBenchmarking.org Seconds, Fewer Is Better Nebular Empirical Analysis Tool 2.3 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.66, N = 15 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 25.68 26.98 26.68 26.17 1. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio
Numpy Benchmark OpenBenchmarking.org Score, More Is Better Numpy Benchmark Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 130 260 390 520 650 SE +/- 2.53, N = 3 SE +/- 1.21, N = 3 SE +/- 2.34, N = 3 SE +/- 1.67, N = 3 422.78 565.28 558.72 615.24
ONNX Runtime Model: yolov4 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: yolov4 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 110 220 330 440 550 SE +/- 4.64, N = 12 SE +/- 0.44, N = 3 SE +/- 1.48, N = 3 SE +/- 5.97, N = 12 520 469 460 468 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ONNX Runtime Model: fcn-resnet101-11 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: fcn-resnet101-11 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.67, N = 3 SE +/- 0.00, N = 3 SE +/- 0.75, N = 5 SE +/- 0.17, N = 3 80 74 68 74 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ONNX Runtime Model: shufflenet-v2-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: shufflenet-v2-10 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 10K 20K 30K 40K 50K SE +/- 46.94, N = 3 SE +/- 181.26, N = 3 SE +/- 105.96, N = 3 SE +/- 620.08, N = 11 45034 40688 42897 39909 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
ONNX Runtime Model: super-resolution-10 - Device: CPU OpenBenchmarking.org Inferences Per Minute, More Is Better ONNX Runtime 1.10 Model: super-resolution-10 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 1000 2000 3000 4000 5000 SE +/- 7.52, N = 3 SE +/- 24.67, N = 3 SE +/- 5.07, N = 3 SE +/- 10.54, N = 3 4579 4662 4417 4593 -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread
OpenCV Test: DNN - Deep Neural Network OpenBenchmarking.org ms, Fewer Is Better OpenCV 4.5.4 Test: DNN - Deep Neural Network Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 713.19, N = 15 SE +/- 1166.72, N = 15 SE +/- 3031.05, N = 12 SE +/- 245.00, N = 15 52247 66637 91035 7537 -march=native -ldl -lm -lpthread -lrt -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -ldl -lm -lpthread -lrt -march=native -ldl -lm -lpthread -lrt 1. (CXX) g++ options: -O3 -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -fvisibility=hidden
PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: VGG16 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 6 12 18 24 30 SE +/- 0.24, N = 4 SE +/- 0.07, N = 3 SE +/- 0.22, N = 4 SE +/- 0.08, N = 3 19.57 22.29 19.23 23.08
PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU OpenBenchmarking.org FPS, More Is Better PlaidML FP16: No - Mode: Inference - Network: ResNet 50 - Device: CPU Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 3 6 9 12 15 SE +/- 0.25, N = 9 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 8.41 8.52 8.32 9.39
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.001, N = 12 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.013 0.011 0.010 0.011
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0077 0.0154 0.0231 0.0308 0.0385 SE +/- 0.001, N = 12 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.034 0.028 0.027 0.028
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.009 0.008 0.007 0.008
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0383 0.0766 0.1149 0.1532 0.1915 SE +/- 0.004, N = 12 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.170 0.134 0.134 0.133
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0083 0.0166 0.0249 0.0332 0.0415 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.037 0.030 0.028 0.029
PyHPC Benchmarks Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1753 0.3506 0.5259 0.7012 0.8765 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.779 0.637 0.644 0.635
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0007 0.0014 0.0021 0.0028 0.0035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 0.002 0.003 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0027 0.0054 0.0081 0.0108 0.0135 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.011 0.012 0.011 0.011
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.002 0.002 0.002 0.002
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0011 0.0022 0.0033 0.0044 0.0055 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.005 0.004 0.005 0.005
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0032 0.0064 0.0096 0.0128 0.016 SE +/- 0.001, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.014 0.013 0.013 0.012
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0054 0.0108 0.0162 0.0216 0.027 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 13 0.024 0.021 0.024 0.021
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0002 0.0004 0.0006 0.0008 0.001 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.001 0.001 0.001
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0009 0.0018 0.0027 0.0036 0.0045 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.004 0.004 0.003 0.004
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0007 0.0014 0.0021 0.0028 0.0035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.003 0.003 0.003 0.003
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0038 0.0076 0.0114 0.0152 0.019 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.016 0.017 0.016 0.016
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.009 0.009 0.009 0.008
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0097 0.0194 0.0291 0.0388 0.0485 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.043 0.043 0.043 0.042
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0119 0.0238 0.0357 0.0476 0.0595 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.053 0.053 0.052 0.051
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0227 0.0454 0.0681 0.0908 0.1135 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.101 0.090 0.097 0.089
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0025 0.005 0.0075 0.01 0.0125 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.011 0.011 0.011 0.010
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0137 0.0274 0.0411 0.0548 0.0685 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.061 0.061 0.060 0.060
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0081 0.0162 0.0243 0.0324 0.0405 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.036 0.035 0.036 0.034
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0455 0.091 0.1365 0.182 0.2275 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.201 0.202 0.201 0.194
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0324 0.0648 0.0972 0.1296 0.162 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 0.144 0.143 0.144 0.137
PyHPC Benchmarks Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1906 0.3812 0.5718 0.7624 0.953 SE +/- 0.005, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 0.837 0.847 0.845 0.815
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0511 0.1022 0.1533 0.2044 0.2555 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.227 0.222 0.222 0.219
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1033 0.2066 0.3099 0.4132 0.5165 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 SE +/- 0.001, N = 3 0.459 0.416 0.454 0.409
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.3022 0.6044 0.9066 1.2088 1.511 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 1.343 1.327 1.325 1.313
PyHPC Benchmarks Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.4795 0.959 1.4385 1.918 2.3975 SE +/- 0.100, N = 15 SE +/- 0.001, N = 3 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 2.131 1.861 2.021 1.847
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.002 0.004 0.006 0.008 0.01 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.009 0.003 0.003 0.003
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0009 0.0018 0.0027 0.0036 0.0045 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.004 0.002 0.001 0.002
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.007 0.014 0.021 0.028 0.035 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 0.031 0.013 0.013 0.013
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0099 0.0198 0.0297 0.0396 0.0495 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.044 0.044 0.044 0.042
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0659 0.1318 0.1977 0.2636 0.3295 SE +/- 0.001, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 SE +/- 0.000, N = 3 0.290 0.293 0.290 0.285
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0403 0.0806 0.1209 0.1612 0.2015 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.179 0.175 0.178 0.168
PyHPC Benchmarks Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.2927 0.5854 0.8781 1.1708 1.4635 SE +/- 0.004, N = 3 SE +/- 0.001, N = 3 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 1.299 1.299 1.301 1.256
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0029 0.0058 0.0087 0.0116 0.0145 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.013 0.005 0.004 0.005
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.025 0.05 0.075 0.1 0.125 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.111 0.052 0.054 0.052
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0124 0.0248 0.0372 0.0496 0.062 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 0.055 0.021 0.015 0.021
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.1123 0.2246 0.3369 0.4492 0.5615 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.003, N = 3 SE +/- 0.001, N = 3 0.499 0.285 0.280 0.281
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0502 0.1004 0.1506 0.2008 0.251 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.002, N = 15 SE +/- 0.000, N = 3 0.223 0.090 0.069 0.088
PyHPC Benchmarks Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral Mixing OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral Mixing Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.4977 0.9954 1.4931 1.9908 2.4885 SE +/- 0.018, N = 9 SE +/- 0.001, N = 3 SE +/- 0.016, N = 3 SE +/- 0.005, N = 3 2.212 1.332 1.336 1.325
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0005 0.001 0.0015 0.002 0.0025 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.001 0.002 0.001 0.002
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0011 0.0022 0.0033 0.0044 0.0055 SE +/- 0.000, N = 15 SE +/- 0.000, N = 3 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.004 0.005 0.004 0.005
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0047 0.0094 0.0141 0.0188 0.0235 SE +/- 0.000, N = 3 SE +/- 0.000, N = 4 SE +/- 0.000, N = 15 SE +/- 0.000, N = 15 0.020 0.020 0.020 0.021
PyHPC Benchmarks Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of State OpenBenchmarking.org Seconds, Fewer Is Better PyHPC Benchmarks 3.0 Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of State Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0214 0.0428 0.0642 0.0856 0.107 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 SE +/- 0.000, N = 3 SE +/- 0.001, N = 3 0.093 0.095 0.094 0.093
R Benchmark OpenBenchmarking.org Seconds, Fewer Is Better R Benchmark Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0247 0.0494 0.0741 0.0988 0.1235 SE +/- 0.0057, N = 12 SE +/- 0.0004, N = 3 SE +/- 0.0013, N = 3 SE +/- 0.0004, N = 3 0.1097 0.1070 0.1096 0.1038 1. R scripting front-end version 4.0.4 (2021-02-15)
RNNoise OpenBenchmarking.org Seconds, Fewer Is Better RNNoise 2020-06-28 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 16.52 17.51 16.79 16.10 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CC) gcc options: -O3 -pedantic -fvisibility=hidden
Scikit-Learn OpenBenchmarking.org Seconds, Fewer Is Better Scikit-Learn 0.22.1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.002, N = 3 SE +/- 0.009, N = 3 SE +/- 0.045, N = 3 SE +/- 0.019, N = 3 8.193 4.883 5.069 6.167
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: S3D OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: S3D Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 4 8 12 16 20 SE +/- 0.12, N = 15 SE +/- 0.18, N = 15 SE +/- 0.12, N = 15 SE +/- 0.11, N = 10 13.96 12.24 13.66 13.70 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Triad OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Triad Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.20, N = 4 SE +/- 0.29, N = 15 SE +/- 0.16, N = 15 SE +/- 0.20, N = 15 18.43 15.63 18.38 17.50 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: FFT SP OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: FFT SP Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 1.24, N = 3 SE +/- 0.45, N = 3 SE +/- 0.58, N = 3 101.12 87.62 98.14 99.88 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Reduction OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Reduction Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 5 10 15 20 25 SE +/- 0.15, N = 3 SE +/- 0.29, N = 3 SE +/- 0.08, N = 3 SE +/- 0.10, N = 3 16.81 21.21 17.91 21.88 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: GEMM SGEMM_N OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: GEMM SGEMM_N Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 60 120 180 240 300 SE +/- 3.17, N = 15 SE +/- 1.08, N = 3 SE +/- 4.42, N = 12 SE +/- 0.87, N = 3 228.41 284.99 254.60 263.16 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Max SP Flops OpenBenchmarking.org GFLOPS, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Max SP Flops Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400K 800K 1200K 1600K 2000K SE +/- 38770.78, N = 15 SE +/- 76015.19, N = 12 SE +/- 87626.15, N = 15 SE +/- 44470.87, N = 12 1581543 1667232 1542874 1723333 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Download OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Download Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 9 18 27 36 45 SE +/- 1.31, N = 15 SE +/- 1.01, N = 12 SE +/- 1.64, N = 15 SE +/- 1.47, N = 15 35.19 30.54 38.53 38.23 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
SHOC Scalable HeterOgeneous Computing Target: OpenCL - Benchmark: Bus Speed Readback OpenBenchmarking.org GB/s, More Is Better SHOC Scalable HeterOgeneous Computing 2020-04-17 Target: OpenCL - Benchmark: Bus Speed Readback Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 9 18 27 36 45 SE +/- 1.03, N = 14 SE +/- 1.21, N = 15 SE +/- 1.55, N = 15 SE +/- 1.52, N = 12 36.04 31.31 38.43 37.19 -march=native -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -march=native -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl
TensorFlow Lite Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: SqueezeNet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 30K 60K 90K 120K 150K SE +/- 195.44, N = 3 SE +/- 86.06, N = 3 SE +/- 701.15, N = 3 SE +/- 1142.56, N = 15 109893 141138 159253 143667
TensorFlow Lite Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception V4 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 500K 1000K 1500K 2000K 2500K SE +/- 2696.20, N = 3 SE +/- 1345.60, N = 3 SE +/- 8085.58, N = 3 SE +/- 10245.31, N = 3 1564193 2024680 2213513 2022477
TensorFlow Lite Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: NASNet Mobile Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 30K 60K 90K 120K 150K SE +/- 609.47, N = 3 SE +/- 95.70, N = 3 SE +/- 6529.38, N = 12 SE +/- 2986.13, N = 15 89096.6 126721.0 154285.0 130527.0
TensorFlow Lite Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Float Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 1007.74, N = 3 SE +/- 70.47, N = 3 SE +/- 153.26, N = 3 SE +/- 1032.89, N = 5 74706.9 95569.3 93959.4 93377.4
TensorFlow Lite Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Mobilenet Quant Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 20K 40K 60K 80K 100K SE +/- 254.55, N = 3 SE +/- 477.30, N = 3 SE +/- 147.61, N = 3 SE +/- 995.15, N = 3 78810.5 99378.6 97487.4 98208.8
TensorFlow Lite Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better TensorFlow Lite 2020-08-23 Model: Inception ResNet V2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 400K 800K 1200K 1600K 2000K SE +/- 972.87, N = 3 SE +/- 2113.91, N = 3 SE +/- 380.18, N = 3 SE +/- 2059.01, N = 3 1413363 1837950 1801520 1783857
Timed MAFFT Alignment Multiple Sequence Alignment - LSU RNA OpenBenchmarking.org Seconds, Fewer Is Better Timed MAFFT Alignment 7.471 Multiple Sequence Alignment - LSU RNA Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 2 4 6 8 10 SE +/- 0.069, N = 3 SE +/- 0.051, N = 3 SE +/- 0.009, N = 3 SE +/- 0.031, N = 3 8.183 7.915 8.013 7.667 1. (CC) gcc options: -std=c99 -O3 -lm -lpthread
TNN Target: CPU - Model: DenseNet OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: DenseNet Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 900 1800 2700 3600 4500 SE +/- 299.40, N = 9 SE +/- 2.71, N = 3 SE +/- 40.36, N = 3 SE +/- 3.06, N = 3 3987.72 2719.82 2854.85 2623.16 -march=native - MIN: 2619.64 / MAX: 5569.92 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2625.5 / MAX: 3184.86 -march=native - MIN: 2660.43 / MAX: 3772.17 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2564.18 / MAX: 2757.86 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: MobileNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: MobileNet v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 110 220 330 440 550 SE +/- 0.45, N = 3 SE +/- 1.06, N = 3 SE +/- 2.18, N = 15 SE +/- 0.22, N = 3 486.56 224.71 262.51 216.87 -march=native - MIN: 479.58 / MAX: 559.42 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 219.21 / MAX: 248.29 -march=native - MIN: 221.83 / MAX: 377.5 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 214.9 / MAX: 226.18 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v2 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v2 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 11 22 33 44 55 SE +/- 0.08, N = 3 SE +/- 0.32, N = 3 SE +/- 0.22, N = 3 SE +/- 0.05, N = 3 45.50 45.90 46.59 44.74 -march=native - MIN: 45.08 / MAX: 45.93 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 45.05 / MAX: 46.7 -march=native - MIN: 45.39 / MAX: 53.21 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 44.37 / MAX: 45.8 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
TNN Target: CPU - Model: SqueezeNet v1.1 OpenBenchmarking.org ms, Fewer Is Better TNN 0.3 Target: CPU - Model: SqueezeNet v1.1 Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt 50 100 150 200 250 SE +/- 0.11, N = 3 SE +/- 0.06, N = 3 SE +/- 1.05, N = 3 SE +/- 0.04, N = 3 230.46 235.53 237.60 225.76 -march=native - MIN: 230.05 / MAX: 230.91 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 235.23 / MAX: 237 -march=native - MIN: 235.29 / MAX: 265.95 -mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 225.47 / MAX: 226.77 1. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl
Phoronix Test Suite v10.8.5