HPC benchmark- POSSIBLE BAD DATA

Intel Core i7-12700K testing with a MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS) and Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB on Pop 21.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2112081-TJ-2112076TJ12
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

Bioinformatics 2 Tests
C/C++ Compiler Tests 3 Tests
CPU Massive 11 Tests
Creator Workloads 2 Tests
Fortran Tests 3 Tests
HPC - High Performance Computing 30 Tests
Machine Learning 15 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 2 Tests
Multi-Core 3 Tests
NVIDIA GPU Compute 3 Tests
OpenMPI Tests 4 Tests
Python 4 Tests
Scientific Computing 12 Tests
Server CPU Tests 5 Tests
Single-Threaded 3 Tests
Speech 2 Tests
Telephony 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt
December 05 2021
  12 Hours, 53 Minutes
Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt
December 06 2021
  12 Hours, 28 Minutes
Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt
December 06 2021
  14 Hours, 10 Minutes
Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt
December 08 2021
  6 Hours, 54 Minutes
Invert Hiding All Results Option
  11 Hours, 36 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


HPC benchmark- POSSIBLE BAD DATAProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLOpenCLVulkanCompilerFile-SystemScreen ResolutionIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700K @ 6.30GHz (8 Cores / 16 Threads)MSI PRO Z690-A DDR4(MS-7D25) v1.0 (1.15 BIOS)Intel Device 7aa732GB500GB Western Digital WDS500G2B0C-00PXH0 + 3 x 10001GB Seagate ST10000DM0004-1Z + 128GB HP SSD S700 ProGigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 (1650/750MHz)Realtek ALC897LG HDR WQHDIntel I225-VPop 21.045.15.5-76051505-generic (x86_64)GNOME Shell 3.38.4X Server 1.20.114.6 Mesa 21.3.0-devel (LLVM 12.0.1)OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUX + OpenCL 2.2 AMD-APP (3361.0)1.2.182GCC 10.3.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910 + CUDA 10.2ext43440x1440Intel Core i7-12700K @ 6.30GHz (12 Cores / 20 Threads)Intel Core i7-12700K @ 6.50GHz (8 Cores / 16 Threads)Gigabyte AMD Radeon RX 5600 OEM/5600 XT / 5700/5700 6GB (1650/750MHz)4.6 Mesa 21.2.2 (LLVM 12.0.0)OpenCL 1.2 Intel FPGA SDK for OpenCL 20.3 + OpenCL 3.0 LINUXGCC 11.1.0 + Intel oneAPI DPC++/C++ Compiler 2021.4.0 (2021.4.0.20210924) + ICC 2021.4.0 20210910OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : CXXFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect" CFLAGS="-O3 -march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect"- Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: CXXFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16" CFLAGS="-O3 -march=sapphirerapids -mno-amx-tile -mno-amx-int8 -mno-amx-bf16"Compiler Details- Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-gDeRY6/gcc-10-10.3.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-RPS7jb/gcc-11-11.1.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : NONE / errors=remount-ro,noatime,rw / Block Size: 4096Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x15 - Thermald 2.4.3Graphics Details- GLAMOR - BAR1 / Visible vRAM Size: 6128 MBPython Details- Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt : Python 3.7.11 :: Intel- Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel- Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.7.11 :: Intel- Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt: Python 3.9.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtLogarithmic Result OverviewPhoronix Test SuiteOpenCVScikit-LearnMobile Neural NetworkMlpack BenchmarkNumpy BenchmarkNCNNTensorFlow LiteCloverLeafTNNDeepSpeechNAMDPyHPC BenchmarksDarmstadt Automotive Parallel Heterogeneous SuitePlaidMLminiFEECP-CANDLEHimeno BenchmarkASKAPACES DGEMMONNX RuntimeCP2K Molecular DynamicsSHOC Scalable HeterOgeneous ComputingRNNoiseKripkeTimed MAFFT AlignmentFFTWR BenchmarkNebular Empirical Analysis ToolGNU Octave BenchmarkDolfyn

HPC benchmark- POSSIBLE BAD DATAfftw: Float + SSE - 2D FFT Size 4096cloverleaf: Lagrangian-Eulerian Hydrodynamicsplaidml: No - Inference - ResNet 50 - CPUtnn: CPU - DenseNetonnx: yolov4 - CPUecp-candle: P3B1mnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: squeezenetv1.1mnn: mobilenetV3opencv: DNN - Deep Neural Networkncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetonnx: shufflenet-v2-10 - CPUpyhpc: CPU - Numpy - 4194304 - Isoneutral Mixingtensorflow-lite: NASNet Mobilefftw: Stock - 2D FFT Size 4096shoc: OpenCL - Max SP Flopsonnx: fcn-resnet101-11 - CPUdaphne: OpenMP - Points2Imagenumpy: ecp-candle: P3B2pyhpc: CPU - PyTorch - 4194304 - Isoneutral Mixingcp2k: Fayalite-FISTplaidml: No - Inference - VGG16 - CPUpyhpc: CPU - JAX - 65536 - Isoneutral Mixingmlpack: scikit_qdatensorflow-lite: SqueezeNetonnx: super-resolution-10 - CPUmlpack: scikit_linearridgeregressionpyhpc: CPU - PyTorch - 1048576 - Equation of Statedaphne: OpenCL - Points2Imagetensorflow-lite: Inception V4namd: ATPase Simulation - 327,506 Atomsncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenettensorflow-lite: Inception ResNet V2pyhpc: CPU - TensorFlow - 1048576 - Equation of Stateshoc: OpenCL - S3Daskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingtensorflow-lite: Mobilenet Floatpyhpc: CPU - Aesara - 4194304 - Isoneutral Mixingtensorflow-lite: Mobilenet Quantpyhpc: CPU - JAX - 4194304 - Isoneutral Mixingpyhpc: CPU - JAX - 262144 - Isoneutral Mixinghimeno: Poisson Pressure Solverneat: mlpack: scikit_icapyhpc: CPU - PyTorch - 4194304 - Equation of Statepyhpc: CPU - Numba - 4194304 - Isoneutral Mixingpyhpc: CPU - TensorFlow - 65536 - Equation of Statemt-dgemm: Sustained Floating-Point Rateminife: Smalldaphne: OpenMP - Euclidean Clusterpyhpc: CPU - Aesara - 65536 - Isoneutral Mixingkripke: pyhpc: CPU - Numpy - 4194304 - Equation of Statedaphne: OpenMP - NDT Mappingpyhpc: CPU - PyTorch - 65536 - Isoneutral Mixingdeepspeech: CPUshoc: OpenCL - GEMM SGEMM_Naskap: Hogbom Clean OpenMPtnn: CPU - MobileNet v2mlpack: scikit_svmpyhpc: CPU - TensorFlow - 262144 - Equation of Statepyhpc: CPU - Numba - 65536 - Equation of Statepyhpc: CPU - Aesara - 4194304 - Equation of Statepyhpc: CPU - Numpy - 1048576 - Equation of Statepyhpc: CPU - JAX - 1048576 - Isoneutral Mixingpyhpc: CPU - Numpy - 65536 - Equation of Staterbenchmark: pyhpc: CPU - Numba - 262144 - Equation of Statepyhpc: CPU - Numba - 4194304 - Equation of Statepyhpc: CPU - PyTorch - 65536 - Equation of Statepyhpc: CPU - Numpy - 16384 - Isoneutral Mixingpyhpc: CPU - Numba - 65536 - Isoneutral Mixingpyhpc: CPU - JAX - 1048576 - Equation of Statepyhpc: CPU - TensorFlow - 4194304 - Equation of Statepyhpc: CPU - Numpy - 1048576 - Isoneutral Mixingpyhpc: CPU - PyTorch - 1048576 - Isoneutral Mixingpyhpc: CPU - Aesara - 1048576 - Isoneutral Mixingpyhpc: CPU - Aesara - 16384 - Isoneutral Mixingpyhpc: CPU - Numba - 16384 - Isoneutral Mixingrnnoise: tnn: CPU - SqueezeNet v1.1pyhpc: CPU - Numpy - 262144 - Isoneutral Mixingpyhpc: CPU - Aesara - 262144 - Isoneutral Mixingpyhpc: CPU - Aesara - 262144 - Equation of Statepyhpc: CPU - PyTorch - 262144 - Isoneutral Mixingpyhpc: CPU - PyTorch - 262144 - Equation of Statepyhpc: CPU - JAX - 4194304 - Equation of Statefftw: Float + SSE - 1D FFT Size 4096pyhpc: CPU - Numba - 1048576 - Isoneutral Mixingpyhpc: CPU - Aesara - 16384 - Equation of Statepyhpc: CPU - JAX - 262144 - Equation of Statedolfyn: Computational Fluid Dynamicsaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingpyhpc: CPU - Numba - 262144 - Isoneutral Mixingshoc: OpenCL - Reductionecp-candle: P1B2pyhpc: CPU - PyTorch - 16384 - Isoneutral Mixingfftw: Stock - 1D FFT Size 4096octave-benchmark: pyhpc: CPU - Aesara - 1048576 - Equation of Statepyhpc: CPU - JAX - 16384 - Isoneutral Mixingpyhpc: CPU - Numba - 16384 - Equation of Statemafft: Multiple Sequence Alignment - LSU RNAshoc: OpenCL - Triadpyhpc: CPU - Numpy - 65536 - Isoneutral Mixingfftw: Float + SSE - 2D FFT Size 32pyhpc: CPU - Numpy - 262144 - Equation of Stateshoc: OpenCL - Bus Speed Readbackshoc: OpenCL - Bus Speed Downloadpyhpc: CPU - Numba - 1048576 - Equation of Statescikit-learn: pyhpc: CPU - Aesara - 65536 - Equation of Statefftw: Stock - 1D FFT Size 32fftw: Stock - 2D FFT Size 32tnn: CPU - SqueezeNet v2fftw: Float + SSE - 1D FFT Size 32pyhpc: CPU - Numpy - 16384 - Equation of Stateshoc: OpenCL - FFT SPIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt40789186.748.522719.820469794.25823.2771.7491.8923.59016.6942.2641.113666375.5513.4115.3416.389.2910.3737.739.001.023.372.122.392.172.389.07406881.8611267211294516672327432570.157710805565.28403.4841.332382.8322.290.01195.9714113846621.530.02115525.66461367720246801.263294.284.638.166.646.693.0417.585.411.697.723.402.684.023.196.8618379500.02012.24222004.441255.8095569.31.29999378.60.6370.02810224.56468526.98012.390.0900.8470.0024.7237925458.441615.650.017768686931.3271008.650.01349.07625284.989253.548224.7106.950.0050.0020.1750.2220.1340.0130.10700.0090.1430.0020.0040.0120.0080.0950.4160.2850.2930.0040.00317.513235.5260.0900.0610.0110.0520.0050.0301001550.2020.0010.00111.1582812.631484.830.04321.208331.7870.003183795.1660.0440.0020.0017.91515.62820.021796680.05331.312630.54380.0354.8830.003227392279645.898300590.00287.617342516191.338.322854.854460918.19725.2312.0542.2174.06018.4612.5591.237910357.4415.8917.3319.4110.6413.2443.9710.431.214.352.723.162.742.9810.74428972.0211542851356815428746825089.993604089558.72401.561.336403.72919.230.01096.4615925344171.640.01515404.27770086122135131.143645.834.7310.037.466.843.9619.807.092.719.344.273.654.764.057.4018015200.02013.65881966.151231.7393959.41.30197487.40.6440.0279116.45550926.67712.020.0690.8450.0014.9095976321.611355.370.016722341871.325993.750.01354.54775254.597243.796262.5147.190.0040.0020.1780.2220.1340.0130.10960.0090.1440.0010.0050.0110.0070.0940.4540.2800.2900.0030.00216.792237.6010.0970.0600.0110.0540.0040.028806670.2010.0010.00111.0423423.091826.950.04317.909728.4050.003182795.1080.0440.0020.0018.01318.37950.024758860.05238.431838.52580.0365.0690.003231172347746.588300470.00298.136543336138.338.413987.715520887.1426.6872.6132.5654.64022.6222.8651.261522476.8014.8517.6220.0111.2311.7440.6110.341.214.472.882.672.753.1011.06450342.13189096.61349415815438026580.090189963422.78409.4342.212398.30119.570.013109.6510989345792.290.05515539.07373980115641930.949573.714.308.146.116.412.3416.965.101.437.652.722.073.642.616.7214133630.0213.96161970.571171.0474706.91.29978810.50.7790.0349209.74883825.67614.580.2230.8370.0015.2050185932.111439.630.016761950001.343821.380.03165.95657228.412255.322486.55813.060.0040.0020.1790.2270.1700.0140.10970.0090.1440.0040.0050.0110.0090.0930.4590.4990.2900.0040.00216.524230.4590.1010.0610.0110.1110.0130.037785730.2010.0010.00111.0572873.341584.970.04316.813628.6930.009178765.0690.0440.0020.0018.18318.43480.024750320.05336.044935.19060.0368.1930.003232402362245.500297800.002101.12243777141.539.392623.161468790.26421.0171.5321.6163.16514.7862.0450.96375374.7112.0614.0515.228.389.2337.917.630.912.901.852.181.892.158.63399091.8471305271406917233337433183.121427314615.24353.811.325368.13223.080.01146.6014366745933.110.02116397.28308551820224771.189103.463.747.635.635.751.9616.434.691.037.202.481.803.332.416.1717838570.02113.70112023.161256.3093377.41.25698208.80.6350.0289558.65577326.16536.430.0880.8150.0024.9653316218.411649.840.016774693631.3131099.430.01348.78499263.157264.784216.87110.830.0050.0020.1680.2190.1330.0120.10380.0080.1370.0020.0050.0110.0080.0930.4090.2810.2850.0040.00216.103225.7580.0890.0600.010.0520.0050.0291023330.1940.0010.00110.6933550.081853.310.04221.880226.1490.003183034.9500.0420.0020.0017.66717.50190.021834260.05137.190938.23390.0346.1670.003227472286144.744322020.00299.8771OpenBenchmarking.org

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt9K18K27K36K45KSE +/- 293.32, N = 3SE +/- 932.44, N = 7SE +/- 522.29, N = 9SE +/- 301.86, N = 340789425164333643777-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -pthread -O3 -lm

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt4080120160200SE +/- 18.74, N = 9SE +/- 2.66, N = 12SE +/- 1.45, N = 3SE +/- 0.42, N = 3191.33186.74141.53138.331. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: ResNet 50 - Device: CPUIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 0.04, N = 3SE +/- 0.25, N = 9SE +/- 0.01, N = 3SE +/- 0.03, N = 38.328.418.529.39

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt9001800270036004500SE +/- 299.40, N = 9SE +/- 40.36, N = 3SE +/- 2.71, N = 3SE +/- 3.06, N = 33987.722854.852719.822623.16-march=native - MIN: 2619.64 / MAX: 5569.92-march=native - MIN: 2660.43 / MAX: 3772.17-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2625.5 / MAX: 3184.86-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2564.18 / MAX: 2757.861. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: yolov4 - Device: CPUIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt110220330440550SE +/- 1.48, N = 3SE +/- 5.97, N = 12SE +/- 0.44, N = 3SE +/- 4.64, N = 12460468469520-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B1Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt2004006008001000918.20887.14794.26790.26

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: inception-v3Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt612182430SE +/- 0.37, N = 12SE +/- 0.55, N = 15SE +/- 0.05, N = 15SE +/- 0.04, N = 326.6925.2323.2821.02-march=native - MIN: 22.92 / MAX: 195.79-march=native - MIN: 20.75 / MAX: 242.75-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 20.16 / MAX: 153.62-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 19.1 / MAX: 37.331. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenet-v1-1.0Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.58791.17581.76372.35162.9395SE +/- 0.016, N = 12SE +/- 0.066, N = 15SE +/- 0.016, N = 15SE +/- 0.007, N = 32.6132.0541.7491.532-march=native - MIN: 2.48 / MAX: 16.01-march=native - MIN: 1.55 / MAX: 55.56-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.51 / MAX: 60.38-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.46 / MAX: 12.021. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: MobileNetV2_224Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.57711.15421.73132.30842.8855SE +/- 0.050, N = 12SE +/- 0.076, N = 15SE +/- 0.021, N = 15SE +/- 0.004, N = 32.5652.2171.8921.616-march=native - MIN: 2.04 / MAX: 63.68-march=native - MIN: 1.65 / MAX: 59.17-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.59 / MAX: 22.25-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.54 / MAX: 7.391. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: SqueezeNetV1.0Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1.0442.0883.1324.1765.22SE +/- 0.105, N = 12SE +/- 0.128, N = 15SE +/- 0.051, N = 15SE +/- 0.251, N = 34.6404.0603.5903.165-march=native - MIN: 3.45 / MAX: 47.3-march=native - MIN: 2.94 / MAX: 82.76-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.83 / MAX: 78.46-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.84 / MAX: 8.571. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: resnet-v2-50Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt510152025SE +/- 0.57, N = 12SE +/- 0.50, N = 15SE +/- 0.06, N = 15SE +/- 0.05, N = 322.6218.4616.6914.79-march=native - MIN: 19.41 / MAX: 83.45-march=native - MIN: 14.88 / MAX: 173.91-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.63 / MAX: 104.64-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.1 / MAX: 42.251. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: squeezenetv1.1Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.64461.28921.93382.57843.223SE +/- 0.110, N = 12SE +/- 0.078, N = 15SE +/- 0.022, N = 15SE +/- 0.010, N = 32.8652.5592.2642.045-march=native - MIN: 2.23 / MAX: 52.16-march=native - MIN: 1.99 / MAX: 49.19-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.95 / MAX: 45.84-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.99 / MAX: 8.271. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenetV3Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.28370.56740.85111.13481.4185SE +/- 0.017, N = 12SE +/- 0.037, N = 15SE +/- 0.016, N = 15SE +/- 0.001, N = 31.2611.2371.1130.963-march=native - MIN: 1.08 / MAX: 48.36-march=native - MIN: 0.95 / MAX: 40.28-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 16.19-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.93 / MAX: 4.841. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.5.4Test: DNN - Deep Neural NetworkIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt20K40K60K80K100KSE +/- 3031.05, N = 12SE +/- 1166.72, N = 15SE +/- 713.19, N = 15SE +/- 245.00, N = 159103566637522477537-march=native -ldl -lm -lpthread -lrt-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect -ldl -lm -lpthread -lrt-march=native -ldl -lm -lpthread -lrt1. (CXX) g++ options: -O3 -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -fvisibility=hidden

I'll need to retest the other setups here, something seems VERY wrong

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt246810SE +/- 0.55, N = 15SE +/- 0.20, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 37.446.805.554.71-march=native - MIN: 5.34 / MAX: 398.2-march=native - MIN: 6.23 / MAX: 15.86-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.92 / MAX: 31.26-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.58 / MAX: 6.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt48121620SE +/- 0.72, N = 15SE +/- 0.52, N = 15SE +/- 0.02, N = 15SE +/- 0.02, N = 315.8914.8513.4112.06-march=native - MIN: 12.62 / MAX: 489.07-march=native - MIN: 13.57 / MAX: 63.1-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 12.41 / MAX: 45.1-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 11.82 / MAX: 17.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt48121620SE +/- 0.44, N = 15SE +/- 0.43, N = 15SE +/- 0.13, N = 15SE +/- 0.20, N = 317.6217.3315.3414.05-march=native - MIN: 15.29 / MAX: 370.58-march=native - MIN: 14.49 / MAX: 220.15-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 13.6 / MAX: 60.85-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 13.36 / MAX: 28.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt510152025SE +/- 0.32, N = 15SE +/- 0.58, N = 15SE +/- 0.11, N = 15SE +/- 0.23, N = 320.0119.4116.3815.22-march=native - MIN: 18.29 / MAX: 72.6-march=native - MIN: 15.46 / MAX: 404.54-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 14.75 / MAX: 169.63-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 14.63 / MAX: 21.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 0.06, N = 15SE +/- 0.31, N = 15SE +/- 0.05, N = 15SE +/- 0.01, N = 311.2310.649.298.38-march=native - MIN: 10.78 / MAX: 35.58-march=native - MIN: 8.84 / MAX: 348.67-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 8.58 / MAX: 188.45-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.27 / MAX: 14.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet18Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 0.65, N = 15SE +/- 0.21, N = 15SE +/- 0.05, N = 15SE +/- 0.03, N = 313.2411.7410.379.23-march=native - MIN: 9.67 / MAX: 377.98-march=native - MIN: 10.66 / MAX: 43.48-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 9.43 / MAX: 107.77-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 9.03 / MAX: 29.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg16Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 1020304050SE +/- 1.09, N = 15SE +/- 0.40, N = 15SE +/- 0.31, N = 3SE +/- 0.15, N = 1543.9740.6137.9137.73-march=native - MIN: 36.31 / MAX: 485.44-march=native - MIN: 38.83 / MAX: 98.35-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 37.18 / MAX: 80.66-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 35.14 / MAX: 288.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 0.39, N = 15SE +/- 0.27, N = 15SE +/- 0.15, N = 15SE +/- 0.04, N = 310.4310.349.007.63-march=native - MIN: 8.03 / MAX: 223.75-march=native - MIN: 9.1 / MAX: 35.14-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.77 / MAX: 212.02-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.41 / MAX: 17.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.27230.54460.81691.08921.3615SE +/- 0.04, N = 15SE +/- 0.04, N = 15SE +/- 0.00, N = 15SE +/- 0.01, N = 31.211.211.020.91-march=native - MIN: 1.05 / MAX: 6.94-march=native - MIN: 0.93 / MAX: 35.68-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 0.93 / MAX: 3.99-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.88 / MAX: 4.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1.00582.01163.01744.02325.029SE +/- 0.12, N = 15SE +/- 0.20, N = 15SE +/- 0.02, N = 15SE +/- 0.01, N = 34.474.353.372.90-march=native - MIN: 3.96 / MAX: 34.65-march=native - MIN: 3.03 / MAX: 261.53-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.95 / MAX: 16.3-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.83 / MAX: 3.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.6481.2961.9442.5923.24SE +/- 0.12, N = 15SE +/- 0.12, N = 14SE +/- 0.01, N = 14SE +/- 0.01, N = 32.882.722.121.85-march=native - MIN: 2.43 / MAX: 186.85-march=native - MIN: 1.95 / MAX: 218.37-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 5.63-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.79 / MAX: 2.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.7111.4222.1332.8443.555SE +/- 0.31, N = 15SE +/- 0.07, N = 14SE +/- 0.01, N = 14SE +/- 0.00, N = 33.162.672.392.18-march=native - MIN: 2.17 / MAX: 324.42-march=native - MIN: 2.45 / MAX: 10.63-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.14 / MAX: 6.73-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.1 / MAX: 6.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v3Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.61881.23761.85642.47523.094SE +/- 0.09, N = 15SE +/- 0.12, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 32.752.742.171.89-march=native - MIN: 2.43 / MAX: 270.92-march=native - MIN: 1.97 / MAX: 173.37-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.87 / MAX: 11.1-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.83 / MAX: 2.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v2Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.69751.3952.09252.793.4875SE +/- 0.15, N = 15SE +/- 0.12, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 33.102.982.382.15-march=native - MIN: 2.73 / MAX: 410-march=native - MIN: 2.18 / MAX: 280.99-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.09 / MAX: 19.16-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.08 / MAX: 5.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 0.31, N = 15SE +/- 0.34, N = 15SE +/- 0.12, N = 15SE +/- 0.10, N = 311.0610.749.078.63-march=native - MIN: 9.8 / MAX: 493.36-march=native - MIN: 8.38 / MAX: 272.88-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.88 / MAX: 149.11-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 8.28 / MAX: 13.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: shufflenet-v2-10 - Device: CPUIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt10K20K30K40K50KSE +/- 620.08, N = 11SE +/- 181.26, N = 3SE +/- 105.96, N = 3SE +/- 46.94, N = 339909406884289745034-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect1. (CXX) g++ options: -O3 -ffunction-sections -fdata-sections -march=native -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.47950.9591.43851.9182.3975SE +/- 0.100, N = 15SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 32.1312.0211.8611.847

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt30K60K90K120K150KSE +/- 6529.38, N = 12SE +/- 2986.13, N = 15SE +/- 95.70, N = 3SE +/- 609.47, N = 3154285.0130527.0126721.089096.6

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3K6K9K12K15KSE +/- 108.98, N = 8SE +/- 24.06, N = 3SE +/- 25.36, N = 3SE +/- 54.22, N = 312945134941356814069-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -pthread -O3 -lm

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Max SP FlopsIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt400K800K1200K1600K2000KSE +/- 87626.15, N = 15SE +/- 38770.78, N = 15SE +/- 76015.19, N = 12SE +/- 44470.87, N = 121542874158154316672321723333-march=native-march=native-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: fcn-resnet101-11 - Device: CPUIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt20406080100SE +/- 0.75, N = 5SE +/- 0.00, N = 3SE +/- 0.17, N = 3SE +/- 0.67, N = 368747480-mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2ImageIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt7K14K21K28K35KSE +/- 1308.12, N = 13SE +/- 100.69, N = 3SE +/- 274.39, N = 3SE +/- 638.53, N = 1225089.9926580.0932570.1633183.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt130260390520650SE +/- 2.53, N = 3SE +/- 2.34, N = 3SE +/- 1.21, N = 3SE +/- 1.67, N = 3422.78558.72565.28615.24

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B2Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt90180270360450409.43403.48401.56353.81

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.49770.99541.49311.99082.4885SE +/- 0.018, N = 9SE +/- 0.016, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 32.2121.3361.3321.325

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently uses the SSMP (OpenMP) version of cp2k. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.2Input: Fayalite-FISTIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt90180270360450403.73398.30382.83368.13

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterPlaidMLFP16: No - Mode: Inference - Network: VGG16 - Device: CPUIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt612182430SE +/- 0.22, N = 4SE +/- 0.24, N = 4SE +/- 0.07, N = 3SE +/- 0.08, N = 319.2319.5722.2923.08

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 65536 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00290.00580.00870.01160.0145SE +/- 0.001, N = 12SE +/- 0.000, N = 15SE +/- 0.000, N = 15SE +/- 0.000, N = 150.0130.0110.0110.010

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qdaIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt20406080100SE +/- 0.88, N = 3SE +/- 0.15, N = 3SE +/- 0.33, N = 3SE +/- 0.12, N = 3109.6596.4695.9746.60

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt30K60K90K120K150KSE +/- 701.15, N = 3SE +/- 1142.56, N = 15SE +/- 86.06, N = 3SE +/- 195.44, N = 3159253143667141138109893

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.10Model: super-resolution-10 - Device: CPUIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 10002000300040005000SE +/- 5.07, N = 3SE +/- 7.52, N = 3SE +/- 10.54, N = 3SE +/- 24.67, N = 34417457945934662-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto -fno-fat-lto-objects -ldl -lrt -pthread -lpthread

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregressionIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.69981.39962.09942.79923.499SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 15SE +/- 0.02, N = 153.112.291.641.53

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.01240.02480.03720.04960.062SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 15SE +/- 0.000, N = 30.0550.0210.0210.015

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenCL - Kernel: Points2ImageIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt4K8K12K16K20KSE +/- 137.15, N = 3SE +/- 177.47, N = 3SE +/- 375.06, N = 12SE +/- 145.81, N = 315404.2815525.6615539.0716397.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt500K1000K1500K2000K2500KSE +/- 8085.58, N = 3SE +/- 1345.60, N = 3SE +/- 10245.31, N = 3SE +/- 2696.20, N = 32213513202468020224771564193

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt0.28420.56840.85261.13681.421SE +/- 0.01507, N = 3SE +/- 0.00934, N = 10SE +/- 0.00123, N = 3SE +/- 0.00925, N = 31.263291.189101.143640.94957

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: regnety_400mIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1.31182.62363.93545.24726.559SE +/- 1.34, N = 3SE +/- 0.09, N = 15SE +/- 0.05, N = 9SE +/- 0.02, N = 35.834.283.713.46-march=native - MIN: 3.47 / MAX: 39.09-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.49 / MAX: 28.51-march=native - MIN: 3.44 / MAX: 27.73-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.39 / MAX: 9.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: squeezenet_ssdIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1.06432.12863.19294.25725.3215SE +/- 0.35, N = 3SE +/- 0.04, N = 15SE +/- 0.06, N = 9SE +/- 0.01, N = 34.734.634.303.74-march=native - MIN: 3.77 / MAX: 44.23-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.77 / MAX: 53.14-march=native - MIN: 3.76 / MAX: 18.02-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.67 / MAX: 6.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: yolov4-tinyIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 1.16, N = 3SE +/- 0.03, N = 15SE +/- 0.02, N = 9SE +/- 0.03, N = 310.038.168.147.63-march=native - MIN: 7.85 / MAX: 80.22-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.45 / MAX: 57.64-march=native - MIN: 7.82 / MAX: 19.98-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.42 / MAX: 9.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet50Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt246810SE +/- 1.23, N = 3SE +/- 0.08, N = 15SE +/- 0.04, N = 9SE +/- 0.01, N = 37.466.646.115.63-march=native - MIN: 5.65 / MAX: 33.99-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.66 / MAX: 49.12-march=native - MIN: 5.6 / MAX: 31.19-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.55 / MAX: 11.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: alexnetIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt246810SE +/- 0.55, N = 3SE +/- 0.06, N = 15SE +/- 0.14, N = 9SE +/- 0.01, N = 36.846.696.415.75-march=native - MIN: 5.56 / MAX: 36.9-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 5.55 / MAX: 48.95-march=native - MIN: 5.54 / MAX: 27.79-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.49 / MAX: 12.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: resnet18Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.8911.7822.6733.5644.455SE +/- 1.09, N = 3SE +/- 0.05, N = 15SE +/- 0.09, N = 7SE +/- 0.04, N = 33.963.042.341.96-march=native - MIN: 2.04 / MAX: 34.37-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.03 / MAX: 39.18-march=native - MIN: 1.95 / MAX: 34.31-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.89 / MAX: 9.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: vgg16Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt510152025SE +/- 2.77, N = 3SE +/- 0.12, N = 15SE +/- 0.08, N = 9SE +/- 0.07, N = 319.8017.5816.9616.43-march=native - MIN: 15.66 / MAX: 57.67-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 15.73 / MAX: 64.58-march=native - MIN: 15.75 / MAX: 62.69-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 15.67 / MAX: 43.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: googlenetIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt246810SE +/- 1.19, N = 3SE +/- 0.07, N = 15SE +/- 0.06, N = 9SE +/- 0.01, N = 37.095.415.104.69-march=native - MIN: 4.73 / MAX: 34.54-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 4.74 / MAX: 36.71-march=native - MIN: 4.68 / MAX: 29.19-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 4.63 / MAX: 7.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: blazefaceIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.60981.21961.82942.43923.049SE +/- 0.30, N = 3SE +/- 0.03, N = 15SE +/- 0.02, N = 9SE +/- 0.04, N = 32.711.691.431.03-march=native - MIN: 1.3 / MAX: 55.08-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.29 / MAX: 28.85-march=native - MIN: 1.17 / MAX: 22.73-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 0.85 / MAX: 22.571. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: efficientnet-b0Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 1.47, N = 3SE +/- 0.04, N = 15SE +/- 0.05, N = 9SE +/- 0.01, N = 39.347.727.657.20-march=native - MIN: 7.17 / MAX: 38.24-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 7.18 / MAX: 34.33-march=native - MIN: 7.2 / MAX: 28.56-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 7.08 / MAX: 11.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mnasnetIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.96081.92162.88243.84324.804SE +/- 1.10, N = 3SE +/- 0.09, N = 15SE +/- 0.06, N = 9SE +/- 0.01, N = 34.273.402.722.48-march=native - MIN: 2.5 / MAX: 32.54-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.5 / MAX: 33.07-march=native - MIN: 2.46 / MAX: 25.53-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.4 / MAX: 6.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: shufflenet-v2Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.82131.64262.46393.28524.1065SE +/- 0.97, N = 3SE +/- 0.04, N = 15SE +/- 0.02, N = 9SE +/- 0.00, N = 33.652.682.071.80-march=native - MIN: 1.88 / MAX: 35.3-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 1.86 / MAX: 25.24-march=native - MIN: 1.82 / MAX: 24.55-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 1.77 / MAX: 3.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1.0712.1423.2134.2845.355SE +/- 0.44, N = 3SE +/- 0.05, N = 15SE +/- 0.05, N = 9SE +/- 0.05, N = 34.764.023.643.33-march=native - MIN: 3.33 / MAX: 28.56-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 3.28 / MAX: 27.17-march=native - MIN: 3.28 / MAX: 24.44-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 3.22 / MAX: 15.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.91131.82262.73393.64524.5565SE +/- 0.50, N = 3SE +/- 0.04, N = 15SE +/- 0.04, N = 9SE +/- 0.01, N = 34.053.192.612.41-march=native - MIN: 2.44 / MAX: 30.19-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 2.43 / MAX: 25.75-march=native - MIN: 2.38 / MAX: 19.16-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 2.33 / MAX: 6.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: Vulkan GPU - Model: mobilenetIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt246810SE +/- 0.10, N = 3SE +/- 0.07, N = 15SE +/- 0.06, N = 9SE +/- 0.05, N = 37.406.866.726.17-march=native - MIN: 6.41 / MAX: 52.12-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 6.23 / MAX: 45.95-march=native - MIN: 6.36 / MAX: 35.89-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 5.85 / MAX: 8.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread -pthread

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt400K800K1200K1600K2000KSE +/- 2113.91, N = 3SE +/- 380.18, N = 3SE +/- 2059.01, N = 3SE +/- 972.87, N = 31837950180152017838571413363

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: TensorFlow - Project Size: 1048576 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00470.00940.01410.01880.0235SE +/- 0.000, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 40.0210.0200.0200.020

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: S3DIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt48121620SE +/- 0.18, N = 15SE +/- 0.12, N = 15SE +/- 0.11, N = 10SE +/- 0.12, N = 1512.2413.6613.7013.96-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt400800120016002000SE +/- 15.85, N = 3SE +/- 1.70, N = 3SE +/- 2.06, N = 3SE +/- 0.85, N = 31966.151970.572004.442023.161. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt30060090012001500SE +/- 0.43, N = 3SE +/- 2.99, N = 3SE +/- 0.45, N = 3SE +/- 0.74, N = 31171.041231.731255.801256.301. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt20K40K60K80K100KSE +/- 70.47, N = 3SE +/- 153.26, N = 3SE +/- 1032.89, N = 5SE +/- 1007.74, N = 395569.393959.493377.474706.9

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.29270.58540.87811.17081.4635SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 31.3011.2991.2991.256

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt20K40K60K80K100KSE +/- 477.30, N = 3SE +/- 995.15, N = 3SE +/- 147.61, N = 3SE +/- 254.55, N = 399378.698208.897487.478810.5

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.17530.35060.52590.70120.8765SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.7790.6440.6370.635

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00770.01540.02310.03080.0385SE +/- 0.001, N = 12SE +/- 0.000, N = 15SE +/- 0.000, N = 15SE +/- 0.000, N = 150.0340.0280.0280.027

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 2K4K6K8K10KSE +/- 32.30, N = 3SE +/- 67.32, N = 3SE +/- 5.76, N = 3SE +/- 6.21, N = 39116.469209.759558.6610224.56-march=native-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect1. (CC) gcc options: -O3 -mavx2

Nebular Empirical Analysis Tool

NEAT is the Nebular Empirical Analysis Tool for empirical analysis of ionised nebulae, with uncertainty propagation. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNebular Empirical Analysis Tool 2.3Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt612182430SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.66, N = 1526.9826.6826.1725.681. (F9X) gfortran options: -O3 -cpp -ffree-line-length-0 -Jsource/ -fopenmp -fno-backtrace -lcfitsio

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_icaIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt816243240SE +/- 0.29, N = 9SE +/- 0.26, N = 12SE +/- 0.02, N = 3SE +/- 0.07, N = 336.4314.5812.3912.02

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 4194304 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.05020.10040.15060.20080.251SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 150.2230.0900.0880.069

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.19060.38120.57180.76240.953SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 3SE +/- 0.003, N = 30.8470.8450.8370.815

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: TensorFlow - Project Size: 65536 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00050.0010.00150.0020.0025SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 30.0020.0020.0010.001

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt1.17112.34223.51334.68445.8555SE +/- 0.007819, N = 3SE +/- 0.027431, N = 3SE +/- 0.019734, N = 3SE +/- 0.005750, N = 34.7237924.9095974.9653315.205018-mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -O3 -march=native -fopenmp

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt14002800420056007000SE +/- 73.38, N = 3SE +/- 4.26, N = 3SE +/- 26.31, N = 3SE +/- 9.91, N = 35458.445932.116218.416321.611. (CXX) g++ options: -O3 -fopenmp -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lrt -lpthread -ldl

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Euclidean ClusterIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt400800120016002000SE +/- 12.00, N = 15SE +/- 27.64, N = 15SE +/- 3.29, N = 3SE +/- 12.05, N = 31355.371439.631615.651649.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00380.00760.01140.01520.019SE +/- 0.000, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0170.0160.0160.016

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt17M34M51M68M85MSE +/- 688051.71, N = 3SE +/- 396295.91, N = 3SE +/- 132255.44, N = 3SE +/- 37071.05, N = 372234187761950007686869377469363-march=native-march=native-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CXX) g++ options: -O3 -fopenmp

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 4194304 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.30220.60440.90661.20881.511SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 31.3431.3271.3251.313

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT MappingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt2004006008001000SE +/- 26.57, N = 15SE +/- 10.71, N = 5SE +/- 4.83, N = 3SE +/- 4.87, N = 3821.38993.751008.651099.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0070.0140.0210.0280.035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 30.0310.0130.0130.013

DeepSpeech

Mozilla DeepSpeech is a speech-to-text engine powered by TensorFlow for machine learning and derived from Baidu's Deep Speech research paper. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPUIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1530456075SE +/- 0.20, N = 3SE +/- 0.25, N = 3SE +/- 0.29, N = 3SE +/- 0.28, N = 365.9654.5549.0848.78

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: GEMM SGEMM_NIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 60120180240300SE +/- 3.17, N = 15SE +/- 4.42, N = 12SE +/- 0.87, N = 3SE +/- 1.08, N = 3228.41254.60263.16284.99-march=native-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt60120180240300SE +/- 4.49, N = 15SE +/- 1.70, N = 15SE +/- 0.58, N = 3SE +/- 0.23, N = 3243.80253.55255.32264.781. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt110220330440550SE +/- 0.45, N = 3SE +/- 2.18, N = 15SE +/- 1.06, N = 3SE +/- 0.22, N = 3486.56262.51224.71216.87-march=native - MIN: 479.58 / MAX: 559.42-march=native - MIN: 221.83 / MAX: 377.5-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 219.21 / MAX: 248.29-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 214.9 / MAX: 226.181. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svmIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 3691215SE +/- 0.54, N = 12SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 1513.0610.837.196.95

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: TensorFlow - Project Size: 262144 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00110.00220.00330.00440.0055SE +/- 0.000, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 150.0050.0050.0040.004

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00050.0010.00150.0020.0025SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0020.0020.0020.002

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 4194304 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.04030.08060.12090.16120.2015SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1790.1780.1750.168

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.05110.10220.15330.20440.2555SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.2270.2220.2220.219

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.03830.07660.11490.15320.1915SE +/- 0.004, N = 12SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1700.1340.1340.133

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.00320.00640.00960.01280.016SE +/- 0.001, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0140.0130.0130.012

R Benchmark

This test is a quick-running survey of general R performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterR BenchmarkIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.02470.04940.07410.09880.1235SE +/- 0.0057, N = 12SE +/- 0.0013, N = 3SE +/- 0.0004, N = 3SE +/- 0.0004, N = 30.10970.10960.10700.10381. R scripting front-end version 4.0.4 (2021-02-15)

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.0020.0040.0060.0080.01SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 30.0090.0090.0090.008

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 4194304 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.03240.06480.09720.12960.162SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1440.1440.1430.137

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 65536 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00090.00180.00270.00360.0045SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0040.0020.0020.001

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Isoneutral MixingIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00110.00220.00330.00440.0055SE +/- 0.000, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 150.0050.0050.0050.004

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 65536 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00270.00540.00810.01080.0135SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0120.0110.0110.011

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 1048576 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.0020.0040.0060.0080.01SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0090.0080.0080.007

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: TensorFlow - Project Size: 4194304 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt0.02140.04280.06420.08560.107SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.0950.0940.0930.093

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 1048576 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.10330.20660.30990.41320.5165SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.4590.4540.4160.409

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 1048576 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.11230.22460.33690.44920.5615SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 30.4990.2850.2810.280

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.06590.13180.19770.26360.3295SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 30.2930.2900.2900.285

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Isoneutral MixingIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00090.00180.00270.00360.0045SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0040.0040.0040.003

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00070.00140.00210.00280.0035SE +/- 0.000, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 150.0030.0020.0020.002

RNNoise

RNNoise is a recurrent neural network for audio noise reduction developed by Mozilla and Xiph.Org. This test profile is a single-threaded test measuring the time to denoise a sample 26 minute long 16-bit RAW audio file using this recurrent neural network noise suppression library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt48121620SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 317.5116.7916.5216.10-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -O3 -pedantic -fvisibility=hidden

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt50100150200250SE +/- 1.05, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 3237.60235.53230.46225.76-march=native - MIN: 235.29 / MAX: 265.95-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 235.23 / MAX: 237-march=native - MIN: 230.05 / MAX: 230.91-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 225.47 / MAX: 226.771. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.02270.04540.06810.09080.1135SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1010.0970.0900.089

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.01370.02740.04110.05480.0685SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0610.0610.0600.060

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 262144 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.00250.0050.00750.010.0125SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0110.0110.0110.010

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0250.050.0750.10.125SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1110.0540.0520.052

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 262144 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00290.00580.00870.01160.0145SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0130.0050.0050.004

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 4194304 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt0.00830.01660.02490.03320.0415SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0370.0300.0290.028

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt20K40K60K80K100KSE +/- 827.63, N = 4SE +/- 367.77, N = 3SE +/- 688.36, N = 15SE +/- 950.48, N = 37857380667100155102333-march=native-march=native-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -pthread -O3 -lm

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.04550.0910.13650.1820.2275SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2020.2010.2010.194

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 16384 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00020.00040.00060.00080.001SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0010.0010.0010.001

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 262144 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00020.00040.00060.00080.001SE +/- 0.000, N = 3SE +/- 0.000, N = 15SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0010.0010.0010.001

Dolfyn

Dolfyn is a Computational Fluid Dynamics (CFD) code of modern numerical simulation techniques. The Dolfyn test profile measures the execution time of the bundled computational fluid dynamics demos that are bundled with Dolfyn. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDolfyn 0.527Computational Fluid DynamicsIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt3691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 311.1611.0611.0410.69

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt8001600240032004000SE +/- 9.94, N = 3SE +/- 10.37, N = 3SE +/- 25.80, N = 5SE +/- 0.00, N = 32812.632873.343423.093550.081. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt400800120016002000SE +/- 10.00, N = 3SE +/- 9.49, N = 3SE +/- 18.90, N = 5SE +/- 4.31, N = 31484.831584.971826.951853.311. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 262144 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.00970.01940.02910.03880.0485SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0430.0430.0430.042

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: ReductionIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt510152025SE +/- 0.15, N = 3SE +/- 0.08, N = 3SE +/- 0.29, N = 3SE +/- 0.10, N = 316.8117.9121.2121.88-march=native-march=native-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P1B2Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt71421283531.7928.6928.4126.15

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: PyTorch - Project Size: 16384 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.0020.0040.0060.0080.01SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0090.0030.0030.003

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 4K8K12K16K20KSE +/- 29.78, N = 3SE +/- 117.70, N = 3SE +/- 33.69, N = 3SE +/- 141.46, N = 1417876182791830318379-march=native-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect1. (CC) gcc options: -pthread -O3 -lm

GNU Octave Benchmark

This test profile measures how long it takes to complete several reference GNU Octave files via octave-benchmark. GNU Octave is used for numerical computations and is an open-source alternative to MATLAB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGNU Octave Benchmark 6.1.1~hg.2021.01.26Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1.16242.32483.48724.64965.812SE +/- 0.024, N = 5SE +/- 0.039, N = 5SE +/- 0.025, N = 5SE +/- 0.017, N = 55.1665.1085.0694.950

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 1048576 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.00990.01980.02970.03960.0495SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0440.0440.0440.042

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: JAX - Project Size: 16384 - Benchmark: Isoneutral MixingIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00050.0010.00150.0020.0025SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0020.0020.0020.002

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 16384 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00020.00040.00060.00080.001SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0010.0010.0010.001

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNAIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt246810SE +/- 0.069, N = 3SE +/- 0.009, N = 3SE +/- 0.051, N = 3SE +/- 0.031, N = 38.1838.0137.9157.6671. (CC) gcc options: -std=c99 -O3 -lm -lpthread

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: TriadIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt510152025SE +/- 0.29, N = 15SE +/- 0.20, N = 15SE +/- 0.16, N = 15SE +/- 0.20, N = 415.6317.5018.3818.43-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native-march=native1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 65536 - Benchmark: Isoneutral MixingIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00540.01080.01620.02160.027SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 13SE +/- 0.000, N = 30.0240.0240.0210.021

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 32Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt20K40K60K80K100KSE +/- 820.18, N = 3SE +/- 618.46, N = 15SE +/- 1461.14, N = 15SE +/- 650.31, N = 375032758867966883426-march=native-march=native-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -pthread -O3 -lm

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 262144 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.01190.02380.03570.04760.0595SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0530.0530.0520.051

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed ReadbackIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt918273645SE +/- 1.21, N = 15SE +/- 1.03, N = 14SE +/- 1.52, N = 12SE +/- 1.55, N = 1531.3136.0437.1938.43-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

OpenBenchmarking.orgGB/s, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: Bus Speed DownloadIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt918273645SE +/- 1.01, N = 12SE +/- 1.31, N = 15SE +/- 1.47, N = 15SE +/- 1.64, N = 1530.5435.1938.2338.53-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numba - Project Size: 1048576 - Benchmark: Equation of StateIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt0.00810.01620.02430.03240.0405SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0360.0360.0350.034

Scikit-Learn

Scikit-learn is a Python module for machine learning Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterScikit-Learn 0.22.1Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 246810SE +/- 0.002, N = 3SE +/- 0.019, N = 3SE +/- 0.045, N = 3SE +/- 0.009, N = 38.1936.1675.0694.883

Another strange result, merits more testing

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Aesara - Project Size: 65536 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00070.00140.00210.00280.0035SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0030.0030.0030.003

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt5K10K15K20K25KSE +/- 1.86, N = 3SE +/- 20.34, N = 3SE +/- 122.88, N = 3SE +/- 13.78, N = 322739227472311723240-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native-march=native1. (CC) gcc options: -pthread -O3 -lm

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 32Intel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt5K10K15K20K25KSE +/- 214.39, N = 3SE +/- 54.44, N = 3SE +/- 178.70, N = 3SE +/- 102.46, N = 322796228612347723622-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native-march=native1. (CC) gcc options: -pthread -O3 -lm

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt1122334455SE +/- 0.22, N = 3SE +/- 0.32, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 346.5945.9045.5044.74-march=native - MIN: 45.39 / MAX: 53.21-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect - MIN: 45.05 / MAX: 46.7-march=native - MIN: 45.08 / MAX: 45.93-mno-amx-tile -mno-amx-int8 -mno-amx-bf16 - MIN: 44.37 / MAX: 45.81. (CXX) g++ options: -O3 -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32Intel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xt7K14K21K28K35KSE +/- 6.89, N = 3SE +/- 192.25, N = 3SE +/- 235.28, N = 3SE +/- 25.36, N = 329780300473005932202-march=native-march=native-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-mno-amx-tile -mno-amx-int8 -mno-amx-bf161. (CC) gcc options: -pthread -O3 -lm

PyHPC Benchmarks

PyHPC-Benchmarks is a suite of Python high performance computing benchmarks for execution on CPUs and GPUs using various popular Python HPC libraries. The PyHPC CPU-based benchmarks focus on sequential CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPyHPC Benchmarks 3.0Device: CPU - Backend: Numpy - Project Size: 16384 - Benchmark: Equation of StateIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt 0.00050.0010.00150.0020.0025SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0020.0020.0020.002

SHOC Scalable HeterOgeneous Computing

The CUDA and OpenCL version of Vetter's Scalable HeterOgeneous Computing benchmark suite. SHOC provides a number of different benchmark programs for evaluating the performance and stability of compute devices. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterSHOC Scalable HeterOgeneous Computing 2020-04-17Target: OpenCL - Benchmark: FFT SPIntel Core i7-12700k Golden Cove Pcore AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xt Intel Core i7-12700k Golden Cove Pcore ONLY no AVX512 DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Pcore AVX512-SAPPHIRE RAPIDS COMPILER SHENANIGANS- DDR4 3200 CL 15 16 16 36 RX 5600xtIntel Core i7-12700k Golden Cove Pcore + Gracemont Ecore DDR4 3200 CL 15 16 16 36 RX 5600xt20406080100SE +/- 1.24, N = 3SE +/- 0.45, N = 3SE +/- 0.58, N = 3SE +/- 0.08, N = 387.6298.1499.88101.12-march=native -mavx512f -mavx512dq -mavx512ifma -mavx512cd -mavx512bw -mavx512vl -mavx512bf16 -mavx512vbmi -mavx512vbmi2 -mavx512vnni -mavx512bitalg -mavx512vpopcntdq -mavx512vp2intersect-march=native-mno-amx-tile -mno-amx-int8 -mno-amx-bf16-march=native1. (CXX) g++ options: -O3 -lSHOCCommonMPI -lSHOCCommonOpenCL -lSHOCCommon -lOpenCL -lrt -Xlinker --enable-new-dtags -rpath -lmpicxx -lmpifort -lmpi -lpthread -ldl

153 Results Shown

FFTW
CloverLeaf
PlaidML
TNN
ONNX Runtime
ECP-CANDLE
Mobile Neural Network:
  inception-v3
  mobilenet-v1-1.0
  MobileNetV2_224
  SqueezeNetV1.0
  resnet-v2-50
  squeezenetv1.1
  mobilenetV3
OpenCV
NCNN:
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
ONNX Runtime
PyHPC Benchmarks
TensorFlow Lite
FFTW
SHOC Scalable HeterOgeneous Computing
ONNX Runtime
Darmstadt Automotive Parallel Heterogeneous Suite
Numpy Benchmark
ECP-CANDLE
PyHPC Benchmarks
CP2K Molecular Dynamics
PlaidML
PyHPC Benchmarks
Mlpack Benchmark
TensorFlow Lite
ONNX Runtime
Mlpack Benchmark
PyHPC Benchmarks
Darmstadt Automotive Parallel Heterogeneous Suite
TensorFlow Lite
NAMD
NCNN:
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
TensorFlow Lite
PyHPC Benchmarks
SHOC Scalable HeterOgeneous Computing
ASKAP:
  tConvolve MT - Degridding
  tConvolve MT - Gridding
TensorFlow Lite
PyHPC Benchmarks
TensorFlow Lite
PyHPC Benchmarks:
  CPU - JAX - 4194304 - Isoneutral Mixing
  CPU - JAX - 262144 - Isoneutral Mixing
Himeno Benchmark
Nebular Empirical Analysis Tool
Mlpack Benchmark
PyHPC Benchmarks:
  CPU - PyTorch - 4194304 - Equation of State
  CPU - Numba - 4194304 - Isoneutral Mixing
  CPU - TensorFlow - 65536 - Equation of State
ACES DGEMM
miniFE
Darmstadt Automotive Parallel Heterogeneous Suite
PyHPC Benchmarks
Kripke
PyHPC Benchmarks
Darmstadt Automotive Parallel Heterogeneous Suite
PyHPC Benchmarks
DeepSpeech
SHOC Scalable HeterOgeneous Computing
ASKAP
TNN
Mlpack Benchmark
PyHPC Benchmarks:
  CPU - TensorFlow - 262144 - Equation of State
  CPU - Numba - 65536 - Equation of State
  CPU - Aesara - 4194304 - Equation of State
  CPU - Numpy - 1048576 - Equation of State
  CPU - JAX - 1048576 - Isoneutral Mixing
  CPU - Numpy - 65536 - Equation of State
R Benchmark
PyHPC Benchmarks:
  CPU - Numba - 262144 - Equation of State
  CPU - Numba - 4194304 - Equation of State
  CPU - PyTorch - 65536 - Equation of State
  CPU - Numpy - 16384 - Isoneutral Mixing
  CPU - Numba - 65536 - Isoneutral Mixing
  CPU - JAX - 1048576 - Equation of State
  CPU - TensorFlow - 4194304 - Equation of State
  CPU - Numpy - 1048576 - Isoneutral Mixing
  CPU - PyTorch - 1048576 - Isoneutral Mixing
  CPU - Aesara - 1048576 - Isoneutral Mixing
  CPU - Aesara - 16384 - Isoneutral Mixing
  CPU - Numba - 16384 - Isoneutral Mixing
RNNoise
TNN
PyHPC Benchmarks:
  CPU - Numpy - 262144 - Isoneutral Mixing
  CPU - Aesara - 262144 - Isoneutral Mixing
  CPU - Aesara - 262144 - Equation of State
  CPU - PyTorch - 262144 - Isoneutral Mixing
  CPU - PyTorch - 262144 - Equation of State
  CPU - JAX - 4194304 - Equation of State
FFTW
PyHPC Benchmarks:
  CPU - Numba - 1048576 - Isoneutral Mixing
  CPU - Aesara - 16384 - Equation of State
  CPU - JAX - 262144 - Equation of State
Dolfyn
ASKAP:
  tConvolve OpenMP - Degridding
  tConvolve OpenMP - Gridding
PyHPC Benchmarks
SHOC Scalable HeterOgeneous Computing
ECP-CANDLE
PyHPC Benchmarks
FFTW
GNU Octave Benchmark
PyHPC Benchmarks:
  CPU - Aesara - 1048576 - Equation of State
  CPU - JAX - 16384 - Isoneutral Mixing
  CPU - Numba - 16384 - Equation of State
Timed MAFFT Alignment
SHOC Scalable HeterOgeneous Computing
PyHPC Benchmarks
FFTW
PyHPC Benchmarks
SHOC Scalable HeterOgeneous Computing:
  OpenCL - Bus Speed Readback
  OpenCL - Bus Speed Download
PyHPC Benchmarks
Scikit-Learn
PyHPC Benchmarks
FFTW:
  Stock - 1D FFT Size 32
  Stock - 2D FFT Size 32
TNN
FFTW
PyHPC Benchmarks
SHOC Scalable HeterOgeneous Computing