Google Cloud c3 Sapphire Rapids

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2303218-PTS-2303217N47

Jump To Table - Results

c3-highcpu-8 SPR

Processor: Intel Xeon Platinum 8481C (4 Cores / 8 Threads), Motherboard: Google Compute Engine c3-highcpu-8, Chipset: Intel 440FX 82441FX PMC, Memory: 16GB, Disk: 322GB nvme_card-pd, Network: Google Compute Engine Virtual

OS: Ubuntu 22.10, Kernel: 5.19.0-1015-gcp (x86_64), Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, System Layer: KVM

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: CPU Microcode: 0xffffffff
Python Notes: Python 3.10.7
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming. Learn more via the OpenBenchmarking.org test page.

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

CockroachDB

CockroachDB is a cloud-native, distributed SQL database for data intensive applications. This test profile uses a server-less CockroachDB configuration to test various Coackroach workloads on the local host with a single node. Learn more via the OpenBenchmarking.org test page.

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries too. Learn more via the OpenBenchmarking.org test page.

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

VVenC

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenRadioss

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

SPECFEM3D

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenRadioss

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

TensorFlow

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenRadioss

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

SPECFEM3D

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

VVenC

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

SPECFEM3D

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

SPECFEM3D

TensorFlow

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

VVenC

Embree

CockroachDB

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OSPRay Studio

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

Embree

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

oneDNN

uvg266

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

uvg266

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

Zstd Compression

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenSSL

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

VVenC

Neural Magic DeepSparse

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

Neural Magic DeepSparse

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenCV

This is a benchmark of the OpenCV (Computer Vision) library's built-in performance tests. Learn more via the OpenBenchmarking.org test page.

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

oneDNN

uvg266

oneDNN

uvg266

oneDNN

Google Draco

oneDNN

110 Results Shown

nekRS
OpenRadioss
OpenCV:
Image Processing
Stitching
CockroachDB
TensorFlow
OpenVKL
VVenC
SPECFEM3D
OpenRadioss
LeelaChessZero:
Eigen
BLAS
SPECFEM3D
MariaDB
Blender
MariaDB
OpenRadioss
Intel Open Image Denoise
Timed Linux Kernel Compilation
TensorFlow
BRL-CAD
GROMACS
OpenRadioss
OpenCV
SPECFEM3D
7-Zip Compression:
Decompression Rating
Compression Rating
OpenSSL:
SHA256
ChaCha20-Poly1305
AES-256-GCM
AES-128-GCM
ChaCha20
SHA512
VVenC
NAMD
OpenFOAM:
drivaerFastback, Small Mesh Size - Execution Time
drivaerFastback, Small Mesh Size - Mesh Time
SPECFEM3D
PostgreSQL:
100 - 800 - Read Only - Average Latency
100 - 800 - Read Only
100 - 1000 - Read Only - Average Latency
100 - 1000 - Read Only
SPECFEM3D
TensorFlow
Intel Open Image Denoise
OSPRay Studio
Timed FFmpeg Compilation
VVenC
Embree
CockroachDB:
KV, 50% Reads - 128
KV, 95% Reads - 128
miniBUDE:
OpenMP - BM1:
Billion Interactions/s
GFInst/s
OSPRay Studio
oneDNN
nginx:
1000
4000
100
500
200
Embree
OpenCV
uvg266
oneDNN
uvg266
Zstd Compression:
19 - Decompression Speed
19 - Compression Speed
Memcached:
1:10
1:100
uvg266
OpenCV
Zstd Compression:
19, Long Mode - Decompression Speed
19, Long Mode - Compression Speed
John The Ripper:
MD5
HMAC-SHA512
OpenSSL:
RSA4096:
verify/s
sign/s
Neural Magic DeepSparse:
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream:
ms/batch
items/sec
VVenC
Neural Magic DeepSparse:
NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
ms/batch
items/sec
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
ms/batch
items/sec
Google Draco
Neural Magic DeepSparse:
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
ms/batch
items/sec
Xcompact3d Incompact3d
OpenCV
John The Ripper:
WPA PSK
bcrypt
Blowfish
oneDNN
uvg266:
Bosphorus 1080p - Very Fast
Bosphorus 1080p - Super Fast
oneDNN
uvg266
oneDNN:
Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU
IP Shapes 3D - bf16bf16bf16 - CPU
Google Draco
oneDNN:
Convolution Batch Shapes Auto - bf16bf16bf16 - CPU
Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU

c3-highcpu-8 SPR

OS: Ubuntu 22.10, Kernel: 5.19.0-1015-gcp (x86_64), Vulkan: 1.3.224, Compiler: GCC 12.2.0, File-System: ext4, System Layer: KVM

Testing initiated at 21 March 2023 03:08 by user michael_larabel.

Google Cloud c3 Sapphire Rapids

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

c3-highcpu-8 SPR

nekRS

OpenRadioss

OpenCV

CockroachDB

TensorFlow

OpenVKL

VVenC

SPECFEM3D

OpenRadioss

LeelaChessZero

SPECFEM3D

MariaDB

Blender

MariaDB

OpenRadioss

Intel Open Image Denoise

Timed Linux Kernel Compilation

TensorFlow

BRL-CAD

GROMACS

OpenRadioss

OpenCV

SPECFEM3D

7-Zip Compression

OpenSSL

VVenC

NAMD

OpenFOAM

SPECFEM3D

PostgreSQL

SPECFEM3D

TensorFlow

Intel Open Image Denoise

OSPRay Studio

Timed FFmpeg Compilation

VVenC

Embree

CockroachDB

miniBUDE

OSPRay Studio

oneDNN

nginx

Embree

OpenCV

uvg266

oneDNN

uvg266

Zstd Compression

Memcached

uvg266

OpenCV

Zstd Compression

John The Ripper

OpenSSL

Neural Magic DeepSparse

VVenC

Neural Magic DeepSparse

Google Draco

Neural Magic DeepSparse

Xcompact3d Incompact3d

OpenCV

John The Ripper

oneDNN

uvg266

oneDNN

uvg266

oneDNN

Google Draco

oneDNN

110 Results Shown

c3-highcpu-8 SPR