Xeon Cascade Lake R Intel FSGSBASE

Intel FSGSBASE benchmarking by Michael Larabel for a future article.

nofsgsbase

Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Disk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw
Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x5002f01
Java Notes: OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-3ubuntu1)
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

FSGSBASE Enabled

Processor: 2 x Intel Xeon Gold 5220R @ 3.90GHz (36 Cores / 72 Threads), Motherboard: TYAN S7106 (V2.01.B40 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 94GB, Disk: 500GB Samsung SSD 860, Graphics: ASPEED, Monitor: VE228, Network: 2 x Intel I210 + 2 x QLogic cLOM8214 1/10GbE

OS: Ubuntu 20.04, Kernel: 5.8.0-rc1-phx-fsgsbase (x86_64) 20200620, Desktop: GNOME Shell 3.36.1, Display Server: X Server 1.20.8, Display Driver: modesetting 1.20.8, Compiler: GCC 9.3.0, File-System: ext4, Screen Resolution: 1920x1080

Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Disk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw
Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002c
Java Notes: OpenJDK Runtime Environment (build 11.0.7-ea+9-post-Ubuntu-1ubuntu1)
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Java Gradle Build

This test runs Java software project builds using the Gradle build system. It is intended to give developers an idea as to the build performance for development activities and build servers. Learn more via the OpenBenchmarking.org test page.

ctx_clock

Ctx_clock is a simple test program to measure the context switch time in clock cycles. Learn more via the OpenBenchmarking.org test page.

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

Renaissance

Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.

Flexible IO Tester

Fio is an advanced disk benchmark that depends upon the kernel's AIO access library. Learn more via the OpenBenchmarking.org test page.

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

PlaidML

This test profile uses PlaidML deep learning framework developed by Intel for offering up various benchmarks. Learn more via the OpenBenchmarking.org test page.

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial timeseries data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. Learn more via the OpenBenchmarking.org test page.

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently makes use of the OpenMP implementation and using the Fayalite-FIST molecular dynamics run and measures the total time to complete. Learn more via the OpenBenchmarking.org test page.

pmbench

Pmbench is a Linux paging and virtual memory benchmark. This test profile will report the average page latency of the system. Learn more via the OpenBenchmarking.org test page.

PostMark

This is a test of NetApp's PostMark benchmark designed to simulate small-file testing similar to the tasks endured by web and mail servers. This test profile will set PostMark to perform 25,000 transactions with 500 files simultaneously with the file sizes ranging between 5 and 512 kilobytes. Learn more via the OpenBenchmarking.org test page.

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

AOM AV1

This is a simple test of the AOMedia AV1 encoder run on the CPU with a sample video file. Learn more via the OpenBenchmarking.org test page.

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9/WebM format using a sample 1080p video. Learn more via the OpenBenchmarking.org test page.

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

YafaRay

YafaRay is an open-source physically based montecarlo ray-tracing engine. Learn more via the OpenBenchmarking.org test page.

BlogBench

BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.

Apache Siege

This is a test of the Apache web server performance being facilitated by the Siege web serverb enchmark program. Learn more via the OpenBenchmarking.org test page.

Node.js Express HTTP Load Test

A Node.js Express server with a Node-based loadtest client for facilitating HTTP benchmarking. Learn more via the OpenBenchmarking.org test page.

Apache HBase

This is a benchmark of the Apache HBase non-relational distributed database system inspired from Google's Bigtable. Learn more via the OpenBenchmarking.org test page.

Memtier_benchmark

Memtier_benchmark is a NoSQL Redis/Memcache traffic generation plus benchmarking tool. This current test profile currently just stresses the Redis protocol and basic options exposed wotj a 1:1 Set/Get ratio, 30 pipeline, 100 clients per thread, and thread count equal to the number of CPU cores/threads present. Patches to extend the test are welcome as always. Learn more via the OpenBenchmarking.org test page.

KeyDB

A benchmark of KeyDB as a multi-threaded fork of the Redis server. The KeyDB benchmark is conducted using memtier-benchmark. Learn more via the OpenBenchmarking.org test page.

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

PostgreSQL pgbench

This is a simple benchmark of PostgreSQL using pgbench. Learn more via the OpenBenchmarking.org test page.

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

ebizzy

This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.

Geometric Mean Of All Test Results

Number Of First Place Finishes

Number Of Last Place Finishes

114 Results Shown

Java Gradle Build
ctx_clock
Stress-NG:
Atomic
SENDFILE
CPU Stress
Context Switching
Renaissance:
Apache Spark ALS
Savina Reactors.IO
Flexible IO Tester:
Rand Write - IO_uring - Yes - No - 2MB - Default Test Directory
Rand Write - IO_uring - Yes - No - 4KB - Default Test Directory
Seq Write - IO_uring - Yes - No - 2MB - Default Test Directory
Seq Write - IO_uring - Yes - No - 2MB - Default Test Directory
Timed HMMer Search
Timed MAFFT Alignment
Himeno Benchmark
PlaidML:
No - Inference - VGG16 - CPU
No - Inference - VGG19 - CPU
No - Inference - IMDB LSTM - CPU
No - Inference - Mobilenet - CPU
No - Inference - ResNet 50 - CPU
No - Inference - DenseNet 201 - CPU
No - Inference - Inception V3 - CPU
No - Inference - NASNer Large - CPU
Numenta Anomaly Benchmark:
EXPoSE
Relative Entropy
Earthgecko Skyline
Bayesian Changepoint
Mlpack Benchmark:
scikit_ica
scikit_qda
scikit_svm
scikit_linearridgeregression
GROMACS
LAMMPS Molecular Dynamics Simulator
NAMD
oneDNN:
IP Batch 1D - bf16bf16bf16 - CPU
IP Batch All - bf16bf16bf16 - CPU
Convolution Batch Shapes Auto - bf16bf16bf16 - CPU
Deconvolution Batch deconv_1d - bf16bf16bf16 - CPU
Deconvolution Batch deconv_3d - bf16bf16bf16 - CPU
Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU
QMCPACK
CP2K Molecular Dynamics
pmbench:
72 - 100% Reads
72 - 100% Writes
1 - 80% Reads 20% Writes
PostMark
Timed GDB GNU Debugger Compilation
Timed Apache Compilation
Timed LLVM Compilation
Timed Linux Kernel Compilation
AOM AV1:
Speed 0 Two-Pass
Speed 4 Two-Pass
Speed 6 Realtime
Speed 6 Two-Pass
Speed 8 Realtime
VP9 libvpx Encoding:
Speed 0
Speed 5
dav1d:
Chimera 1080p
Summer Nature 4K
Summer Nature 1080p
Chimera 1080p 10-bit
SVT-AV1:
Enc Mode 0 - 1080p
Enc Mode 4 - 1080p
Enc Mode 8 - 1080p
YafaRay
BlogBench
Apache Siege:
10
50
200
Node.js Express HTTP Load Test
Apache HBase:
Increment - 1:
Rows Per Second
Microseconds - Average Latency
Rand Read - 1:
Rows Per Second
Microseconds - Average Latency
Seq Read - 1:
Rows Per Second
Microseconds - Average Latency
Async Rand Read - 1:
Rows Per Second
Microseconds - Average Latency
Memtier_benchmark
KeyDB
Redis:
LPOP
SADD
GET
SET
Facebook RocksDB:
Rand Fill
Rand Read
Seq Fill
Rand Fill Sync
Read While Writing
LevelDB:
Hot Read
Fill Sync
Fill Sync
Overwrite
Overwrite
Rand Fill
Rand Fill
Rand Read
Seek Rand
Rand Delete
Seq Fill
Seq Fill
Apache Cassandra
PostgreSQL pgbench:
Buffer Test - Normal Load - Read Only
Buffer Test - Normal Load - Read Write
Buffer Test - Heavy Contention - Read Only
Buffer Test - Heavy Contention - Read Write
MariaDB:
64
128
256
512
ebizzy
Geometric Mean Of All Test Results:
Result Composite - Xeon Cascade Lake R Intel FSGSBASE
Wins - 111 Tests
Losses - 111 Tests

nofsgsbase

Testing initiated at 22 June 2020 16:05 by user phoronix.

FSGSBASE Enabled

Environment Notes: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Disk Notes: MQ-DEADLINE / errors=remount-ro,relatime,rw
Processor Notes: Scaling Governor: intel_pstate powersave - CPU Microcode: 0x500002c
Java Notes: OpenJDK Runtime Environment (build 11.0.7-ea+9-post-Ubuntu-1ubuntu1)
Python Notes: Python 3.8.2
Security Notes: itlb_multihit: KVM: Mitigation of Split huge pages + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled

Testing initiated at 20 June 2020 20:34 by user phoronix.

Xeon Cascade Lake R Intel FSGSBASE

View

Limit displaying results to tests within:

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

nofsgsbase

FSGSBASE Enabled

Java Gradle Build

ctx_clock

Stress-NG

Renaissance

Flexible IO Tester

Timed HMMer Search

Timed MAFFT Alignment

Himeno Benchmark

PlaidML

Numenta Anomaly Benchmark

Mlpack Benchmark

GROMACS

LAMMPS Molecular Dynamics Simulator

NAMD

oneDNN

QMCPACK

CP2K Molecular Dynamics

pmbench

PostMark

Timed GDB GNU Debugger Compilation

Timed Apache Compilation

Timed LLVM Compilation

Timed Linux Kernel Compilation

AOM AV1

VP9 libvpx Encoding

dav1d

SVT-AV1

YafaRay

BlogBench

Apache Siege

Node.js Express HTTP Load Test

Apache HBase

Memtier_benchmark

KeyDB

Redis

Facebook RocksDB

LevelDB

Apache Cassandra

PostgreSQL pgbench

MariaDB

ebizzy

Geometric Mean Of All Test Results

Number Of First Place Finishes

Number Of Last Place Finishes

114 Results Shown

nofsgsbase

FSGSBASE Enabled