satty

Intel Xeon Platinum 8490H testing with a Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.

a

Processor: Intel Xeon Platinum 8490H @ 3.50GHz (60 Cores / 120 Threads), Motherboard: Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS), Chipset: Intel Device 1bce, Memory: 512GB, Disk: 3 x 3841GB Micron_9300_MTFDHAL3T8TDP, Graphics: ASPEED, Network: 4 x Intel E810-C for QSFP

OS: Ubuntu 22.04, Kernel: 5.15.0-47-generic (x86_64), Desktop: GNOME Shell 42.4, Display Server: X Server 1.21.1.3, Vulkan: 1.2.204, Compiler: GCC 11.2.0, File-System: ext4, Screen Resolution: 1024x768

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0000c0
Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

b

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

libxsmm

HeFFTe - Highly Efficient FFT for Exascale

libxsmm

HeFFTe - Highly Efficient FFT for Exascale

Palabos

The Palabos library is a framework for general purpose Computational Fluid Dynamics (CFD). Palabos uses a kernel based on the Lattice Boltzmann method. This test profile uses the Palabos MPI-based Cavity3D benchmark. Learn more via the OpenBenchmarking.org test page.

HeFFTe - Highly Efficient FFT for Exascale

Palabos

HeFFTe - Highly Efficient FFT for Exascale

Palabos

Grid Size: 4000

a: The test quit with a non-zero exit status.

b: The test quit with a non-zero exit status.

32 Results Shown

libxsmm
HeFFTe - Highly Efficient FFT for Exascale:
r2c - Stock - float - 256
r2c - FFTW - float - 256
libxsmm
HeFFTe - Highly Efficient FFT for Exascale:
r2c - Stock - double - 128
c2c - Stock - double - 128
libxsmm:
32
64
HeFFTe - Highly Efficient FFT for Exascale
Palabos
HeFFTe - Highly Efficient FFT for Exascale:
r2c - FFTW - float - 512
c2c - FFTW - double - 128
r2c - Stock - float - 128
r2c - Stock - float - 512
r2c - FFTW - double - 256
c2c - Stock - float - 256
r2c - FFTW - double - 128
r2c - Stock - double - 512
c2c - FFTW - float - 512
c2c - Stock - float - 512
Palabos
HeFFTe - Highly Efficient FFT for Exascale:
r2c - FFTW - double - 512
c2c - FFTW - float - 128
c2c - Stock - double - 512
r2c - FFTW - float - 128
c2c - FFTW - double - 256
c2c - Stock - float - 128
c2c - FFTW - double - 512
r2c - Stock - double - 256
c2c - Stock - double - 256
Palabos:
400
500

a

Testing initiated at 29 July 2023 05:52 by user phoronix.

b

Testing initiated at 29 July 2023 07:32 by user phoronix.

satty

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

a

b

libxsmm

HeFFTe - Highly Efficient FFT for Exascale

libxsmm

HeFFTe - Highly Efficient FFT for Exascale

libxsmm

HeFFTe - Highly Efficient FFT for Exascale

Palabos

HeFFTe - Highly Efficient FFT for Exascale

Palabos

HeFFTe - Highly Efficient FFT for Exascale

Palabos

32 Results Shown

a

b