Neu Benchmarks [2403108-NE-NEU24701101]

AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.7 BIOS) and ASPEED 24GB on EndeavourOS rolling via the Phoronix Test Suite.

AMD EPYC 7R13 48-Core - ASPEED 24GB - Supermicro

Kernel Notes: Transparent Huge Pages: always
Environment Notes: NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"
Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu
Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d3
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC 7R13 48-Core

Processor: AMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads), Motherboard: Supermicro H12SSL-I v1.02 (2.7 BIOS), Chipset: AMD Starship/Matisse, Memory: 256GB, Disk: 15363GB Micron_7450_MTFDKCC15T3TFR, Graphics: ASPEED 24GB, Audio: NVIDIA AD102 HD Audio, Monitor: 38GN950, Network: 2 x Intel X710 for 10GbE SFP+

OS: EndeavourOS rolling, Kernel: 6.7.9-zen1-1-zen (x86_64), Display Server: X Server 1.21.1.11, Display Driver: NVIDIA, Compiler: GCC 13.2.1 20230801 + Clang 17.0.6 + LLVM 17.0.6 + CUDA 12.4, File-System: btrfs, Screen Resolution: 1024x768

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI toolkit. Learn more via the OpenBenchmarking.org test page.

8 Results Shown

Llama.cpp
oneDNN:
IP Shapes 1D - CPU
IP Shapes 3D - CPU
Convolution Batch Shapes Auto - CPU
Deconvolution Batch shapes_1d - CPU
Deconvolution Batch shapes_3d - CPU
Recurrent Neural Network Training - CPU
Recurrent Neural Network Inference - CPU

AMD EPYC 7R13 48-Core - ASPEED 24GB - Supermicro

Testing initiated at 10 March 2024 22:28 by user jmanley.

AMD EPYC 7R13 48-Core

Testing initiated at 10 March 2024 22:33 by user jmanley.

neu

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

AMD EPYC 7R13 48-Core - ASPEED 24GB - Supermicro

AMD EPYC 7R13 48-Core

Llama.cpp

oneDNN

8 Results Shown

AMD EPYC 7R13 48-Core - ASPEED 24GB - Supermicro

AMD EPYC 7R13 48-Core