Gigabyte G242-P36 Ampere Altra Max Server Benchmarks

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

System Power Consumption

Quicksilver

Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.

Result

CPU Peak Freq (Highest CPU Core Frequency

CPU Power Consumption

CPU Temp

System Power Consumption

114 Results Shown

CacheBench:
Read
Write
Read / Modify / Write
Neural Magic DeepSparse:
NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
ms/batch
NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
items/sec
ms/batch
CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
items/sec
ms/batch
ResNet-50, Baseline - Asynchronous Multi-Stream:
items/sec
ms/batch
ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
ms/batch
BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
items/sec
ms/batch
BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
items/sec
ms/batch
LeelaChessZero:
BLAS
Eigen
Llama.cpp:
llama-2-7b.Q4_0.gguf
llama-2-13b.Q4_0.gguf
llama-2-70b-chat.Q5_0.gguf
PyTorch:
CPU - 1 - ResNet-50
CPU - 1 - ResNet-152
CPU - 1 - Efficientnet_v2_l
CPU - 16 - ResNet-50
CPU - 16 - ResNet-152
Quicksilver:
CORAL2 P1
CORAL2 P2
CTS2
Speedb:
Seq Fill
Rand Fill
Rand Fill Sync
Rand Read
Read While Writing
Read Rand Write Rand
Update Rand
Xmrig:
Monero - 1M
Wownero - 1M
Stress-NG:
CPU Stress
Crypto
Memory Copying
Glibc Qsort Data Sorting
Glibc C String Functions
Vector Math
Matrix Math
Forking
System V Message Passing
Semaphores
Socket Activity
Context Switching
Atomic
CPU Cache
Malloc
MEMFD
MMAP
NUMA
SENDFILE
IO_uring
Futex
Mutex
Function Call
Poll
Hash
Pthread
Zlib
Floating Point
Fused Multiply-Add
Pipe
Matrix 3D Math
AVL Tree
Vector Floating Point
Vector Shuffle
Wide Vector Math
Cloning
AVX-512 VNNI
Mixed Scheduler
Timed Linux Kernel Compilation:
defconfig
allmodconfig
Timed LLVM Compilation:
Ninja
Unix Makefiles
7-Zip Compression:
Compression Rating
Decompression Rating
RocksDB:
Rand Read
Read While Writing
Read Rand Write Rand
Update Rand
Stockfish
OpenSSL:
RSA4096:
sign/s
verify/s
SHA256:
byte/s
SHA512:
byte/s
AES-128-GCM:
byte/s
AES-256-GCM:
byte/s
ChaCha20:
byte/s
ChaCha20-Poly1305:
byte/s
miniFE
ACES DGEMM
Algebraic Multi-Grid Benchmark
GROMACS
CPU Peak Freq (Highest CPU Core Frequency) Monitor:
Phoronix Test Suite System Monitoring:
Megahertz
Watts
Celsius
Watts

G242-P36

Processor: ARMv8 Neoverse-N1 @ 3.00GHz (128 Cores), Motherboard: GIGABYTE G242-P36-00 MP32-AR2-00 v01000100 (F31k SCP: 2.10.20220531 BIOS), Chipset: Ampere Computing LLC Altra PCI Root Complex A, Memory: 16 x 32 GB DDR4-3200MT/s Samsung M393A4K40DB3-CWE, Disk: 800GB Micron_7450_MTFDKBA800TFS, Graphics: ASPEED, Monitor: VGA HDMI, Network: 2 x Intel I350

OS: Ubuntu 23.10, Kernel: 6.5.0-13-generic (aarch64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1080

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v
Processor Notes: Scaling Governor: cppc_cpufreq performance (Boost: Disabled)
Python Notes: Python 3.11.6
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

Testing initiated at 16 January 2024 23:01 by user phoronix.

Gigabyte G242-P36 Ampere Altra Max Server

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

G242-P36

CacheBench

Neural Magic DeepSparse

LeelaChessZero

Llama.cpp

PyTorch

Quicksilver

Speedb

Xmrig

Stress-NG

Timed Linux Kernel Compilation

Timed LLVM Compilation

7-Zip Compression

RocksDB

Stockfish

OpenSSL

miniFE

ACES DGEMM

Algebraic Multi-Grid Benchmark

GROMACS

CPU Peak Freq (Highest CPU Core Frequency) Monitor

CPU Power Consumption Monitor

CPU Temperature Monitor

System Power Consumption Monitor

114 Results Shown

G242-P36