ARMv8 Neoverse-V2 testing with a Pegatron JIMBO P4352 (00022432 BIOS) and ASPEED on Ubuntu 24.04 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2411235-NE-LLAMACPPG43
llama cpp grace
ARMv8 Neoverse-V2 testing with a Pegatron JIMBO P4352 (00022432 BIOS) and ASPEED on Ubuntu 24.04 via the Phoronix Test Suite.
,,"a","b","c","d"
Processor,,ARMv8 Neoverse-V2 @ 3.47GHz (72 Cores),ARMv8 Neoverse-V2 @ 3.47GHz (72 Cores),ARMv8 Neoverse-V2 @ 3.47GHz (72 Cores),ARMv8 Neoverse-V2 @ 3.47GHz (72 Cores)
Motherboard,,Pegatron JIMBO P4352 (00022432 BIOS),Pegatron JIMBO P4352 (00022432 BIOS),Pegatron JIMBO P4352 (00022432 BIOS),Pegatron JIMBO P4352 (00022432 BIOS)
Memory,,1 x 480GB LPDDR5-6400MT/s NVIDIA 699-2G530-0236-RC1,1 x 480GB LPDDR5-6400MT/s NVIDIA 699-2G530-0236-RC1,1 x 480GB LPDDR5-6400MT/s NVIDIA 699-2G530-0236-RC1,1 x 480GB LPDDR5-6400MT/s NVIDIA 699-2G530-0236-RC1
Disk,,1000GB CT1000T700SSD3,1000GB CT1000T700SSD3,1000GB CT1000T700SSD3,1000GB CT1000T700SSD3
Graphics,,ASPEED,ASPEED,ASPEED,ASPEED
Network,,2 x Intel X550,2 x Intel X550,2 x Intel X550,2 x Intel X550
OS,,Ubuntu 24.04,Ubuntu 24.04,Ubuntu 24.04,Ubuntu 24.04
Kernel,,6.8.0-49-generic-64k (aarch64),6.8.0-49-generic-64k (aarch64),6.8.0-49-generic-64k (aarch64),6.8.0-49-generic-64k (aarch64)
Compiler,,GCC 13.2.0 + Clang 18.1.3 + CUDA 11.8,GCC 13.2.0 + Clang 18.1.3 + CUDA 11.8,GCC 13.2.0 + Clang 18.1.3 + CUDA 11.8,GCC 13.2.0 + Clang 18.1.3 + CUDA 11.8
File-System,,ext4,ext4,ext4,ext4
Screen Resolution,,1920x1200,1920x1200,1920x1200,1920x1200
,,"a","b","c","d"
"Llama.cpp - Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128 (Tokens/sec)",HIB,20.07,20.56,20.70,18.24
"Llama.cpp - Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512 (Tokens/sec)",HIB,121.74,121.76,121.85,120.71
"Llama.cpp - Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024 (Tokens/sec)",HIB,118.77,118.88,119.02,118.98
"Llama.cpp - Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048 (Tokens/sec)",HIB,105.71,105.56,105.75,106.1
"Llama.cpp - Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128 (Tokens/sec)",HIB,21.48,21.78,20.06,19.48
"Llama.cpp - Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512 (Tokens/sec)",HIB,122.12,122.28,122.54,121.45
"Llama.cpp - Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024 (Tokens/sec)",HIB,119.72,119.92,119.62,119.36
"Llama.cpp - Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048 (Tokens/sec)",HIB,107.00,106.65,106.90,106.49
"Llama.cpp - Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128 (Tokens/sec)",HIB,50.75,50.87,51.00,49.91
"Llama.cpp - Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512 (Tokens/sec)",HIB,123.29,123.69,123.66,131.19
"Llama.cpp - Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024 (Tokens/sec)",HIB,132.61,132.75,128.63,131.74
"Llama.cpp - Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048 (Tokens/sec)",HIB,131.81,134.42,130.52,133.64