Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage.
To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark llama-cpp.
OpenBenchmarking.org metrics for this test profile configuration based on 50 public results since 29 December 2024 with the latest data as of 7 January 2025.
Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.
Based on OpenBenchmarking.org data, the selected test / test configuration (Llama.cpp b4397 - Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048) has an average run-time of 13 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.
Notable instruction set extensions supported by this test, based on an automatic analysis by the Phoronix Test Suite / OpenBenchmarking.org analytics engine.
This test profile binary relies on the shared libraries libopenblas.so.0, libm.so.6, libc.so.6, libgfortran.so.5, libquadmath.so.0.
This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.
1 System - 7 Benchmark Results |
2 x Intel Xeon Gold 6138 - GIGABYTE V4288 MD61-SC2-00 v01000100 - Intel Sky Lake-E DMI3 Registers Debian 12 - 6.1.0-26-amd64 - GCC 12.2.0 |
3 Systems - 48 Benchmark Results |
Intel Core i7-1280P - MSI Prestige 14Evo A12M MS-14C6 - Intel Alder Lake PCH Ubuntu 24.10 - 6.11.0-rc6-phx - GNOME Shell 47.0 |
3 Systems - 48 Benchmark Results |
Intel Core Ultra 7 155H - MTL Swift SFG14-72T Coral_MTH - Intel Device 7e7f Ubuntu 24.10 - 6.11.0-rc6-phx - GNOME Shell 47.0 |
3 Systems - 219 Benchmark Results |
2 x AMD EPYC 9684X 96-Core - AMD Titanite_4G - AMD Device 14a4 Ubuntu 24.10 - 6.11.0-13-generic - GNOME Shell 47.0 |
4 Systems - 48 Benchmark Results |
AMD Ryzen AI 9 365 - ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 - AMD Device 1507 Ubuntu 24.10 - 6.12.0-rc7-phx-eraps - GNOME Shell 47.0 |
4 Systems - 48 Benchmark Results |
ARMv8 Neoverse-N1 - System76 Thelio Astra - Ampere Computing LLC Altra PCI Root Complex A Ubuntu 24.04 - 6.8.0-48-generic-64k - GNOME Shell 46.0 |
3 Systems - 48 Benchmark Results |
AMD Ryzen AI 9 HX 370 - ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 - AMD Device 1507 Ubuntu 24.10 - 6.11.0-rc6-phx - GNOME Shell 47.0 |
4 Systems - 48 Benchmark Results |
Intel Core Ultra 9 285K - ASUS ROG MAXIMUS Z890 HERO - Intel Device ae7f Ubuntu 24.10 - 6.11.0-13-generic - GNOME Shell 47.0 |
4 Systems - 48 Benchmark Results |
AMD Ryzen 9 9950X 16-Core - ASRock X870E Taichi - AMD Device 14d8 Ubuntu 24.04 - 6.12.3-061203-generic - GNOME Shell 46.0 |
3 Systems - 48 Benchmark Results |
AMD Ryzen 7 7840HS - Framework Laptop 16 - AMD Device 14e8 Ubuntu 24.04 - 6.8.0-49-generic - GNOME Shell 46.0 |
4 Systems - 48 Benchmark Results |
Intel Core Ultra 7 256V - ASUS Zenbook S 14 UX5406SA_UX5406SA UX5406SA v1.0 - Intel Device a87f Ubuntu 24.10 - 6.12.0-rc6-phx-drm-next - GNOME Shell 47.0 |
4 Systems - 48 Benchmark Results |
AMD Ryzen Threadripper 7980X 64-Cores - System76 Thelio Major - AMD Device 14a4 Ubuntu 24.04 - 6.12.3-061203-generic - GNOME Shell 46.0 |
2 Systems - 30 Benchmark Results |
2 x AMD EPYC 9654 96-Core - AMD Titanite_4G - AMD Device 14a4 Ubuntu 24.10 - 6.11.0-13-generic - GNOME Shell 47.0 |