ncnn llama

Intel Core Ultra 7 155H testing with a MTL Swift SFG14-72T Coral_MTH (V1.01 BIOS) and Intel Arc MTL 8GB on Ubuntu 24.10 via the Phoronix Test Suite.

a

Kernel Notes: Transparent Huge Pages: madvise
Compiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v
Processor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x1e - Thermald 2.5.8
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected

b

c

Processor: Intel Core Ultra 7 155H @ 4.80GHz (16 Cores / 22 Threads), Motherboard: MTL Swift SFG14-72T Coral_MTH (V1.01 BIOS), Chipset: Intel Device 7e7f, Memory: 8 x 2GB LPDDR5-6400MT/s Micron MT62F1G32D2DS-026, Disk: 1024GB Micron_2550_MTFDKBA1T0TGE, Graphics: Intel Arc MTL 8GB, Audio: Intel Meteor Lake-P HD Audio, Network: Intel Meteor Lake PCH CNVi WiFi

OS: Ubuntu 24.10, Kernel: 6.11.0-rc6-phx (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server + Wayland, OpenGL: 4.6 Mesa 25.0~git2411250600.45c523~oibaf~o (git-45c5231 2024-11-25 oracular-oibaf-pp, OpenCL: OpenCL 3.0, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 3840x1200

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

48 Results Shown

NCNN:
CPU - regnety_400m
CPU - efficientnet-b0
Vulkan GPU - shufflenet-v2
Vulkan GPU - mnasnet
Vulkan GPU-v3-v3 - mobilenet-v3
Vulkan GPU - resnet18
Vulkan GPU - efficientnet-b0
Llama.cpp
NCNN:
CPU - blazeface
CPU - yolov4-tiny
CPU - vision_transformer
CPU - squeezenet_ssd
CPU - alexnet
Vulkan GPU-v2-v2 - mobilenet-v2
Llama.cpp
NCNN:
CPU-v3-v3 - mobilenet-v3
CPU - googlenet
Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
Vulkan GPU - mobilenet
Llama.cpp:
CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512
CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 512
NCNN
Llama.cpp
NCNN:
Vulkan GPU - alexnet
CPU - resnet50
Llama.cpp
NCNN:
CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3
CPU - mobilenet
CPU - vgg16
Llama.cpp
NCNN
Llama.cpp
NCNN:
CPU - shufflenet-v2
Vulkan GPU - vgg16
Vulkan GPU - resnet50
Vulkan GPU - yolov4-tiny
CPU - mnasnet
Vulkan GPU - blazeface
Llama.cpp
NCNN:
CPU - FastestDet
CPU-v2-v2 - mobilenet-v2
Llama.cpp
NCNN:
Vulkan GPU - vision_transformer
Vulkan GPU - regnety_400m
Vulkan GPU - squeezenet_ssd
Vulkan GPU - googlenet
Llama.cpp:
CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 1024
CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 1024

a

Testing initiated at 30 December 2024 13:57 by user phoronix.

b

Testing initiated at 30 December 2024 16:15 by user phoronix.

c

Testing initiated at 30 December 2024 17:24 by user phoronix.

ncnn llama

View

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

a

b

c

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

NCNN

Llama.cpp

48 Results Shown

a

b

c