dddda

AMD Ryzen AI 9 365 testing with a ASUS Zenbook S 16 UM5606WA_UM5606WA UM5606WA v1.0 (UM5606WA.308 BIOS) and AMD Radeon 512MB on Ubuntu 24.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2412110-NE-DDDDA678639&grr.

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 16

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 2048

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenVINO

Model: Noise Suppression Poconet-Like FP16 - Device: CPU

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenVINO

Model: Road Segmentation ADAS FP16 - Device: CPU

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 128

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenVINO

Model: Handwritten English Recognition FP16-INT8 - Device: CPU

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

OpenVINO

Model: Road Segmentation ADAS FP16-INT8 - Device: CPU

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 1024

OpenVINO GenAI

Model: Gemma-7b-int4-ov - Device: CPU - Time Per Output Token

OpenVINO GenAI

Model: Gemma-7b-int4-ov - Device: CPU - Time To First Token

OpenVINO GenAI

Model: Gemma-7b-int4-ov - Device: CPU

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 2048

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 128

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenVINO

Model: Person Re-Identification Retail FP16 - Device: CPU

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenVINO

Model: Face Detection Retail FP16-INT8 - Device: CPU

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenVINO

Model: Handwritten English Recognition FP16 - Device: CPU

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenVINO

Model: Face Detection Retail FP16 - Device: CPU

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenVINO GenAI

Model: Falcon-7b-instruct-int4-ov - Device: CPU - Time Per Output Token

OpenVINO GenAI

Model: Falcon-7b-instruct-int4-ov - Device: CPU - Time To First Token

OpenVINO GenAI

Model: Falcon-7b-instruct-int4-ov - Device: CPU

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 128

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 16

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 512

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 1024

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 16

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 256

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 16

OpenVINO GenAI

Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU - Time Per Output Token

OpenVINO GenAI

Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU - Time To First Token

OpenVINO GenAI

Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU

OpenVINO GenAI

Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time Per Output Token

OpenVINO GenAI

Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time To First Token

OpenVINO GenAI

Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 512

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 256

Phoronix Test Suite v10.8.5