llama

AMD Ryzen 7 9800X3D 8-Core testing with a ASRock X870E Taichi (3.12.AS02 BIOS) and AMD Radeon PRO W7500 8GB on Ubuntu 24.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2412052-PTS-LLAMA11353&sor&grr.

llamaProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBcAMD Ryzen 7 9800X3D 8-Core @ 6.22GHz (8 Cores / 16 Threads)ASRock X870E Taichi (3.12.AS02 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s F5-6000J2836G16GWestern Digital WD_BLACK SN850X 2000GBAMD Radeon PRO W7500 8GBAMD Navi 31 HDMI/DPDELL U2723QERealtek Device 8126 + MEDIATEK Device 0717Ubuntu 24.046.8.0-49-generic (x86_64)GNOME Shell 46.0X Server 1.21.1.11 + Wayland4.6 Mesa 24.2.0-devel (LLVM 18.1.7 DRM 3.58)GCC 13.2.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseProcessor Details- Scaling Governor: amd-pstate-epp powersave (EPP: balance_performance) - CPU Microcode: 0xb404023Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

llamallamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 2048llamafile: wizardcoder-python-34b-v1.0.Q6_K - Text Generation 128llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 2048llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 1024llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Text Generation 128llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 1024llamafile: wizardcoder-python-34b-v1.0.Q6_K - Text Generation 16llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 2048llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 512llamafile: Llama-3.2-3B-Instruct.Q6_K - Text Generation 128llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Text Generation 128llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 512llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 256llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 1024llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 2048llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 256llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Text Generation 16llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 512llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 1024llamafile: Llama-3.2-3B-Instruct.Q6_K - Text Generation 16llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 256llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Text Generation 16llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 512llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 256abAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBc122882.132768614411.31163842.0932768307221.4227.57819215361638432768409611.1981921638421.47409627.5481924096122882.132768614411.316384232768307221.4227.51819215361638432768409610.9481921638420.78409626.2881924096122882.132768614411.3163841.9832768307221.4327.55819215361638432768409610.9481921638420.77409626.4281924096122882.132768614411.3163841.9632768307221.4327.52819215361638432768409610.9581921638420.79409626.5181924096OpenBenchmarking.org

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 2048cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba3K6K9K12K15KSE +/- 0.00, N = 312288122881228812288

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 128cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba0.47250.9451.41751.892.3625SE +/- 0.00, N = 32.12.12.12.1

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 2048cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba7K14K21K28K35KSE +/- 0.00, N = 332768327683276832768

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 1024cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba13002600390052006500SE +/- 0.00, N = 36144614461446144

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 128acAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBb3691215SE +/- 0.00, N = 311.3111.3011.3011.30

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 1024cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba4K8K12K16K20KSE +/- 0.00, N = 316384163841638416384

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 16

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 16abAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBc0.47030.94061.41091.88122.3515SE +/- 0.02, N = 122.092.001.981.96

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 2048cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba7K14K21K28K35KSE +/- 0.00, N = 332768327683276832768

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 512cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba7001400210028003500SE +/- 0.00, N = 33072307230723072

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 128cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba510152025SE +/- 0.00, N = 321.4321.4321.4221.42

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 128aAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBcb612182430SE +/- 0.02, N = 327.5727.5527.5227.51

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 512cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba2K4K6K8K10KSE +/- 0.00, N = 38192819281928192

Llamafile

Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 256

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 256cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba30060090012001500SE +/- 0.00, N = 31536153615361536

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 1024cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba4K8K12K16K20KSE +/- 0.00, N = 316384163841638416384

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 2048cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba7K14K21K28K35KSE +/- 0.00, N = 332768327683276832768

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 256

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 256cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba9001800270036004500SE +/- 0.00, N = 34096409640964096

Llamafile

Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 16

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 16acAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBb3691215SE +/- 0.13, N = 311.1910.9510.9410.94

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 512cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba2K4K6K8K10KSE +/- 0.00, N = 38192819281928192

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 1024cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba4K8K12K16K20KSE +/- 0.00, N = 316384163841638416384

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 16

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 16acbAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GB510152025SE +/- 0.01, N = 321.4720.7920.7820.77

Llamafile

Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 256

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 256cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba9001800270036004500SE +/- 0.00, N = 34096409640964096

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 16

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 16acAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBb612182430SE +/- 0.00, N = 327.5426.5126.4226.28

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 512cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba2K4K6K8K10KSE +/- 0.00, N = 38192819281928192

Llamafile

Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 256

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 256cAMD Ryzen 7 9800X3D 8-Core - AMD Radeon PRO W7500 8GBba9001800270036004500SE +/- 0.00, N = 34096409640964096


Phoronix Test Suite v10.8.5