hw-mig

AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.7 BIOS) and NVIDIA GeForce RTX 4090 24GB on EndeavourOS rolling via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2402106-NE-HWMIG354306

Jump To Table - Results

AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB

Processor: AMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads), Motherboard: Supermicro H12SSL-I v1.02 (2.7 BIOS), Chipset: AMD Starship/Matisse, Memory: 256GB, Disk: 15363GB Micron_7450_MTFDKCC15T3TFR, Graphics: NVIDIA GeForce RTX 4090 24GB, Audio: NVIDIA AD102 HD Audio, Monitor: 38GN950, Network: 2 x Intel X710 for 10GbE SFP+

OS: EndeavourOS rolling, Kernel: 6.7.4-zen1-1-zen (x86_64), Desktop: Xfce 4.18, Display Server: X Server 1.21.1.11, Display Driver: NVIDIA 545.29.06, OpenGL: 4.6.0, Compiler: GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3, File-System: btrfs, Screen Resolution: 3840x1600

Kernel Notes: Transparent Huge Pages: always
Environment Notes: NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"
Compiler Notes: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu
Processor Notes: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d1
Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Llama.cpp

Llama.cpp is a port of Facebook's LLaMA model in C/C++ developed by Georgi Gerganov. Llama.cpp allows the inference of LLaMA and other supported models in C/C++. For CPU inference Llama.cpp supports AVX2/AVX-512, ARM NEON, and other modern ISAs along with features like OpenBLAS usage. Learn more via the OpenBenchmarking.org test page.

Llamafile

Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

12 Results Shown

Llama.cpp:
llama-2-7b.Q4_0.gguf
llama-2-13b.Q4_0.gguf
llama-2-70b-chat.Q5_0.gguf
Llamafile:
llava-v1.5-7b-q4 - CPU
mistral-7b-instruct-v0.2.Q8_0 - CPU
wizardcoder-python-34b-v1.0.Q6_K - CPU
Redis:
GET - 50
GET - 1000
SET - 1000
LPOP - 1000
SADD - 1000
LPUSH - 1000

AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB

Testing initiated at 10 February 2024 10:33 by user jmanley.

hw-mig

Statistics

Graph Settings

Multi-Way Comparison

Table

Run Management

AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB

Llama.cpp

Llamafile

Redis

12 Results Shown

AMD EPYC 7R13 48-Core - NVIDIA GeForce RTX 4090 24GB