feb 9950X

AMD Ryzen 9 9950X 16-Core testing with a ASRock X870E Taichi (3.12.AS02 BIOS) and XFX AMD Radeon RX 7900 XTX 24GB on Ubuntu 24.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2502104-PTS-FEB9950X21&sro&grs.

feb 9950XProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcdAMD Ryzen 9 9950X 16-Core @ 5.75GHz (16 Cores / 32 Threads)ASRock X870E Taichi (3.12.AS02 BIOS)AMD Device 14d82 x 16GB DDR5-6000MT/s F5-6000J2836G16GWestern Digital WD_BLACK SN850X 2000GBXFX AMD Radeon RX 7900 XTX 24GBAMD Navi 31 HDMI/DPDELL U2723QERealtek Device 8126 + MEDIATEK Device 0717Ubuntu 24.046.12.3-061203-generic (x86_64)GNOME Shell 46.0X Server 1.21.1.11 + Wayland4.6 Mesa 24.2.0-devel (LLVM 18.1.7 DRM 3.59)GCC 13.3.0ext43840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fG75Ri/gcc-13-13.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fG75Ri/gcc-13-13.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: amd-pstate-epp powersave (Boost: Enabled EPP: balance_performance) - CPU Microcode: 0xb404023 Python Details- Python 2.7.16 + Python 3.12.3Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

feb 9950Xqmcpack: H4_aellama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 512llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512liquid-dsp: 1 - 256 - 32llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 512liquid-dsp: 2 - 256 - 512liquid-dsp: 4 - 256 - 512liquid-dsp: 8 - 256 - 512llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 1024liquid-dsp: 1 - 256 - 512llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048qmcpack: O_ae_pyscf_UHFllama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 1024liquid-dsp: 16 - 256 - 512qmcpack: Li2_STO_aellama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 1024llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048liquid-dsp: 8 - 256 - 57qmcpack: FeCO6_b3lyp_gmsliquid-dsp: 4 - 256 - 57liquid-dsp: 16 - 256 - 57qmcpack: LiH_ae_MSDliquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128liquid-dsp: 2 - 256 - 32llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128liquid-dsp: 32 - 256 - 57liquid-dsp: 32 - 256 - 32liquid-dsp: 32 - 256 - 512llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128liquid-dsp: 16 - 256 - 32liquid-dsp: 8 - 256 - 32liquid-dsp: 4 - 256 - 32abcd1189.3891.2556787000413.388230300015804000031520000090.334141000088.09125.15395.79582860000121.76375.0591.4288.4158618000052.822315060000109600000042.036182470000900880009.7111393000065.08159680000015833000006182300009.1789252000045410000022966000010.3688.692.0456789000418.768257600016069000032205000092.074157100090.16126.79402.62574670000123.24370.3791.3789.1358593000052.295313980000109350000042.005183150000903750009.7411378000065.01160120000015876000006179800009.1789315000045435000022974000010.6993.688.4957980000406.488254300016037000031223000089.34234700089.69124.76394.84580280000122.97371.2190.0889.1458992000052.529315910000109190000041.883183260000901420009.7511416000065.1159970000015848000006183300009.1789131000045374000022992000010.5492.1890.7759018000420.468000800015576000031707000091.214242300089.74124.25399.88585540000123.99376.5990.2989.5559268000052.313315130000108940000041.811182390000904660009.7411388000064.9159930000015837000006168000009.19892320000454640000229660000OpenBenchmarking.org

QMCPACK

Input: H4_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 4.0Input: H4_aeabcd369121511.0010.3610.6910.541. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512abcd2040608010089.3888.6093.6092.181. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512abcd2040608010091.2592.0488.4990.771. (CXX) g++ options: -O3

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 1 - Buffer Length: 256 - Filter Length: 32abcd13M26M39M52M65M567870005678900057980000590180001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512abcd90180270360450413.38418.76406.48420.461. (CXX) g++ options: -O3

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 2 - Buffer Length: 256 - Filter Length: 512abcd20M40M60M80M100M823030008257600082543000800080001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 4 - Buffer Length: 256 - Filter Length: 512abcd30M60M90M120M150M1580400001606900001603700001557600001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 8 - Buffer Length: 256 - Filter Length: 512abcd70M140M210M280M350M3152000003220500003122300003170700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024abcd2040608010090.3392.0789.3091.211. (CXX) g++ options: -O3

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 1 - Buffer Length: 256 - Filter Length: 512abcd9M18M27M36M45M414100004157100042347000424230001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048abcd2040608010088.0990.1689.6989.741. (CXX) g++ options: -O3

QMCPACK

Input: O_ae_pyscf_UHF

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 4.0Input: O_ae_pyscf_UHFabcd306090120150125.15126.79124.76124.251. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024abcd90180270360450395.79402.62394.84399.881. (CXX) g++ options: -O3

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 16 - Buffer Length: 256 - Filter Length: 512abcd130M260M390M520M650M5828600005746700005802800005855400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 4.0Input: Li2_STO_aeabcd306090120150121.76123.24122.97123.991. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048abcd80160240320400375.05370.37371.21376.591. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024abcd2040608010091.4291.3790.0890.291. (CXX) g++ options: -O3

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048abcd2040608010088.4189.1389.1489.551. (CXX) g++ options: -O3

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 8 - Buffer Length: 256 - Filter Length: 57abcd130M260M390M520M650M5861800005859300005899200005926800001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 4.0Input: FeCO6_b3lyp_gmsabcd122436486052.8252.3052.5352.311. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 4 - Buffer Length: 256 - Filter Length: 57abcd70M140M210M280M350M3150600003139800003159100003151300001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 16 - Buffer Length: 256 - Filter Length: 57abcd200M400M600M800M1000M10960000001093500000109190000010894000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

QMCPACK

Input: LiH_ae_MSD

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 4.0Input: LiH_ae_MSDabcd102030405042.0442.0141.8841.811. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 2 - Buffer Length: 256 - Filter Length: 57abcd40M80M120M160M200M1824700001831500001832600001823900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 1 - Buffer Length: 256 - Filter Length: 57abcd20M40M60M80M100M900880009037500090142000904660001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Llama.cpp

Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128abcd36912159.719.749.759.741. (CXX) g++ options: -O3

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 2 - Buffer Length: 256 - Filter Length: 32abcd20M40M60M80M100M1139300001137800001141600001138800001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Llama.cpp

Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128abcd153045607565.0865.0165.1064.901. (CXX) g++ options: -O3

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 32 - Buffer Length: 256 - Filter Length: 57abcd300M600M900M1200M1500M15968000001601200000159970000015993000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 32 - Buffer Length: 256 - Filter Length: 32abcd300M600M900M1200M1500M15833000001587600000158480000015837000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 32 - Buffer Length: 256 - Filter Length: 512abcd130M260M390M520M650M6182300006179800006183300006168000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Llama.cpp

Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128abcd36912159.179.179.179.191. (CXX) g++ options: -O3

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 16 - Buffer Length: 256 - Filter Length: 32abcd200M400M600M800M1000M8925200008931500008913100008923200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 8 - Buffer Length: 256 - Filter Length: 32abcd100M200M300M400M500M4541000004543500004537400004546400001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.7Threads: 4 - Buffer Length: 256 - Filter Length: 32abcd50M100M150M200M250M2296600002297400002299200002296600001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid


Phoronix Test Suite v10.8.5