Linux 6.14 Early Benchmarks AMD EPYC

Benchmarks for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2501308-NE-LINUX614E01
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
v6.13
January 29
  4 Hours, 32 Minutes
v6.14 29 Jan
January 30
  3 Hours, 52 Minutes
Invert Behavior (Only Show Selected Data)
  4 Hours, 12 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Linux 6.14 Early Benchmarks AMD EPYCOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.13.0-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionLinux 6.14 Early Benchmarks AMD EPYC PerformanceSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)- Python 3.12.7- v6.13: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - v6.14 29 Jan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

v6.13 vs. v6.14 29 Jan ComparisonPhoronix Test SuiteBaseline+2.5%+2.5%+5%+5%+7.5%+7.5%+10%+10%10.1%7.3%3%2.1%2%A.L.S.EWritesBasic - CPU5.4%NUMA4%CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - P.P.23.5%Read While Writing100 - 800 - Read Write2.8%100 - 800 - Read Write - Average Latency2.8%TinyLlama-1.1B-Chat-v1.0 - CPUTinyLlama-1.1B-Chat-v1.0 - CPU - T.P.O.TDaCapo BenchmarkApache CassandraRELIONStress-NGLlama.cppRocksDBPostgreSQLPostgreSQLOpenVINO GenAIOpenVINO GenAIv6.13v6.14 29 Jan

Linux 6.14 Early Benchmarks AMD EPYCstress-ng: NUMAstress-ng: Pipestress-ng: Futexstress-ng: Mutexstress-ng: SENDFILEstress-ng: Socket Activitystress-ng: Context Switchingsvt-av1: Preset 3 - Bosphorus 4Ksvt-av1: Preset 5 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ktensorflow: CPU - 512 - ResNet-50astcenc: Exhaustiveastcenc: Very Thoroughgromacs: MPI CPU - water_GMX50_barenamd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomscassandra: Writesrocksdb: Rand Readrocksdb: Update Randrocksdb: Read While Writingrocksdb: Read Rand Write Randjava-jmh: Throughputmemcached: 1:5memcached: 1:10memcached: 1:100clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPUpgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Writepgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 800 - Read Write - Average Latencyopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Tokenopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output Tokendacapobench: Eclipsedacapobench: Tradesoapdacapobench: Tradebeansdacapobench: Apache Tomcatdacapobench: Apache Lucene Search Indexdacapobench: Apache Lucene Search Enginedacapobench: Avrora AVR Simulation Frameworkopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timerelion: Basic - CPUbuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigblender: Junkshop - CPU-Onlyblender: Classroom - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlybuild-llvm: Ninjabuild-nodejs: Time To Compilev6.13v6.14 29 Jan2144.5782801426.524271894.4742972205.272105661.9547642.2652538077.7717.03360.574200.461471.679253.766.233610.175014.58513.073003.75749461247535255862692027117349037398435316508832117.053888552.687054421.8813460038.76778.02802.09803.9949.91143.4552.55143.26118.47448.2276.5748504551287620.1656.21616.1613.06624429774870104322363981218389.311917230.22542165.83221.756191.01419.9741.15146.6946.85101.046124.2402062.2781959177.614224604.0742507064.952099036.5646759.6152207921.6517.09460.392197.302471.825253.536.229310.175614.58312.834013.75641494806535465377691421120839357348485316375911973.063911714.467187440.1213605595.65778.75802.93801.1350.00144.2352.55145.08117.88432.9478.1448010971252630.1676.38716.1012.80632130334872106022333615218488.166581226.99365174.72921.758190.88219.9441.16147.0547.08101.413124.276OpenBenchmarking.org

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: NUMAv6.13v6.14 29 Jan5001000150020002500SE +/- 3.79, N = 3SE +/- 4.06, N = 32144.572062.271. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Pipev6.13v6.14 29 Jan20M40M60M80M100MSE +/- 1013892.33, N = 15SE +/- 1868912.16, N = 1582801426.5281959177.611. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Futexv6.13v6.14 29 Jan900K1800K2700K3600K4500KSE +/- 31436.04, N = 3SE +/- 47235.90, N = 34271894.474224604.071. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Mutexv6.13v6.14 29 Jan9M18M27M36M45MSE +/- 125947.68, N = 3SE +/- 317020.74, N = 342972205.2742507064.951. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: SENDFILEv6.13v6.14 29 Jan500K1000K1500K2000K2500KSE +/- 3455.33, N = 3SE +/- 4114.32, N = 32105661.952099036.561. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Socket Activityv6.13v6.14 29 Jan10K20K30K40K50KSE +/- 61.51, N = 3SE +/- 50.78, N = 347642.2646759.611. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Context Switchingv6.13v6.14 29 Jan11M22M33M44M55MSE +/- 424783.11, N = 3SE +/- 58765.40, N = 352538077.7752207921.651. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4Kv6.14 29 Janv6.1348121620SE +/- 0.05, N = 3SE +/- 0.03, N = 317.0917.031. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4Kv6.13v6.14 29 Jan1428425670SE +/- 0.27, N = 3SE +/- 0.05, N = 360.5760.391. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4Kv6.13v6.14 29 Jan4080120160200SE +/- 0.02, N = 3SE +/- 1.79, N = 3200.46197.301. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4Kv6.14 29 Janv6.13100200300400500SE +/- 2.61, N = 3SE +/- 4.76, N = 3471.83471.681. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.16.1Device: CPU - Batch Size: 512 - Model: ResNet-50v6.13v6.14 29 Jan60120180240300SE +/- 0.26, N = 3SE +/- 0.19, N = 3253.76253.53

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Exhaustivev6.13v6.14 29 Jan246810SE +/- 0.0010, N = 3SE +/- 0.0014, N = 36.23366.22931. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Very Thoroughv6.14 29 Janv6.133691215SE +/- 0.01, N = 3SE +/- 0.00, N = 310.1810.181. (CXX) g++ options: -O3 -flto -pthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_barev6.13v6.14 29 Jan48121620SE +/- 0.08, N = 3SE +/- 0.16, N = 314.5914.581. (CXX) g++ options: -O3 -lm

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 Atomsv6.13v6.14 29 Jan3691215SE +/- 0.09, N = 3SE +/- 0.16, N = 413.0712.83

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 Atomsv6.13v6.14 29 Jan0.84541.69082.53623.38164.227SE +/- 0.01116, N = 3SE +/- 0.00440, N = 33.757493.75641

Apache Cassandra

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: Writesv6.14 29 Janv6.13110K220K330K440K550KSE +/- 4037.80, N = 3SE +/- 2230.81, N = 3494806461247

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random Readv6.14 29 Janv6.13110M220M330M440M550MSE +/- 657385.47, N = 3SE +/- 1082500.13, N = 35354653775352558621. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update Randomv6.13v6.14 29 Jan150K300K450K600K750KSE +/- 200.83, N = 3SE +/- 1034.41, N = 36920276914211. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While Writingv6.14 29 Janv6.133M6M9M12M15MSE +/- 116959.75, N = 3SE +/- 94771.95, N = 312083935117349031. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write Randomv6.13v6.14 29 Jan1.6M3.2M4.8M6.4M8MSE +/- 47503.65, N = 3SE +/- 69319.41, N = 3739843573484851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Java JMH

This very basic test profile runs the stock benchmark of the Java JMH benchmark via Maven. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/s, More Is BetterJava JMHThroughputv6.13v6.14 29 Jan70000M140000M210000M280000M350000M316508832117.05316375911973.06

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:5v6.14 29 Janv6.13800K1600K2400K3200K4000KSE +/- 5274.38, N = 3SE +/- 987.88, N = 33911714.463888552.681. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10v6.14 29 Janv6.131.5M3M4.5M6M7.5MSE +/- 21122.44, N = 3SE +/- 11715.29, N = 37187440.127054421.881. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100v6.14 29 Janv6.133M6M9M12M15MSE +/- 9305.87, N = 3SE +/- 65975.03, N = 313605595.6513460038.761. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cachev6.14 29 Janv6.132004006008001000SE +/- 4.26, N = 3SE +/- 6.64, N = 3778.75778.02MIN: 65.57 / MAX: 7500MIN: 65.22 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runv6.14 29 Janv6.132004006008001000SE +/- 5.08, N = 3SE +/- 7.48, N = 3802.93802.09MIN: 66.01 / MAX: 8571.43MIN: 65.43 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runv6.13v6.14 29 Jan2004006008001000SE +/- 1.04, N = 3SE +/- 5.26, N = 3803.99801.13MIN: 66.08 / MAX: 10000MIN: 65.01 / MAX: 8571.43

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128v6.14 29 Janv6.131122334455SE +/- 0.03, N = 3SE +/- 0.09, N = 350.0049.911. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048v6.14 29 Janv6.13306090120150SE +/- 0.77, N = 3SE +/- 0.48, N = 3144.23143.451. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128v6.14 29 Janv6.131224364860SE +/- 0.02, N = 3SE +/- 0.15, N = 352.5552.551. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048v6.14 29 Janv6.13306090120150SE +/- 0.59, N = 3SE +/- 0.13, N = 3145.08143.261. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128v6.13v6.14 29 Jan306090120150SE +/- 0.52, N = 3SE +/- 0.46, N = 3118.47117.881. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048v6.13v6.14 29 Jan100200300400500SE +/- 3.77, N = 3SE +/- 2.05, N = 3448.22432.941. (CXX) g++ options: -O3

OpenVINO GenAI

Model: Gemma-7b-int4-ov - Device: CPU

v6.13: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

v6.14 29 Jan: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPUv6.14 29 Janv6.1320406080100SE +/- 0.12, N = 3SE +/- 0.76, N = 378.1476.57

Model: Falcon-7b-instruct-int4-ov - Device: CPU

v6.13: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

v6.14 29 Jan: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU

v6.13: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

v6.14 29 Jan: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

PostgreSQL

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyv6.13v6.14 29 Jan1000K2000K3000K4000K5000KSE +/- 12547.95, N = 3SE +/- 2758.89, N = 3485045548010971. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Writev6.13v6.14 29 Jan30K60K90K120K150KSE +/- 895.52, N = 12SE +/- 608.11, N = 31287621252631. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyv6.13v6.14 29 Jan0.03760.07520.11280.15040.188SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1650.1671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyv6.13v6.14 29 Jan246810SE +/- 0.042, N = 12SE +/- 0.031, N = 36.2166.3871. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Eclipsev6.13v6.14 29 Jan14002800420056007000SE +/- 9.71, N = 3SE +/- 13.28, N = 362446321

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Tradesoapv6.13v6.14 29 Jan7001400210028003500SE +/- 17.32, N = 3SE +/- 24.47, N = 1529773033

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Tradebeansv6.13v6.14 29 Jan10002000300040005000SE +/- 93.65, N = 15SE +/- 91.74, N = 1548704872

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Tomcatv6.13v6.14 29 Jan2004006008001000SE +/- 2.08, N = 3SE +/- 12.17, N = 310431060

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Lucene Search Indexv6.14 29 Janv6.135001000150020002500SE +/- 17.23, N = 3SE +/- 6.03, N = 322332236

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Apache Lucene Search Enginev6.14 29 Janv6.139001800270036004500SE +/- 39.07, N = 4SE +/- 47.78, N = 1536153981

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 23.11Java Test: Avrora AVR Simulation Frameworkv6.13v6.14 29 Jan5001000150020002500SE +/- 27.74, N = 15SE +/- 33.19, N = 1521832184

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Timev6.14 29 Janv6.132040608010088.1789.311. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Timev6.14 29 Janv6.1350100150200250226.99230.231. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

RELION

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 5.0Test: Basic - Device: CPUv6.13v6.14 29 Jan4080120160200SE +/- 3.02, N = 12SE +/- 1.63, N = 12165.83174.731. (CXX) g++ options: -fPIC -std=c++14 -fopenmp -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: defconfigv6.13v6.14 29 Jan510152025SE +/- 0.24, N = 4SE +/- 0.28, N = 321.7621.76

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfigv6.14 29 Janv6.134080120160200SE +/- 0.30, N = 3SE +/- 0.40, N = 3190.88191.01

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: CPU-Onlyv6.14 29 Janv6.13510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 319.9419.97

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: CPU-Onlyv6.13v6.14 29 Jan918273645SE +/- 0.05, N = 3SE +/- 0.01, N = 341.1541.16

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: CPU-Onlyv6.13v6.14 29 Jan306090120150SE +/- 0.07, N = 3SE +/- 0.22, N = 3146.69147.05

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: CPU-Onlyv6.13v6.14 29 Jan1122334455SE +/- 0.07, N = 3SE +/- 0.09, N = 346.8547.08

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninjav6.13v6.14 29 Jan20406080100SE +/- 0.25, N = 3SE +/- 0.18, N = 3101.05101.41

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To Compilev6.13v6.14 29 Jan306090120150SE +/- 0.04, N = 3SE +/- 0.16, N = 3124.24124.28

58 Results Shown

Stress-NG:
  NUMA
  Pipe
  Futex
  Mutex
  SENDFILE
  Socket Activity
  Context Switching
SVT-AV1:
  Preset 3 - Bosphorus 4K
  Preset 5 - Bosphorus 4K
  Preset 8 - Bosphorus 4K
  Preset 13 - Bosphorus 4K
TensorFlow
ASTC Encoder:
  Exhaustive
  Very Thorough
GROMACS
NAMD:
  ATPase with 327,506 Atoms
  STMV with 1,066,628 Atoms
Apache Cassandra
RocksDB:
  Rand Read
  Update Rand
  Read While Writing
  Read Rand Write Rand
Java JMH
Memcached:
  1:5
  1:10
  1:100
ClickHouse:
  100M Rows Hits Dataset, First Run / Cold Cache
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, Third Run
Llama.cpp:
  CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128
  CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048
  CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128
  CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048
OpenVINO GenAI
PostgreSQL:
  100 - 800 - Read Only
  100 - 800 - Read Write
  100 - 800 - Read Only - Average Latency
  100 - 800 - Read Write - Average Latency
DaCapo Benchmark:
  Eclipse
  Tradesoap
  Tradebeans
  Apache Tomcat
  Apache Lucene Search Index
  Apache Lucene Search Engine
  Avrora AVR Simulation Framework
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
RELION
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Blender:
  Junkshop - CPU-Only
  Classroom - CPU-Only
  Barbershop - CPU-Only
  Pabellon Barcelona - CPU-Only
Timed LLVM Compilation
Timed Node.js Compilation