Benchmarks for a future article.
v6.13 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Notes: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Notes: Python 3.12.7Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
v6.14 29 Jan Processor: AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads), Motherboard: Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS), Chipset: AMD 1Ah, Memory: 12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 24.10, Kernel: 6.13.0-phx (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Notes: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Notes: Python 3.12.7Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Linux 6.14 Early Benchmarks AMD EPYC OpenBenchmarking.org Phoronix Test Suite AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads) Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS) AMD 1Ah 12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF 3201GB Micron_7450_MTFDKCB3T2TFS ASPEED 2 x Broadcom NetXtreme BCM5720 PCIe Ubuntu 24.10 6.13.0-phx (x86_64) GNOME Shell 47.0 X Server GCC 14.2.0 ext4 1024x768 Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution Linux 6.14 Early Benchmarks AMD EPYC Performance System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10) - Python 3.12.7 - v6.13: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - v6.14 29 Jan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
v6.13 vs. v6.14 29 Jan Comparison Phoronix Test Suite Baseline +2.5% +2.5% +5% +5% +7.5% +7.5% +10% +10% 10.1% 7.3% 3% 2.1% 2% A.L.S.E Writes Basic - CPU 5.4% NUMA 4% CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - P.P.2 3.5% Read While Writing 100 - 800 - Read Write 2.8% 100 - 800 - Read Write - Average Latency 2.8% TinyLlama-1.1B-Chat-v1.0 - CPU TinyLlama-1.1B-Chat-v1.0 - CPU - T.P.O.T DaCapo Benchmark Apache Cassandra RELION Stress-NG Llama.cpp RocksDB PostgreSQL PostgreSQL OpenVINO GenAI OpenVINO GenAI v6.13 v6.14 29 Jan
Linux 6.14 Early Benchmarks AMD EPYC relion: Basic - CPU pgbench: 100 - 800 - Read Write - Average Latency pgbench: 100 - 800 - Read Write tensorflow: CPU - 512 - ResNet-50 build-linux-kernel: allmodconfig java-jmh: Throughput stress-ng: Pipe blender: Barbershop - CPU-Only svt-av1: Preset 3 - Bosphorus 4K pgbench: 100 - 800 - Read Only - Average Latency pgbench: 100 - 800 - Read Only cassandra: Writes build-nodejs: Time To Compile clickhouse: 100M Rows Hits Dataset, Third Run clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache openfoam: drivaerFastback, Medium Mesh Size - Execution Time openfoam: drivaerFastback, Medium Mesh Size - Mesh Time build-llvm: Ninja dacapobench: Tradebeans llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048 llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048 memcached: 1:5 memcached: 1:10 memcached: 1:100 rocksdb: Update Rand rocksdb: Read Rand Write Rand rocksdb: Read While Writing rocksdb: Rand Read blender: Pabellon Barcelona - CPU-Only dacapobench: Tradesoap dacapobench: Apache Lucene Search Engine blender: Classroom - CPU-Only svt-av1: Preset 5 - Bosphorus 4K dacapobench: Avrora AVR Simulation Framework stress-ng: NUMA stress-ng: SENDFILE stress-ng: Futex stress-ng: Context Switching stress-ng: Socket Activity stress-ng: Mutex llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048 astcenc: Very Thorough astcenc: Exhaustive build-linux-kernel: defconfig dacapobench: Eclipse namd: STMV with 1,066,628 Atoms blender: Junkshop - CPU-Only gromacs: MPI CPU - water_GMX50_bare dacapobench: Apache Lucene Search Index openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output Token openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Token openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128 llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128 svt-av1: Preset 8 - Bosphorus 4K namd: ATPase with 327,506 Atoms llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128 svt-av1: Preset 13 - Bosphorus 4K dacapobench: Apache Tomcat openvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPU v6.13 v6.14 29 Jan 165.832 6.216 128762 253.76 191.014 316508832117.05 82801426.52 146.69 17.033 0.165 4850455 461247 124.240 803.99 802.09 778.02 230.22542 89.311917 101.046 4870 143.45 143.26 3888552.68 7054421.88 13460038.76 692027 7398435 11734903 535255862 46.85 2977 3981 41.15 60.574 2183 2144.57 2105661.95 4271894.47 52538077.77 47642.26 42972205.27 448.22 10.1750 6.2336 21.756 6244 3.75749 19.97 14.585 2236 13.06 16.16 76.57 49.91 52.55 200.461 13.07300 118.47 471.679 1043 174.729 6.387 125263 253.53 190.882 316375911973.06 81959177.61 147.05 17.094 0.167 4801097 494806 124.276 801.13 802.93 778.75 226.99365 88.166581 101.413 4872 144.23 145.08 3911714.46 7187440.12 13605595.65 691421 7348485 12083935 535465377 47.08 3033 3615 41.16 60.392 2184 2062.27 2099036.56 4224604.07 52207921.65 46759.61 42507064.95 432.94 10.1756 6.2293 21.758 6321 3.75641 19.94 14.583 2233 12.80 16.10 78.14 50.00 52.55 197.302 12.83401 117.88 471.825 1060 OpenBenchmarking.org
RELION OpenBenchmarking.org Seconds, Fewer Is Better RELION 5.0 Test: Basic - Device: CPU v6.14 29 Jan v6.13 40 80 120 160 200 SE +/- 1.63, N = 12 SE +/- 3.02, N = 12 174.73 165.83 1. (CXX) g++ options: -fPIC -std=c++14 -fopenmp -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi
PostgreSQL OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 17 Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latency v6.14 29 Jan v6.13 2 4 6 8 10 SE +/- 0.031, N = 3 SE +/- 0.042, N = 12 6.387 6.216 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 17 Scaling Factor: 100 - Clients: 800 - Mode: Read Write v6.14 29 Jan v6.13 30K 60K 90K 120K 150K SE +/- 608.11, N = 3 SE +/- 895.52, N = 12 125263 128762 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 v6.14 29 Jan v6.13 60 120 180 240 300 SE +/- 0.19, N = 3 SE +/- 0.26, N = 3 253.53 253.76
Stress-NG OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: Pipe v6.14 29 Jan v6.13 20M 40M 60M 80M 100M SE +/- 1868912.16, N = 15 SE +/- 1013892.33, N = 15 81959177.61 82801426.52 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Barbershop - Compute: CPU-Only v6.14 29 Jan v6.13 30 60 90 120 150 SE +/- 0.22, N = 3 SE +/- 0.07, N = 3 147.05 146.69
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 3 - Input: Bosphorus 4K v6.14 29 Jan v6.13 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 17.09 17.03 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
PostgreSQL OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 17 Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency v6.14 29 Jan v6.13 0.0376 0.0752 0.1128 0.1504 0.188 SE +/- 0.000, N = 3 SE +/- 0.000, N = 3 0.167 0.165 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 17 Scaling Factor: 100 - Clients: 800 - Mode: Read Only v6.14 29 Jan v6.13 1000K 2000K 3000K 4000K 5000K SE +/- 2758.89, N = 3 SE +/- 12547.95, N = 3 4801097 4850455 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm
ClickHouse ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run v6.14 29 Jan v6.13 200 400 600 800 1000 SE +/- 5.26, N = 3 SE +/- 1.04, N = 3 801.13 803.99 MIN: 65.01 / MAX: 8571.43 MIN: 66.08 / MAX: 10000
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run v6.14 29 Jan v6.13 200 400 600 800 1000 SE +/- 5.08, N = 3 SE +/- 7.48, N = 3 802.93 802.09 MIN: 66.01 / MAX: 8571.43 MIN: 65.43 / MAX: 8571.43
OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache v6.14 29 Jan v6.13 200 400 600 800 1000 SE +/- 4.26, N = 3 SE +/- 6.64, N = 3 778.75 778.02 MIN: 65.57 / MAX: 7500 MIN: 65.22 / MAX: 8571.43
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time v6.14 29 Jan v6.13 50 100 150 200 250 226.99 230.23 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time v6.14 29 Jan v6.13 20 40 60 80 100 88.17 89.31 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048 v6.14 29 Jan v6.13 30 60 90 120 150 SE +/- 0.77, N = 3 SE +/- 0.48, N = 3 144.23 143.45 1. (CXX) g++ options: -O3
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048 v6.14 29 Jan v6.13 30 60 90 120 150 SE +/- 0.59, N = 3 SE +/- 0.13, N = 3 145.08 143.26 1. (CXX) g++ options: -O3
Memcached Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:5 v6.14 29 Jan v6.13 800K 1600K 2400K 3200K 4000K SE +/- 5274.38, N = 3 SE +/- 987.88, N = 3 3911714.46 3888552.68 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 v6.14 29 Jan v6.13 1.5M 3M 4.5M 6M 7.5M SE +/- 21122.44, N = 3 SE +/- 11715.29, N = 3 7187440.12 7054421.88 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 v6.14 29 Jan v6.13 3M 6M 9M 12M 15M SE +/- 9305.87, N = 3 SE +/- 65975.03, N = 3 13605595.65 13460038.76 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
RocksDB This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Update Random v6.14 29 Jan v6.13 150K 300K 450K 600K 750K SE +/- 1034.41, N = 3 SE +/- 200.83, N = 3 691421 692027 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read Random Write Random v6.14 29 Jan v6.13 1.6M 3.2M 4.8M 6.4M 8M SE +/- 69319.41, N = 3 SE +/- 47503.65, N = 3 7348485 7398435 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read While Writing v6.14 29 Jan v6.13 3M 6M 9M 12M 15M SE +/- 116959.75, N = 3 SE +/- 94771.95, N = 3 12083935 11734903 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Read v6.14 29 Jan v6.13 110M 220M 330M 440M 550M SE +/- 657385.47, N = 3 SE +/- 1082500.13, N = 3 535465377 535255862 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Pabellon Barcelona - Compute: CPU-Only v6.14 29 Jan v6.13 11 22 33 44 55 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 47.08 46.85
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Classroom - Compute: CPU-Only v6.14 29 Jan v6.13 9 18 27 36 45 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 41.16 41.15
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 5 - Input: Bosphorus 4K v6.14 29 Jan v6.13 14 28 42 56 70 SE +/- 0.05, N = 3 SE +/- 0.27, N = 3 60.39 60.57 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Stress-NG OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: NUMA v6.14 29 Jan v6.13 500 1000 1500 2000 2500 SE +/- 4.06, N = 3 SE +/- 3.79, N = 3 2062.27 2144.57 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: SENDFILE v6.14 29 Jan v6.13 500K 1000K 1500K 2000K 2500K SE +/- 4114.32, N = 3 SE +/- 3455.33, N = 3 2099036.56 2105661.95 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: Futex v6.14 29 Jan v6.13 900K 1800K 2700K 3600K 4500K SE +/- 47235.90, N = 3 SE +/- 31436.04, N = 3 4224604.07 4271894.47 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: Context Switching v6.14 29 Jan v6.13 11M 22M 33M 44M 55M SE +/- 58765.40, N = 3 SE +/- 424783.11, N = 3 52207921.65 52538077.77 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: Socket Activity v6.14 29 Jan v6.13 10K 20K 30K 40K 50K SE +/- 50.78, N = 3 SE +/- 61.51, N = 3 46759.61 47642.26 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.18.09 Test: Mutex v6.14 29 Jan v6.13 9M 18M 27M 36M 45M SE +/- 317020.74, N = 3 SE +/- 125947.68, N = 3 42507064.95 42972205.27 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048 v6.14 29 Jan v6.13 100 200 300 400 500 SE +/- 2.05, N = 3 SE +/- 3.77, N = 3 432.94 448.22 1. (CXX) g++ options: -O3
ASTC Encoder OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 5.0 Preset: Very Thorough v6.14 29 Jan v6.13 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 10.18 10.18 1. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 5.0 Preset: Exhaustive v6.14 29 Jan v6.13 2 4 6 8 10 SE +/- 0.0014, N = 3 SE +/- 0.0010, N = 3 6.2293 6.2336 1. (CXX) g++ options: -O3 -flto -pthread
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: STMV with 1,066,628 Atoms v6.14 29 Jan v6.13 0.8454 1.6908 2.5362 3.3816 4.227 SE +/- 0.00440, N = 3 SE +/- 0.01116, N = 3 3.75641 3.75749
Blender OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Junkshop - Compute: CPU-Only v6.14 29 Jan v6.13 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 19.94 19.97
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare v6.14 29 Jan v6.13 4 8 12 16 20 SE +/- 0.16, N = 3 SE +/- 0.08, N = 3 14.58 14.59 1. (CXX) g++ options: -O3 -lm
OpenVINO GenAI OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time Per Output Token v6.14 29 Jan v6.13 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.13, N = 3 12.80 13.06
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128 v6.14 29 Jan v6.13 11 22 33 44 55 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 50.00 49.91 1. (CXX) g++ options: -O3
OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128 v6.14 29 Jan v6.13 12 24 36 48 60 SE +/- 0.02, N = 3 SE +/- 0.15, N = 3 52.55 52.55 1. (CXX) g++ options: -O3
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 8 - Input: Bosphorus 4K v6.14 29 Jan v6.13 40 80 120 160 200 SE +/- 1.79, N = 3 SE +/- 0.02, N = 3 197.30 200.46 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
NAMD OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms v6.14 29 Jan v6.13 3 6 9 12 15 SE +/- 0.16, N = 4 SE +/- 0.09, N = 3 12.83 13.07
Llama.cpp OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128 v6.14 29 Jan v6.13 30 60 90 120 150 SE +/- 0.46, N = 3 SE +/- 0.52, N = 3 117.88 118.47 1. (CXX) g++ options: -O3
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 13 - Input: Bosphorus 4K v6.14 29 Jan v6.13 100 200 300 400 500 SE +/- 2.61, N = 3 SE +/- 4.76, N = 3 471.83 471.68 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenVINO GenAI Model: Gemma-7b-int4-ov - Device: CPU
v6.13: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:
v6.14 29 Jan: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:
Model: Falcon-7b-instruct-int4-ov - Device: CPU
v6.13: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:
v6.14 29 Jan: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:
Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU
v6.13: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:
v6.14 29 Jan: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:
v6.13 Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Notes: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Notes: Python 3.12.7Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 29 January 2025 19:58 by user phoronix.
v6.14 29 Jan Processor: AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads), Motherboard: Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS), Chipset: AMD 1Ah, Memory: 12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 24.10, Kernel: 6.13.0-phx (x86_64), Desktop: GNOME Shell 47.0, Display Server: X Server, Compiler: GCC 14.2.0, File-System: ext4, Screen Resolution: 1024x768
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116Java Notes: OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10)Python Notes: Python 3.12.7Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 30 January 2025 01:11 by user phoronix.