Benchmarks by Michael Larabel for a future article.
Linux 6.8 Processor: 2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash Drive, Graphics: ASPEED, Network: Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.8.0-060800-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113ePython Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Linux 6.9 27 Mar OS: Ubuntu 23.10, Kernel: 6.9.0-060900rc1daily20240327-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113ePython Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
AMD EPYC Genoa-X Linux 6.9 Kernel Benchmarks OpenBenchmarking.org Phoronix Test Suite 2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads) AMD Titanite_4G (RTI1007B BIOS) AMD Device 14a4 1520GB 3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash Drive ASPEED Broadcom NetXtreme BCM5720 PCIe Ubuntu 23.10 6.8.0-060800-generic (x86_64) 6.9.0-060900rc1daily20240327-generic (x86_64) GCC 13.2.0 ext4 1920x1200 Processor Motherboard Chipset Memory Disk Graphics Network OS Kernels Compiler File-System Screen Resolution AMD EPYC Genoa-X Linux 6.9 Kernel Benchmarks Performance System Logs - Transparent Huge Pages: madvise - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e - Python 3.11.6 - Linux 6.8: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - Linux 6.9 27 Mar: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Linux 6.8 vs. Linux 6.9 27 Mar Comparison Phoronix Test Suite Baseline +18.9% +18.9% +37.8% +37.8% +56.7% +56.7% 75.4% 11.2% 10.8% 9.5% 9.3% 8.5% 8.1% 7% 6.8% 5.2% 5.1% 5.1% 4.7% 4.5% 4.4% 4.4% 3.9% 3.3% 3.2% 3.1% 3% 3% 2.8% 2.7% 2.7% 2.6% 2.5% 2.4% 2.3% 2% llava-v1.5-7b-q4 - CPU S.V.M.P Semaphores R.R.W.R Overwrite Read While Writing Update Rand NUMA Update Rand mistral-7b-instruct-v0.2.Q8_0 - CPU R.S.A.F.I - CPU R.S.A.F.I - CPU Rand Read V.D.F.I - CPU Context Switching V.D.F.I - CPU 32 - Process AVX-512 VNNI Vector Math M.T.E.T.D.F - CPU M.T.E.T.D.F - CPU Preset 12 - Bosphorus 4K F.D.R.F.I - CPU F.D.R.F.I - CPU CORAL2 P2 Rand Read H.E.R.F.I - CPU Matrix Math H.E.R.F.I - CPU R.R.W.R 2% 1024 Llamafile Stress-NG Stress-NG RocksDB RocksDB RocksDB RocksDB Stress-NG Speedb Llamafile OpenVINO OpenVINO RocksDB OpenVINO Stress-NG OpenVINO Hackbench Stress-NG Stress-NG OpenVINO OpenVINO SVT-AV1 OpenVINO OpenVINO Quicksilver Speedb OpenVINO Stress-NG OpenVINO Speedb MariaDB mariadb-slap Linux 6.8 Linux 6.9 27 Mar
RocksDB OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read Random Write Random Linux 6.9 27 Mar Linux 6.8 700K 1400K 2100K 2800K 3500K SE +/- 3987.34, N = 3 SE +/- 3761.31, N = 3 3484877 3182572 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Overwrite Linux 6.9 27 Mar Linux 6.8 90K 180K 270K 360K 450K SE +/- 1320.84, N = 3 SE +/- 180.51, N = 3 416860 381523 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Update Random Linux 6.9 27 Mar Linux 6.8 90K 180K 270K 360K 450K SE +/- 1155.95, N = 3 SE +/- 379.66, N = 3 406258 375714 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Update Random Linux 6.9 27 Mar Linux 6.8 70K 140K 210K 280K 350K SE +/- 2172.75, N = 3 SE +/- 945.90, N = 3 313791 293746 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Llamafile Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 3 6 9 12 15 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 13.46 12.79
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 900 1800 2700 3600 4500 SE +/- 3.68, N = 3 SE +/- 1.98, N = 3 4063.11 3864.65 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 3 6 9 12 15 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 11.80 12.40 MIN: 11.09 / MAX: 37.4 MIN: 11.56 / MAX: 38.83 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
RocksDB OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Random Read Linux 6.9 27 Mar Linux 6.8 200M 400M 600M 800M 1000M SE +/- 2941599.90, N = 3 SE +/- 4458137.36, N = 3 1096081951 1047290863 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 3K 6K 9K 12K 15K SE +/- 24.42, N = 3 SE +/- 5.76, N = 3 11719.99 11218.12 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Vehicle Detection FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 0.9585 1.917 2.8755 3.834 4.7925 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 4.08 4.26 MIN: 3.64 / MAX: 20.85 MIN: 3.76 / MAX: 20.9 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Math Linux 6.9 27 Mar Linux 6.8 300K 600K 900K 1200K 1500K SE +/- 1326.11, N = 3 SE +/- 1312.02, N = 3 1349057.51 1306706.19 1. (CXX) g++ options: -O2 -std=gnu99 -lc
OpenVINO OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU Linux 6.9 27 Mar Linux 6.8 10 20 30 40 50 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 42.82 44.13 MIN: 34.68 / MAX: 192.44 MIN: 36.04 / MAX: 216.03 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Machine Translation EN To DE FP16 - Device: CPU Linux 6.9 27 Mar Linux 6.8 200 400 600 800 1000 SE +/- 0.57, N = 3 SE +/- 1.18, N = 3 1119.56 1086.48 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 12 - Input: Bosphorus 4K Linux 6.9 27 Mar Linux 6.8 40 80 120 160 200 SE +/- 0.71, N = 3 SE +/- 1.75, N = 3 164.31 159.51 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenVINO OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 1.2375 2.475 3.7125 4.95 6.1875 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 5.35 5.50 MIN: 4.84 / MAX: 24.91 MIN: 4.83 / MAX: 27 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Face Detection Retail FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 8K 16K 24K 32K 40K SE +/- 83.35, N = 3 SE +/- 19.55, N = 3 35740.91 34785.75 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P2 Linux 6.9 27 Mar Linux 6.8 3M 6M 9M 12M 15M SE +/- 272810.02, N = 9 SE +/- 81921.37, N = 3 16294444 15873333 1. (CXX) g++ options: -fopenmp -O3 -march=native
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Read Linux 6.9 27 Mar Linux 6.8 200M 400M 600M 800M 1000M SE +/- 848602.66, N = 3 SE +/- 5405476.66, N = 3 1112167042 1084511886 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenVINO OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 8 16 24 32 40 SE +/- 0.13, N = 3 SE +/- 0.05, N = 3 32.43 33.25 MIN: 29.07 / MAX: 50.08 MIN: 29.55 / MAX: 52.69 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Handwritten English Recognition FP16-INT8 - Device: CPU Linux 6.9 27 Mar Linux 6.8 1300 2600 3900 5200 6500 SE +/- 26.57, N = 3 SE +/- 9.86, N = 3 5898.06 5765.91 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Speedb Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Read Random Write Random Linux 6.9 27 Mar Linux 6.8 300K 600K 900K 1200K 1500K SE +/- 7976.44, N = 3 SE +/- 2371.05, N = 3 1572109 1603891 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
MariaDB mariadb-slap OpenBenchmarking.org Queries Per Second, More Is Better MariaDB mariadb-slap 11.5 Clients: 1024 Linux 6.9 27 Mar Linux 6.8 12 24 36 48 60 SE +/- 0.26, N = 3 SE +/- 0.46, N = 3 52 51 1. (CXX) g++ options: -fPIC -pie -fstack-protector -O3 -shared -lrt -lpthread -lz -ldl -lm -lstdc++
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 8 - Input: Bosphorus 4K Linux 6.9 27 Mar Linux 6.8 20 40 60 80 100 SE +/- 0.66, N = 15 SE +/- 0.60, N = 15 88.14 86.62 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 4 - Input: Bosphorus 4K Linux 6.9 27 Mar Linux 6.8 2 4 6 8 10 SE +/- 0.054, N = 3 SE +/- 0.052, N = 3 8.565 8.431 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Sockperf This is a network socket API performance benchmark developed by Mellanox. This test profile runs both the client and server on the local host for evaluating individual system performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org usec, Fewer Is Better Sockperf 3.7 Test: Latency Under Load Linux 6.9 27 Mar Linux 6.8 4 8 12 16 20 SE +/- 0.12, N = 24 SE +/- 0.13, N = 10 16.11 16.36 1. (CXX) g++ options: --param -O3 -rdynamic
Sockperf This is a network socket API performance benchmark developed by Mellanox. This test profile runs both the client and server on the local host for evaluating individual system performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Messages Per Second, More Is Better Sockperf 3.7 Test: Throughput Linux 6.9 27 Mar Linux 6.8 70K 140K 210K 280K 350K SE +/- 3244.28, N = 6 SE +/- 2015.52, N = 5 324029 319386 1. (CXX) g++ options: --param -O3 -rdynamic
TensorFlow OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 256 - Model: ResNet-50 Linux 6.9 27 Mar Linux 6.8 30 60 90 120 150 SE +/- 0.86, N = 3 SE +/- 0.34, N = 3 119.29 117.77
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms Linux 6.9 27 Mar Linux 6.8 2 4 6 8 10 SE +/- 0.00170, N = 3 SE +/- 0.00857, N = 3 6.38903 6.46062
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 Linux 6.9 27 Mar Linux 6.8 4M 8M 12M 16M 20M SE +/- 156027.06, N = 3 SE +/- 17638.34, N = 3 17903333 17713333 1. (CXX) g++ options: -fopenmp -O3 -march=native
OpenVINO OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU Linux 6.9 27 Mar Linux 6.8 1700 3400 5100 6800 8500 SE +/- 55.96, N = 3 SE +/- 90.39, N = 3 7762.09 7703.03 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 Linux 6.9 27 Mar Linux 6.8 20000M 40000M 60000M 80000M 100000M SE +/- 200003199.85, N = 3 SE +/- 24428915.02, N = 3 91315613557 92015573177 1. (CC) gcc options: -pthread -m64 -O3 -ldl
SVT-AV1 OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.0 Encoder Mode: Preset 13 - Input: Bosphorus 4K Linux 6.9 27 Mar Linux 6.8 40 80 120 160 200 SE +/- 1.96, N = 15 SE +/- 1.80, N = 15 159.12 157.92 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OSPRay Studio OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 140 280 420 560 700 SE +/- 0.58, N = 3 SE +/- 3.28, N = 3 655 659
OpenVINO OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.0 Model: Noise Suppression Poconet-Like FP16 - Device: CPU Linux 6.9 27 Mar Linux 6.8 6 12 18 24 30 SE +/- 0.26, N = 3 SE +/- 0.34, N = 3 23.50 23.59 MIN: 9.59 / MAX: 65.24 MIN: 9.67 / MAX: 85.05 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms Linux 6.9 27 Mar Linux 6.8 5 10 15 20 25 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 20.68 20.75
OSPRay Studio OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 4K 8K 12K 16K 20K SE +/- 48.89, N = 3 SE +/- 40.08, N = 3 20914 20984
TensorFlow OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.16.1 Device: CPU - Batch Size: 512 - Model: ResNet-50 Linux 6.9 27 Mar Linux 6.8 30 60 90 120 150 SE +/- 0.14, N = 3 SE +/- 0.10, N = 3 136.37 136.82
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 Linux 6.9 27 Mar Linux 6.8 170000M 340000M 510000M 680000M 850000M SE +/- 709551963.95, N = 3 SE +/- 372177165.42, N = 3 774761881287 777163987993 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OSPRay Studio OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 4K 8K 12K 16K 20K SE +/- 68.53, N = 3 SE +/- 30.37, N = 3 17877 17834
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 4K 8K 12K 16K 20K SE +/- 64.98, N = 3 SE +/- 60.00, N = 3 17746 17788
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM Linux 6.9 27 Mar Linux 6.8 400000M 800000M 1200000M 1600000M 2000000M SE +/- 1466705411.41, N = 3 SE +/- 826670837.74, N = 3 1732048811713 1735984667803 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 Linux 6.9 27 Mar Linux 6.8 200000M 400000M 600000M 800000M 1000000M SE +/- 4778305950.15, N = 3 SE +/- 5401967811.92, N = 3 1093188749947 1095564664557 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 Linux 6.9 27 Mar Linux 6.8 20K 40K 60K 80K 100K SE +/- 73.99, N = 3 SE +/- 54.75, N = 3 97661.7 97844.2 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OSPRay Studio OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 120 240 360 480 600 SE +/- 1.15, N = 3 SE +/- 1.00, N = 3 559 558
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 Linux 6.9 27 Mar Linux 6.8 60000M 120000M 180000M 240000M 300000M SE +/- 91878369.01, N = 3 SE +/- 6021947.24, N = 3 283971383020 284349804093 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM Linux 6.9 27 Mar Linux 6.8 400000M 800000M 1200000M 1600000M 2000000M SE +/- 483814569.81, N = 3 SE +/- 5046453117.89, N = 3 2001406545920 2003432456830 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 Linux 6.9 27 Mar Linux 6.8 700K 1400K 2100K 2800K 3500K SE +/- 1390.93, N = 3 SE +/- 1592.51, N = 3 3230787.6 3232989.3 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OSPRay Studio OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 120 240 360 480 600 SE +/- 1.53, N = 3 SE +/- 1.33, N = 3 563 563
Llamafile Mozilla's Llamafile allows distributing and running large language models (LLMs) as a single file. Llamafile aims to make open-source LLMs more accessible to developers and users. Llamafile supports a variety of models, CPUs and GPUs, and other options. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Tokens Per Second, More Is Better Llamafile 0.6 Test: llava-v1.5-7b-q4 - Acceleration: CPU Linux 6.9 27 Mar Linux 6.8 5 10 15 20 25 SE +/- 0.49, N = 15 SE +/- 0.42, N = 15 21.10 12.03
RocksDB OpenBenchmarking.org Op/s, More Is Better RocksDB 9.0 Test: Read While Writing Linux 6.9 27 Mar Linux 6.8 5M 10M 15M 20M 25M SE +/- 390827.28, N = 12 SE +/- 155882.11, N = 3 22418140 20665324 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
Linux 6.8 Processor: 2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash Drive, Graphics: ASPEED, Network: Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.8.0-060800-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113ePython Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 27 March 2024 16:44 by user phoronix.
Linux 6.9 27 Mar Processor: 2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads), Motherboard: AMD Titanite_4G (RTI1007B BIOS), Chipset: AMD Device 14a4, Memory: 1520GB, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash Drive, Graphics: ASPEED, Network: Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.9.0-060900rc1daily20240327-generic (x86_64), Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113ePython Notes: Python 3.11.6Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 28 March 2024 01:54 by user phoronix.