AMD EPYC 8534P 64-Core testing with a AMD Cinnabar (RCB1009C BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.
6.5.0-15-lowlatency Processor: AMD EPYC 8534P 64-Core @ 2.30GHz (64 Cores / 128 Threads), Motherboard: AMD Cinnabar (RCB1009C BIOS), Chipset: AMD Device 14a4, Memory: 6 x 32GB DRAM-4800MT/s Samsung M321R4GA0BB0-CQKMG, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.5.0-15-lowlatency (x86_64), Desktop: GNOME Shell, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu1)Python Notes: Python 3.11.5Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
6.5.0-15-generic OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (x86_64), Desktop: GNOME Shell, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
PyTorch This is a benchmark of PyTorch making use of pytorch-benchmark [https://github.com/LukasHedegaard/pytorch-benchmark]. Currently this test profile is catered to CPU-based testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.1 Device: CPU - Batch Size: 1 - Model: ResNet-50 6.5.0-15-lowlatency 6.5.0-15-generic 10 20 30 40 50 SE +/- 0.12, N = 3 SE +/- 0.11, N = 3 44.68 45.56 MIN: 20.29 / MAX: 45.41 MIN: 14.65 / MAX: 46.41
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.1 Device: CPU - Batch Size: 16 - Model: ResNet-50 6.5.0-15-lowlatency 6.5.0-15-generic 8 16 24 32 40 SE +/- 0.13, N = 3 SE +/- 0.22, N = 3 36.15 36.71 MIN: 35.07 / MAX: 36.69 MIN: 35.15 / MAX: 37.53
OpenBenchmarking.org batches/sec, More Is Better PyTorch 2.1 Device: CPU - Batch Size: 32 - Model: ResNet-50 6.5.0-15-lowlatency 6.5.0-15-generic 8 16 24 32 40 SE +/- 0.05, N = 3 SE +/- 0.06, N = 3 36.04 36.52 MIN: 21.12 / MAX: 36.53 MIN: 14.8 / MAX: 37.07
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MMAP 6.5.0-15-lowlatency 6.5.0-15-generic 150 300 450 600 750 SE +/- 0.98, N = 3 SE +/- 0.69, N = 3 704.82 716.72 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: NUMA 6.5.0-15-lowlatency 6.5.0-15-generic 150 300 450 600 750 SE +/- 1.48, N = 3 SE +/- 0.89, N = 3 636.53 700.51 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pipe 6.5.0-15-lowlatency 6.5.0-15-generic 4M 8M 12M 16M 20M SE +/- 240322.92, N = 3 SE +/- 54256.67, N = 3 18643347.91 20557017.78 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Poll 6.5.0-15-lowlatency 6.5.0-15-generic 1.2M 2.4M 3.6M 4.8M 6M SE +/- 1984.30, N = 3 SE +/- 3278.55, N = 3 5648980.49 5722201.36 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Zlib 6.5.0-15-lowlatency 6.5.0-15-generic 1600 3200 4800 6400 8000 SE +/- 2.46, N = 3 SE +/- 10.81, N = 3 7378.54 7405.07 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Futex 6.5.0-15-lowlatency 6.5.0-15-generic 700K 1400K 2100K 2800K 3500K SE +/- 41561.40, N = 12 SE +/- 27733.99, N = 15 3327937.38 3462526.41 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: MEMFD 6.5.0-15-lowlatency 6.5.0-15-generic 90 180 270 360 450 SE +/- 0.36, N = 3 SE +/- 0.62, N = 3 404.61 397.78 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mutex 6.5.0-15-lowlatency 6.5.0-15-generic 3M 6M 9M 12M 15M SE +/- 52786.16, N = 3 SE +/- 98927.59, N = 3 14551312.18 14926322.79 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Atomic 6.5.0-15-lowlatency 6.5.0-15-generic 40 80 120 160 200 SE +/- 0.32, N = 3 SE +/- 0.17, N = 3 159.48 159.74 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Crypto 6.5.0-15-lowlatency 6.5.0-15-generic 30K 60K 90K 120K 150K SE +/- 1011.54, N = 3 SE +/- 71.78, N = 3 136909.80 138204.03 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Malloc 6.5.0-15-lowlatency 6.5.0-15-generic 14M 28M 42M 56M 70M SE +/- 184967.23, N = 3 SE +/- 55877.51, N = 3 67596590.54 67506235.12 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Cloning 6.5.0-15-lowlatency 6.5.0-15-generic 2K 4K 6K 8K 10K SE +/- 19.32, N = 3 SE +/- 4.93, N = 3 9370.01 9216.17 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Forking 6.5.0-15-lowlatency 6.5.0-15-generic 13K 26K 39K 52K 65K SE +/- 598.81, N = 3 SE +/- 413.74, N = 3 59764.27 55199.93 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Pthread 6.5.0-15-lowlatency 6.5.0-15-generic 20K 40K 60K 80K 100K SE +/- 107.61, N = 3 SE +/- 18.69, N = 3 96546.53 92150.11 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVL Tree 6.5.0-15-lowlatency 6.5.0-15-generic 100 200 300 400 500 SE +/- 0.35, N = 3 SE +/- 0.14, N = 3 468.96 472.28 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: IO_uring 6.5.0-15-lowlatency 6.5.0-15-generic 700K 1400K 2100K 2800K 3500K SE +/- 51499.17, N = 15 SE +/- 198534.21, N = 12 3442148.86 3138743.68 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: SENDFILE 6.5.0-15-lowlatency 6.5.0-15-generic 110K 220K 330K 440K 550K SE +/- 4199.63, N = 15 SE +/- 6422.27, N = 4 521912.25 535491.60 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Cache 6.5.0-15-lowlatency 6.5.0-15-generic 200K 400K 600K 800K 1000K SE +/- 21496.63, N = 15 SE +/- 18757.84, N = 15 1032723.13 1065746.51 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: CPU Stress 6.5.0-15-lowlatency 6.5.0-15-generic 30K 60K 90K 120K 150K SE +/- 26.52, N = 3 SE +/- 26.95, N = 3 132403.75 132983.37 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Semaphores 6.5.0-15-lowlatency 6.5.0-15-generic 20M 40M 60M 80M 100M SE +/- 277504.14, N = 3 SE +/- 613300.97, N = 3 75089447.32 78155878.19 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix Math 6.5.0-15-lowlatency 6.5.0-15-generic 60K 120K 180K 240K 300K SE +/- 606.41, N = 3 SE +/- 1059.88, N = 3 264943.59 266118.69 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Math 6.5.0-15-lowlatency 6.5.0-15-generic 80K 160K 240K 320K 400K SE +/- 75.87, N = 3 SE +/- 144.39, N = 3 370337.00 371488.55 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: AVX-512 VNNI 6.5.0-15-lowlatency 6.5.0-15-generic 1.2M 2.4M 3.6M 4.8M 6M SE +/- 671.99, N = 3 SE +/- 3126.14, N = 3 5486245.61 5497771.73 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Function Call 6.5.0-15-lowlatency 6.5.0-15-generic 10K 20K 30K 40K 50K SE +/- 14.17, N = 3 SE +/- 201.97, N = 3 44601.60 44557.59 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: x86_64 RdRand 6.5.0-15-lowlatency 6.5.0-15-generic 4M 8M 12M 16M 20M SE +/- 529.91, N = 3 SE +/- 779.64, N = 3 20511068.69 20563515.77 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Floating Point 6.5.0-15-lowlatency 6.5.0-15-generic 4K 8K 12K 16K 20K SE +/- 28.95, N = 3 SE +/- 1.33, N = 3 18558.55 18675.28 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Matrix 3D Math 6.5.0-15-lowlatency 6.5.0-15-generic 1300 2600 3900 5200 6500 SE +/- 4.25, N = 3 SE +/- 4.59, N = 3 6040.88 6049.34 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Memory Copying 6.5.0-15-lowlatency 6.5.0-15-generic 4K 8K 12K 16K 20K SE +/- 0.69, N = 3 SE +/- 5.49, N = 3 16427.96 16467.86 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Shuffle 6.5.0-15-lowlatency 6.5.0-15-generic 9K 18K 27K 36K 45K SE +/- 1.05, N = 3 SE +/- 0.49, N = 3 39928.75 40037.38 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Mixed Scheduler 6.5.0-15-lowlatency 6.5.0-15-generic 7K 14K 21K 28K 35K SE +/- 34.60, N = 3 SE +/- 167.10, N = 3 32486.75 31279.37 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Socket Activity 6.5.0-15-lowlatency 6.5.0-15-generic 6K 12K 18K 24K 30K SE +/- 103.72, N = 3 SE +/- 106.01, N = 3 13469.98 29718.94 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Wide Vector Math 6.5.0-15-lowlatency 6.5.0-15-generic 400K 800K 1200K 1600K 2000K SE +/- 364.98, N = 3 SE +/- 151.71, N = 3 1662923.70 1668092.45 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Context Switching 6.5.0-15-lowlatency 6.5.0-15-generic 2M 4M 6M 8M 10M SE +/- 26060.73, N = 3 SE +/- 13763.06, N = 3 11475558.36 11325453.15 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Fused Multiply-Add 6.5.0-15-lowlatency 6.5.0-15-generic 30M 60M 90M 120M 150M SE +/- 236978.02, N = 3 SE +/- 720348.50, N = 3 123555973.64 123709386.43 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Vector Floating Point 6.5.0-15-lowlatency 6.5.0-15-generic 30K 60K 90K 120K 150K SE +/- 193.55, N = 3 SE +/- 255.73, N = 3 133716.35 134898.53 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc C String Functions 6.5.0-15-lowlatency 6.5.0-15-generic 12M 24M 36M 48M 60M SE +/- 450311.20, N = 15 SE +/- 790042.39, N = 3 56978621.35 55365663.36 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: Glibc Qsort Data Sorting 6.5.0-15-lowlatency 6.5.0-15-generic 300 600 900 1200 1500 SE +/- 1.18, N = 3 SE +/- 0.29, N = 3 1275.99 1281.79 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.16.04 Test: System V Message Passing 6.5.0-15-lowlatency 6.5.0-15-generic 4M 8M 12M 16M 20M SE +/- 26226.97, N = 3 SE +/- 24607.27, N = 3 19340813.32 20507819.85 1. (CXX) g++ options: -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 6.5.0-15-lowlatency 6.5.0-15-generic 20000M 40000M 60000M 80000M 100000M SE +/- 42796422.99, N = 3 SE +/- 30370906.26, N = 3 78501524673 78797021957 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 6.5.0-15-lowlatency 6.5.0-15-generic 5000M 10000M 15000M 20000M 25000M SE +/- 3067863.29, N = 3 SE +/- 745050.04, N = 3 25463814910 25551254463 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 6.5.0-15-lowlatency 6.5.0-15-generic 70000M 140000M 210000M 280000M 350000M SE +/- 25916007.30, N = 3 SE +/- 6879596.46, N = 3 317443372413 318200857393 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM 6.5.0-15-lowlatency 6.5.0-15-generic 120000M 240000M 360000M 480000M 600000M SE +/- 875380897.88, N = 3 SE +/- 591990057.48, N = 3 555173773503 557764215970 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM 6.5.0-15-lowlatency 6.5.0-15-generic 100000M 200000M 300000M 400000M 500000M SE +/- 183215368.38, N = 3 SE +/- 708194585.39, N = 3 478096851310 478996153957 1. (CC) gcc options: -pthread -m64 -O3 -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 6.5.0-15-lowlatency 6.5.0-15-generic 50000M 100000M 150000M 200000M 250000M SE +/- 47929208.90, N = 3 SE +/- 42805654.09, N = 3 218588896550 219536681733 1. (CC) gcc options: -pthread -m64 -O3 -ldl
Quicksilver Quicksilver is a proxy application that represents some elements of the Mercury workload by solving a simplified dynamic Monte Carlo particle transport problem. Quicksilver is developed by Lawrence Livermore National Laboratory (LLNL) and this test profile currently makes use of the OpenMP CPU threaded code path. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CTS2 6.5.0-15-lowlatency 6.5.0-15-generic 4M 8M 12M 16M 20M SE +/- 8819.17, N = 3 SE +/- 11547.01, N = 3 16226667 16360000 1. (CXX) g++ options: -fopenmp -O3 -march=native
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 6.5.0-15-lowlatency 6.5.0-15-generic 5M 10M 15M 20M 25M SE +/- 8819.17, N = 3 SE +/- 46308.15, N = 3 21266667 21393333 1. (CXX) g++ options: -fopenmp -O3 -march=native
OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P2 6.5.0-15-lowlatency 6.5.0-15-generic 3M 6M 9M 12M 15M SE +/- 15275.25, N = 3 SE +/- 3333.33, N = 3 16190000 16276667 1. (CXX) g++ options: -fopenmp -O3 -march=native
Embree Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown 6.5.0-15-lowlatency 6.5.0-15-generic 15 30 45 60 75 SE +/- 0.11, N = 3 SE +/- 0.08, N = 3 65.56 66.08 MIN: 64.4 / MAX: 67.95 MIN: 64.99 / MAX: 68.67
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon 6.5.0-15-lowlatency 6.5.0-15-generic 20 40 60 80 100 SE +/- 0.06, N = 3 SE +/- 0.11, N = 3 79.42 80.33 MIN: 78.74 / MAX: 80.49 MIN: 79.52 / MAX: 81.9
OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj 6.5.0-15-lowlatency 6.5.0-15-generic 15 30 45 60 75 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 68.15 68.73 MIN: 67.57 / MAX: 69 MIN: 68.21 / MAX: 69.63
SVT-AV1 This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 4 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 2 4 6 8 10 SE +/- 0.029, N = 3 SE +/- 0.022, N = 3 6.671 6.765 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 8 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 15 30 45 60 75 SE +/- 0.25, N = 3 SE +/- 0.04, N = 3 67.39 67.31 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 12 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 40 80 120 160 200 SE +/- 0.85, N = 3 SE +/- 1.30, N = 3 191.90 196.02 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 13 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 40 80 120 160 200 SE +/- 1.84, N = 3 SE +/- 0.65, N = 3 191.27 194.79 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-HEVC This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 3 6 9 12 15 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 11.71 11.80 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 40 80 120 160 200 SE +/- 0.41, N = 3 SE +/- 0.54, N = 3 157.55 160.18 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 40 80 120 160 200 SE +/- 1.29, N = 13 SE +/- 2.64, N = 3 187.30 193.48 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 1 - Input: Bosphorus 1080p 6.5.0-15-lowlatency 6.5.0-15-generic 10 20 30 40 50 SE +/- 0.06, N = 3 SE +/- 0.06, N = 3 43.04 43.61 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 1080p 6.5.0-15-lowlatency 6.5.0-15-generic 80 160 240 320 400 SE +/- 0.36, N = 3 SE +/- 1.09, N = 3 373.60 381.04 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 1080p 6.5.0-15-lowlatency 6.5.0-15-generic 130 260 390 520 650 SE +/- 3.00, N = 3 SE +/- 4.86, N = 15 605.07 605.78 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
uvg266 uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast 6.5.0-15-lowlatency 6.5.0-15-generic 12 24 36 48 60 SE +/- 0.08, N = 3 SE +/- 0.20, N = 3 53.06 55.18
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast 6.5.0-15-lowlatency 6.5.0-15-generic 2 4 6 8 10 SE +/- 0.009, N = 3 SE +/- 0.036, N = 3 6.270 6.252 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster 6.5.0-15-lowlatency 6.5.0-15-generic 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 11.48 11.56 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Fast 6.5.0-15-lowlatency 6.5.0-15-generic 4 8 12 16 20 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 17.35 17.42 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 1080p - Video Preset: Faster 6.5.0-15-lowlatency 6.5.0-15-generic 7 14 21 28 35 SE +/- 0.07, N = 3 SE +/- 0.03, N = 3 30.74 30.84 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
x265 This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K 6.5.0-15-lowlatency 6.5.0-15-generic 6 12 18 24 30 SE +/- 0.07, N = 3 SE +/- 0.11, N = 3 27.47 27.54 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Xmrig Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: Monero - Hash Count: 1M 6.5.0-15-lowlatency 6.5.0-15-generic 4K 8K 12K 16K 20K SE +/- 12.05, N = 3 SE +/- 11.59, N = 3 20615.4 20800.8 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OpenBenchmarking.org H/s, More Is Better Xmrig 6.21 Variant: Wownero - Hash Count: 1M 6.5.0-15-lowlatency 6.5.0-15-generic 9K 18K 27K 36K 45K SE +/- 37.63, N = 3 SE +/- 57.20, N = 3 39311.8 39793.8 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
TensorFlow This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 1 - Model: ResNet-50 6.5.0-15-lowlatency 6.5.0-15-generic 1.3298 2.6596 3.9894 5.3192 6.649 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 5.83 5.91
OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 256 - Model: ResNet-50 6.5.0-15-lowlatency 6.5.0-15-generic 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 99.66 99.86
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating 6.5.0-15-lowlatency 6.5.0-15-generic 80K 160K 240K 320K 400K SE +/- 71.67, N = 3 SE +/- 213.86, N = 3 385445 386375 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 6.5.0-15-lowlatency 6.5.0-15-generic 40K 80K 120K 160K 200K SE +/- 1947.02, N = 3 SE +/- 1636.08, N = 3 164282.90 164926.61 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 6.5.0-15-lowlatency 6.5.0-15-generic 30K 60K 90K 120K 150K SE +/- 2020.85, N = 3 SE +/- 562.98, N = 3 161643.39 161255.10 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 6.5.0-15-lowlatency 6.5.0-15-generic 6K 12K 18K 24K 30K SE +/- 12.12, N = 3 SE +/- 11.43, N = 3 27452.4 27526.1 1. (CC) gcc options: -pthread -m64 -O3 -ldl
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only 6.5.0-15-lowlatency 6.5.0-15-generic 600K 1200K 1800K 2400K 3000K SE +/- 11767.66, N = 3 SE +/- 9375.92, N = 3 2721155 2780491 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write 6.5.0-15-lowlatency 6.5.0-15-generic 13K 26K 39K 52K 65K SE +/- 161.98, N = 3 SE +/- 139.98, N = 3 59561 59690 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1000 - Clients: 1000 - Mode: Read Only 6.5.0-15-lowlatency 6.5.0-15-generic 300K 600K 900K 1200K 1500K SE +/- 6639.24, N = 3 SE +/- 6376.04, N = 3 1423119 1424534 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 1000 - Clients: 1000 - Mode: Read Write 6.5.0-15-lowlatency 6.5.0-15-generic 8K 16K 24K 32K 40K SE +/- 166.48, N = 3 SE +/- 166.45, N = 3 36129 33967 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 6.5.0-15-lowlatency 6.5.0-15-generic 200K 400K 600K 800K 1000K SE +/- 75.90, N = 3 SE +/- 118.84, N = 3 923491.5 927407.6 1. (CC) gcc options: -pthread -m64 -O3 -ldl
NAMD NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms 6.5.0-15-lowlatency 6.5.0-15-generic 0.0862 0.1724 0.2586 0.3448 0.431 SE +/- 0.00028, N = 3 SE +/- 0.00139, N = 3 0.38321 0.38212
OSPRay Studio Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 400 800 1200 1600 2000 SE +/- 0.67, N = 3 SE +/- 0.33, N = 3 1919 1909
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 400 800 1200 1600 2000 SE +/- 0.33, N = 3 SE +/- 1.86, N = 3 1937 1924
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 500 1000 1500 2000 2500 SE +/- 1.20, N = 3 SE +/- 0.58, N = 3 2276 2264
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 8K 16K 24K 32K 40K SE +/- 165.13, N = 3 SE +/- 73.06, N = 3 37253 37167
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 15K 30K 45K 60K 75K SE +/- 79.50, N = 3 SE +/- 177.83, N = 3 68016 67599
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 8K 16K 24K 32K 40K SE +/- 91.85, N = 3 SE +/- 250.73, N = 3 37375 37275
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 15K 30K 45K 60K 75K SE +/- 67.71, N = 3 SE +/- 146.08, N = 3 68417 68047
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 9K 18K 27K 36K 45K SE +/- 168.90, N = 3 SE +/- 298.50, N = 3 42737 42736
OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU 6.5.0-15-lowlatency 6.5.0-15-generic 20K 40K 60K 80K 100K SE +/- 130.78, N = 3 SE +/- 56.83, N = 3 79528 78986
PostgreSQL This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency 6.5.0-15-lowlatency 6.5.0-15-generic 0.0826 0.1652 0.2478 0.3304 0.413 SE +/- 0.002, N = 3 SE +/- 0.001, N = 3 0.367 0.359 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency 6.5.0-15-lowlatency 6.5.0-15-generic 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 16.79 16.75 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1000 - Clients: 1000 - Mode: Read Only - Average Latency 6.5.0-15-lowlatency 6.5.0-15-generic 0.1582 0.3164 0.4746 0.6328 0.791 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 0.703 0.702 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 1000 - Clients: 1000 - Mode: Read Write - Average Latency 6.5.0-15-lowlatency 6.5.0-15-generic 7 14 21 28 35 SE +/- 0.13, N = 3 SE +/- 0.14, N = 3 27.68 29.44 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenBenchmarking.org msec, Fewer Is Better DaCapo Benchmark 23.11 Java Test: H2O In-Memory Platform For Machine Learning 6.5.0-15-lowlatency 6.5.0-15-generic 900 1800 2700 3600 4500 SE +/- 37.89, N = 3 SE +/- 38.17, N = 3 4037 4036
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction 6.5.0-15-lowlatency 6.5.0-15-generic 1.0368 2.0736 3.1104 4.1472 5.184 SE +/- 0.04467137, N = 3 SE +/- 0.01890146, N = 3 4.60799265 4.55911493 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction 6.5.0-15-lowlatency 6.5.0-15-generic 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 20.56 20.57 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenFOAM OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time 6.5.0-15-lowlatency 6.5.0-15-generic 7 14 21 28 35 32.03 29.50 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time 6.5.0-15-lowlatency 6.5.0-15-generic 13 26 39 52 65 55.79 55.14 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time 6.5.0-15-lowlatency 6.5.0-15-generic 40 80 120 160 200 168.85 168.82 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time 6.5.0-15-lowlatency 6.5.0-15-generic 150 300 450 600 750 678.46 685.38 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenRadioss OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test 6.5.0-15-lowlatency 6.5.0-15-generic 8 16 24 32 40 SE +/- 0.14, N = 3 SE +/- 0.05, N = 3 33.16 32.62
Count: 32 - Type: Thread
6.5.0-15-lowlatency: The test quit with a non-zero exit status. E: Creating fdpair (error: Too many open files)
6.5.0-15-generic: The test quit with a non-zero exit status. E: Creating fdpair (error: Too many open files)
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 16 - Type: Process 6.5.0-15-lowlatency 6.5.0-15-generic 4 8 12 16 20 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 13.54 13.65 1. (CC) gcc options: -lpthread
OpenBenchmarking.org Seconds, Fewer Is Better Hackbench Count: 32 - Type: Process 6.5.0-15-lowlatency 6.5.0-15-generic 6 12 18 24 30 SE +/- 0.05, N = 3 SE +/- 0.04, N = 3 24.92 24.68 1. (CC) gcc options: -lpthread
Darktable Darktable is an open-source photography / workflow application this will use any system-installed Darktable program or on Windows will automatically download the pre-built binary from the project. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Boat - Acceleration: CPU-only 6.5.0-15-lowlatency 6.5.0-15-generic 0.5427 1.0854 1.6281 2.1708 2.7135 SE +/- 0.012, N = 3 SE +/- 0.014, N = 3 2.394 2.412
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only 6.5.0-15-lowlatency 6.5.0-15-generic 0.3494 0.6988 1.0482 1.3976 1.747 SE +/- 0.007, N = 3 SE +/- 0.005, N = 3 1.553 1.526
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Rack - Acceleration: CPU-only 6.5.0-15-lowlatency 6.5.0-15-generic 0.0232 0.0464 0.0696 0.0928 0.116 SE +/- 0.001, N = 3 SE +/- 0.001, N = 4 0.102 0.103
OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Room - Acceleration: CPU-only 6.5.0-15-lowlatency 6.5.0-15-generic 0.4163 0.8326 1.2489 1.6652 2.0815 SE +/- 0.007, N = 3 SE +/- 0.002, N = 3 1.850 1.850
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: auto-levels 6.5.0-15-lowlatency 6.5.0-15-generic 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.07, N = 3 21.17 21.14
OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: unsharp-mask 6.5.0-15-lowlatency 6.5.0-15-generic 6 12 18 24 30 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 26.63 26.65
Blender Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: BMW27 - Compute: CPU-Only 6.5.0-15-lowlatency 6.5.0-15-generic 6 12 18 24 30 SE +/- 0.17, N = 3 SE +/- 0.04, N = 3 26.66 26.42
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Classroom - Compute: CPU-Only 6.5.0-15-lowlatency 6.5.0-15-generic 15 30 45 60 75 SE +/- 0.09, N = 3 SE +/- 0.04, N = 3 66.34 65.88
OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.0 Blend File: Barbershop - Compute: CPU-Only 6.5.0-15-lowlatency 6.5.0-15-generic 50 100 150 200 250 SE +/- 0.10, N = 3 SE +/- 0.07, N = 3 238.86 236.99
Geometric Mean Of All Test Results OpenBenchmarking.org Geometric Mean, More Is Better Geometric Mean Of All Test Results Result Composite - Ubuntu Kernel Comparison Low-Latency Kernel 6.5.0-15-lowlatency 6.5.0-15-generic 70 140 210 280 350 333.22 336.81
Hide Result
Number Of First Place Finishes 6.5.0-15-generic 114 [77.0%] 6.5.0-15-lowlatency 34 [23.0%] Number Of First Place Finishes Wins - 148 Tests OpenBenchmarking.org
Hide Result
Number Of Last Place Finishes 6.5.0-15-lowlatency 115 [77.7%] 6.5.0-15-generic 33 [22.3%] Number Of Last Place Finishes Losses - 148 Tests OpenBenchmarking.org
Hide Result
6.5.0-15-lowlatency Processor: AMD EPYC 8534P 64-Core @ 2.30GHz (64 Cores / 128 Threads), Motherboard: AMD Cinnabar (RCB1009C BIOS), Chipset: AMD Device 14a4, Memory: 6 x 32GB DRAM-4800MT/s Samsung M321R4GA0BB0-CQKMG, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.5.0-15-lowlatency (x86_64), Desktop: GNOME Shell, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu1)Python Notes: Python 3.11.5Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 29 January 2024 02:38 by user phoronix.
6.5.0-15-generic Processor: AMD EPYC 8534P 64-Core @ 2.30GHz (64 Cores / 128 Threads), Motherboard: AMD Cinnabar (RCB1009C BIOS), Chipset: AMD Device 14a4, Memory: 6 x 32GB DRAM-4800MT/s Samsung M321R4GA0BB0-CQKMG, Disk: 3201GB Micron_7450_MTFDKCB3T2TFS, Graphics: ASPEED, Network: 2 x Broadcom NetXtreme BCM5720 PCIe
OS: Ubuntu 23.10, Kernel: 6.5.0-15-generic (x86_64), Desktop: GNOME Shell, Display Server: X Server 1.21.1.7, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 1920x1200
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-FTCNCZ/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00212Java Notes: OpenJDK Runtime Environment (build 11.0.20+8-post-Ubuntu-1ubuntu1)Python Notes: Python 3.11.5Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 29 January 2024 11:19 by user phoronix.