amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
r6a.4xlarge Processor: AMD EPYC 7R13 (8 Cores / 16 Threads), Motherboard: Amazon EC2 r6a.4xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 1 x 128 GB DDR4-3200MT/s, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Amazon Linux 2023.4.20240401, Kernel: 6.1.82-99.168.amzn2023.x86_64 (x86_64), Compiler: GCC 11.4.1 20230605, File-System: xfs, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverProcessor Notes: CPU Microcode: 0xa0011d1Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Intel Memory Latency Checker Intel Memory Latency Checker (MLC) is a binary-only system memory bandwidth and latency benchmark. If the download fails you may need to manually download the file from https://www.intel.com/content/www/us/en/developer/articles/tool/intelr-memory-latency-checker.html and place it in your PTS download cache. On some systems root privileges are needed to run the MLC tester. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns, Fewer Is Better Intel Memory Latency Checker 3.10 Test: Idle Latency r6a.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 123.2
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 13.75, N = 3 51742.22 MIN: 45737.39 / MAX: 54502.64 1. (CC) gcc options: -O3 -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write r6a.4xlarge 20K 40K 60K 80K 100K SE +/- 71.89, N = 3 102080.81 MIN: 87920.86 / MAX: 108481.71 1. (CC) gcc options: -O3 -lrt
GNU GMP GMPbench GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.
r6a.4xlarge: The test run did not produce a result. E: GMPbench.app.pi(1000000) ./runbench: line 121: ./pi: No such file or directory
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 r6a.4xlarge 2000M 4000M 6000M 8000M 10000M SE +/- 11087062.81, N = 3 11165896017 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 r6a.4xlarge 800M 1600M 2400M 3200M 4000M SE +/- 1360255.13, N = 3 3755914720 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r6a.4xlarge 500 1000 1500 2000 2500 SE +/- 0.59, N = 3 2099.8 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r6a.4xlarge 30K 60K 90K 120K 150K SE +/- 3.52, N = 3 137205.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 r6a.4xlarge 8000M 16000M 24000M 32000M 40000M SE +/- 5373039.12, N = 3 35343929890 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM r6a.4xlarge 8000M 16000M 24000M 32000M 40000M SE +/- 1654552.15, N = 3 37945733573 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM r6a.4xlarge 7000M 14000M 21000M 28000M 35000M SE +/- 5090891.00, N = 3 34776523753 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 r6a.4xlarge 5000M 10000M 15000M 20000M 25000M SE +/- 18263607.95, N = 3 24046419150 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Botan Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI r6a.4xlarge 20 40 60 80 100 SE +/- 0.01, N = 3 85.66 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt r6a.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 82.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 r6a.4xlarge 1200 2400 3600 4800 6000 SE +/- 6.18, N = 3 5582.96 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt r6a.4xlarge 1200 2400 3600 4800 6000 SE +/- 5.64, N = 3 5583.55 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish r6a.4xlarge 70 140 210 280 350 SE +/- 0.10, N = 3 318.03 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt r6a.4xlarge 70 140 210 280 350 SE +/- 0.14, N = 3 323.04 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish r6a.4xlarge 80 160 240 320 400 SE +/- 0.07, N = 3 376.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt r6a.4xlarge 80 160 240 320 400 SE +/- 0.03, N = 3 377.43 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 r6a.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 129.10 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt r6a.4xlarge 30 60 90 120 150 SE +/- 0.01, N = 3 129.07 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 r6a.4xlarge 160 320 480 640 800 SE +/- 0.44, N = 3 731.35 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt r6a.4xlarge 150 300 450 600 750 SE +/- 0.93, N = 3 714.73 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p r6a.4xlarge 30 60 90 120 150 SE +/- 0.09, N = 3 114.66 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -flto
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 1080p r6a.4xlarge 12 24 36 48 60 SE +/- 0.26, N = 3 55.19 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl r6a.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 109 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate r6a.4xlarge 30 60 90 120 150 SE +/- 0.33, N = 3 134 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen r6a.4xlarge 5 10 15 20 25 SE +/- 0.00, N = 3 22 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced r6a.4xlarge 8 16 24 32 40 SE +/- 0.00, N = 3 36 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing r6a.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 177 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian r6a.4xlarge 12 24 36 48 60 SE +/- 0.00, N = 3 53 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space r6a.4xlarge 40 80 120 160 200 SE +/- 0.33, N = 3 178 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples r6a.4xlarge 3 6 9 12 15 SE +/- 0.02, N = 3 11.27 1. (CXX) g++ options: -fopenmp -O3
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark r6a.4xlarge 3M 6M 9M 12M 15M SE +/- 200886.58, N = 3 14950049 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel r6a.4xlarge 15 30 45 60 75 SE +/- 0.05, N = 3 68.97 1. (CC) gcc options: -lm -lpthread -O3
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite r6a.4xlarge 140 280 420 560 700 SE +/- 1.31, N = 3 661.60 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo r6a.4xlarge 30 60 90 120 150 SE +/- 0.39, N = 3 141.03 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform r6a.4xlarge 80 160 240 320 400 SE +/- 6.43, N = 3 355.55 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply r6a.4xlarge 140 280 420 560 700 SE +/- 0.48, N = 3 646.94 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization r6a.4xlarge 200 400 600 800 1000 SE +/- 0.77, N = 3 1139.34 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation r6a.4xlarge 200 400 600 800 1000 SE +/- 0.08, N = 3 1025.13 1. (CC) gcc options: -lm
Renaissance Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.
Test: Scala Dotty
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Random Forest
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: ALS Movie Lens
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Apache Spark ALS
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Apache Spark Bayes
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Savina Reactors.IO
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Apache Spark PageRank
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Finagle HTTP Requests
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: In-Memory Database Shootout
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Akka Unbalanced Cobwebbed Tree
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Genetic Algorithm Using Jenetics + Futures
r6a.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
Java Test: Jython
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Eclipse
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: GraphChi
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Tradesoap
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Tradebeans
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Spring Boot
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Kafka
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Tomcat
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: jMonkeyEngine
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Cassandra
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Xalan XSLT
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Batik SVG Toolkit
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: H2 Database Engine
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: FOP Print Formatter
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: PMD Source Code Analyzer
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Lucene Search Index
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Lucene Search Engine
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Avrora AVR Simulation Framework
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: BioJava Biological Data Framework
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Zxing 1D/2D Barcode Image Processing
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: H2O In-Memory Platform For Machine Learning
r6a.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
BlogBench BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read r6a.4xlarge 700K 1400K 2100K 2800K 3500K SE +/- 11890.55, N = 3 3303161 1. (CC) gcc options: -O2
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
Connections: 1
r6a.4xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 20 r6a.4xlarge 9K 18K 27K 36K 45K SE +/- 39.93, N = 3 43467.35 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 100 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 26.49, N = 3 53527.79 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 35.41, N = 3 53546.88 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 32.55, N = 3 51769.47 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 34.11, N = 3 49459.27 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 4000 r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 147.89, N = 3 46860.10 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 4040.76, N = 3 2346107.71 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 5206.96, N = 3 1998837.30 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 4812.47, N = 3 2081724.01 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 16543.77, N = 3 2248697.50 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 4285.87, N = 3 1961016.78 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 9727.66, N = 3 1922744.73 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 11872.60, N = 3 2199721.37 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 20065.63, N = 5 1900547.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 12254.86, N = 3 2047586.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 11207.74, N = 3 1751865.28 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 5921.27, N = 3 1892964.18 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 18308.95, N = 3 2096259.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 8786.86, N = 3 1747045.85 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 11457.17, N = 3 1977046.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Test: Mixed 1:1
r6a.4xlarge: The test run did not produce a result.
Test: Mixed 1:3
r6a.4xlarge: The test run did not produce a result.
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast r6a.4xlarge 0.8498 1.6996 2.5494 3.3992 4.249 SE +/- 0.009, N = 3 3.777 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster r6a.4xlarge 2 4 6 8 10 SE +/- 0.003, N = 3 8.746 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast r6a.4xlarge 3 6 9 12 15 SE +/- 0.01, N = 3 12.31 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster r6a.4xlarge 7 14 21 28 35 SE +/- 0.05, N = 3 30.21 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 r6a.4xlarge 0.5378 1.0756 1.6134 2.1512 2.689 SE +/- 0.00, N = 3 2.39 MIN: 2.33 / MAX: 3.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 r6a.4xlarge 0.4613 0.9226 1.3839 1.8452 2.3065 SE +/- 0.00, N = 3 2.05 MIN: 2.01 / MAX: 2.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 r6a.4xlarge 0.4478 0.8956 1.3434 1.7912 2.239 SE +/- 0.00, N = 3 1.99 MIN: 1.94 / MAX: 2.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet r6a.4xlarge 0.5018 1.0036 1.5054 2.0072 2.509 SE +/- 0.00, N = 3 2.23 MIN: 2.18 / MAX: 2.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 r6a.4xlarge 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.00, N = 3 4.53 MIN: 4.45 / MAX: 4.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface r6a.4xlarge 0.1643 0.3286 0.4929 0.6572 0.8215 SE +/- 0.00, N = 3 0.73 MIN: 0.71 / MAX: 0.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet r6a.4xlarge 2 4 6 8 10 SE +/- 0.06, N = 3 7.71 MIN: 7.52 / MAX: 8.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 r6a.4xlarge 8 16 24 32 40 SE +/- 0.02, N = 3 36.18 MIN: 35.78 / MAX: 41.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 r6a.4xlarge 1.2938 2.5876 3.8814 5.1752 6.469 SE +/- 0.01, N = 3 5.75 MIN: 5.66 / MAX: 7.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet r6a.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.43 MIN: 6.32 / MAX: 8.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 r6a.4xlarge 3 6 9 12 15 SE +/- 0.11, N = 3 12.68 MIN: 12.43 / MAX: 22.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r6a.4xlarge 3 6 9 12 15 SE +/- 0.10, N = 3 9.45 MIN: 9.22 / MAX: 21.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny r6a.4xlarge 4 8 12 16 20 SE +/- 0.02, N = 3 15.02 MIN: 14.81 / MAX: 21.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd r6a.4xlarge 1.3208 2.6416 3.9624 5.2832 6.604 SE +/- 0.02, N = 3 5.87 MIN: 5.73 / MAX: 15.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m r6a.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.32 MIN: 6.15 / MAX: 6.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer r6a.4xlarge 20 40 60 80 100 SE +/- 0.22, N = 3 96.68 MIN: 96.04 / MAX: 153.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet r6a.4xlarge 0.576 1.152 1.728 2.304 2.88 SE +/- 0.02, N = 3 2.56 MIN: 2.47 / MAX: 2.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet r6a.4xlarge 3 6 9 12 15 SE +/- 0.09, N = 15 9.55 MIN: 9.19 / MAX: 12.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 r6a.4xlarge 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.00, N = 15 2.38 MIN: 2.33 / MAX: 7.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 r6a.4xlarge 0.4613 0.9226 1.3839 1.8452 2.3065 SE +/- 0.00, N = 15 2.05 MIN: 2.01 / MAX: 2.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 r6a.4xlarge 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.01, N = 15 2.01 MIN: 1.93 / MAX: 11.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet r6a.4xlarge 0.504 1.008 1.512 2.016 2.52 SE +/- 0.01, N = 15 2.24 MIN: 2.18 / MAX: 12.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 r6a.4xlarge 1.0215 2.043 3.0645 4.086 5.1075 SE +/- 0.01, N = 15 4.54 MIN: 4.45 / MAX: 23.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface r6a.4xlarge 0.1643 0.3286 0.4929 0.6572 0.8215 SE +/- 0.00, N = 15 0.73 MIN: 0.7 / MAX: 2.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet r6a.4xlarge 2 4 6 8 10 SE +/- 0.07, N = 15 7.79 MIN: 7.5 / MAX: 18.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 r6a.4xlarge 8 16 24 32 40 SE +/- 0.04, N = 15 36.26 MIN: 35.75 / MAX: 55.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 r6a.4xlarge 1.3275 2.655 3.9825 5.31 6.6375 SE +/- 0.07, N = 15 5.90 MIN: 5.66 / MAX: 15.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet r6a.4xlarge 2 4 6 8 10 SE +/- 0.00, N = 15 6.43 MIN: 6.29 / MAX: 8.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 r6a.4xlarge 3 6 9 12 15 SE +/- 0.12, N = 15 12.86 MIN: 12.43 / MAX: 16.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r6a.4xlarge 3 6 9 12 15 SE +/- 0.09, N = 15 9.55 MIN: 9.19 / MAX: 12.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny r6a.4xlarge 4 8 12 16 20 SE +/- 0.15, N = 15 15.64 MIN: 14.84 / MAX: 25.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd r6a.4xlarge 2 4 6 8 10 SE +/- 0.06, N = 15 6.00 MIN: 5.73 / MAX: 24.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m r6a.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 15 6.33 MIN: 6.13 / MAX: 12.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer r6a.4xlarge 20 40 60 80 100 SE +/- 0.06, N = 15 96.79 MIN: 95.96 / MAX: 174.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet r6a.4xlarge 0.5985 1.197 1.7955 2.394 2.9925 SE +/- 0.03, N = 15 2.66 MIN: 2.48 / MAX: 4.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
r6a.4xlarge Processor: AMD EPYC 7R13 (8 Cores / 16 Threads), Motherboard: Amazon EC2 r6a.4xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 1 x 128 GB DDR4-3200MT/s, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Amazon Linux 2023.4.20240401, Kernel: 6.1.82-99.168.amzn2023.x86_64 (x86_64), Compiler: GCC 11.4.1 20230605, File-System: xfs, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driverProcessor Notes: CPU Microcode: 0xa0011d1Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 April 2024 05:54 by user root.