amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
r8g.4xlarge Processor: ARMv8 Neoverse-V2 (16 Cores), Motherboard: Amazon EC2 r8g.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Amazon Linux 2023.4.20240401, Kernel: 6.1.82-99.168.amzn2023.aarch64 (aarch64) 20240325, Compiler: GCC 11.4.1 20230605, File-System: xfs, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch=armv8.2-a+crypto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=neoverse-n1Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite r8g.4xlarge 130 260 390 520 650 SE +/- 0.23, N = 3 606.65 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo r8g.4xlarge 30 60 90 120 150 SE +/- 0.12, N = 3 135.35 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform r8g.4xlarge 80 160 240 320 400 SE +/- 1.23, N = 3 386.09 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply r8g.4xlarge 140 280 420 560 700 SE +/- 0.18, N = 3 630.76 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization r8g.4xlarge 160 320 480 640 800 SE +/- 0.29, N = 3 759.55 1. (CC) gcc options: -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation r8g.4xlarge 200 400 600 800 1000 SE +/- 0.20, N = 3 1121.54 1. (CC) gcc options: -lm
DaCapo Benchmark This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance of various popular real-world Java workloads. Learn more via the OpenBenchmarking.org test page.
Java Test: Jython
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Eclipse
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: GraphChi
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Tradesoap
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Tradebeans
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Spring Boot
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Kafka
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Tomcat
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: jMonkeyEngine
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Cassandra
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Xalan XSLT
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Batik SVG Toolkit
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: H2 Database Engine
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: FOP Print Formatter
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: PMD Source Code Analyzer
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Lucene Search Index
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Apache Lucene Search Engine
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Avrora AVR Simulation Framework
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: BioJava Biological Data Framework
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: Zxing 1D/2D Barcode Image Processing
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Java Test: H2O In-Memory Platform For Machine Learning
r8g.4xlarge: The test quit with a non-zero exit status. E: dacapobench: line 2: java: command not found
Renaissance Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.
Test: Scala Dotty
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Random Forest
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: ALS Movie Lens
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Apache Spark ALS
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Apache Spark Bayes
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Savina Reactors.IO
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Apache Spark PageRank
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Finagle HTTP Requests
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: In-Memory Database Shootout
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Akka Unbalanced Cobwebbed Tree
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Test: Genetic Algorithm Using Jenetics + Futures
r8g.4xlarge: The test quit with a non-zero exit status. E: renaissance: line 2: java: command not found
Botan Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI r8g.4xlarge 20 40 60 80 100 SE +/- 0.01, N = 3 81.17 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt r8g.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 81.70 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 r8g.4xlarge 1500 3000 4500 6000 7500 SE +/- 3.76, N = 3 6831.76 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt r8g.4xlarge 1500 3000 4500 6000 7500 SE +/- 4.49, N = 3 6841.95 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish r8g.4xlarge 60 120 180 240 300 SE +/- 0.01, N = 3 279.77 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt r8g.4xlarge 60 120 180 240 300 SE +/- 0.05, N = 3 282.36 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish r8g.4xlarge 70 140 210 280 350 SE +/- 0.06, N = 3 324.59 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt r8g.4xlarge 70 140 210 280 350 SE +/- 0.18, N = 3 322.63 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 r8g.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 121.38 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt r8g.4xlarge 30 60 90 120 150 SE +/- 0.02, N = 3 121.79 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 r8g.4xlarge 90 180 270 360 450 SE +/- 0.15, N = 3 423.73 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt r8g.4xlarge 90 180 270 360 450 SE +/- 0.65, N = 3 415.62 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write r8g.4xlarge 9K 18K 27K 36K 45K SE +/- 352.60, N = 8 39949.98 MIN: 32064.74 / MAX: 42409.68 1. (CC) gcc options: -O3 -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 575.43, N = 3 81049.39 MIN: 67937.13 / MAX: 84809.36 1. (CC) gcc options: -O3 -lrt
libjpeg-turbo tjbench tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.
Test: Decompression Throughput
r8g.4xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 r8g.4xlarge 0.3443 0.6886 1.0329 1.3772 1.7215 SE +/- 0.00, N = 3 1.53 MIN: 1.47 / MAX: 1.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 r8g.4xlarge 0.2903 0.5806 0.8709 1.1612 1.4515 SE +/- 0.00, N = 3 1.29 MIN: 1.25 / MAX: 1.36 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 r8g.4xlarge 0.261 0.522 0.783 1.044 1.305 SE +/- 0.01, N = 3 1.16 MIN: 1.13 / MAX: 1.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet r8g.4xlarge 0.306 0.612 0.918 1.224 1.53 SE +/- 0.00, N = 3 1.36 MIN: 1.34 / MAX: 1.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 r8g.4xlarge 0.486 0.972 1.458 1.944 2.43 SE +/- 0.01, N = 3 2.16 MIN: 2.12 / MAX: 2.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface r8g.4xlarge 0.2205 0.441 0.6615 0.882 1.1025 SE +/- 0.11, N = 3 0.98 MIN: 0.85 / MAX: 30.7 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet r8g.4xlarge 0.8145 1.629 2.4435 3.258 4.0725 SE +/- 0.00, N = 3 3.62 MIN: 3.57 / MAX: 3.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 r8g.4xlarge 2 4 6 8 10 SE +/- 0.04, N = 3 8.54 MIN: 8.39 / MAX: 13.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 r8g.4xlarge 0.4658 0.9316 1.3974 1.8632 2.329 SE +/- 0.00, N = 3 2.07 MIN: 2.03 / MAX: 2.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet r8g.4xlarge 0.4995 0.999 1.4985 1.998 2.4975 SE +/- 0.00, N = 3 2.22 MIN: 2.19 / MAX: 2.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 r8g.4xlarge 1.224 2.448 3.672 4.896 6.12 SE +/- 0.12, N = 3 5.44 MIN: 5.25 / MAX: 50.84 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r8g.4xlarge 1.2443 2.4886 3.7329 4.9772 6.2215 SE +/- 0.03, N = 3 5.53 MIN: 5.41 / MAX: 28.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny r8g.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 7.65 MIN: 7.58 / MAX: 8.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd r8g.4xlarge 0.855 1.71 2.565 3.42 4.275 SE +/- 0.01, N = 3 3.80 MIN: 3.73 / MAX: 4.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m r8g.4xlarge 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.07, N = 3 4.53 MIN: 4.41 / MAX: 4.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer r8g.4xlarge 16 32 48 64 80 SE +/- 1.11, N = 3 73.55 MIN: 71.66 / MAX: 108.23 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet r8g.4xlarge 0.3533 0.7066 1.0599 1.4132 1.7665 SE +/- 0.05, N = 3 1.57 MIN: 1.48 / MAX: 32.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet r8g.4xlarge 1.242 2.484 3.726 4.968 6.21 SE +/- 0.01, N = 3 5.52 MIN: 5.43 / MAX: 6.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 r8g.4xlarge 0.3443 0.6886 1.0329 1.3772 1.7215 SE +/- 0.00, N = 3 1.53 MIN: 1.48 / MAX: 1.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 r8g.4xlarge 0.2925 0.585 0.8775 1.17 1.4625 SE +/- 0.00, N = 3 1.30 MIN: 1.25 / MAX: 1.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 r8g.4xlarge 0.261 0.522 0.783 1.044 1.305 SE +/- 0.00, N = 3 1.16 MIN: 1.13 / MAX: 1.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet r8g.4xlarge 0.306 0.612 0.918 1.224 1.53 SE +/- 0.00, N = 3 1.36 MIN: 1.33 / MAX: 1.61 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 r8g.4xlarge 0.513 1.026 1.539 2.052 2.565 SE +/- 0.11, N = 3 2.28 MIN: 2.13 / MAX: 56.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface r8g.4xlarge 0.1958 0.3916 0.5874 0.7832 0.979 SE +/- 0.01, N = 3 0.87 MIN: 0.86 / MAX: 0.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet r8g.4xlarge 0.8168 1.6336 2.4504 3.2672 4.084 SE +/- 0.00, N = 3 3.63 MIN: 3.58 / MAX: 3.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 r8g.4xlarge 2 4 6 8 10 SE +/- 0.00, N = 3 8.5 MIN: 8.44 / MAX: 8.81 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 r8g.4xlarge 0.4658 0.9316 1.3974 1.8632 2.329 SE +/- 0.00, N = 3 2.07 MIN: 2.04 / MAX: 2.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet r8g.4xlarge 0.513 1.026 1.539 2.052 2.565 SE +/- 0.05, N = 3 2.28 MIN: 2.2 / MAX: 15.93 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 r8g.4xlarge 1.2263 2.4526 3.6789 4.9052 6.1315 SE +/- 0.13, N = 3 5.45 MIN: 5.26 / MAX: 60.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r8g.4xlarge 1.242 2.484 3.726 4.968 6.21 SE +/- 0.01, N = 3 5.52 MIN: 5.43 / MAX: 6.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny r8g.4xlarge 2 4 6 8 10 SE +/- 0.04, N = 3 7.69 MIN: 7.53 / MAX: 23.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd r8g.4xlarge 0.8573 1.7146 2.5719 3.4292 4.2865 SE +/- 0.01, N = 3 3.81 MIN: 3.73 / MAX: 3.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m r8g.4xlarge 1.0283 2.0566 3.0849 4.1132 5.1415 SE +/- 0.02, N = 3 4.57 MIN: 4.49 / MAX: 4.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer r8g.4xlarge 16 32 48 64 80 SE +/- 0.12, N = 3 71.99 MIN: 71.58 / MAX: 97.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet r8g.4xlarge 0.342 0.684 1.026 1.368 1.71 SE +/- 0.00, N = 3 1.52 MIN: 1.5 / MAX: 1.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 1024 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark r8g.4xlarge 4M 8M 12M 16M 20M SE +/- 148055.91, N = 15 18498182 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Decompression Speed r8g.4xlarge 300 600 900 1200 1500 SE +/- 4.96, N = 3 1407.3 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed r8g.4xlarge 300 600 900 1200 1500 SE +/- 5.89, N = 3 1527.6 1. (CC) gcc options: -O3 -pthread -lz -llzma
OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed r8g.4xlarge 300 600 900 1200 1500 SE +/- 6.93, N = 3 1270.1 1. (CC) gcc options: -O3 -pthread -lz -llzma
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl r8g.4xlarge 40 80 120 160 200 SE +/- 0.33, N = 3 181 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate r8g.4xlarge 50 100 150 200 250 SE +/- 0.33, N = 3 231 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen r8g.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 81 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced r8g.4xlarge 15 30 45 60 75 SE +/- 0.00, N = 3 68 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing r8g.4xlarge 80 160 240 320 400 SE +/- 0.67, N = 3 379 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian r8g.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 91 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space r8g.4xlarge 80 160 240 320 400 SE +/- 1.53, N = 3 353 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lzstd -llzma -lz -lm -lpthread -lgomp
OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p r8g.4xlarge 30 60 90 120 150 SE +/- 0.08, N = 3 141.03 1. (CC) gcc options: -ldl -lm -lpthread -O3 -flto
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 1080p r8g.4xlarge 14 28 42 56 70 SE +/- 0.04, N = 3 60.93 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel r8g.4xlarge 7 14 21 28 35 SE +/- 0.01, N = 3 28.44 1. (CC) gcc options: -lm -lpthread -O3
VVenC VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast r8g.4xlarge 0.4194 0.8388 1.2582 1.6776 2.097 SE +/- 0.001, N = 3 1.864 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster r8g.4xlarge 1.0071 2.0142 3.0213 4.0284 5.0355 SE +/- 0.004, N = 3 4.476 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast r8g.4xlarge 2 4 6 8 10 SE +/- 0.003, N = 3 6.191 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster r8g.4xlarge 4 8 12 16 20 SE +/- 0.01, N = 3 14.38 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Smallpt Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples r8g.4xlarge 2 4 6 8 10 SE +/- 0.011, N = 3 6.487 1. (CXX) g++ options: -fopenmp -O3
BlogBench BlogBench is designed to replicate the load of a real-world busy file server by stressing the file-system with multiple threads of random reads, writes, and rewrites. The behavior is mimicked of that of a blog by creating blogs with content and pictures, modifying blog posts, adding comments to these blogs, and then reading the content of the blogs. All of these blogs generated are created locally with fake content and pictures. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read r8g.4xlarge 800K 1600K 2400K 3200K 4000K SE +/- 104173.89, N = 9 3940767 1. (CC) gcc options: -O2
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
Connections: 1
r8g.4xlarge: The test quit with a non-zero exit status.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 20 r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 141.71, N = 3 83542.50 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 100 r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 28.11, N = 3 100860.57 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 179.14, N = 3 102498.57 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 34.72, N = 3 100329.07 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 32.96, N = 3 96708.89 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 4000 r8g.4xlarge 20K 40K 60K 80K 100K SE +/- 338.53, N = 3 79039.29 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 r8g.4xlarge 3000M 6000M 9000M 12000M 15000M SE +/- 23021075.87, N = 3 16105222607 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 r8g.4xlarge 2000M 4000M 6000M 8000M 10000M SE +/- 2354314.58, N = 3 9299066767 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r8g.4xlarge 600 1200 1800 2400 3000 SE +/- 0.06, N = 3 2870.6 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r8g.4xlarge 40K 80K 120K 160K 200K SE +/- 74.23, N = 3 199253.5 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 r8g.4xlarge 6000M 12000M 18000M 24000M 30000M SE +/- 1063598.05, N = 3 27604832893 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM r8g.4xlarge 16000M 32000M 48000M 64000M 80000M SE +/- 2540575.13, N = 3 76402437783 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM r8g.4xlarge 14000M 28000M 42000M 56000M 70000M SE +/- 2261581.12, N = 3 67679553673 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 r8g.4xlarge 4000M 8000M 12000M 16000M 20000M SE +/- 1518672.55, N = 3 20049221860 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 r8g.4xlarge 600K 1200K 1800K 2400K 3000K SE +/- 32100.86, N = 3 2630419.35 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 25933.12, N = 3 2228193.21 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 14254.25, N = 3 2218605.41 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 15891.92, N = 3 2412926.43 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 r8g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 5989.17, N = 3 2096293.46 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 13878.60, N = 3 2141589.45 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 r8g.4xlarge 600K 1200K 1800K 2400K 3000K SE +/- 6891.34, N = 3 2570209.93 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 141710.45, N = 15 2551128.06 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 r8g.4xlarge 600K 1200K 1800K 2400K 3000K SE +/- 70089.52, N = 15 2644681.95 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 117370.27, N = 12 2413706.65 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 r8g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 19732.75, N = 3 1977320.29 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 8994.96, N = 3 2381517.13 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 110152.98, N = 12 2410412.48 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 r8g.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 16057.89, N = 3 2468036.02 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Test: Mixed 1:1
r8g.4xlarge: The test run did not produce a result.
Test: Mixed 1:3
r8g.4xlarge: The test run did not produce a result.
r8g.4xlarge Processor: ARMv8 Neoverse-V2 (16 Cores), Motherboard: Amazon EC2 r8g.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Amazon Linux 2023.4.20240401, Kernel: 6.1.82-99.168.amzn2023.aarch64 (aarch64) 20240325, Compiler: GCC 11.4.1 20230605, File-System: xfs, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch=armv8.2-a+crypto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=neoverse-n1Security Notes: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 11 April 2024 02:55 by user root.