r7a.4xlarge amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite. r7a.4xlarge: Processor: AMD EPYC 9R14 (16 Cores), Motherboard: Amazon EC2 r7a.4xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 1 x 128 GB DDR5-4800MT/s, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Amazon Linux 2023.4.20240401, Kernel: 6.1.82-99.168.amzn2023.x86_64 (x86_64), Compiler: GCC 11.4.1 20230605, File-System: xfs, System Layer: amazon Stream 2013-01-17 Type: Copy MB/s > Higher Is Better r7a.4xlarge . 107123.7 |======================================================= Stream 2013-01-17 Type: Scale MB/s > Higher Is Better r7a.4xlarge . 76463.4 |======================================================== Stream 2013-01-17 Type: Triad MB/s > Higher Is Better r7a.4xlarge . 80223.5 |======================================================== Stream 2013-01-17 Type: Add MB/s > Higher Is Better r7a.4xlarge . 80140.0 |======================================================== Intel Memory Latency Checker 3.10 Test: Idle Latency ns < Lower Is Better r7a.4xlarge . 148.2 |========================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - All Reads MB/s > Higher Is Better r7a.4xlarge . 78588.43 |======================================================= Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 3:1 Reads-Writes MB/s > Higher Is Better r7a.4xlarge . 107844.61 |====================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 2:1 Reads-Writes MB/s > Higher Is Better r7a.4xlarge . 114805.61 |====================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 1:1 Reads-Writes MB/s > Higher Is Better r7a.4xlarge . 129618.30 |====================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - Stream-Triad Like MB/s > Higher Is Better r7a.4xlarge . 100462.96 |====================================================== Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - All Reads MB/s > Higher Is Better r7a.4xlarge . 78570.5 |======================================================== Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 3:1 Reads-Writes MB/s > Higher Is Better r7a.4xlarge . 107868.9 |======================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 2:1 Reads-Writes MB/s > Higher Is Better r7a.4xlarge . 114801.6 |======================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 1:1 Reads-Writes MB/s > Higher Is Better r7a.4xlarge . 129565.9 |======================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - Stream-Triad Like MB/s > Higher Is Better r7a.4xlarge . 100295.7 |======================================================= CacheBench Test: Read MB/s > Higher Is Better r7a.4xlarge . 9396.35 |======================================================== CacheBench Test: Write MB/s > Higher Is Better r7a.4xlarge . 54465.15 |======================================================= CacheBench Test: Read / Modify / Write MB/s > Higher Is Better r7a.4xlarge . 107392.72 |====================================================== GNU GMP GMPbench 6.2.1 GMPbench Score > Higher Is Better Zstd Compression 1.5.4 Compression Level: 3 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 2744.0 |========================================================= Zstd Compression 1.5.4 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1489.9 |========================================================= Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 671.6 |========================================================== Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1627.9 |========================================================= Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 208.1 |========================================================== Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1670.4 |========================================================= Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 13.8 |=========================================================== Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1414.4 |========================================================= Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 1034.7 |========================================================= Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1515.3 |========================================================= Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 721.9 |========================================================== Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1637.9 |========================================================= Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 7.03 |=========================================================== Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 1323.6 |========================================================= LZ4 Compression 1.9.4 Compression Level: 1 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 700.93 |========================================================= LZ4 Compression 1.9.4 Compression Level: 1 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 4496.2 |========================================================= LZ4 Compression 1.9.4 Compression Level: 3 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 108.67 |========================================================= LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 4068.1 |========================================================= LZ4 Compression 1.9.4 Compression Level: 9 - Compression Speed MB/s > Higher Is Better r7a.4xlarge . 35.03 |========================================================== LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed MB/s > Higher Is Better r7a.4xlarge . 4254.9 |========================================================= OpenSSL 3.1 Algorithm: SHA256 byte/s > Higher Is Better r7a.4xlarge . 16675650953 |==================================================== OpenSSL 3.1 Algorithm: SHA512 byte/s > Higher Is Better r7a.4xlarge . 7794115687 |===================================================== OpenSSL 3.1 Algorithm: RSA4096 sign/s > Higher Is Better r7a.4xlarge . 11167.7 |======================================================== OpenSSL 3.1 Algorithm: RSA4096 verify/s > Higher Is Better r7a.4xlarge . 268824.1 |======================================================= OpenSSL 3.1 Algorithm: ChaCha20 byte/s > Higher Is Better r7a.4xlarge . 85397552697 |==================================================== OpenSSL 3.1 Algorithm: AES-128-GCM byte/s > Higher Is Better r7a.4xlarge . 185952123653 |=================================================== OpenSSL 3.1 Algorithm: AES-256-GCM byte/s > Higher Is Better r7a.4xlarge . 160337853370 |=================================================== OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 byte/s > Higher Is Better r7a.4xlarge . 60378824160 |==================================================== Botan 2.17.3 Test: KASUMI MiB/s > Higher Is Better r7a.4xlarge . 88.14 |========================================================== Botan 2.17.3 Test: KASUMI - Decrypt MiB/s > Higher Is Better r7a.4xlarge . 85.04 |========================================================== Botan 2.17.3 Test: AES-256 MiB/s > Higher Is Better r7a.4xlarge . 5875.91 |======================================================== Botan 2.17.3 Test: AES-256 - Decrypt MiB/s > Higher Is Better r7a.4xlarge . 5872.76 |======================================================== Botan 2.17.3 Test: Twofish MiB/s > Higher Is Better r7a.4xlarge . 327.22 |========================================================= Botan 2.17.3 Test: Twofish - Decrypt MiB/s > Higher Is Better r7a.4xlarge . 347.04 |========================================================= Botan 2.17.3 Test: Blowfish MiB/s > Higher Is Better r7a.4xlarge . 390.85 |========================================================= Botan 2.17.3 Test: Blowfish - Decrypt MiB/s > Higher Is Better r7a.4xlarge . 389.57 |========================================================= Botan 2.17.3 Test: CAST-256 MiB/s > Higher Is Better r7a.4xlarge . 133.04 |========================================================= Botan 2.17.3 Test: CAST-256 - Decrypt MiB/s > Higher Is Better r7a.4xlarge . 133.05 |========================================================= Botan 2.17.3 Test: ChaCha20Poly1305 MiB/s > Higher Is Better r7a.4xlarge . 735.85 |========================================================= Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt MiB/s > Higher Is Better r7a.4xlarge . 723.31 |========================================================= x264 2022-02-22 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better r7a.4xlarge . 42.43 |========================================================== x264 2022-02-22 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better r7a.4xlarge . 174.34 |========================================================= x265 3.6 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better r7a.4xlarge . 15.00 |========================================================== x265 3.6 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better r7a.4xlarge . 80.81 |========================================================== PyPerformance 1.0.0 Benchmark: go Milliseconds < Lower Is Better r7a.4xlarge . 230 |============================================================ PyPerformance 1.0.0 Benchmark: 2to3 Milliseconds < Lower Is Better r7a.4xlarge . 282 |============================================================ PyPerformance 1.0.0 Benchmark: chaos Milliseconds < Lower Is Better r7a.4xlarge . 98.9 |=========================================================== PyPerformance 1.0.0 Benchmark: float Milliseconds < Lower Is Better r7a.4xlarge . 106 |============================================================ PyPerformance 1.0.0 Benchmark: nbody Milliseconds < Lower Is Better r7a.4xlarge . 126 |============================================================ PyPerformance 1.0.0 Benchmark: pathlib Milliseconds < Lower Is Better r7a.4xlarge . 22.1 |=========================================================== PyPerformance 1.0.0 Benchmark: raytrace Milliseconds < Lower Is Better r7a.4xlarge . 456 |============================================================ PyPerformance 1.0.0 Benchmark: json_loads Milliseconds < Lower Is Better r7a.4xlarge . 18.2 |=========================================================== PyPerformance 1.0.0 Benchmark: crypto_pyaes Milliseconds < Lower Is Better r7a.4xlarge . 102 |============================================================ PyPerformance 1.0.0 Benchmark: regex_compile Milliseconds < Lower Is Better r7a.4xlarge . 135 |============================================================ PyPerformance 1.0.0 Benchmark: python_startup Milliseconds < Lower Is Better r7a.4xlarge . 8.99 |=========================================================== PyPerformance 1.0.0 Benchmark: django_template Milliseconds < Lower Is Better r7a.4xlarge . 43.8 |=========================================================== PyPerformance 1.0.0 Benchmark: pickle_pure_python Milliseconds < Lower Is Better r7a.4xlarge . 387 |============================================================ CppPerformanceBenchmarks 9 Test: Atol Seconds < Lower Is Better r7a.4xlarge . 38.12 |========================================================== CppPerformanceBenchmarks 9 Test: Ctype Seconds < Lower Is Better r7a.4xlarge . 48.77 |========================================================== CppPerformanceBenchmarks 9 Test: Math Library Seconds < Lower Is Better r7a.4xlarge . 256.04 |========================================================= CppPerformanceBenchmarks 9 Test: Random Numbers Seconds < Lower Is Better r7a.4xlarge . 849.50 |========================================================= CppPerformanceBenchmarks 9 Test: Stepanov Vector Seconds < Lower Is Better r7a.4xlarge . 54.46 |========================================================== CppPerformanceBenchmarks 9 Test: Function Objects Seconds < Lower Is Better r7a.4xlarge . 14.20 |========================================================== CppPerformanceBenchmarks 9 Test: Stepanov Abstraction Seconds < Lower Is Better r7a.4xlarge . 24.59 |========================================================== GraphicsMagick 1.3.43 Operation: Swirl Iterations Per Minute > Higher Is Better r7a.4xlarge . 165 |============================================================ GraphicsMagick 1.3.43 Operation: Rotate Iterations Per Minute > Higher Is Better r7a.4xlarge . 124 |============================================================ GraphicsMagick 1.3.43 Operation: Sharpen Iterations Per Minute > Higher Is Better r7a.4xlarge . 43 |============================================================= GraphicsMagick 1.3.43 Operation: Enhanced Iterations Per Minute > Higher Is Better r7a.4xlarge . 53 |============================================================= GraphicsMagick 1.3.43 Operation: Resizing Iterations Per Minute > Higher Is Better r7a.4xlarge . 293 |============================================================ GraphicsMagick 1.3.43 Operation: Noise-Gaussian Iterations Per Minute > Higher Is Better r7a.4xlarge . 61 |============================================================= GraphicsMagick 1.3.43 Operation: HWB Color Space Iterations Per Minute > Higher Is Better r7a.4xlarge . 204 |============================================================ Smallpt 1.0 Global Illumination Renderer; 128 Samples Seconds < Lower Is Better r7a.4xlarge . 7.658 |========================================================== Stockfish 16.1 Chess Benchmark Nodes Per Second > Higher Is Better r7a.4xlarge . 25161360 |======================================================= C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better r7a.4xlarge . 34.17 |========================================================== SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better r7a.4xlarge . 704.65 |========================================================= SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better r7a.4xlarge . 161.44 |========================================================= SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better r7a.4xlarge . 451.45 |========================================================= SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better r7a.4xlarge . 689.07 |========================================================= SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better r7a.4xlarge . 1164.70 |======================================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better r7a.4xlarge . 1056.57 |======================================================== Renaissance 0.14 Test: Scala Dotty ms < Lower Is Better Renaissance 0.14 Test: Random Forest ms < Lower Is Better Renaissance 0.14 Test: ALS Movie Lens ms < Lower Is Better Renaissance 0.14 Test: Apache Spark ALS ms < Lower Is Better Renaissance 0.14 Test: Apache Spark Bayes ms < Lower Is Better Renaissance 0.14 Test: Savina Reactors.IO ms < Lower Is Better Renaissance 0.14 Test: Apache Spark PageRank ms < Lower Is Better Renaissance 0.14 Test: Finagle HTTP Requests ms < Lower Is Better Renaissance 0.14 Test: In-Memory Database Shootout ms < Lower Is Better Renaissance 0.14 Test: Akka Unbalanced Cobwebbed Tree ms < Lower Is Better Renaissance 0.14 Test: Genetic Algorithm Using Jenetics + Futures ms < Lower Is Better DaCapo Benchmark 23.11 Java Test: Jython msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Eclipse msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: GraphChi msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Tradesoap msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Tradebeans msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Spring Boot msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Kafka msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Tomcat msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: jMonkeyEngine msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Cassandra msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Xalan XSLT msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Batik SVG Toolkit msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: H2 Database Engine msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: FOP Print Formatter msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: PMD Source Code Analyzer msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Lucene Search Index msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Lucene Search Engine msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Avrora AVR Simulation Framework msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: BioJava Biological Data Framework msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Zxing 1D/2D Barcode Image Processing msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: H2O In-Memory Platform For Machine Learning msec < Lower Is Better BlogBench 1.1 Test: Read Final Score > Higher Is Better r7a.4xlarge . 2411119 |======================================================== BlogBench 1.1 Test: Write Final Score > Higher Is Better r7a.4xlarge . 10422 |========================================================== nginx 1.23.2 Connections: 1 Requests Per Second > Higher Is Better nginx 1.23.2 Connections: 20 Requests Per Second > Higher Is Better r7a.4xlarge . 56519.79 |======================================================= nginx 1.23.2 Connections: 100 Requests Per Second > Higher Is Better r7a.4xlarge . 63409.26 |======================================================= nginx 1.23.2 Connections: 200 Requests Per Second > Higher Is Better r7a.4xlarge . 64512.33 |======================================================= nginx 1.23.2 Connections: 500 Requests Per Second > Higher Is Better r7a.4xlarge . 65452.75 |======================================================= nginx 1.23.2 Connections: 1000 Requests Per Second > Higher Is Better r7a.4xlarge . 64799.34 |======================================================= nginx 1.23.2 Connections: 4000 Requests Per Second > Higher Is Better r7a.4xlarge . 62897.23 |======================================================= Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 Ops/sec > Higher Is Better r7a.4xlarge . 2797827.94 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 Ops/sec > Higher Is Better r7a.4xlarge . 2997062.31 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 Ops/sec > Higher Is Better r7a.4xlarge . 2553261.19 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 Ops/sec > Higher Is Better r7a.4xlarge . 2469143.21 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 Ops/sec > Higher Is Better r7a.4xlarge . 2646865.74 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 Ops/sec > Higher Is Better r7a.4xlarge . 2354124.76 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 Ops/sec > Higher Is Better r7a.4xlarge . 2431915.60 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 Ops/sec > Higher Is Better r7a.4xlarge . 2740324.85 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 Ops/sec > Higher Is Better r7a.4xlarge . 2283141.90 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 Ops/sec > Higher Is Better r7a.4xlarge . 2419810.58 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 Ops/sec > Higher Is Better r7a.4xlarge . 2055619.68 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 Ops/sec > Higher Is Better r7a.4xlarge . 2119598.94 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 Ops/sec > Higher Is Better r7a.4xlarge . 2434639.14 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 Ops/sec > Higher Is Better r7a.4xlarge . 2019821.66 |===================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 Ops/sec > Higher Is Better r7a.4xlarge . 2291753.10 |===================================================== Apache Cassandra 4.1.3 Test: Writes Op/s > Higher Is Better Apache Cassandra 4.1.3 Test: Mixed 1:1 Op/s > Higher Is Better Apache Cassandra 4.1.3 Test: Mixed 1:3 Op/s > Higher Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput Megapixels/sec > Higher Is Better r7a.4xlarge . 219.12 |========================================================= VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast Frames Per Second > Higher Is Better r7a.4xlarge . 5.643 |========================================================== VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster Frames Per Second > Higher Is Better r7a.4xlarge . 12.84 |========================================================== VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast Frames Per Second > Higher Is Better r7a.4xlarge . 19.07 |========================================================== VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster Frames Per Second > Higher Is Better r7a.4xlarge . 41.83 |========================================================== NCNN 20230517 Target: CPU - Model: mobilenet ms < Lower Is Better r7a.4xlarge . 13.08 |========================================================== NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better r7a.4xlarge . 5.59 |=========================================================== NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better r7a.4xlarge . 5.56 |=========================================================== NCNN 20230517 Target: CPU - Model: shufflenet-v2 ms < Lower Is Better r7a.4xlarge . 5.79 |=========================================================== NCNN 20230517 Target: CPU - Model: mnasnet ms < Lower Is Better r7a.4xlarge . 5.03 |=========================================================== NCNN 20230517 Target: CPU - Model: efficientnet-b0 ms < Lower Is Better r7a.4xlarge . 6.65 |=========================================================== NCNN 20230517 Target: CPU - Model: blazeface ms < Lower Is Better r7a.4xlarge . 2.31 |=========================================================== NCNN 20230517 Target: CPU - Model: googlenet ms < Lower Is Better r7a.4xlarge . 14.06 |========================================================== NCNN 20230517 Target: CPU - Model: vgg16 ms < Lower Is Better r7a.4xlarge . 20.94 |========================================================== NCNN 20230517 Target: CPU - Model: resnet18 ms < Lower Is Better r7a.4xlarge . 7.69 |=========================================================== NCNN 20230517 Target: CPU - Model: alexnet ms < Lower Is Better r7a.4xlarge . 5.59 |=========================================================== NCNN 20230517 Target: CPU - Model: resnet50 ms < Lower Is Better r7a.4xlarge . 13.99 |========================================================== NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 ms < Lower Is Better r7a.4xlarge . 13.08 |========================================================== NCNN 20230517 Target: CPU - Model: yolov4-tiny ms < Lower Is Better r7a.4xlarge . 20.16 |========================================================== NCNN 20230517 Target: CPU - Model: squeezenet_ssd ms < Lower Is Better r7a.4xlarge . 11.84 |========================================================== NCNN 20230517 Target: CPU - Model: regnety_400m ms < Lower Is Better r7a.4xlarge . 11.63 |========================================================== NCNN 20230517 Target: CPU - Model: vision_transformer ms < Lower Is Better r7a.4xlarge . 63.91 |========================================================== NCNN 20230517 Target: CPU - Model: FastestDet ms < Lower Is Better r7a.4xlarge . 7.27 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better r7a.4xlarge . 13.12 |========================================================== NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better r7a.4xlarge . 5.53 |=========================================================== NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better r7a.4xlarge . 5.57 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better r7a.4xlarge . 5.78 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better r7a.4xlarge . 5.06 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better r7a.4xlarge . 6.65 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better r7a.4xlarge . 2.32 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better r7a.4xlarge . 14.05 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better r7a.4xlarge . 21.00 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better r7a.4xlarge . 7.70 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better r7a.4xlarge . 5.60 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better r7a.4xlarge . 14.01 |========================================================== NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 ms < Lower Is Better r7a.4xlarge . 13.12 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better r7a.4xlarge . 20.14 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better r7a.4xlarge . 11.95 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better r7a.4xlarge . 11.67 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better r7a.4xlarge . 63.86 |========================================================== NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better r7a.4xlarge . 7.45 |=========================================================== libxsmm 2-1.17-3645 M N K: 128 GFLOPS/s > Higher Is Better r7a.4xlarge . 630.1 |========================================================== libxsmm 2-1.17-3645 M N K: 256 GFLOPS/s > Higher Is Better r7a.4xlarge . 762.2 |========================================================== libxsmm 2-1.17-3645 M N K: 32 GFLOPS/s > Higher Is Better r7a.4xlarge . 214.1 |========================================================== libxsmm 2-1.17-3645 M N K: 64 GFLOPS/s > Higher Is Better r7a.4xlarge . 378.5 |========================================================== Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 2000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 2000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 2000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 2000 Seconds < Lower Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache Queries Per Minute, Geo Mean > Higher Is Better r7a.4xlarge . 258.02 |========================================================= ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run Queries Per Minute, Geo Mean > Higher Is Better r7a.4xlarge . 272.63 |========================================================= ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run Queries Per Minute, Geo Mean > Higher Is Better r7a.4xlarge . 275.21 |========================================================= InfluxDB 1.8.2 Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 val/sec > Higher Is Better r7a.4xlarge . 1579445.6 |====================================================== InfluxDB 1.8.2 Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 val/sec > Higher Is Better r7a.4xlarge . 2183799.9 |====================================================== InfluxDB 1.8.2 Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000 val/sec > Higher Is Better r7a.4xlarge . 2200230.1 |======================================================