r5.4xlarge amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite. r5.4xlarge: Processor: Intel Xeon Platinum 8259CL (8 Cores / 16 Threads), Motherboard: Amazon EC2 r5.4xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 1 x 128 GB DDR4-2933MT/s, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Amazon Linux 2023.4.20240401, Kernel: 6.1.82-99.168.amzn2023.x86_64 (x86_64), Compiler: GCC 11.4.1 20230605, File-System: xfs, System Layer: amazon Stream 2013-01-17 Type: Copy MB/s > Higher Is Better r5.4xlarge . 84703.6 |========================================================= Stream 2013-01-17 Type: Scale MB/s > Higher Is Better r5.4xlarge . 64499.0 |========================================================= Stream 2013-01-17 Type: Triad MB/s > Higher Is Better r5.4xlarge . 71377.4 |========================================================= Stream 2013-01-17 Type: Add MB/s > Higher Is Better r5.4xlarge . 71470.9 |========================================================= Intel Memory Latency Checker 3.10 Test: Idle Latency ns < Lower Is Better r5.4xlarge . 90.0 |============================================================ Intel Memory Latency Checker 3.10 Test: Max Bandwidth - All Reads MB/s > Higher Is Better r5.4xlarge . 91215.65 |======================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 3:1 Reads-Writes MB/s > Higher Is Better r5.4xlarge . 96351.88 |======================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 2:1 Reads-Writes MB/s > Higher Is Better r5.4xlarge . 96249.85 |======================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 1:1 Reads-Writes MB/s > Higher Is Better r5.4xlarge . 97010.00 |======================================================== Intel Memory Latency Checker 3.10 Test: Max Bandwidth - Stream-Triad Like MB/s > Higher Is Better r5.4xlarge . 89342.10 |======================================================== Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - All Reads MB/s > Higher Is Better r5.4xlarge . 90835.8 |========================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 3:1 Reads-Writes MB/s > Higher Is Better r5.4xlarge . 94890.6 |========================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 2:1 Reads-Writes MB/s > Higher Is Better r5.4xlarge . 92818.7 |========================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 1:1 Reads-Writes MB/s > Higher Is Better r5.4xlarge . 86504.5 |========================================================= Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - Stream-Triad Like MB/s > Higher Is Better r5.4xlarge . 85760.6 |========================================================= CacheBench Test: Read MB/s > Higher Is Better r5.4xlarge . 5901.42 |========================================================= CacheBench Test: Write MB/s > Higher Is Better r5.4xlarge . 40219.22 |======================================================== CacheBench Test: Read / Modify / Write MB/s > Higher Is Better r5.4xlarge . 67823.98 |======================================================== GNU GMP GMPbench 6.2.1 GMPbench Score > Higher Is Better Zstd Compression 1.5.4 Compression Level: 3 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 1273.5 |========================================================== Zstd Compression 1.5.4 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 777.9 |=========================================================== Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 322.8 |=========================================================== Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 817.7 |=========================================================== Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 126.7 |=========================================================== Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 781.5 |=========================================================== Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 10.3 |============================================================ Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 671.0 |=========================================================== Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Compression Speed MB/s > Higher Is Better r5.4xlarge . 681.6 |=========================================================== Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 807.6 |=========================================================== Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed MB/s > Higher Is Better r5.4xlarge . 342.2 |=========================================================== Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 826.6 |=========================================================== Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better r5.4xlarge . 5.37 |============================================================ Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 650.3 |=========================================================== LZ4 Compression 1.9.4 Compression Level: 1 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 431.07 |========================================================== LZ4 Compression 1.9.4 Compression Level: 1 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 2521.9 |========================================================== LZ4 Compression 1.9.4 Compression Level: 3 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 70.77 |=========================================================== LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 2354.7 |========================================================== LZ4 Compression 1.9.4 Compression Level: 9 - Compression Speed MB/s > Higher Is Better r5.4xlarge . 24.59 |=========================================================== LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed MB/s > Higher Is Better r5.4xlarge . 2464.9 |========================================================== OpenSSL 3.1 Algorithm: SHA256 byte/s > Higher Is Better r5.4xlarge . 2147501110 |====================================================== OpenSSL 3.1 Algorithm: SHA512 byte/s > Higher Is Better r5.4xlarge . 2397065050 |====================================================== OpenSSL 3.1 Algorithm: RSA4096 sign/s > Higher Is Better r5.4xlarge . 1884.4 |========================================================== OpenSSL 3.1 Algorithm: RSA4096 verify/s > Higher Is Better r5.4xlarge . 124488.5 |======================================================== OpenSSL 3.1 Algorithm: ChaCha20 byte/s > Higher Is Better r5.4xlarge . 34805518700 |===================================================== OpenSSL 3.1 Algorithm: AES-128-GCM byte/s > Higher Is Better r5.4xlarge . 37949298003 |===================================================== OpenSSL 3.1 Algorithm: AES-256-GCM byte/s > Higher Is Better r5.4xlarge . 27673820453 |===================================================== OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 byte/s > Higher Is Better r5.4xlarge . 17887037850 |===================================================== Botan 2.17.3 Test: KASUMI MiB/s > Higher Is Better r5.4xlarge . 68.33 |=========================================================== Botan 2.17.3 Test: KASUMI - Decrypt MiB/s > Higher Is Better r5.4xlarge . 66.89 |=========================================================== Botan 2.17.3 Test: AES-256 MiB/s > Higher Is Better r5.4xlarge . 2887.51 |========================================================= Botan 2.17.3 Test: AES-256 - Decrypt MiB/s > Higher Is Better r5.4xlarge . 2892.63 |========================================================= Botan 2.17.3 Test: Twofish MiB/s > Higher Is Better r5.4xlarge . 246.83 |========================================================== Botan 2.17.3 Test: Twofish - Decrypt MiB/s > Higher Is Better r5.4xlarge . 245.72 |========================================================== Botan 2.17.3 Test: Blowfish MiB/s > Higher Is Better r5.4xlarge . 300.66 |========================================================== Botan 2.17.3 Test: Blowfish - Decrypt MiB/s > Higher Is Better r5.4xlarge . 296.83 |========================================================== Botan 2.17.3 Test: CAST-256 MiB/s > Higher Is Better r5.4xlarge . 102.63 |========================================================== Botan 2.17.3 Test: CAST-256 - Decrypt MiB/s > Higher Is Better r5.4xlarge . 102.69 |========================================================== Botan 2.17.3 Test: ChaCha20Poly1305 MiB/s > Higher Is Better r5.4xlarge . 565.10 |========================================================== Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt MiB/s > Higher Is Better r5.4xlarge . 560.47 |========================================================== x264 2022-02-22 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better r5.4xlarge . 16.08 |=========================================================== x264 2022-02-22 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better r5.4xlarge . 68.43 |=========================================================== x265 3.6 Video Input: Bosphorus 4K Frames Per Second > Higher Is Better r5.4xlarge . 7.70 |============================================================ x265 3.6 Video Input: Bosphorus 1080p Frames Per Second > Higher Is Better r5.4xlarge . 37.07 |=========================================================== PyPerformance 1.0.0 Benchmark: go Milliseconds < Lower Is Better r5.4xlarge . 388 |============================================================= PyPerformance 1.0.0 Benchmark: 2to3 Milliseconds < Lower Is Better r5.4xlarge . 467 |============================================================= PyPerformance 1.0.0 Benchmark: chaos Milliseconds < Lower Is Better r5.4xlarge . 174 |============================================================= PyPerformance 1.0.0 Benchmark: float Milliseconds < Lower Is Better r5.4xlarge . 187 |============================================================= PyPerformance 1.0.0 Benchmark: nbody Milliseconds < Lower Is Better r5.4xlarge . 221 |============================================================= PyPerformance 1.0.0 Benchmark: pathlib Milliseconds < Lower Is Better r5.4xlarge . 26.1 |============================================================ PyPerformance 1.0.0 Benchmark: raytrace Milliseconds < Lower Is Better r5.4xlarge . 767 |============================================================= PyPerformance 1.0.0 Benchmark: json_loads Milliseconds < Lower Is Better r5.4xlarge . 36.9 |============================================================ PyPerformance 1.0.0 Benchmark: crypto_pyaes Milliseconds < Lower Is Better r5.4xlarge . 178 |============================================================= PyPerformance 1.0.0 Benchmark: regex_compile Milliseconds < Lower Is Better r5.4xlarge . 257 |============================================================= PyPerformance 1.0.0 Benchmark: python_startup Milliseconds < Lower Is Better r5.4xlarge . 12.0 |============================================================ PyPerformance 1.0.0 Benchmark: django_template Milliseconds < Lower Is Better r5.4xlarge . 71.8 |============================================================ PyPerformance 1.0.0 Benchmark: pickle_pure_python Milliseconds < Lower Is Better r5.4xlarge . 679 |============================================================= CppPerformanceBenchmarks 9 Test: Atol Seconds < Lower Is Better r5.4xlarge . 89.15 |=========================================================== CppPerformanceBenchmarks 9 Test: Ctype Seconds < Lower Is Better r5.4xlarge . 40.43 |=========================================================== CppPerformanceBenchmarks 9 Test: Math Library Seconds < Lower Is Better r5.4xlarge . 413.67 |========================================================== CppPerformanceBenchmarks 9 Test: Random Numbers Seconds < Lower Is Better r5.4xlarge . 1283.16 |========================================================= CppPerformanceBenchmarks 9 Test: Stepanov Vector Seconds < Lower Is Better r5.4xlarge . 120.93 |========================================================== CppPerformanceBenchmarks 9 Test: Function Objects Seconds < Lower Is Better r5.4xlarge . 18.99 |=========================================================== CppPerformanceBenchmarks 9 Test: Stepanov Abstraction Seconds < Lower Is Better r5.4xlarge . 47.80 |=========================================================== GraphicsMagick 1.3.43 Operation: Swirl Iterations Per Minute > Higher Is Better r5.4xlarge . 64 |============================================================== GraphicsMagick 1.3.43 Operation: Rotate Iterations Per Minute > Higher Is Better r5.4xlarge . 116 |============================================================= GraphicsMagick 1.3.43 Operation: Sharpen Iterations Per Minute > Higher Is Better r5.4xlarge . 20 |============================================================== GraphicsMagick 1.3.43 Operation: Enhanced Iterations Per Minute > Higher Is Better r5.4xlarge . 27 |============================================================== GraphicsMagick 1.3.43 Operation: Resizing Iterations Per Minute > Higher Is Better r5.4xlarge . 135 |============================================================= GraphicsMagick 1.3.43 Operation: Noise-Gaussian Iterations Per Minute > Higher Is Better r5.4xlarge . 35 |============================================================== GraphicsMagick 1.3.43 Operation: HWB Color Space Iterations Per Minute > Higher Is Better r5.4xlarge . 131 |============================================================= Smallpt 1.0 Global Illumination Renderer; 128 Samples Seconds < Lower Is Better r5.4xlarge . 15.52 |=========================================================== Stockfish 16.1 Chess Benchmark Nodes Per Second > Higher Is Better r5.4xlarge . 9643551 |========================================================= C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better r5.4xlarge . 109.18 |========================================================== SciMark 2.0 Computational Test: Composite Mflops > Higher Is Better r5.4xlarge . 492.91 |========================================================== SciMark 2.0 Computational Test: Monte Carlo Mflops > Higher Is Better r5.4xlarge . 106.71 |========================================================== SciMark 2.0 Computational Test: Fast Fourier Transform Mflops > Higher Is Better r5.4xlarge . 226.20 |========================================================== SciMark 2.0 Computational Test: Sparse Matrix Multiply Mflops > Higher Is Better r5.4xlarge . 561.25 |========================================================== SciMark 2.0 Computational Test: Dense LU Matrix Factorization Mflops > Higher Is Better r5.4xlarge . 694.39 |========================================================== SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Mflops > Higher Is Better r5.4xlarge . 875.98 |========================================================== Renaissance 0.14 Test: Scala Dotty ms < Lower Is Better Renaissance 0.14 Test: Random Forest ms < Lower Is Better Renaissance 0.14 Test: ALS Movie Lens ms < Lower Is Better Renaissance 0.14 Test: Apache Spark ALS ms < Lower Is Better Renaissance 0.14 Test: Apache Spark Bayes ms < Lower Is Better Renaissance 0.14 Test: Savina Reactors.IO ms < Lower Is Better Renaissance 0.14 Test: Apache Spark PageRank ms < Lower Is Better Renaissance 0.14 Test: Finagle HTTP Requests ms < Lower Is Better Renaissance 0.14 Test: In-Memory Database Shootout ms < Lower Is Better Renaissance 0.14 Test: Akka Unbalanced Cobwebbed Tree ms < Lower Is Better Renaissance 0.14 Test: Genetic Algorithm Using Jenetics + Futures ms < Lower Is Better DaCapo Benchmark 23.11 Java Test: Jython msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Eclipse msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: GraphChi msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Tradesoap msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Tradebeans msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Spring Boot msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Kafka msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Tomcat msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: jMonkeyEngine msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Cassandra msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Xalan XSLT msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Batik SVG Toolkit msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: H2 Database Engine msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: FOP Print Formatter msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: PMD Source Code Analyzer msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Lucene Search Index msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Apache Lucene Search Engine msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Avrora AVR Simulation Framework msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: BioJava Biological Data Framework msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: Zxing 1D/2D Barcode Image Processing msec < Lower Is Better DaCapo Benchmark 23.11 Java Test: H2O In-Memory Platform For Machine Learning msec < Lower Is Better BlogBench 1.1 Test: Read Final Score > Higher Is Better r5.4xlarge . 2410482 |========================================================= BlogBench 1.1 Test: Write Final Score > Higher Is Better r5.4xlarge . 10569 |=========================================================== nginx 1.23.2 Connections: 1 Requests Per Second > Higher Is Better nginx 1.23.2 Connections: 20 Requests Per Second > Higher Is Better r5.4xlarge . 40042.38 |======================================================== nginx 1.23.2 Connections: 100 Requests Per Second > Higher Is Better r5.4xlarge . 45638.33 |======================================================== nginx 1.23.2 Connections: 200 Requests Per Second > Higher Is Better r5.4xlarge . 46019.07 |======================================================== nginx 1.23.2 Connections: 500 Requests Per Second > Higher Is Better r5.4xlarge . 44379.39 |======================================================== nginx 1.23.2 Connections: 1000 Requests Per Second > Higher Is Better r5.4xlarge . 42862.39 |======================================================== nginx 1.23.2 Connections: 4000 Requests Per Second > Higher Is Better r5.4xlarge . 40920.26 |======================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 Ops/sec > Higher Is Better r5.4xlarge . 1586149.75 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 Ops/sec > Higher Is Better r5.4xlarge . 1696597.16 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 Ops/sec > Higher Is Better r5.4xlarge . 1443386.97 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 Ops/sec > Higher Is Better r5.4xlarge . 1468305.38 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 Ops/sec > Higher Is Better r5.4xlarge . 1599986.41 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 Ops/sec > Higher Is Better r5.4xlarge . 1415631.65 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 Ops/sec > Higher Is Better r5.4xlarge . 1377985.95 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 Ops/sec > Higher Is Better r5.4xlarge . 1541158.58 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 Ops/sec > Higher Is Better r5.4xlarge . 1338486.05 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 Ops/sec > Higher Is Better r5.4xlarge . 1410244.64 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 Ops/sec > Higher Is Better r5.4xlarge . 1239692.43 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 Ops/sec > Higher Is Better r5.4xlarge . 1350197.45 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 Ops/sec > Higher Is Better r5.4xlarge . 1507444.16 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 Ops/sec > Higher Is Better r5.4xlarge . 1232823.03 |====================================================== Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 Ops/sec > Higher Is Better r5.4xlarge . 1403099.37 |====================================================== Apache Cassandra 4.1.3 Test: Writes Op/s > Higher Is Better Apache Cassandra 4.1.3 Test: Mixed 1:1 Op/s > Higher Is Better Apache Cassandra 4.1.3 Test: Mixed 1:3 Op/s > Higher Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput Megapixels/sec > Higher Is Better r5.4xlarge . 139.46 |========================================================== VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast Frames Per Second > Higher Is Better r5.4xlarge . 2.341 |=========================================================== VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster Frames Per Second > Higher Is Better r5.4xlarge . 5.523 |=========================================================== VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast Frames Per Second > Higher Is Better r5.4xlarge . 7.513 |=========================================================== VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster Frames Per Second > Higher Is Better r5.4xlarge . 19.05 |=========================================================== NCNN 20230517 Target: CPU - Model: mobilenet ms < Lower Is Better r5.4xlarge . 14.62 |=========================================================== NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better r5.4xlarge . 4.28 |============================================================ NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better r5.4xlarge . 4.36 |============================================================ NCNN 20230517 Target: CPU - Model: shufflenet-v2 ms < Lower Is Better r5.4xlarge . 4.40 |============================================================ NCNN 20230517 Target: CPU - Model: mnasnet ms < Lower Is Better r5.4xlarge . 4.04 |============================================================ NCNN 20230517 Target: CPU - Model: efficientnet-b0 ms < Lower Is Better r5.4xlarge . 5.73 |============================================================ NCNN 20230517 Target: CPU - Model: blazeface ms < Lower Is Better r5.4xlarge . 1.56 |============================================================ NCNN 20230517 Target: CPU - Model: googlenet ms < Lower Is Better r5.4xlarge . 10.45 |=========================================================== NCNN 20230517 Target: CPU - Model: vgg16 ms < Lower Is Better r5.4xlarge . 36.29 |=========================================================== NCNN 20230517 Target: CPU - Model: resnet18 ms < Lower Is Better r5.4xlarge . 6.88 |============================================================ NCNN 20230517 Target: CPU - Model: alexnet ms < Lower Is Better r5.4xlarge . 4.95 |============================================================ NCNN 20230517 Target: CPU - Model: resnet50 ms < Lower Is Better r5.4xlarge . 14.24 |=========================================================== NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 ms < Lower Is Better r5.4xlarge . 14.62 |=========================================================== NCNN 20230517 Target: CPU - Model: yolov4-tiny ms < Lower Is Better r5.4xlarge . 25.58 |=========================================================== NCNN 20230517 Target: CPU - Model: squeezenet_ssd ms < Lower Is Better r5.4xlarge . 10.85 |=========================================================== NCNN 20230517 Target: CPU - Model: regnety_400m ms < Lower Is Better r5.4xlarge . 12.81 |=========================================================== NCNN 20230517 Target: CPU - Model: vision_transformer ms < Lower Is Better r5.4xlarge . 93.23 |=========================================================== NCNN 20230517 Target: CPU - Model: FastestDet ms < Lower Is Better r5.4xlarge . 5.30 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: mobilenet ms < Lower Is Better r5.4xlarge . 14.79 |=========================================================== NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 ms < Lower Is Better r5.4xlarge . 4.33 |============================================================ NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 ms < Lower Is Better r5.4xlarge . 4.41 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 ms < Lower Is Better r5.4xlarge . 4.39 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: mnasnet ms < Lower Is Better r5.4xlarge . 4.03 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 ms < Lower Is Better r5.4xlarge . 5.77 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: blazeface ms < Lower Is Better r5.4xlarge . 1.58 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: googlenet ms < Lower Is Better r5.4xlarge . 10.64 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: vgg16 ms < Lower Is Better r5.4xlarge . 36.42 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: resnet18 ms < Lower Is Better r5.4xlarge . 6.98 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: alexnet ms < Lower Is Better r5.4xlarge . 5.17 |============================================================ NCNN 20230517 Target: Vulkan GPU - Model: resnet50 ms < Lower Is Better r5.4xlarge . 14.46 |=========================================================== NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 ms < Lower Is Better r5.4xlarge . 14.79 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny ms < Lower Is Better r5.4xlarge . 25.60 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd ms < Lower Is Better r5.4xlarge . 11.02 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m ms < Lower Is Better r5.4xlarge . 12.97 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer ms < Lower Is Better r5.4xlarge . 93.04 |=========================================================== NCNN 20230517 Target: Vulkan GPU - Model: FastestDet ms < Lower Is Better r5.4xlarge . 5.22 |============================================================ libxsmm 2-1.17-3645 M N K: 128 GFLOPS/s > Higher Is Better r5.4xlarge . 309.2 |=========================================================== libxsmm 2-1.17-3645 M N K: 256 GFLOPS/s > Higher Is Better r5.4xlarge . 115.2 |=========================================================== libxsmm 2-1.17-3645 M N K: 32 GFLOPS/s > Higher Is Better r5.4xlarge . 179.2 |=========================================================== libxsmm 2-1.17-3645 M N K: 64 GFLOPS/s > Higher Is Better r5.4xlarge . 242.3 |=========================================================== Apache Spark 3.3 Row Count: 1000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 1000000 - Partitions: 2000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 100 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 500 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 10000000 - Partitions: 2000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 20000000 - Partitions: 2000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 1000 Seconds < Lower Is Better Apache Spark 3.3 Row Count: 40000000 - Partitions: 2000 Seconds < Lower Is Better