amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2404109-NE-R6G4XLARG31 r6g.4xlarge - Phoronix Test Suite r6g.4xlarge amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2404109-NE-R6G4XLARG31&grr .
r6g.4xlarge Processor Motherboard Chipset Memory Disk Network OS Kernel Compiler File-System System Layer r6g.4xlarge ARMv8 Neoverse-N1 (16 Cores) Amazon EC2 r6g.4xlarge (1.0 BIOS) Amazon Device 0200 128GB 215GB Amazon Elastic Block Store Amazon Elastic Amazon Linux 2023.4.20240401 6.1.82-99.168.amzn2023.aarch64 (aarch64) 20240325 GCC 11.4.1 20230605 xfs amazon OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=aarch64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch=armv8.2-a+crypto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=neoverse-n1 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
r6g.4xlarge libxsmm: 256 libxsmm: 128 cpp-perf-bench: Rand Numbers stockfish: Chess Benchmark cpp-perf-bench: Math Library vvenc: Bosphorus 4K - Fast memtier-benchmark: Redis - 500 - 10:1 memtier-benchmark: Redis - 500 - 1:1 memtier-benchmark: Redis - 500 - 5:1 memtier-benchmark: Redis - 500 - 1:10 compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed blogbench: Read memtier-benchmark: Redis - 100 - 1:10 compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed vvenc: Bosphorus 4K - Faster openssl: ChaCha20-Poly1305 openssl: AES-256-GCM openssl: AES-128-GCM openssl: ChaCha20 openssl: SHA512 openssl: SHA256 vvenc: Bosphorus 1080p - Fast pyperformance: raytrace cachebench: Read / Modify / Write cachebench: Write cachebench: Read pyperformance: python_startup cpp-perf-bench: Stepanov Vector pyperformance: 2to3 nginx: 4000 nginx: 1000 cpp-perf-bench: Atol nginx: 500 nginx: 200 nginx: 100 nginx: 20 pyperformance: go cpp-perf-bench: Ctype vvenc: Bosphorus 1080p - Faster x265: Bosphorus 4K memtier-benchmark: Redis - 500 - 1:5 memtier-benchmark: Redis - 100 - 10:1 memtier-benchmark: Redis - 100 - 5:1 memtier-benchmark: Redis - 100 - 1:1 memtier-benchmark: Redis - 100 - 1:5 memtier-benchmark: Redis - 50 - 10:1 memtier-benchmark: Redis - 50 - 5:1 memtier-benchmark: Redis - 50 - 1:1 memtier-benchmark: Redis - 50 - 1:10 memtier-benchmark: Redis - 50 - 1:5 compress-zstd: 12 - Decompression Speed compress-zstd: 12 - Compression Speed c-ray: Total Time - 4K, 16 Rays Per Pixel compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed graphics-magick: Sharpen graphics-magick: Noise-Gaussian graphics-magick: Swirl graphics-magick: Rotate graphics-magick: HWB Color Space graphics-magick: Resizing openssl: RSA4096 openssl: RSA4096 graphics-magick: Enhanced ncnn: Vulkan GPU - FastestDet ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet ncnn: CPU - FastestDet ncnn: CPU - vision_transformer ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet libxsmm: 64 pyperformance: regex_compile pyperformance: nbody pyperformance: django_template cpp-perf-bench: Stepanov Abstraction pyperformance: float pyperformance: crypto_pyaes pyperformance: chaos libxsmm: 32 botan: AES-256 - Decrypt botan: AES-256 compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed x264: Bosphorus 4K botan: Blowfish - Decrypt botan: Blowfish botan: Twofish - Decrypt botan: Twofish botan: ChaCha20Poly1305 - Decrypt botan: ChaCha20Poly1305 botan: CAST-256 - Decrypt botan: CAST-256 botan: KASUMI - Decrypt botan: KASUMI pyperformance: pathlib scimark2: Composite compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed pyperformance: pickle_pure_python compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed pyperformance: json_loads x265: Bosphorus 1080p cpp-perf-bench: Function Objects smallpt: Global Illumination Renderer; 128 Samples stream: Copy x264: Bosphorus 1080p blogbench: Write scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo stream: Add stream: Triad stream: Scale r6g.4xlarge 195.6 193.7 1527.678 9545114 513.443 1.224 1163292.46 1244474.80 1163553.10 1239372.06 577.5 8.78 3005792 1102905.86 594.7 4.38 2.810 11687814487 32230997017 39725140013 16854830557 3617155013 10712087513 3.899 843 49943.268621 31540.112402 9525.606112 13.6 103.334 523 41597.09 46502.49 90.396 46400.13 46386.48 44934.26 39441.33 419 77.428 7.991 7.98 1216162.20 919351.79 1028622.85 1091836.03 1200393.02 1072272.83 1106619.44 1239410.54 1296678.95 1328112.15 696.9 107.9 62.018 785.6 386.7 767.5 1364.9 781.1 346.5 794.5 558.4 48 50 106 109 186 194 53955.6 662.4 35 2.74 90.76 7.13 6.17 12.88 8.82 8.13 3.52 3.33 13.66 5.52 1.68 3.58 2.13 1.98 2.06 2.42 8.82 2.70 90.23 6.97 6.45 12.83 8.79 8.30 3.53 3.32 13.70 5.52 1.67 3.57 2.29 1.99 2.07 2.42 8.79 203.9 249 227 81.4 39.506 211 192 178 171.5 2306.633 2301.176 2147.3 23.08 19.48 243.754 246.605 206.833 207.929 240.945 241.666 94.531 94.858 58.684 57.433 31.4 391.01 2112.5 68.04 763 2358.3 459.42 40.2 32.67 17.375 10.429 170463.0 82.49 10825 971.85 393.90 386.55 103.12 99.64 167210.0 167372.9 171629.7 OpenBenchmarking.org
libxsmm M N K: 256 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 256 r6g.4xlarge 40 80 120 160 200 SE +/- 0.35, N = 3 195.6 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
libxsmm M N K: 128 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 r6g.4xlarge 40 80 120 160 200 SE +/- 0.12, N = 3 193.7 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
CppPerformanceBenchmarks Test: Random Numbers OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers r6g.4xlarge 300 600 900 1200 1500 SE +/- 0.04, N = 3 1527.68 1. (CXX) g++ options: -std=c++11 -O3
Stockfish Chess Benchmark OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark r6g.4xlarge 2M 4M 6M 8M 10M SE +/- 115262.87, N = 12 9545114 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library r6g.4xlarge 110 220 330 440 550 SE +/- 0.18, N = 3 513.44 1. (CXX) g++ options: -std=c++11 -O3
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast r6g.4xlarge 0.2754 0.5508 0.8262 1.1016 1.377 SE +/- 0.001, N = 3 1.224 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 36599.91, N = 15 1163292.46 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 31551.84, N = 15 1244474.80 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 30648.83, N = 15 1163553.10 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 11055.97, N = 15 1239372.06 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed r6g.4xlarge 120 240 360 480 600 SE +/- 10.76, N = 15 577.5 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed r6g.4xlarge 2 4 6 8 10 SE +/- 0.13, N = 15 8.78 1. (CC) gcc options: -O3 -pthread -lz
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read r6g.4xlarge 600K 1200K 1800K 2400K 3000K SE +/- 37964.88, N = 3 3005792 1. (CC) gcc options: -O2
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 8568.69, N = 15 1102905.86 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed r6g.4xlarge 130 260 390 520 650 SE +/- 9.91, N = 12 594.7 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed r6g.4xlarge 0.9855 1.971 2.9565 3.942 4.9275 SE +/- 0.08, N = 12 4.38 1. (CC) gcc options: -O3 -pthread -lz
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster r6g.4xlarge 0.6323 1.2646 1.8969 2.5292 3.1615 SE +/- 0.009, N = 3 2.810 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 r6g.4xlarge 3000M 6000M 9000M 12000M 15000M SE +/- 413593.98, N = 3 11687814487 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM r6g.4xlarge 7000M 14000M 21000M 28000M 35000M SE +/- 1329248.61, N = 3 32230997017 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM r6g.4xlarge 9000M 18000M 27000M 36000M 45000M SE +/- 559313.01, N = 3 39725140013 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 r6g.4xlarge 4000M 8000M 12000M 16000M 20000M SE +/- 1709996.43, N = 3 16854830557 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 r6g.4xlarge 800M 1600M 2400M 3200M 4000M SE +/- 4271017.01, N = 3 3617155013 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 r6g.4xlarge 2000M 4000M 6000M 8000M 10000M SE +/- 5404104.00, N = 3 10712087513 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
VVenC Video Input: Bosphorus 1080p - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast r6g.4xlarge 0.8773 1.7546 2.6319 3.5092 4.3865 SE +/- 0.006, N = 3 3.899 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
PyPerformance Benchmark: raytrace OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: raytrace r6g.4xlarge 200 400 600 800 1000 SE +/- 0.33, N = 3 843
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write r6g.4xlarge 11K 22K 33K 44K 55K SE +/- 2.82, N = 3 49943.27 MIN: 49108.87 / MAX: 50609.95 1. (CC) gcc options: -O3 -lrt
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write r6g.4xlarge 7K 14K 21K 28K 35K SE +/- 34.39, N = 3 31540.11 MIN: 28719.67 / MAX: 35222.86 1. (CC) gcc options: -O3 -lrt
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read r6g.4xlarge 2K 4K 6K 8K 10K SE +/- 0.08, N = 3 9525.61 MIN: 9524.61 / MAX: 9526.78 1. (CC) gcc options: -O3 -lrt
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup r6g.4xlarge 3 6 9 12 15 SE +/- 0.15, N = 5 13.6
CppPerformanceBenchmarks Test: Stepanov Vector OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector r6g.4xlarge 20 40 60 80 100 SE +/- 0.06, N = 3 103.33 1. (CXX) g++ options: -std=c++11 -O3
PyPerformance Benchmark: 2to3 OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: 2to3 r6g.4xlarge 110 220 330 440 550 SE +/- 6.69, N = 3 523
nginx Connections: 4000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 4000 r6g.4xlarge 9K 18K 27K 36K 45K SE +/- 107.80, N = 3 41597.09 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 r6g.4xlarge 10K 20K 30K 40K 50K SE +/- 114.89, N = 3 46502.49 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
CppPerformanceBenchmarks Test: Atol OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol r6g.4xlarge 20 40 60 80 100 SE +/- 0.25, N = 3 90.40 1. (CXX) g++ options: -std=c++11 -O3
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 r6g.4xlarge 10K 20K 30K 40K 50K SE +/- 69.93, N = 3 46400.13 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 r6g.4xlarge 10K 20K 30K 40K 50K SE +/- 549.92, N = 3 46386.48 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 100 r6g.4xlarge 10K 20K 30K 40K 50K SE +/- 117.08, N = 3 44934.26 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 20 r6g.4xlarge 8K 16K 24K 32K 40K SE +/- 99.73, N = 3 39441.33 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
PyPerformance Benchmark: go OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: go r6g.4xlarge 90 180 270 360 450 SE +/- 0.33, N = 3 419
CppPerformanceBenchmarks Test: Ctype OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype r6g.4xlarge 20 40 60 80 100 SE +/- 0.04, N = 3 77.43 1. (CXX) g++ options: -std=c++11 -O3
VVenC Video Input: Bosphorus 1080p - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster r6g.4xlarge 2 4 6 8 10 SE +/- 0.005, N = 3 7.991 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 4K r6g.4xlarge 2 4 6 8 10 SE +/- 0.04, N = 3 7.98 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 9046.71, N = 3 1216162.20 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 10989.25, N = 3 919351.79 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 4533.42, N = 3 1028622.85 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 3540.95, N = 3 1091836.03 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 8588.09, N = 3 1200393.02 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 1160.41, N = 3 1072272.83 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 r6g.4xlarge 200K 400K 600K 800K 1000K SE +/- 10989.87, N = 3 1106619.44 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 9673.16, N = 3 1239410.54 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 3831.65, N = 3 1296678.95 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 r6g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 3048.23, N = 3 1328112.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Zstd Compression Compression Level: 12 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed r6g.4xlarge 150 300 450 600 750 SE +/- 6.01, N = 3 696.9 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 12 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed r6g.4xlarge 20 40 60 80 100 SE +/- 0.19, N = 3 107.9 1. (CC) gcc options: -O3 -pthread -lz
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel r6g.4xlarge 14 28 42 56 70 SE +/- 0.01, N = 3 62.02 1. (CC) gcc options: -lm -lpthread -O3
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed r6g.4xlarge 200 400 600 800 1000 SE +/- 2.63, N = 3 785.6 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed r6g.4xlarge 80 160 240 320 400 SE +/- 1.08, N = 3 386.7 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3 - Decompression Speed r6g.4xlarge 170 340 510 680 850 SE +/- 0.47, N = 3 767.5 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3 - Compression Speed r6g.4xlarge 300 600 900 1200 1500 SE +/- 8.21, N = 3 1364.9 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed r6g.4xlarge 200 400 600 800 1000 SE +/- 0.96, N = 3 781.1 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed r6g.4xlarge 80 160 240 320 400 SE +/- 0.07, N = 3 346.5 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Decompression Speed r6g.4xlarge 200 400 600 800 1000 SE +/- 1.45, N = 3 794.5 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Compression Speed r6g.4xlarge 120 240 360 480 600 SE +/- 1.58, N = 3 558.4 1. (CC) gcc options: -O3 -pthread -lz
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen r6g.4xlarge 11 22 33 44 55 SE +/- 0.33, N = 3 48 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian r6g.4xlarge 11 22 33 44 55 SE +/- 0.00, N = 3 50 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl r6g.4xlarge 20 40 60 80 100 SE +/- 0.33, N = 3 106 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate r6g.4xlarge 20 40 60 80 100 SE +/- 0.33, N = 3 109 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space r6g.4xlarge 40 80 120 160 200 SE +/- 1.33, N = 3 186 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing r6g.4xlarge 40 80 120 160 200 SE +/- 0.33, N = 3 194 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r6g.4xlarge 12K 24K 36K 48K 60K SE +/- 2.92, N = 3 53955.6 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r6g.4xlarge 140 280 420 560 700 SE +/- 0.07, N = 3 662.4 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced r6g.4xlarge 8 16 24 32 40 SE +/- 0.00, N = 3 35 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet r6g.4xlarge 0.6165 1.233 1.8495 2.466 3.0825 SE +/- 0.09, N = 3 2.74 MIN: 2.58 / MAX: 80.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer r6g.4xlarge 20 40 60 80 100 SE +/- 0.17, N = 3 90.76 MIN: 89.09 / MAX: 125.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m r6g.4xlarge 2 4 6 8 10 SE +/- 0.09, N = 3 7.13 MIN: 6.88 / MAX: 92.16 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd r6g.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.17 MIN: 6.09 / MAX: 6.73 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny r6g.4xlarge 3 6 9 12 15 SE +/- 0.10, N = 3 12.88 MIN: 12.63 / MAX: 34.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r6g.4xlarge 2 4 6 8 10 SE +/- 0.02, N = 3 8.82 MIN: 8.66 / MAX: 18.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 r6g.4xlarge 2 4 6 8 10 SE +/- 0.02, N = 3 8.13 MIN: 8.03 / MAX: 8.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet r6g.4xlarge 0.792 1.584 2.376 3.168 3.96 SE +/- 0.00, N = 3 3.52 MIN: 3.47 / MAX: 3.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 r6g.4xlarge 0.7493 1.4986 2.2479 2.9972 3.7465 SE +/- 0.02, N = 3 3.33 MIN: 3.27 / MAX: 3.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 r6g.4xlarge 4 8 12 16 20 SE +/- 0.06, N = 3 13.66 MIN: 13.39 / MAX: 19.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet r6g.4xlarge 1.242 2.484 3.726 4.968 6.21 SE +/- 0.01, N = 3 5.52 MIN: 5.44 / MAX: 6.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface r6g.4xlarge 0.378 0.756 1.134 1.512 1.89 SE +/- 0.01, N = 3 1.68 MIN: 1.66 / MAX: 6.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 r6g.4xlarge 0.8055 1.611 2.4165 3.222 4.0275 SE +/- 0.01, N = 3 3.58 MIN: 3.53 / MAX: 3.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet r6g.4xlarge 0.4793 0.9586 1.4379 1.9172 2.3965 SE +/- 0.00, N = 3 2.13 MIN: 2.08 / MAX: 2.41 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 r6g.4xlarge 0.4455 0.891 1.3365 1.782 2.2275 SE +/- 0.00, N = 3 1.98 MIN: 1.95 / MAX: 2.1 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 r6g.4xlarge 0.4635 0.927 1.3905 1.854 2.3175 SE +/- 0.01, N = 3 2.06 MIN: 2.02 / MAX: 2.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 r6g.4xlarge 0.5445 1.089 1.6335 2.178 2.7225 SE +/- 0.00, N = 3 2.42 MIN: 2.35 / MAX: 2.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet r6g.4xlarge 2 4 6 8 10 SE +/- 0.02, N = 3 8.82 MIN: 8.66 / MAX: 18.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet r6g.4xlarge 0.6075 1.215 1.8225 2.43 3.0375 SE +/- 0.10, N = 3 2.70 MIN: 2.56 / MAX: 28.86 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer r6g.4xlarge 20 40 60 80 100 SE +/- 0.12, N = 3 90.23 MIN: 88.64 / MAX: 123.97 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m r6g.4xlarge 2 4 6 8 10 SE +/- 0.08, N = 3 6.97 MIN: 6.81 / MAX: 7.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd r6g.4xlarge 2 4 6 8 10 SE +/- 0.26, N = 3 6.45 MIN: 6.11 / MAX: 148.95 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny r6g.4xlarge 3 6 9 12 15 SE +/- 0.03, N = 3 12.83 MIN: 12.67 / MAX: 14.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r6g.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 8.79 MIN: 8.67 / MAX: 9.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 r6g.4xlarge 2 4 6 8 10 SE +/- 0.07, N = 3 8.30 MIN: 8.04 / MAX: 61.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet r6g.4xlarge 0.7943 1.5886 2.3829 3.1772 3.9715 SE +/- 0.01, N = 3 3.53 MIN: 3.45 / MAX: 5.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 r6g.4xlarge 0.747 1.494 2.241 2.988 3.735 SE +/- 0.00, N = 3 3.32 MIN: 3.27 / MAX: 3.5 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 r6g.4xlarge 4 8 12 16 20 SE +/- 0.05, N = 3 13.70 MIN: 13.41 / MAX: 27.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet r6g.4xlarge 1.242 2.484 3.726 4.968 6.21 SE +/- 0.01, N = 3 5.52 MIN: 5.45 / MAX: 6.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface r6g.4xlarge 0.3758 0.7516 1.1274 1.5032 1.879 SE +/- 0.01, N = 3 1.67 MIN: 1.65 / MAX: 2.11 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 r6g.4xlarge 0.8033 1.6066 2.4099 3.2132 4.0165 SE +/- 0.02, N = 3 3.57 MIN: 3.51 / MAX: 3.78 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet r6g.4xlarge 0.5153 1.0306 1.5459 2.0612 2.5765 SE +/- 0.16, N = 3 2.29 MIN: 2.08 / MAX: 73.74 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 r6g.4xlarge 0.4478 0.8956 1.3434 1.7912 2.239 SE +/- 0.01, N = 3 1.99 MIN: 1.95 / MAX: 2.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 r6g.4xlarge 0.4658 0.9316 1.3974 1.8632 2.329 SE +/- 0.01, N = 3 2.07 MIN: 2.01 / MAX: 2.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 r6g.4xlarge 0.5445 1.089 1.6335 2.178 2.7225 SE +/- 0.01, N = 3 2.42 MIN: 2.34 / MAX: 2.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet r6g.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 8.79 MIN: 8.67 / MAX: 9.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 r6g.4xlarge 40 80 120 160 200 SE +/- 0.07, N = 3 203.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile r6g.4xlarge 50 100 150 200 250 SE +/- 0.00, N = 3 249
PyPerformance Benchmark: nbody OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: nbody r6g.4xlarge 50 100 150 200 250 SE +/- 0.00, N = 3 227
PyPerformance Benchmark: django_template OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: django_template r6g.4xlarge 20 40 60 80 100 SE +/- 0.03, N = 3 81.4
CppPerformanceBenchmarks Test: Stepanov Abstraction OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction r6g.4xlarge 9 18 27 36 45 SE +/- 0.03, N = 3 39.51 1. (CXX) g++ options: -std=c++11 -O3
PyPerformance Benchmark: float OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: float r6g.4xlarge 50 100 150 200 250 SE +/- 0.67, N = 3 211
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes r6g.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 192
PyPerformance Benchmark: chaos OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: chaos r6g.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 178
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 r6g.4xlarge 40 80 120 160 200 SE +/- 0.64, N = 3 171.5 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt r6g.4xlarge 500 1000 1500 2000 2500 SE +/- 0.12, N = 3 2306.63 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 r6g.4xlarge 500 1000 1500 2000 2500 SE +/- 0.17, N = 3 2301.18 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed r6g.4xlarge 500 1000 1500 2000 2500 SE +/- 0.35, N = 3 2147.3 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Compression Speed r6g.4xlarge 6 12 18 24 30 SE +/- 0.01, N = 3 23.08 1. (CC) gcc options: -O3
x264 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 4K r6g.4xlarge 5 10 15 20 25 SE +/- 0.01, N = 3 19.48 1. (CC) gcc options: -ldl -lm -lpthread -O3 -flto
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt r6g.4xlarge 50 100 150 200 250 SE +/- 0.02, N = 3 243.75 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish r6g.4xlarge 50 100 150 200 250 SE +/- 0.01, N = 3 246.61 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt r6g.4xlarge 50 100 150 200 250 SE +/- 0.31, N = 3 206.83 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish r6g.4xlarge 50 100 150 200 250 SE +/- 0.07, N = 3 207.93 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt r6g.4xlarge 50 100 150 200 250 SE +/- 0.03, N = 3 240.95 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 r6g.4xlarge 50 100 150 200 250 SE +/- 0.21, N = 3 241.67 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt r6g.4xlarge 20 40 60 80 100 SE +/- 0.57, N = 3 94.53 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 r6g.4xlarge 20 40 60 80 100 SE +/- 0.59, N = 3 94.86 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt r6g.4xlarge 13 26 39 52 65 SE +/- 0.01, N = 3 58.68 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI r6g.4xlarge 13 26 39 52 65 SE +/- 0.01, N = 3 57.43 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
PyPerformance Benchmark: pathlib OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: pathlib r6g.4xlarge 7 14 21 28 35 SE +/- 0.44, N = 3 31.4
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite r6g.4xlarge 80 160 240 320 400 SE +/- 3.32, N = 3 391.01 1. (CC) gcc options: -lm
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed r6g.4xlarge 500 1000 1500 2000 2500 SE +/- 0.24, N = 3 2112.5 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Compression Speed r6g.4xlarge 15 30 45 60 75 SE +/- 0.02, N = 3 68.04 1. (CC) gcc options: -O3
PyPerformance Benchmark: pickle_pure_python OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: pickle_pure_python r6g.4xlarge 160 320 480 640 800 SE +/- 0.58, N = 3 763
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Decompression Speed r6g.4xlarge 500 1000 1500 2000 2500 SE +/- 0.91, N = 3 2358.3 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Compression Speed r6g.4xlarge 100 200 300 400 500 SE +/- 0.14, N = 3 459.42 1. (CC) gcc options: -O3
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads r6g.4xlarge 9 18 27 36 45 SE +/- 0.00, N = 3 40.2
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 1080p r6g.4xlarge 8 16 24 32 40 SE +/- 0.03, N = 3 32.67 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects r6g.4xlarge 4 8 12 16 20 SE +/- 0.00, N = 3 17.38 1. (CXX) g++ options: -std=c++11 -O3
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples r6g.4xlarge 3 6 9 12 15 SE +/- 0.01, N = 3 10.43 1. (CXX) g++ options: -fopenmp -O3
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy r6g.4xlarge 40K 80K 120K 160K 200K SE +/- 72.22, N = 5 170463.0 1. (CC) gcc options: -O3 -march=native -fopenmp
x264 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p r6g.4xlarge 20 40 60 80 100 SE +/- 0.01, N = 3 82.49 1. (CC) gcc options: -ldl -lm -lpthread -O3 -flto
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write r6g.4xlarge 2K 4K 6K 8K 10K SE +/- 21.23, N = 3 10825 1. (CC) gcc options: -O2
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation r6g.4xlarge 200 400 600 800 1000 SE +/- 0.25, N = 3 971.85 1. (CC) gcc options: -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization r6g.4xlarge 90 180 270 360 450 SE +/- 15.69, N = 3 393.90 1. (CC) gcc options: -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply r6g.4xlarge 80 160 240 320 400 SE +/- 0.44, N = 3 386.55 1. (CC) gcc options: -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform r6g.4xlarge 20 40 60 80 100 SE +/- 1.09, N = 3 103.12 1. (CC) gcc options: -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo r6g.4xlarge 20 40 60 80 100 SE +/- 0.22, N = 3 99.64 1. (CC) gcc options: -lm
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add r6g.4xlarge 40K 80K 120K 160K 200K SE +/- 91.99, N = 5 167210.0 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad r6g.4xlarge 40K 80K 120K 160K 200K SE +/- 94.26, N = 5 167372.9 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale r6g.4xlarge 40K 80K 120K 160K 200K SE +/- 88.22, N = 5 171629.7 1. (CC) gcc options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.4