amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2404108-NE-R7G4XLARG76 r7g.4xlarge - Phoronix Test Suite r7g.4xlarge amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2404108-NE-R7G4XLARG76&grr .
r7g.4xlarge Processor Motherboard Chipset Memory Disk Network OS Kernel Compiler File-System System Layer r7g.4xlarge ARMv8 Neoverse-V1 (16 Cores) Amazon EC2 r7g.4xlarge (1.0 BIOS) Amazon Device 0200 128GB 215GB Amazon Elastic Block Store Amazon Elastic Amazon Linux 2023.4.20240401 6.1.82-99.168.amzn2023.aarch64 (aarch64) 20240325 GCC 11.4.1 20230605 xfs amazon OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=aarch64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch=armv8.2-a+crypto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=neoverse-n1 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
r7g.4xlarge libxsmm: 256 libxsmm: 128 cpp-perf-bench: Rand Numbers blogbench: Read stockfish: Chess Benchmark memtier-benchmark: Redis - 500 - 1:5 memtier-benchmark: Redis - 500 - 10:1 memtier-benchmark: Redis - 500 - 5:1 vvenc: Bosphorus 4K - Fast memtier-benchmark: Redis - 500 - 1:1 memtier-benchmark: Redis - 500 - 1:10 cpp-perf-bench: Math Library memtier-benchmark: Redis - 100 - 1:10 openssl: ChaCha20-Poly1305 openssl: AES-256-GCM openssl: AES-128-GCM openssl: ChaCha20 openssl: SHA512 openssl: SHA256 vvenc: Bosphorus 4K - Faster cachebench: Read / Modify / Write cachebench: Write cachebench: Read vvenc: Bosphorus 1080p - Fast pyperformance: raytrace nginx: 4000 nginx: 1000 nginx: 500 nginx: 200 nginx: 100 nginx: 20 cpp-perf-bench: Stepanov Vector pyperformance: python_startup pyperformance: 2to3 memtier-benchmark: Redis - 100 - 10:1 compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed memtier-benchmark: Redis - 100 - 5:1 memtier-benchmark: Redis - 100 - 1:1 memtier-benchmark: Redis - 100 - 1:5 memtier-benchmark: Redis - 50 - 10:1 memtier-benchmark: Redis - 50 - 1:5 memtier-benchmark: Redis - 50 - 1:10 memtier-benchmark: Redis - 50 - 1:1 memtier-benchmark: Redis - 50 - 5:1 compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed pyperformance: go compress-zstd: 3 - Decompression Speed compress-zstd: 3 - Compression Speed cpp-perf-bench: Atol compress-zstd: 12 - Decompression Speed compress-zstd: 12 - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed graphics-magick: Noise-Gaussian graphics-magick: Sharpen graphics-magick: Rotate graphics-magick: Enhanced graphics-magick: HWB Color Space graphics-magick: Resizing graphics-magick: Swirl openssl: RSA4096 openssl: RSA4096 vvenc: Bosphorus 1080p - Faster cpp-perf-bench: Ctype ncnn: CPU - FastestDet ncnn: CPU - vision_transformer ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet ncnn: Vulkan GPU - FastestDet ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet x265: Bosphorus 4K c-ray: Total Time - 4K, 16 Rays Per Pixel pyperformance: regex_compile botan: AES-256 - Decrypt botan: AES-256 pyperformance: pathlib pyperformance: pickle_pure_python libxsmm: 64 cpp-perf-bench: Stepanov Abstraction pyperformance: django_template botan: Blowfish - Decrypt botan: Blowfish botan: Twofish - Decrypt botan: Twofish botan: ChaCha20Poly1305 - Decrypt botan: ChaCha20Poly1305 pyperformance: nbody botan: CAST-256 - Decrypt botan: CAST-256 botan: KASUMI - Decrypt botan: KASUMI pyperformance: float scimark2: Composite pyperformance: crypto_pyaes pyperformance: chaos pyperformance: json_loads compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed x264: Bosphorus 4K libxsmm: 32 cpp-perf-bench: Function Objects x265: Bosphorus 1080p smallpt: Global Illumination Renderer; 128 Samples stream: Copy x264: Bosphorus 1080p blogbench: Write scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo stream: Add stream: Triad stream: Scale r7g.4xlarge 366.0 342.7 1165.459 4050734 14645937 1960192.69 1696476.57 1657396.37 1.526 1852520.19 1928291.79 316.623 1661985.56 18626038990 71152167780 83521036743 25868587007 7943468107 13441375783 3.660 77743.745485 38901.481612 9907.186309 4.898 668 69003.52 74965.04 77795.5 79573.43 77802.22 65205.30 89.996 10 381 1398117.78 1041.2 11.9 1497886.42 1581133.53 1729318.31 1569255.07 1888088.64 1869781.07 1786164.87 1595327.93 1069.9 6.02 329 1164.3 2050.2 62.665 1192.9 167.4 1254.7 598.4 1198.8 1064.3 1245.8 518.8 75 81 162 56 298 286 151 178204.7 2561.5 10.644 28.729 1.76 88.64 5.24 4.28 9.34 6.44 6.14 2.52 2.47 10.13 4.22 0.99 2.78 1.65 1.33 1.51 1.83 6.44 1.74 88.52 5.21 4.25 9.30 6.52 6.14 2.59 2.58 9.97 4.34 0.99 2.52 1.57 1.32 1.50 1.75 6.52 13.75 38.910 195 5453.819 5455.193 22.1 595 343.2 31.843 63.5 276.528 280.809 243.873 241.724 367.535 373.368 168 109.493 109.311 66.978 65.567 164 537.45 158 142 28.9 3011.4 31.41 2952.4 89.81 3291.0 605.47 27.53 277.9 15.232 49.63 7.676 225658.3 116.17 10598 949.49 715.06 554.77 346.81 121.15 213584.8 213691.0 225050.1 OpenBenchmarking.org
libxsmm M N K: 256 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 256 r7g.4xlarge 80 160 240 320 400 SE +/- 0.26, N = 3 366.0 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
libxsmm M N K: 128 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 r7g.4xlarge 70 140 210 280 350 SE +/- 0.07, N = 3 342.7 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
CppPerformanceBenchmarks Test: Random Numbers OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers r7g.4xlarge 300 600 900 1200 1500 SE +/- 0.63, N = 3 1165.46 1. (CXX) g++ options: -std=c++11 -O3
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read r7g.4xlarge 900K 1800K 2700K 3600K 4500K SE +/- 59427.29, N = 9 4050734 1. (CC) gcc options: -O2
Stockfish Chess Benchmark OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark r7g.4xlarge 3M 6M 9M 12M 15M SE +/- 115991.72, N = 12 14645937 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 72639.38, N = 15 1960192.69 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 55791.94, N = 15 1696476.57 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 36604.00, N = 15 1657396.37 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast r7g.4xlarge 0.3434 0.6868 1.0302 1.3736 1.717 SE +/- 0.001, N = 3 1.526 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 77306.04, N = 12 1852520.19 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 76517.75, N = 12 1928291.79 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library r7g.4xlarge 70 140 210 280 350 SE +/- 0.12, N = 3 316.62 1. (CXX) g++ options: -std=c++11 -O3
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 12793.25, N = 10 1661985.56 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 r7g.4xlarge 4000M 8000M 12000M 16000M 20000M SE +/- 245454.78, N = 3 18626038990 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM r7g.4xlarge 15000M 30000M 45000M 60000M 75000M SE +/- 2859108.20, N = 3 71152167780 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM r7g.4xlarge 20000M 40000M 60000M 80000M 100000M SE +/- 4647785.85, N = 3 83521036743 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 r7g.4xlarge 6000M 12000M 18000M 24000M 30000M SE +/- 218250.25, N = 3 25868587007 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 r7g.4xlarge 2000M 4000M 6000M 8000M 10000M SE +/- 4256905.92, N = 3 7943468107 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 r7g.4xlarge 3000M 6000M 9000M 12000M 15000M SE +/- 7750983.69, N = 3 13441375783 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster r7g.4xlarge 0.8235 1.647 2.4705 3.294 4.1175 SE +/- 0.004, N = 3 3.660 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write r7g.4xlarge 17K 34K 51K 68K 85K SE +/- 63.95, N = 3 77743.75 MIN: 73554.39 / MAX: 78862.45 1. (CC) gcc options: -O3 -lrt
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write r7g.4xlarge 8K 16K 24K 32K 40K SE +/- 18.42, N = 3 38901.48 MIN: 37503.54 / MAX: 39493.98 1. (CC) gcc options: -O3 -lrt
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read r7g.4xlarge 2K 4K 6K 8K 10K SE +/- 0.03, N = 3 9907.19 MIN: 9906.68 / MAX: 9907.79 1. (CC) gcc options: -O3 -lrt
VVenC Video Input: Bosphorus 1080p - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast r7g.4xlarge 1.1021 2.2042 3.3063 4.4084 5.5105 SE +/- 0.010, N = 3 4.898 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
PyPerformance Benchmark: raytrace OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: raytrace r7g.4xlarge 140 280 420 560 700 SE +/- 0.67, N = 3 668
nginx Connections: 4000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 4000 r7g.4xlarge 15K 30K 45K 60K 75K SE +/- 712.04, N = 3 69003.52 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 r7g.4xlarge 16K 32K 48K 64K 80K SE +/- 34.81, N = 3 74965.04 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 r7g.4xlarge 17K 34K 51K 68K 85K SE +/- 78.67, N = 3 77795.5 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 r7g.4xlarge 20K 40K 60K 80K 100K SE +/- 253.64, N = 3 79573.43 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 100 r7g.4xlarge 17K 34K 51K 68K 85K SE +/- 118.57, N = 3 77802.22 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 20 r7g.4xlarge 14K 28K 42K 56K 70K SE +/- 10.42, N = 3 65205.30 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
CppPerformanceBenchmarks Test: Stepanov Vector OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector r7g.4xlarge 20 40 60 80 100 SE +/- 0.76, N = 3 90.00 1. (CXX) g++ options: -std=c++11 -O3
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup r7g.4xlarge 3 6 9 12 15 SE +/- 0.00, N = 3 10
PyPerformance Benchmark: 2to3 OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: 2to3 r7g.4xlarge 80 160 240 320 400 SE +/- 0.33, N = 3 381
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 r7g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 17907.73, N = 3 1398117.78 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed r7g.4xlarge 200 400 600 800 1000 SE +/- 3.86, N = 3 1041.2 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed r7g.4xlarge 3 6 9 12 15 SE +/- 0.00, N = 3 11.9 1. (CC) gcc options: -O3 -pthread -lz
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 r7g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 3535.12, N = 3 1497886.42 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 r7g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 9399.79, N = 3 1581133.53 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 18961.50, N = 3 1729318.31 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 r7g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 3286.20, N = 3 1569255.07 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 13391.57, N = 3 1888088.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 16228.88, N = 3 1869781.07 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 r7g.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 19547.74, N = 3 1786164.87 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 r7g.4xlarge 300K 600K 900K 1200K 1500K SE +/- 9560.28, N = 3 1595327.93 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed r7g.4xlarge 200 400 600 800 1000 SE +/- 5.53, N = 3 1069.9 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed r7g.4xlarge 2 4 6 8 10 SE +/- 0.00, N = 3 6.02 1. (CC) gcc options: -O3 -pthread -lz
PyPerformance Benchmark: go OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: go r7g.4xlarge 70 140 210 280 350 SE +/- 0.33, N = 3 329
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3 - Decompression Speed r7g.4xlarge 300 600 900 1200 1500 SE +/- 0.50, N = 3 1164.3 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3 - Compression Speed r7g.4xlarge 400 800 1200 1600 2000 SE +/- 17.66, N = 3 2050.2 1. (CC) gcc options: -O3 -pthread -lz
CppPerformanceBenchmarks Test: Atol OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol r7g.4xlarge 14 28 42 56 70 SE +/- 0.01, N = 3 62.67 1. (CXX) g++ options: -std=c++11 -O3
Zstd Compression Compression Level: 12 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed r7g.4xlarge 300 600 900 1200 1500 SE +/- 12.00, N = 3 1192.9 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 12 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed r7g.4xlarge 40 80 120 160 200 SE +/- 0.96, N = 3 167.4 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed r7g.4xlarge 300 600 900 1200 1500 SE +/- 4.88, N = 3 1254.7 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed r7g.4xlarge 130 260 390 520 650 SE +/- 1.63, N = 3 598.4 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Decompression Speed r7g.4xlarge 300 600 900 1200 1500 SE +/- 2.99, N = 3 1198.8 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Compression Speed r7g.4xlarge 200 400 600 800 1000 SE +/- 4.38, N = 3 1064.3 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed r7g.4xlarge 300 600 900 1200 1500 SE +/- 5.04, N = 3 1245.8 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed r7g.4xlarge 110 220 330 440 550 SE +/- 1.27, N = 3 518.8 1. (CC) gcc options: -O3 -pthread -lz
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian r7g.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 75 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen r7g.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 81 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate r7g.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 162 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced r7g.4xlarge 13 26 39 52 65 SE +/- 0.00, N = 3 56 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space r7g.4xlarge 60 120 180 240 300 SE +/- 0.88, N = 3 298 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing r7g.4xlarge 60 120 180 240 300 SE +/- 0.58, N = 3 286 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl r7g.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 151 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r7g.4xlarge 40K 80K 120K 160K 200K SE +/- 42.12, N = 3 178204.7 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r7g.4xlarge 500 1000 1500 2000 2500 SE +/- 0.19, N = 3 2561.5 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
VVenC Video Input: Bosphorus 1080p - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster r7g.4xlarge 3 6 9 12 15 SE +/- 0.01, N = 3 10.64 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
CppPerformanceBenchmarks Test: Ctype OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype r7g.4xlarge 7 14 21 28 35 SE +/- 0.32, N = 5 28.73 1. (CXX) g++ options: -std=c++11 -O3
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet r7g.4xlarge 0.396 0.792 1.188 1.584 1.98 SE +/- 0.02, N = 3 1.76 MIN: 1.7 / MAX: 2.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer r7g.4xlarge 20 40 60 80 100 SE +/- 0.26, N = 3 88.64 MIN: 87.71 / MAX: 105.35 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m r7g.4xlarge 1.179 2.358 3.537 4.716 5.895 SE +/- 0.05, N = 3 5.24 MIN: 5.13 / MAX: 7.52 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd r7g.4xlarge 0.963 1.926 2.889 3.852 4.815 SE +/- 0.03, N = 3 4.28 MIN: 4.19 / MAX: 4.99 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny r7g.4xlarge 3 6 9 12 15 SE +/- 0.02, N = 3 9.34 MIN: 9.25 / MAX: 16.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r7g.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.44 MIN: 6.3 / MAX: 6.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 r7g.4xlarge 2 4 6 8 10 SE +/- 0.02, N = 3 6.14 MIN: 6.06 / MAX: 6.62 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet r7g.4xlarge 0.567 1.134 1.701 2.268 2.835 SE +/- 0.00, N = 3 2.52 MIN: 2.48 / MAX: 3.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 r7g.4xlarge 0.5558 1.1116 1.6674 2.2232 2.779 SE +/- 0.01, N = 3 2.47 MIN: 2.42 / MAX: 4.31 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 r7g.4xlarge 3 6 9 12 15 SE +/- 0.12, N = 3 10.13 MIN: 9.9 / MAX: 34.05 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet r7g.4xlarge 0.9495 1.899 2.8485 3.798 4.7475 SE +/- 0.01, N = 3 4.22 MIN: 4.16 / MAX: 4.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface r7g.4xlarge 0.2228 0.4456 0.6684 0.8912 1.114 SE +/- 0.01, N = 3 0.99 MIN: 0.97 / MAX: 1.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 r7g.4xlarge 0.6255 1.251 1.8765 2.502 3.1275 SE +/- 0.23, N = 3 2.78 MIN: 2.48 / MAX: 70.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet r7g.4xlarge 0.3713 0.7426 1.1139 1.4852 1.8565 SE +/- 0.06, N = 3 1.65 MIN: 1.53 / MAX: 37.68 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 r7g.4xlarge 0.2993 0.5986 0.8979 1.1972 1.4965 SE +/- 0.01, N = 3 1.33 MIN: 1.28 / MAX: 1.75 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 r7g.4xlarge 0.3398 0.6796 1.0194 1.3592 1.699 SE +/- 0.01, N = 3 1.51 MIN: 1.47 / MAX: 1.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 r7g.4xlarge 0.4118 0.8236 1.2354 1.6472 2.059 SE +/- 0.08, N = 3 1.83 MIN: 1.72 / MAX: 42.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet r7g.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.44 MIN: 6.3 / MAX: 6.89 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet r7g.4xlarge 0.3915 0.783 1.1745 1.566 1.9575 SE +/- 0.02, N = 3 1.74 MIN: 1.69 / MAX: 2.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer r7g.4xlarge 20 40 60 80 100 SE +/- 0.28, N = 3 88.52 MIN: 87.01 / MAX: 129.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m r7g.4xlarge 1.1723 2.3446 3.5169 4.6892 5.8615 SE +/- 0.11, N = 3 5.21 MIN: 5.05 / MAX: 5.8 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd r7g.4xlarge 0.9563 1.9126 2.8689 3.8252 4.7815 SE +/- 0.03, N = 3 4.25 MIN: 4.14 / MAX: 4.49 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny r7g.4xlarge 3 6 9 12 15 SE +/- 0.02, N = 3 9.30 MIN: 9.2 / MAX: 9.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r7g.4xlarge 2 4 6 8 10 SE +/- 0.07, N = 3 6.52 MIN: 6.27 / MAX: 32.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 r7g.4xlarge 2 4 6 8 10 SE +/- 0.02, N = 3 6.14 MIN: 6.07 / MAX: 6.65 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet r7g.4xlarge 0.5828 1.1656 1.7484 2.3312 2.914 SE +/- 0.07, N = 3 2.59 MIN: 2.48 / MAX: 17.38 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 r7g.4xlarge 0.5805 1.161 1.7415 2.322 2.9025 SE +/- 0.11, N = 3 2.58 MIN: 2.41 / MAX: 30.57 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 r7g.4xlarge 3 6 9 12 15 SE +/- 0.01, N = 3 9.97 MIN: 9.87 / MAX: 10.19 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet r7g.4xlarge 0.9765 1.953 2.9295 3.906 4.8825 SE +/- 0.13, N = 3 4.34 MIN: 4.16 / MAX: 67.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface r7g.4xlarge 0.2228 0.4456 0.6684 0.8912 1.114 SE +/- 0.01, N = 3 0.99 MIN: 0.97 / MAX: 1.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 r7g.4xlarge 0.567 1.134 1.701 2.268 2.835 SE +/- 0.03, N = 3 2.52 MIN: 2.45 / MAX: 2.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet r7g.4xlarge 0.3533 0.7066 1.0599 1.4132 1.7665 SE +/- 0.01, N = 3 1.57 MIN: 1.52 / MAX: 1.64 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 r7g.4xlarge 0.297 0.594 0.891 1.188 1.485 SE +/- 0.02, N = 3 1.32 MIN: 1.27 / MAX: 1.79 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 r7g.4xlarge 0.3375 0.675 1.0125 1.35 1.6875 SE +/- 0.02, N = 3 1.50 MIN: 1.44 / MAX: 1.6 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 r7g.4xlarge 0.3938 0.7876 1.1814 1.5752 1.969 SE +/- 0.01, N = 3 1.75 MIN: 1.71 / MAX: 2.24 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet r7g.4xlarge 2 4 6 8 10 SE +/- 0.07, N = 3 6.52 MIN: 6.27 / MAX: 32.15 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 4K r7g.4xlarge 4 8 12 16 20 SE +/- 0.12, N = 3 13.75 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel r7g.4xlarge 9 18 27 36 45 SE +/- 0.02, N = 3 38.91 1. (CC) gcc options: -lm -lpthread -O3
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile r7g.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 195
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt r7g.4xlarge 1200 2400 3600 4800 6000 SE +/- 6.95, N = 3 5453.82 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 r7g.4xlarge 1200 2400 3600 4800 6000 SE +/- 1.11, N = 3 5455.19 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
PyPerformance Benchmark: pathlib OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: pathlib r7g.4xlarge 5 10 15 20 25 SE +/- 0.03, N = 3 22.1
PyPerformance Benchmark: pickle_pure_python OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: pickle_pure_python r7g.4xlarge 130 260 390 520 650 SE +/- 0.67, N = 3 595
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 r7g.4xlarge 70 140 210 280 350 SE +/- 0.35, N = 3 343.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
CppPerformanceBenchmarks Test: Stepanov Abstraction OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction r7g.4xlarge 7 14 21 28 35 SE +/- 0.02, N = 3 31.84 1. (CXX) g++ options: -std=c++11 -O3
PyPerformance Benchmark: django_template OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: django_template r7g.4xlarge 14 28 42 56 70 SE +/- 0.31, N = 3 63.5
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt r7g.4xlarge 60 120 180 240 300 SE +/- 0.12, N = 3 276.53 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish r7g.4xlarge 60 120 180 240 300 SE +/- 0.05, N = 3 280.81 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt r7g.4xlarge 50 100 150 200 250 SE +/- 0.05, N = 3 243.87 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish r7g.4xlarge 50 100 150 200 250 SE +/- 0.03, N = 3 241.72 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt r7g.4xlarge 80 160 240 320 400 SE +/- 0.03, N = 3 367.54 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 r7g.4xlarge 80 160 240 320 400 SE +/- 0.08, N = 3 373.37 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
PyPerformance Benchmark: nbody OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: nbody r7g.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 168
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt r7g.4xlarge 20 40 60 80 100 SE +/- 0.02, N = 3 109.49 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 r7g.4xlarge 20 40 60 80 100 SE +/- 0.01, N = 3 109.31 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt r7g.4xlarge 15 30 45 60 75 SE +/- 0.01, N = 3 66.98 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI r7g.4xlarge 15 30 45 60 75 SE +/- 0.01, N = 3 65.57 1. (CXX) g++ options: -fstack-protector -pthread -lbotan-2 -ldl -lrt
PyPerformance Benchmark: float OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: float r7g.4xlarge 40 80 120 160 200 SE +/- 0.33, N = 3 164
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite r7g.4xlarge 120 240 360 480 600 SE +/- 0.14, N = 3 537.45 1. (CC) gcc options: -lm
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes r7g.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 158
PyPerformance Benchmark: chaos OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: chaos r7g.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 142
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads r7g.4xlarge 7 14 21 28 35 SE +/- 0.00, N = 3 28.9
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed r7g.4xlarge 600 1200 1800 2400 3000 SE +/- 0.27, N = 3 3011.4 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Compression Speed r7g.4xlarge 7 14 21 28 35 SE +/- 0.42, N = 3 31.41 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed r7g.4xlarge 600 1200 1800 2400 3000 SE +/- 2.30, N = 3 2952.4 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Compression Speed r7g.4xlarge 20 40 60 80 100 SE +/- 0.41, N = 3 89.81 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Decompression Speed r7g.4xlarge 700 1400 2100 2800 3500 SE +/- 0.48, N = 3 3291.0 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Compression Speed r7g.4xlarge 130 260 390 520 650 SE +/- 0.03, N = 3 605.47 1. (CC) gcc options: -O3
x264 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 4K r7g.4xlarge 6 12 18 24 30 SE +/- 0.02, N = 3 27.53 1. (CC) gcc options: -ldl -lm -lpthread -O3 -flto
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 r7g.4xlarge 60 120 180 240 300 SE +/- 0.10, N = 3 277.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=armv8.1-a
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects r7g.4xlarge 4 8 12 16 20 SE +/- 0.00, N = 3 15.23 1. (CXX) g++ options: -std=c++11 -O3
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 1080p r7g.4xlarge 11 22 33 44 55 SE +/- 0.05, N = 3 49.63 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples r7g.4xlarge 2 4 6 8 10 SE +/- 0.003, N = 3 7.676 1. (CXX) g++ options: -fopenmp -O3
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy r7g.4xlarge 50K 100K 150K 200K 250K SE +/- 135.96, N = 5 225658.3 1. (CC) gcc options: -O3 -march=native -fopenmp
x264 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p r7g.4xlarge 30 60 90 120 150 SE +/- 0.10, N = 3 116.17 1. (CC) gcc options: -ldl -lm -lpthread -O3 -flto
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write r7g.4xlarge 2K 4K 6K 8K 10K SE +/- 10.84, N = 3 10598 1. (CC) gcc options: -O2
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation r7g.4xlarge 200 400 600 800 1000 SE +/- 0.28, N = 3 949.49 1. (CC) gcc options: -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization r7g.4xlarge 150 300 450 600 750 SE +/- 0.41, N = 3 715.06 1. (CC) gcc options: -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply r7g.4xlarge 120 240 360 480 600 SE +/- 0.08, N = 3 554.77 1. (CC) gcc options: -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform r7g.4xlarge 80 160 240 320 400 SE +/- 0.50, N = 3 346.81 1. (CC) gcc options: -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo r7g.4xlarge 30 60 90 120 150 SE +/- 0.51, N = 3 121.15 1. (CC) gcc options: -lm
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add r7g.4xlarge 50K 100K 150K 200K 250K SE +/- 82.27, N = 5 213584.8 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad r7g.4xlarge 50K 100K 150K 200K 250K SE +/- 123.19, N = 5 213691.0 1. (CC) gcc options: -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale r7g.4xlarge 50K 100K 150K 200K 250K SE +/- 178.63, N = 5 225050.1 1. (CC) gcc options: -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.4