amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2404107-NE-R6A4XLARG93 r6a.4xlarge - Phoronix Test Suite r6a.4xlarge amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2404107-NE-R6A4XLARG93&grr .
r6a.4xlarge Processor Motherboard Chipset Memory Disk Network OS Kernel Compiler File-System System Layer r6a.4xlarge AMD EPYC 7R13 (8 Cores / 16 Threads) Amazon EC2 r6a.4xlarge (1.0 BIOS) Intel 440FX 82441FX PMC 1 x 128 GB DDR4-3200MT/s 215GB Amazon Elastic Block Store Amazon Elastic Amazon Linux 2023.4.20240401 6.1.82-99.168.amzn2023.x86_64 (x86_64) GCC 11.4.1 20230605 xfs amazon OpenBenchmarking.org - Transparent Huge Pages: madvise - --build=x86_64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - CPU Microcode: 0xa0011d1 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
r6a.4xlarge libxsmm: 256 clickhouse: 100M Rows Hits Dataset, Third Run clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache libxsmm: 128 cpp-perf-bench: Rand Numbers blogbench: Read ncnn: Vulkan GPU - FastestDet ncnn: Vulkan GPU - vision_transformer ncnn: Vulkan GPU - regnety_400m ncnn: Vulkan GPU - squeezenet_ssd ncnn: Vulkan GPU - yolov4-tiny ncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: Vulkan GPU - resnet50 ncnn: Vulkan GPU - alexnet ncnn: Vulkan GPU - resnet18 ncnn: Vulkan GPU - vgg16 ncnn: Vulkan GPU - googlenet ncnn: Vulkan GPU - blazeface ncnn: Vulkan GPU - efficientnet-b0 ncnn: Vulkan GPU - mnasnet ncnn: Vulkan GPU - shufflenet-v2 ncnn: Vulkan GPU-v3-v3 - mobilenet-v3 ncnn: Vulkan GPU-v2-v2 - mobilenet-v2 ncnn: Vulkan GPU - mobilenet cpp-perf-bench: Math Library openssl: ChaCha20-Poly1305 openssl: AES-256-GCM openssl: AES-128-GCM openssl: ChaCha20 openssl: SHA512 openssl: SHA256 vvenc: Bosphorus 4K - Fast intel-mlc: Max Bandwidth - Stream-Triad Like intel-mlc: Max Bandwidth - 1:1 Reads-Writes intel-mlc: Max Bandwidth - 2:1 Reads-Writes intel-mlc: Max Bandwidth - 3:1 Reads-Writes intel-mlc: Max Bandwidth - All Reads stockfish: Chess Benchmark cachebench: Read / Modify / Write cachebench: Write cachebench: Read memtier-benchmark: Redis - 500 - 1:1 nginx: 4000 nginx: 1000 nginx: 500 nginx: 200 nginx: 100 nginx: 20 pyperformance: raytrace compress-zstd: 12 - Decompression Speed compress-zstd: 12 - Compression Speed pyperformance: python_startup memtier-benchmark: Redis - 500 - 1:5 memtier-benchmark: Redis - 500 - 10:1 vvenc: Bosphorus 4K - Faster c-ray: Total Time - 4K, 16 Rays Per Pixel memtier-benchmark: Redis - 500 - 1:10 memtier-benchmark: Redis - 500 - 5:1 ncnn: CPU - FastestDet ncnn: CPU - vision_transformer ncnn: CPU - regnety_400m ncnn: CPU - squeezenet_ssd ncnn: CPU - yolov4-tiny ncnn: CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: CPU - resnet50 ncnn: CPU - alexnet ncnn: CPU - resnet18 ncnn: CPU - vgg16 ncnn: CPU - googlenet ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU - mnasnet ncnn: CPU - shufflenet-v2 ncnn: CPU-v3-v3 - mobilenet-v3 ncnn: CPU-v2-v2 - mobilenet-v2 ncnn: CPU - mobilenet compress-zstd: 19 - Decompression Speed compress-zstd: 19 - Compression Speed memtier-benchmark: Redis - 100 - 1:10 memtier-benchmark: Redis - 100 - 5:1 memtier-benchmark: Redis - 100 - 1:5 memtier-benchmark: Redis - 100 - 10:1 memtier-benchmark: Redis - 100 - 1:1 memtier-benchmark: Redis - 50 - 1:10 memtier-benchmark: Redis - 50 - 5:1 memtier-benchmark: Redis - 50 - 10:1 memtier-benchmark: Redis - 50 - 1:1 memtier-benchmark: Redis - 50 - 1:5 libxsmm: 32 libxsmm: 64 compress-zstd: 19, Long Mode - Decompression Speed compress-zstd: 19, Long Mode - Compression Speed compress-zstd: 3 - Decompression Speed compress-zstd: 3 - Compression Speed compress-zstd: 3, Long Mode - Decompression Speed compress-zstd: 3, Long Mode - Compression Speed compress-zstd: 8, Long Mode - Decompression Speed compress-zstd: 8, Long Mode - Compression Speed compress-zstd: 8 - Decompression Speed compress-zstd: 8 - Compression Speed pyperformance: 2to3 graphics-magick: Noise-Gaussian graphics-magick: Enhanced graphics-magick: Sharpen graphics-magick: HWB Color Space graphics-magick: Swirl graphics-magick: Rotate graphics-magick: Resizing openssl: RSA4096 openssl: RSA4096 cpp-perf-bench: Stepanov Vector pyperformance: go stream: Copy vvenc: Bosphorus 1080p - Fast cpp-perf-bench: Ctype cpp-perf-bench: Atol x265: Bosphorus 4K pyperformance: regex_compile botan: AES-256 - Decrypt botan: AES-256 pyperformance: pathlib botan: Blowfish - Decrypt botan: Blowfish botan: Twofish - Decrypt botan: Twofish botan: ChaCha20Poly1305 - Decrypt botan: ChaCha20Poly1305 botan: CAST-256 - Decrypt botan: CAST-256 botan: KASUMI - Decrypt botan: KASUMI scimark2: Composite pyperformance: pickle_pure_python pyperformance: django_template cpp-perf-bench: Stepanov Abstraction pyperformance: nbody tjbench: Decompression Throughput pyperformance: chaos compress-lz4: 9 - Decompression Speed compress-lz4: 9 - Compression Speed compress-lz4: 1 - Decompression Speed compress-lz4: 1 - Compression Speed pyperformance: float intel-mlc: Peak Injection Bandwidth - Stream-Triad Like intel-mlc: Peak Injection Bandwidth - 1:1 Reads-Writes intel-mlc: Peak Injection Bandwidth - 2:1 Reads-Writes intel-mlc: Peak Injection Bandwidth - 3:1 Reads-Writes intel-mlc: Peak Injection Bandwidth - All Reads x264: Bosphorus 4K compress-lz4: 3 - Decompression Speed compress-lz4: 3 - Compression Speed pyperformance: crypto_pyaes pyperformance: json_loads vvenc: Bosphorus 1080p - Faster cpp-perf-bench: Function Objects smallpt: Global Illumination Renderer; 128 Samples x265: Bosphorus 1080p intel-mlc: Idle Latency x264: Bosphorus 1080p blogbench: Write scimark2: Jacobi Successive Over-Relaxation scimark2: Dense LU Matrix Factorization scimark2: Sparse Matrix Multiply scimark2: Fast Fourier Transform scimark2: Monte Carlo stream: Add stream: Triad stream: Scale r6a.4xlarge 220.1 172.07 171.01 165.32 260.7 870.318 3303161 2.66 96.79 6.33 6.00 15.64 9.55 12.86 6.43 5.90 36.26 7.79 0.73 4.54 2.24 2.01 2.05 2.38 9.55 293.181 24046419150 34776523753 37945733573 35343929890 3755914720 11165896017 3.777 45649.10 45554.89 46089.25 45156.51 40264.71 14950049 102080.807379 51742.215398 9139.651439 1900547.15 46860.10 49459.27 51769.47 53546.88 53527.79 43467.35 528 1473.7 137.2 9.77 2047586.15 1747045.85 8.746 68.973 1977046.15 1751865.28 2.56 96.68 6.32 5.87 15.02 9.45 12.68 6.43 5.75 36.18 7.71 0.73 4.53 2.23 1.99 2.05 2.39 9.45 1300.1 11.9 2096259.64 1961016.78 2248697.50 1892964.18 2081724.01 2199721.37 1998837.30 1922744.73 2223332.41 2346107.71 87.8 178.3 1185.0 6.54 1324.3 1676.8 1297.9 925.6 1449.1 398.5 1442.6 385.7 339 53 36 22 178 109 134 177 137205.3 2099.8 57.486 280 44429.7 12.311 48.864 48.551 12.44 183 5583.548 5582.962 21.6 377.434 376.269 323.036 318.025 714.727 731.346 129.067 129.096 82.502 85.662 661.60 467 52.3 25.850 139 198.272065 128 3918.1 34.37 4071.2 663.36 120 45531.2 45514.7 45776.6 44591.2 40032.5 27.24 3711.9 103.79 118 22.4 30.213 15.330 11.266 55.19 123.2 114.66 10860 1025.13 1139.34 646.94 355.55 141.03 32617.2 32609.0 30431.7 OpenBenchmarking.org
libxsmm M N K: 256 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 256 r6a.4xlarge 50 100 150 200 250 SE +/- 0.07, N = 3 220.1 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
ClickHouse 100M Rows Hits Dataset, Third Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run r6a.4xlarge 40 80 120 160 200 SE +/- 1.39, N = 9 172.07 MIN: 7.03 / MAX: 7500
ClickHouse 100M Rows Hits Dataset, Second Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run r6a.4xlarge 40 80 120 160 200 SE +/- 0.80, N = 9 171.01 MIN: 7.04 / MAX: 7500
ClickHouse 100M Rows Hits Dataset, First Run / Cold Cache OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache r6a.4xlarge 40 80 120 160 200 SE +/- 1.62, N = 9 165.32 MIN: 6.95 / MAX: 7500
libxsmm M N K: 128 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 128 r6a.4xlarge 60 120 180 240 300 SE +/- 0.47, N = 3 260.7 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
CppPerformanceBenchmarks Test: Random Numbers OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Random Numbers r6a.4xlarge 200 400 600 800 1000 SE +/- 0.00, N = 3 870.32 1. (CXX) g++ options: -std=c++11 -O3
BlogBench Test: Read OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Read r6a.4xlarge 700K 1400K 2100K 2800K 3500K SE +/- 11890.55, N = 3 3303161 1. (CC) gcc options: -O2
NCNN Target: Vulkan GPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: FastestDet r6a.4xlarge 0.5985 1.197 1.7955 2.394 2.9925 SE +/- 0.03, N = 15 2.66 MIN: 2.48 / MAX: 4.91 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vision_transformer r6a.4xlarge 20 40 60 80 100 SE +/- 0.06, N = 15 96.79 MIN: 95.96 / MAX: 174.71 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: regnety_400m r6a.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 15 6.33 MIN: 6.13 / MAX: 12.51 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: squeezenet_ssd r6a.4xlarge 2 4 6 8 10 SE +/- 0.06, N = 15 6.00 MIN: 5.73 / MAX: 24.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: yolov4-tiny r6a.4xlarge 4 8 12 16 20 SE +/- 0.15, N = 15 15.64 MIN: 14.84 / MAX: 25.45 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r6a.4xlarge 3 6 9 12 15 SE +/- 0.09, N = 15 9.55 MIN: 9.19 / MAX: 12.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet50 r6a.4xlarge 3 6 9 12 15 SE +/- 0.12, N = 15 12.86 MIN: 12.43 / MAX: 16.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: alexnet r6a.4xlarge 2 4 6 8 10 SE +/- 0.00, N = 15 6.43 MIN: 6.29 / MAX: 8.92 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: resnet18 r6a.4xlarge 1.3275 2.655 3.9825 5.31 6.6375 SE +/- 0.07, N = 15 5.90 MIN: 5.66 / MAX: 15.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: vgg16 r6a.4xlarge 8 16 24 32 40 SE +/- 0.04, N = 15 36.26 MIN: 35.75 / MAX: 55.34 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: googlenet r6a.4xlarge 2 4 6 8 10 SE +/- 0.07, N = 15 7.79 MIN: 7.5 / MAX: 18.87 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: blazeface r6a.4xlarge 0.1643 0.3286 0.4929 0.6572 0.8215 SE +/- 0.00, N = 15 0.73 MIN: 0.7 / MAX: 2.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: efficientnet-b0 r6a.4xlarge 1.0215 2.043 3.0645 4.086 5.1075 SE +/- 0.01, N = 15 4.54 MIN: 4.45 / MAX: 23.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mnasnet r6a.4xlarge 0.504 1.008 1.512 2.016 2.52 SE +/- 0.01, N = 15 2.24 MIN: 2.18 / MAX: 12.13 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: shufflenet-v2 r6a.4xlarge 0.4523 0.9046 1.3569 1.8092 2.2615 SE +/- 0.01, N = 15 2.01 MIN: 1.93 / MAX: 11.76 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3 r6a.4xlarge 0.4613 0.9226 1.3839 1.8452 2.3065 SE +/- 0.00, N = 15 2.05 MIN: 2.01 / MAX: 2.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2 r6a.4xlarge 0.5355 1.071 1.6065 2.142 2.6775 SE +/- 0.00, N = 15 2.38 MIN: 2.33 / MAX: 7.46 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: Vulkan GPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: Vulkan GPU - Model: mobilenet r6a.4xlarge 3 6 9 12 15 SE +/- 0.09, N = 15 9.55 MIN: 9.19 / MAX: 12.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
CppPerformanceBenchmarks Test: Math Library OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Math Library r6a.4xlarge 60 120 180 240 300 SE +/- 0.08, N = 3 293.18 1. (CXX) g++ options: -std=c++11 -O3
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 r6a.4xlarge 5000M 10000M 15000M 20000M 25000M SE +/- 18263607.95, N = 3 24046419150 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM r6a.4xlarge 7000M 14000M 21000M 28000M 35000M SE +/- 5090891.00, N = 3 34776523753 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM r6a.4xlarge 8000M 16000M 24000M 32000M 40000M SE +/- 1654552.15, N = 3 37945733573 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 r6a.4xlarge 8000M 16000M 24000M 32000M 40000M SE +/- 5373039.12, N = 3 35343929890 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 r6a.4xlarge 800M 1600M 2400M 3200M 4000M SE +/- 1360255.13, N = 3 3755914720 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 r6a.4xlarge 2000M 4000M 6000M 8000M 10000M SE +/- 11087062.81, N = 3 11165896017 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast r6a.4xlarge 0.8498 1.6996 2.5494 3.3992 4.249 SE +/- 0.009, N = 3 3.777 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Intel Memory Latency Checker Test: Max Bandwidth - Stream-Triad Like OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Max Bandwidth - Stream-Triad Like r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 14.23, N = 3 45649.10
Intel Memory Latency Checker Test: Max Bandwidth - 1:1 Reads-Writes OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 1:1 Reads-Writes r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 2.23, N = 3 45554.89
Intel Memory Latency Checker Test: Max Bandwidth - 2:1 Reads-Writes OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 2:1 Reads-Writes r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 17.51, N = 3 46089.25
Intel Memory Latency Checker Test: Max Bandwidth - 3:1 Reads-Writes OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Max Bandwidth - 3:1 Reads-Writes r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 13.04, N = 3 45156.51
Intel Memory Latency Checker Test: Max Bandwidth - All Reads OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Max Bandwidth - All Reads r6a.4xlarge 9K 18K 27K 36K 45K SE +/- 0.69, N = 3 40264.71
Stockfish Chess Benchmark OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 16.1 Chess Benchmark r6a.4xlarge 3M 6M 9M 12M 15M SE +/- 200886.58, N = 3 14950049 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
CacheBench Test: Read / Modify / Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write r6a.4xlarge 20K 40K 60K 80K 100K SE +/- 71.89, N = 3 102080.81 MIN: 87920.86 / MAX: 108481.71 1. (CC) gcc options: -O3 -lrt
CacheBench Test: Write OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 13.75, N = 3 51742.22 MIN: 45737.39 / MAX: 54502.64 1. (CC) gcc options: -O3 -lrt
CacheBench Test: Read OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read r6a.4xlarge 2K 4K 6K 8K 10K SE +/- 0.54, N = 3 9139.65 MIN: 9137.65 / MAX: 9141.18 1. (CC) gcc options: -O3 -lrt
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 20065.63, N = 5 1900547.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
nginx Connections: 4000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 4000 r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 147.89, N = 3 46860.10 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 34.11, N = 3 49459.27 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 32.55, N = 3 51769.47 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 200 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 200 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 35.41, N = 3 53546.88 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 100 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 100 r6a.4xlarge 11K 22K 33K 44K 55K SE +/- 26.49, N = 3 53527.79 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 20 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 20 r6a.4xlarge 9K 18K 27K 36K 45K SE +/- 39.93, N = 3 43467.35 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
PyPerformance Benchmark: raytrace OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: raytrace r6a.4xlarge 110 220 330 440 550 SE +/- 0.88, N = 3 528
Zstd Compression Compression Level: 12 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 5.45, N = 4 1473.7 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 12 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 12 - Compression Speed r6a.4xlarge 30 60 90 120 150 SE +/- 1.52, N = 4 137.2 1. (CC) gcc options: -O3 -pthread -lz
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup r6a.4xlarge 3 6 9 12 15 SE +/- 0.01, N = 3 9.77
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 12254.86, N = 3 2047586.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 8786.86, N = 3 1747045.85 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster r6a.4xlarge 2 4 6 8 10 SE +/- 0.003, N = 3 8.746 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
C-Ray Total Time - 4K, 16 Rays Per Pixel OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel r6a.4xlarge 15 30 45 60 75 SE +/- 0.05, N = 3 68.97 1. (CC) gcc options: -lm -lpthread -O3
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 11457.17, N = 3 1977046.15 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 11207.74, N = 3 1751865.28 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: FastestDet r6a.4xlarge 0.576 1.152 1.728 2.304 2.88 SE +/- 0.02, N = 3 2.56 MIN: 2.47 / MAX: 2.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vision_transformer r6a.4xlarge 20 40 60 80 100 SE +/- 0.22, N = 3 96.68 MIN: 96.04 / MAX: 153.77 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: regnety_400m r6a.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.32 MIN: 6.15 / MAX: 6.69 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: squeezenet_ssd r6a.4xlarge 1.3208 2.6416 3.9624 5.2832 6.604 SE +/- 0.02, N = 3 5.87 MIN: 5.73 / MAX: 15.39 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: yolov4-tiny r6a.4xlarge 4 8 12 16 20 SE +/- 0.02, N = 3 15.02 MIN: 14.81 / MAX: 21.08 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 r6a.4xlarge 3 6 9 12 15 SE +/- 0.10, N = 3 9.45 MIN: 9.22 / MAX: 21.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet50 r6a.4xlarge 3 6 9 12 15 SE +/- 0.11, N = 3 12.68 MIN: 12.43 / MAX: 22.54 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: alexnet r6a.4xlarge 2 4 6 8 10 SE +/- 0.01, N = 3 6.43 MIN: 6.32 / MAX: 8.67 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: resnet18 r6a.4xlarge 1.2938 2.5876 3.8814 5.1752 6.469 SE +/- 0.01, N = 3 5.75 MIN: 5.66 / MAX: 7.06 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: vgg16 r6a.4xlarge 8 16 24 32 40 SE +/- 0.02, N = 3 36.18 MIN: 35.78 / MAX: 41.48 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: googlenet r6a.4xlarge 2 4 6 8 10 SE +/- 0.06, N = 3 7.71 MIN: 7.52 / MAX: 8.04 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: blazeface r6a.4xlarge 0.1643 0.3286 0.4929 0.6572 0.8215 SE +/- 0.00, N = 3 0.73 MIN: 0.71 / MAX: 0.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: efficientnet-b0 r6a.4xlarge 1.0193 2.0386 3.0579 4.0772 5.0965 SE +/- 0.00, N = 3 4.53 MIN: 4.45 / MAX: 4.83 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mnasnet r6a.4xlarge 0.5018 1.0036 1.5054 2.0072 2.509 SE +/- 0.00, N = 3 2.23 MIN: 2.18 / MAX: 2.4 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: shufflenet-v2 r6a.4xlarge 0.4478 0.8956 1.3434 1.7912 2.239 SE +/- 0.00, N = 3 1.99 MIN: 1.94 / MAX: 2.17 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v3-v3 - Model: mobilenet-v3 r6a.4xlarge 0.4613 0.9226 1.3839 1.8452 2.3065 SE +/- 0.00, N = 3 2.05 MIN: 2.01 / MAX: 2.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU-v2-v2 - Model: mobilenet-v2 r6a.4xlarge 0.5378 1.0756 1.6134 2.1512 2.689 SE +/- 0.00, N = 3 2.39 MIN: 2.33 / MAX: 3.3 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20230517 Target: CPU - Model: mobilenet r6a.4xlarge 3 6 9 12 15 SE +/- 0.10, N = 3 9.45 MIN: 9.22 / MAX: 21.29 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Zstd Compression Compression Level: 19 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 5.40, N = 3 1300.1 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 19 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19 - Compression Speed r6a.4xlarge 3 6 9 12 15 SE +/- 0.10, N = 3 11.9 1. (CC) gcc options: -O3 -pthread -lz
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 18308.95, N = 3 2096259.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 4285.87, N = 3 1961016.78 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 16543.77, N = 3 2248697.50 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 5921.27, N = 3 1892964.18 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 4812.47, N = 3 2081724.01 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 11872.60, N = 3 2199721.37 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 5206.96, N = 3 1998837.30 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1 r6a.4xlarge 400K 800K 1200K 1600K 2000K SE +/- 9727.66, N = 3 1922744.73 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 4452.47, N = 3 2223332.41 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Redis 7.0.12 + memtier_benchmark Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 OpenBenchmarking.org Ops/sec, More Is Better Redis 7.0.12 + memtier_benchmark 2.0 Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5 r6a.4xlarge 500K 1000K 1500K 2000K 2500K SE +/- 4040.76, N = 3 2346107.71 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 r6a.4xlarge 20 40 60 80 100 SE +/- 0.09, N = 3 87.8 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 r6a.4xlarge 40 80 120 160 200 SE +/- 0.03, N = 3 178.3 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2
Zstd Compression Compression Level: 19, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 23.00, N = 3 1185.0 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 19, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 19, Long Mode - Compression Speed r6a.4xlarge 2 4 6 8 10 SE +/- 0.02, N = 3 6.54 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3 - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 0.33, N = 3 1324.3 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3 - Compression Speed r6a.4xlarge 400 800 1200 1600 2000 SE +/- 11.76, N = 3 1676.8 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 57.69, N = 3 1297.9 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 3, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 3, Long Mode - Compression Speed r6a.4xlarge 200 400 600 800 1000 SE +/- 2.74, N = 3 925.6 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 0.76, N = 3 1449.1 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8, Long Mode - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8, Long Mode - Compression Speed r6a.4xlarge 90 180 270 360 450 SE +/- 0.58, N = 3 398.5 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Decompression Speed r6a.4xlarge 300 600 900 1200 1500 SE +/- 4.36, N = 3 1442.6 1. (CC) gcc options: -O3 -pthread -lz
Zstd Compression Compression Level: 8 - Compression Speed OpenBenchmarking.org MB/s, More Is Better Zstd Compression 1.5.4 Compression Level: 8 - Compression Speed r6a.4xlarge 80 160 240 320 400 SE +/- 0.66, N = 3 385.7 1. (CC) gcc options: -O3 -pthread -lz
PyPerformance Benchmark: 2to3 OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: 2to3 r6a.4xlarge 70 140 210 280 350 SE +/- 0.00, N = 3 339
GraphicsMagick Operation: Noise-Gaussian OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Noise-Gaussian r6a.4xlarge 12 24 36 48 60 SE +/- 0.00, N = 3 53 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Enhanced OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Enhanced r6a.4xlarge 8 16 24 32 40 SE +/- 0.00, N = 3 36 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Sharpen OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Sharpen r6a.4xlarge 5 10 15 20 25 SE +/- 0.00, N = 3 22 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: HWB Color Space OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: HWB Color Space r6a.4xlarge 40 80 120 160 200 SE +/- 0.33, N = 3 178 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Swirl OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Swirl r6a.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 109 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Rotate OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Rotate r6a.4xlarge 30 60 90 120 150 SE +/- 0.33, N = 3 134 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
GraphicsMagick Operation: Resizing OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.43 Operation: Resizing r6a.4xlarge 40 80 120 160 200 SE +/- 0.00, N = 3 177 1. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r6a.4xlarge 30K 60K 90K 120K 150K SE +/- 3.52, N = 3 137205.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 r6a.4xlarge 500 1000 1500 2000 2500 SE +/- 0.59, N = 3 2099.8 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
CppPerformanceBenchmarks Test: Stepanov Vector OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Vector r6a.4xlarge 13 26 39 52 65 SE +/- 0.01, N = 3 57.49 1. (CXX) g++ options: -std=c++11 -O3
PyPerformance Benchmark: go OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: go r6a.4xlarge 60 120 180 240 300 SE +/- 0.33, N = 3 280
Stream Type: Copy OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Copy r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 6.15, N = 5 44429.7 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
VVenC Video Input: Bosphorus 1080p - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Fast r6a.4xlarge 3 6 9 12 15 SE +/- 0.01, N = 3 12.31 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
CppPerformanceBenchmarks Test: Ctype OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Ctype r6a.4xlarge 11 22 33 44 55 SE +/- 0.11, N = 3 48.86 1. (CXX) g++ options: -std=c++11 -O3
CppPerformanceBenchmarks Test: Atol OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Atol r6a.4xlarge 11 22 33 44 55 SE +/- 0.02, N = 3 48.55 1. (CXX) g++ options: -std=c++11 -O3
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 4K r6a.4xlarge 3 6 9 12 15 SE +/- 0.08, N = 3 12.44 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile r6a.4xlarge 40 80 120 160 200 SE +/- 0.33, N = 3 183
Botan Test: AES-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 - Decrypt r6a.4xlarge 1200 2400 3600 4800 6000 SE +/- 5.64, N = 3 5583.55 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: AES-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: AES-256 r6a.4xlarge 1200 2400 3600 4800 6000 SE +/- 6.18, N = 3 5582.96 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
PyPerformance Benchmark: pathlib OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: pathlib r6a.4xlarge 5 10 15 20 25 SE +/- 0.00, N = 3 21.6
Botan Test: Blowfish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish - Decrypt r6a.4xlarge 80 160 240 320 400 SE +/- 0.03, N = 3 377.43 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Blowfish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Blowfish r6a.4xlarge 80 160 240 320 400 SE +/- 0.07, N = 3 376.27 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish - Decrypt r6a.4xlarge 70 140 210 280 350 SE +/- 0.14, N = 3 323.04 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: Twofish OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: Twofish r6a.4xlarge 70 140 210 280 350 SE +/- 0.10, N = 3 318.03 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 - Decrypt r6a.4xlarge 150 300 450 600 750 SE +/- 0.93, N = 3 714.73 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: ChaCha20Poly1305 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: ChaCha20Poly1305 r6a.4xlarge 160 320 480 640 800 SE +/- 0.44, N = 3 731.35 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 - Decrypt r6a.4xlarge 30 60 90 120 150 SE +/- 0.01, N = 3 129.07 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: CAST-256 OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: CAST-256 r6a.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 129.10 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI - Decrypt OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI - Decrypt r6a.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 82.50 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
Botan Test: KASUMI OpenBenchmarking.org MiB/s, More Is Better Botan 2.17.3 Test: KASUMI r6a.4xlarge 20 40 60 80 100 SE +/- 0.01, N = 3 85.66 1. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt
SciMark Computational Test: Composite OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite r6a.4xlarge 140 280 420 560 700 SE +/- 1.31, N = 3 661.60 1. (CC) gcc options: -lm
PyPerformance Benchmark: pickle_pure_python OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: pickle_pure_python r6a.4xlarge 100 200 300 400 500 SE +/- 0.58, N = 3 467
PyPerformance Benchmark: django_template OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: django_template r6a.4xlarge 12 24 36 48 60 SE +/- 0.03, N = 3 52.3
CppPerformanceBenchmarks Test: Stepanov Abstraction OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Stepanov Abstraction r6a.4xlarge 6 12 18 24 30 SE +/- 0.00, N = 3 25.85 1. (CXX) g++ options: -std=c++11 -O3
PyPerformance Benchmark: nbody OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: nbody r6a.4xlarge 30 60 90 120 150 SE +/- 0.33, N = 3 139
libjpeg-turbo tjbench Test: Decompression Throughput OpenBenchmarking.org Megapixels/sec, More Is Better libjpeg-turbo tjbench 2.1.0 Test: Decompression Throughput r6a.4xlarge 40 80 120 160 200 SE +/- 0.20, N = 3 198.27 1. (CC) gcc options: -O3 -rdynamic
PyPerformance Benchmark: chaos OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: chaos r6a.4xlarge 30 60 90 120 150 SE +/- 0.58, N = 3 128
LZ4 Compression Compression Level: 9 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Decompression Speed r6a.4xlarge 800 1600 2400 3200 4000 SE +/- 2.08, N = 3 3918.1 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 9 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 9 - Compression Speed r6a.4xlarge 8 16 24 32 40 SE +/- 0.00, N = 3 34.37 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Decompression Speed r6a.4xlarge 900 1800 2700 3600 4500 SE +/- 0.37, N = 3 4071.2 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 1 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 1 - Compression Speed r6a.4xlarge 140 280 420 560 700 SE +/- 0.41, N = 3 663.36 1. (CC) gcc options: -O3
PyPerformance Benchmark: float OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: float r6a.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 120
Intel Memory Latency Checker Test: Peak Injection Bandwidth - Stream-Triad Like OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - Stream-Triad Like r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 16.95, N = 3 45531.2
Intel Memory Latency Checker Test: Peak Injection Bandwidth - 1:1 Reads-Writes OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 1:1 Reads-Writes r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 0.95, N = 3 45514.7
Intel Memory Latency Checker Test: Peak Injection Bandwidth - 2:1 Reads-Writes OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 2:1 Reads-Writes r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 8.42, N = 3 45776.6
Intel Memory Latency Checker Test: Peak Injection Bandwidth - 3:1 Reads-Writes OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - 3:1 Reads-Writes r6a.4xlarge 10K 20K 30K 40K 50K SE +/- 28.36, N = 3 44591.2
Intel Memory Latency Checker Test: Peak Injection Bandwidth - All Reads OpenBenchmarking.org MB/s, More Is Better Intel Memory Latency Checker 3.10 Test: Peak Injection Bandwidth - All Reads r6a.4xlarge 9K 18K 27K 36K 45K SE +/- 4.34, N = 3 40032.5
x264 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 4K r6a.4xlarge 6 12 18 24 30 SE +/- 0.01, N = 3 27.24 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -flto
LZ4 Compression Compression Level: 3 - Decompression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Decompression Speed r6a.4xlarge 800 1600 2400 3200 4000 SE +/- 0.15, N = 3 3711.9 1. (CC) gcc options: -O3
LZ4 Compression Compression Level: 3 - Compression Speed OpenBenchmarking.org MB/s, More Is Better LZ4 Compression 1.9.4 Compression Level: 3 - Compression Speed r6a.4xlarge 20 40 60 80 100 SE +/- 0.00, N = 3 103.79 1. (CC) gcc options: -O3
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes r6a.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 118
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads r6a.4xlarge 5 10 15 20 25 SE +/- 0.03, N = 3 22.4
VVenC Video Input: Bosphorus 1080p - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 1080p - Video Preset: Faster r6a.4xlarge 7 14 21 28 35 SE +/- 0.05, N = 3 30.21 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
CppPerformanceBenchmarks Test: Function Objects OpenBenchmarking.org Seconds, Fewer Is Better CppPerformanceBenchmarks 9 Test: Function Objects r6a.4xlarge 4 8 12 16 20 SE +/- 0.00, N = 3 15.33 1. (CXX) g++ options: -std=c++11 -O3
Smallpt Global Illumination Renderer; 128 Samples OpenBenchmarking.org Seconds, Fewer Is Better Smallpt 1.0 Global Illumination Renderer; 128 Samples r6a.4xlarge 3 6 9 12 15 SE +/- 0.02, N = 3 11.27 1. (CXX) g++ options: -fopenmp -O3
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 3.6 Video Input: Bosphorus 1080p r6a.4xlarge 12 24 36 48 60 SE +/- 0.26, N = 3 55.19 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl
Intel Memory Latency Checker Test: Idle Latency OpenBenchmarking.org ns, Fewer Is Better Intel Memory Latency Checker 3.10 Test: Idle Latency r6a.4xlarge 30 60 90 120 150 SE +/- 0.00, N = 3 123.2
x264 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x264 2022-02-22 Video Input: Bosphorus 1080p r6a.4xlarge 30 60 90 120 150 SE +/- 0.09, N = 3 114.66 1. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -flto
BlogBench Test: Write OpenBenchmarking.org Final Score, More Is Better BlogBench 1.1 Test: Write r6a.4xlarge 2K 4K 6K 8K 10K SE +/- 95.65, N = 3 10860 1. (CC) gcc options: -O2
SciMark Computational Test: Jacobi Successive Over-Relaxation OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation r6a.4xlarge 200 400 600 800 1000 SE +/- 0.08, N = 3 1025.13 1. (CC) gcc options: -lm
SciMark Computational Test: Dense LU Matrix Factorization OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization r6a.4xlarge 200 400 600 800 1000 SE +/- 0.77, N = 3 1139.34 1. (CC) gcc options: -lm
SciMark Computational Test: Sparse Matrix Multiply OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply r6a.4xlarge 140 280 420 560 700 SE +/- 0.48, N = 3 646.94 1. (CC) gcc options: -lm
SciMark Computational Test: Fast Fourier Transform OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform r6a.4xlarge 80 160 240 320 400 SE +/- 6.43, N = 3 355.55 1. (CC) gcc options: -lm
SciMark Computational Test: Monte Carlo OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo r6a.4xlarge 30 60 90 120 150 SE +/- 0.39, N = 3 141.03 1. (CC) gcc options: -lm
Stream Type: Add OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Add r6a.4xlarge 7K 14K 21K 28K 35K SE +/- 7.60, N = 5 32617.2 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Stream Type: Triad OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Triad r6a.4xlarge 7K 14K 21K 28K 35K SE +/- 2.87, N = 5 32609.0 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Stream Type: Scale OpenBenchmarking.org MB/s, More Is Better Stream 2013-01-17 Type: Scale r6a.4xlarge 7K 14K 21K 28K 35K SE +/- 3.06, N = 5 30431.7 1. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp
Phoronix Test Suite v10.8.4