r7a.4xlarge

amazon testing on Amazon Linux 2023.4.20240401 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2404102-NE-R7A4XLARG55&grw.

r7a.4xlargeProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layerr7a.4xlargeAMD EPYC 9R14 (16 Cores)Amazon EC2 r7a.4xlarge (1.0 BIOS)Intel 440FX 82441FX PMC1 x 128 GB DDR5-4800MT/s215GB Amazon Elastic Block StoreAmazon ElasticAmazon Linux 2023.4.202404016.1.82-99.168.amzn2023.x86_64 (x86_64)GCC 11.4.1 20230605xfsamazonOpenBenchmarking.org- Transparent Huge Pages: madvise- --build=x86_64-amazon-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - CPU Microcode: 0xa10113e- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

r7a.4xlargescimark2: Compositeintel-mlc: Peak Injection Bandwidth - All Readsscimark2: Monte Carlocompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedintel-mlc: Peak Injection Bandwidth - Stream-Triad Likescimark2: Sparse Matrix Multiplyintel-mlc: Peak Injection Bandwidth - 2:1 Reads-Writesscimark2: Fast Fourier Transformcompress-lz4: 3 - Compression Speedintel-mlc: Idle Latencyintel-mlc: Max Bandwidth - 1:1 Reads-Writesintel-mlc: Max Bandwidth - Stream-Triad Likeintel-mlc: Max Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - 3:1 Reads-Writesintel-mlc: Max Bandwidth - All Readsscimark2: Jacobi Successive Over-Relaxationintel-mlc: Peak Injection Bandwidth - 3:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - 1:1 Reads-Writesscimark2: Dense LU Matrix Factorizationcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedcpp-perf-bench: Atolcpp-perf-bench: Ctypecpp-perf-bench: Math Librarycpp-perf-bench: Rand Numberscpp-perf-bench: Stepanov Vectorcpp-perf-bench: Function Objectscpp-perf-bench: Stepanov Abstractionstream: Copystream: Scalestream: Triadstream: Addbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptcachebench: Readcachebench: Writecachebench: Read / Modify / Writetjbench: Decompression Throughputncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - vision_transformerncnn: Vulkan GPU - FastestDetstockfish: Chess Benchmarkcompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 12 - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacex264: Bosphorus 4Kx264: Bosphorus 1080px265: Bosphorus 4Kx265: Bosphorus 1080pc-ray: Total Time - 4K, 16 Rays Per Pixelvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fastervvenc: Bosphorus 1080p - Fastvvenc: Bosphorus 1080p - Fastersmallpt: Global Illumination Renderer; 128 Samplespyperformance: gopyperformance: 2to3pyperformance: chaospyperformance: floatpyperformance: nbodypyperformance: pathlibpyperformance: raytracepyperformance: json_loadspyperformance: crypto_pyaespyperformance: regex_compilepyperformance: python_startuppyperformance: django_templatepyperformance: pickle_pure_pythonblogbench: Readblogbench: Writenginx: 20nginx: 100nginx: 200nginx: 500nginx: 1000nginx: 4000openssl: SHA256openssl: SHA512openssl: RSA4096openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305libxsmm: 32libxsmm: 64libxsmm: 128libxsmm: 256clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000influxdb: 1024 - 10000 - 2,5000,1 - 10000memtier-benchmark: Redis - 50 - 1:1memtier-benchmark: Redis - 50 - 1:5memtier-benchmark: Redis - 50 - 5:1memtier-benchmark: Redis - 100 - 1:1memtier-benchmark: Redis - 100 - 1:5memtier-benchmark: Redis - 100 - 5:1memtier-benchmark: Redis - 50 - 10:1memtier-benchmark: Redis - 50 - 1:10memtier-benchmark: Redis - 500 - 1:1memtier-benchmark: Redis - 500 - 1:5memtier-benchmark: Redis - 500 - 5:1memtier-benchmark: Redis - 100 - 10:1memtier-benchmark: Redis - 100 - 1:10memtier-benchmark: Redis - 500 - 10:1memtier-benchmark: Redis - 500 - 1:10gmpbench: r7a.4xlarge704.6578570.5161.44700.934496.2100295.7689.07114801.6451.45108.67148.2129618.30100462.96114805.61107844.6178588.431056.57107868.9129565.91164.704068.135.034254.938.11648.768256.037849.49854.46214.19624.587107123.776463.480223.580140.088.13985.0385875.9135872.756327.219347.044390.849389.570133.043133.054735.853723.3079396.34926454465.148062107392.718184219.12211613.085.595.565.795.036.652.3114.0620.947.695.5913.9913.0820.1611.8411.6363.917.2713.125.535.575.785.066.652.3214.0521.007.705.6014.0113.1220.1411.9511.6763.867.45251613602744.01489.9671.61627.9208.11670.413.81414.41034.71515.3721.91637.97.031323.616512443532936120442.43174.3415.0080.8134.1715.64312.84019.07441.8277.65823028298.910612622.145618.21021358.9943.838724111191042256519.7963409.2664512.3365452.7564799.3462897.2316675650953779411568711167.7268824.18539755269718595212365316033785337060378824160214.1378.5630.1762.2258.02272.63275.211579445.62183799.92200230.12797827.942997062.312553261.192469143.212646865.742354124.762431915.602740324.852283141.902419810.582055619.682119598.942434639.142019821.662291753.10OpenBenchmarking.org

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositer7a.4xlarge150300450600750SE +/- 0.24, N = 3704.651. (CC) gcc options: -lm

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - All Reads

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - All Readsr7a.4xlarge20K40K60K80K100KSE +/- 2.52, N = 378570.5

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte Carlor7a.4xlarge4080120160200SE +/- 0.20, N = 3161.441. (CC) gcc options: -lm

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 1 - Compression Speedr7a.4xlarge150300450600750SE +/- 0.40, N = 3700.931. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 1 - Decompression Speedr7a.4xlarge10002000300040005000SE +/- 1.82, N = 34496.21. (CC) gcc options: -O3

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - Stream-Triad Like

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - Stream-Triad Liker7a.4xlarge20K40K60K80K100KSE +/- 2.45, N = 3100295.7

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix Multiplyr7a.4xlarge150300450600750SE +/- 0.05, N = 3689.071. (CC) gcc options: -lm

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 2:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 2:1 Reads-Writesr7a.4xlarge20K40K60K80K100KSE +/- 10.14, N = 3114801.6

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier Transformr7a.4xlarge100200300400500SE +/- 1.58, N = 3451.451. (CC) gcc options: -lm

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 3 - Compression Speedr7a.4xlarge20406080100SE +/- 0.03, N = 3108.671. (CC) gcc options: -O3

Intel Memory Latency Checker

Test: Idle Latency

OpenBenchmarking.orgns, Fewer Is BetterIntel Memory Latency Checker 3.10Test: Idle Latencyr7a.4xlarge306090120150SE +/- 0.03, N = 3148.2

Intel Memory Latency Checker

Test: Max Bandwidth - 1:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 1:1 Reads-Writesr7a.4xlarge30K60K90K120K150KSE +/- 5.87, N = 3129618.30

Intel Memory Latency Checker

Test: Max Bandwidth - Stream-Triad Like

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - Stream-Triad Liker7a.4xlarge20K40K60K80K100KSE +/- 55.45, N = 3100462.96

Intel Memory Latency Checker

Test: Max Bandwidth - 2:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 2:1 Reads-Writesr7a.4xlarge20K40K60K80K100KSE +/- 17.93, N = 3114805.61

Intel Memory Latency Checker

Test: Max Bandwidth - 3:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - 3:1 Reads-Writesr7a.4xlarge20K40K60K80K100KSE +/- 25.08, N = 3107844.61

Intel Memory Latency Checker

Test: Max Bandwidth - All Reads

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Max Bandwidth - All Readsr7a.4xlarge20K40K60K80K100KSE +/- 8.62, N = 378588.43

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-Relaxationr7a.4xlarge2004006008001000SE +/- 0.10, N = 31056.571. (CC) gcc options: -lm

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 3:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 3:1 Reads-Writesr7a.4xlarge20K40K60K80K100KSE +/- 2.17, N = 3107868.9

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 1:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency Checker 3.10Test: Peak Injection Bandwidth - 1:1 Reads-Writesr7a.4xlarge30K60K90K120K150KSE +/- 32.50, N = 3129565.9

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix Factorizationr7a.4xlarge30060090012001500SE +/- 0.53, N = 31164.701. (CC) gcc options: -lm

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 3 - Decompression Speedr7a.4xlarge9001800270036004500SE +/- 0.99, N = 34068.11. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 9 - Compression Speedr7a.4xlarge816243240SE +/- 0.01, N = 335.031. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.4Compression Level: 9 - Decompression Speedr7a.4xlarge9001800270036004500SE +/- 0.58, N = 34254.91. (CC) gcc options: -O3

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Atolr7a.4xlarge918273645SE +/- 0.06, N = 338.121. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Ctyper7a.4xlarge1122334455SE +/- 0.38, N = 348.771. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math Libraryr7a.4xlarge60120180240300SE +/- 0.13, N = 3256.041. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random Numbersr7a.4xlarge2004006008001000SE +/- 0.05, N = 3849.501. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Vectorr7a.4xlarge1224364860SE +/- 0.04, N = 354.461. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function Objectsr7a.4xlarge48121620SE +/- 0.01, N = 314.201. (CXX) g++ options: -std=c++11 -O3

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstractionr7a.4xlarge612182430SE +/- 0.00, N = 324.591. (CXX) g++ options: -std=c++11 -O3

Stream

Type: Copy

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Copyr7a.4xlarge20K40K60K80K100KSE +/- 64.34, N = 5107123.71. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Scale

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Scaler7a.4xlarge16K32K48K64K80KSE +/- 11.62, N = 576463.41. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Triad

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Triadr7a.4xlarge20K40K60K80K100KSE +/- 6.48, N = 580223.51. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Stream

Type: Add

OpenBenchmarking.orgMB/s, More Is BetterStream 2013-01-17Type: Addr7a.4xlarge20K40K60K80K100KSE +/- 5.77, N = 580140.01. (CC) gcc options: -mcmodel=medium -O3 -march=native -fopenmp

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIr7a.4xlarge20406080100SE +/- 0.00, N = 388.141. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decryptr7a.4xlarge20406080100SE +/- 0.02, N = 385.041. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256r7a.4xlarge13002600390052006500SE +/- 13.42, N = 35875.911. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decryptr7a.4xlarge13002600390052006500SE +/- 11.26, N = 35872.761. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofishr7a.4xlarge70140210280350SE +/- 0.03, N = 3327.221. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decryptr7a.4xlarge80160240320400SE +/- 0.02, N = 3347.041. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfishr7a.4xlarge80160240320400SE +/- 0.16, N = 3390.851. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decryptr7a.4xlarge80160240320400SE +/- 0.11, N = 3389.571. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256r7a.4xlarge306090120150SE +/- 0.01, N = 3133.041. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decryptr7a.4xlarge306090120150SE +/- 0.01, N = 3133.051. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305r7a.4xlarge160320480640800SE +/- 0.27, N = 3735.851. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decryptr7a.4xlarge160320480640800SE +/- 0.53, N = 3723.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Readr7a.4xlarge2K4K6K8K10KSE +/- 0.12, N = 39396.35MIN: 9395.38 / MAX: 9397.091. (CC) gcc options: -O3 -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Writer7a.4xlarge12K24K36K48K60KSE +/- 146.20, N = 354465.15MIN: 42462.43 / MAX: 56019.171. (CC) gcc options: -O3 -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / Writer7a.4xlarge20K40K60K80K100KSE +/- 37.75, N = 3107392.72MIN: 81119.54 / MAX: 111987.811. (CC) gcc options: -O3 -lrt

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughputr7a.4xlarge50100150200250SE +/- 0.74, N = 3219.121. (CC) gcc options: -O3 -rdynamic

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetr7a.4xlarge3691215SE +/- 0.13, N = 1513.08MIN: 11.76 / MAX: 23.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2r7a.4xlarge1.25782.51563.77345.03126.289SE +/- 0.04, N = 155.59MIN: 5.18 / MAX: 521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3r7a.4xlarge1.2512.5023.7535.0046.255SE +/- 0.00, N = 155.56MIN: 5.41 / MAX: 7.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2r7a.4xlarge1.30282.60563.90845.21126.514SE +/- 0.02, N = 155.79MIN: 5.67 / MAX: 14.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetr7a.4xlarge1.13182.26363.39544.52725.659SE +/- 0.00, N = 155.03MIN: 4.94 / MAX: 5.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0r7a.4xlarge246810SE +/- 0.01, N = 156.65MIN: 6.52 / MAX: 16.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefacer7a.4xlarge0.51981.03961.55942.07922.599SE +/- 0.00, N = 152.31MIN: 2.25 / MAX: 2.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetr7a.4xlarge48121620SE +/- 0.02, N = 1514.06MIN: 12.15 / MAX: 32.841. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16r7a.4xlarge510152025SE +/- 0.05, N = 1520.94MIN: 20.49 / MAX: 39.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18r7a.4xlarge246810SE +/- 0.01, N = 157.69MIN: 7.49 / MAX: 17.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetr7a.4xlarge1.25782.51563.77345.03126.289SE +/- 0.00, N = 155.59MIN: 5.5 / MAX: 8.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50r7a.4xlarge48121620SE +/- 0.02, N = 1513.99MIN: 13.67 / MAX: 32.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3r7a.4xlarge3691215SE +/- 0.13, N = 1513.08MIN: 11.76 / MAX: 23.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinyr7a.4xlarge510152025SE +/- 0.06, N = 1520.16MIN: 19.34 / MAX: 29.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssdr7a.4xlarge3691215SE +/- 0.10, N = 1511.84MIN: 10.67 / MAX: 20.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mr7a.4xlarge3691215SE +/- 0.02, N = 1511.63MIN: 11.4 / MAX: 30.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerr7a.4xlarge1428425670SE +/- 0.09, N = 1563.91MIN: 62.34 / MAX: 1461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetr7a.4xlarge246810SE +/- 0.20, N = 157.27MIN: 4.83 / MAX: 16.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mobilenetr7a.4xlarge3691215SE +/- 0.12, N = 1513.12MIN: 11.83 / MAX: 32.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2r7a.4xlarge1.24432.48863.73294.97726.2215SE +/- 0.00, N = 155.53MIN: 5.17 / MAX: 11.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3r7a.4xlarge1.25332.50663.75995.01326.2665SE +/- 0.02, N = 155.57MIN: 5.41 / MAX: 14.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: shufflenet-v2r7a.4xlarge1.30052.6013.90155.2026.5025SE +/- 0.00, N = 155.78MIN: 5.67 / MAX: 24.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: mnasnetr7a.4xlarge1.13852.2773.41554.5545.6925SE +/- 0.02, N = 155.06MIN: 4.98 / MAX: 13.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: efficientnet-b0r7a.4xlarge246810SE +/- 0.01, N = 156.65MIN: 6.55 / MAX: 11.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: blazefacer7a.4xlarge0.5221.0441.5662.0882.61SE +/- 0.00, N = 152.32MIN: 2.27 / MAX: 2.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: googlenetr7a.4xlarge48121620SE +/- 0.01, N = 1514.05MIN: 13.82 / MAX: 23.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vgg16r7a.4xlarge510152025SE +/- 0.10, N = 1521.00MIN: 20.45 / MAX: 41.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet18r7a.4xlarge246810SE +/- 0.03, N = 157.70MIN: 7.43 / MAX: 18.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: alexnetr7a.4xlarge1.262.523.785.046.3SE +/- 0.01, N = 155.60MIN: 5.51 / MAX: 13.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: resnet50r7a.4xlarge48121620SE +/- 0.03, N = 1514.01MIN: 13.68 / MAX: 23.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3r7a.4xlarge3691215SE +/- 0.12, N = 1513.12MIN: 11.83 / MAX: 32.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: yolov4-tinyr7a.4xlarge510152025SE +/- 0.08, N = 1520.14MIN: 19.09 / MAX: 29.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: squeezenet_ssdr7a.4xlarge3691215SE +/- 0.05, N = 1511.95MIN: 11.09 / MAX: 30.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: regnety_400mr7a.4xlarge3691215SE +/- 0.03, N = 1511.67MIN: 11.45 / MAX: 31.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: vision_transformerr7a.4xlarge1428425670SE +/- 0.10, N = 1563.86MIN: 61.65 / MAX: 160.051. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: Vulkan GPU - Model: FastestDetr7a.4xlarge246810SE +/- 0.20, N = 157.45MIN: 5.1 / MAX: 16.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess Benchmarkr7a.4xlarge5M10M15M20M25MSE +/- 249740.35, N = 3251613601. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3 - Compression Speedr7a.4xlarge6001200180024003000SE +/- 5.96, N = 32744.01. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3 - Decompression Speedr7a.4xlarge30060090012001500SE +/- 1.05, N = 31489.91. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8 - Compression Speedr7a.4xlarge140280420560700SE +/- 1.97, N = 3671.61. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8 - Decompression Speedr7a.4xlarge30060090012001500SE +/- 2.02, N = 31627.91. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression Speedr7a.4xlarge50100150200250SE +/- 0.43, N = 3208.11. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression Speedr7a.4xlarge400800120016002000SE +/- 3.18, N = 31670.41. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression Speedr7a.4xlarge48121620SE +/- 0.09, N = 313.81. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression Speedr7a.4xlarge30060090012001500SE +/- 1.48, N = 31414.41. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3, Long Mode - Compression Speedr7a.4xlarge2004006008001000SE +/- 0.75, N = 31034.71. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3, Long Mode - Decompression Speedr7a.4xlarge30060090012001500SE +/- 0.03, N = 31515.31. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8, Long Mode - Compression Speedr7a.4xlarge160320480640800SE +/- 0.75, N = 3721.91. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8, Long Mode - Decompression Speedr7a.4xlarge400800120016002000SE +/- 1.00, N = 31637.91. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression Speedr7a.4xlarge246810SE +/- 0.01, N = 37.031. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression Speedr7a.4xlarge30060090012001500SE +/- 4.36, N = 31323.61. (CC) gcc options: -O3 -pthread -lz

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Swirlr7a.4xlarge4080120160200SE +/- 0.00, N = 31651. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Rotater7a.4xlarge306090120150SE +/- 0.00, N = 31241. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Sharpenr7a.4xlarge1020304050SE +/- 0.00, N = 3431. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Enhancedr7a.4xlarge1224364860SE +/- 0.33, N = 3531. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Resizingr7a.4xlarge60120180240300SE +/- 0.00, N = 32931. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-Gaussianr7a.4xlarge1428425670SE +/- 0.00, N = 3611. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color Spacer7a.4xlarge4080120160200SE +/- 0.00, N = 32041. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lz -lm -lpthread -lgomp

x264

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2022-02-22Video Input: Bosphorus 4Kr7a.4xlarge1020304050SE +/- 0.05, N = 342.431. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -flto

x264

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2022-02-22Video Input: Bosphorus 1080pr7a.4xlarge4080120160200SE +/- 0.94, N = 3174.341. (CC) gcc options: -ldl -m64 -lm -lpthread -O3 -flto

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.6Video Input: Bosphorus 4Kr7a.4xlarge48121620SE +/- 0.15, N = 515.001. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.6Video Input: Bosphorus 1080pr7a.4xlarge20406080100SE +/- 0.75, N = 380.811. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelr7a.4xlarge816243240SE +/- 0.02, N = 334.171. (CC) gcc options: -lm -lpthread -O3

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Fastr7a.4xlarge1.26972.53943.80915.07886.3485SE +/- 0.007, N = 35.6431. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 4K - Video Preset: Fasterr7a.4xlarge3691215SE +/- 0.02, N = 312.841. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 1080p - Video Preset: Fastr7a.4xlarge510152025SE +/- 0.02, N = 319.071. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.11Video Input: Bosphorus 1080p - Video Preset: Fasterr7a.4xlarge1020304050SE +/- 0.08, N = 341.831. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samplesr7a.4xlarge246810SE +/- 0.012, N = 37.6581. (CXX) g++ options: -fopenmp -O3

PyPerformance

Benchmark: go

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: gor7a.4xlarge50100150200250SE +/- 0.33, N = 3230

PyPerformance

Benchmark: 2to3

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to3r7a.4xlarge60120180240300SE +/- 0.00, N = 3282

PyPerformance

Benchmark: chaos

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaosr7a.4xlarge20406080100SE +/- 0.06, N = 398.9

PyPerformance

Benchmark: float

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: floatr7a.4xlarge20406080100SE +/- 0.00, N = 3106

PyPerformance

Benchmark: nbody

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbodyr7a.4xlarge306090120150SE +/- 0.33, N = 3126

PyPerformance

Benchmark: pathlib

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlibr7a.4xlarge510152025SE +/- 0.00, N = 322.1

PyPerformance

Benchmark: raytrace

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: raytracer7a.4xlarge100200300400500SE +/- 1.53, N = 3456

PyPerformance

Benchmark: json_loads

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loadsr7a.4xlarge48121620SE +/- 0.03, N = 318.2

PyPerformance

Benchmark: crypto_pyaes

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaesr7a.4xlarge20406080100SE +/- 0.33, N = 3102

PyPerformance

Benchmark: regex_compile

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compiler7a.4xlarge306090120150SE +/- 0.33, N = 3135

PyPerformance

Benchmark: python_startup

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startupr7a.4xlarge3691215SE +/- 0.00, N = 38.99

PyPerformance

Benchmark: django_template

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_templater7a.4xlarge1020304050SE +/- 0.10, N = 343.8

PyPerformance

Benchmark: pickle_pure_python

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_pythonr7a.4xlarge80160240320400SE +/- 0.88, N = 3387

BlogBench

Test: Read

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: Readr7a.4xlarge500K1000K1500K2000K2500KSE +/- 5626.74, N = 324111191. (CC) gcc options: -O2

BlogBench

Test: Write

OpenBenchmarking.orgFinal Score, More Is BetterBlogBench 1.1Test: Writer7a.4xlarge2K4K6K8K10KSE +/- 53.23, N = 3104221. (CC) gcc options: -O2

nginx

Connections: 20

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 20r7a.4xlarge12K24K36K48K60KSE +/- 168.76, N = 356519.791. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 100r7a.4xlarge14K28K42K56K70KSE +/- 111.94, N = 363409.261. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 200r7a.4xlarge14K28K42K56K70KSE +/- 434.30, N = 364512.331. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500r7a.4xlarge14K28K42K56K70KSE +/- 895.55, N = 365452.751. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000r7a.4xlarge14K28K42K56K70KSE +/- 150.44, N = 364799.341. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 4000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 4000r7a.4xlarge13K26K39K52K65KSE +/- 256.39, N = 362897.231. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256r7a.4xlarge4000M8000M12000M16000M20000MSE +/- 8087664.52, N = 3166756509531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512r7a.4xlarge2000M4000M6000M8000M10000MSE +/- 7636541.01, N = 377941156871. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096r7a.4xlarge2K4K6K8K10KSE +/- 2.23, N = 311167.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096r7a.4xlarge60K120K180K240K300KSE +/- 67.94, N = 3268824.11. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20r7a.4xlarge20000M40000M60000M80000M100000MSE +/- 23301462.85, N = 3853975526971. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMr7a.4xlarge40000M80000M120000M160000M200000MSE +/- 204866113.77, N = 31859521236531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMr7a.4xlarge30000M60000M90000M120000M150000MSE +/- 518883457.12, N = 31603378533701. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305r7a.4xlarge13000M26000M39000M52000M65000MSE +/- 12413256.53, N = 3603788241601. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

libxsmm

M N K: 32

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32r7a.4xlarge50100150200250SE +/- 0.13, N = 3214.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 64

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64r7a.4xlarge80160240320400SE +/- 0.62, N = 3378.51. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 128

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 128r7a.4xlarge140280420560700SE +/- 0.93, N = 3630.11. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

libxsmm

M N K: 256

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 256r7a.4xlarge160320480640800SE +/- 0.32, N = 3762.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -pedantic -O2 -fopenmp -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -march=core-avx2

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cacher7a.4xlarge60120180240300SE +/- 3.13, N = 9258.02MIN: 13.67 / MAX: 7500

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runr7a.4xlarge60120180240300SE +/- 1.21, N = 9272.63MIN: 13.81 / MAX: 7500

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runr7a.4xlarge60120180240300SE +/- 2.66, N = 9275.21MIN: 13.77 / MAX: 8571.43

InfluxDB

Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000r7a.4xlarge300K600K900K1200K1500KSE +/- 5073.70, N = 31579445.6

InfluxDB

Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000r7a.4xlarge500K1000K1500K2000K2500KSE +/- 2898.28, N = 32183799.9

InfluxDB

Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 1024 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000r7a.4xlarge500K1000K1500K2000K2500KSE +/- 1149.63, N = 32200230.1

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:1r7a.4xlarge600K1200K1800K2400K3000KSE +/- 39697.06, N = 32797827.941. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:5r7a.4xlarge600K1200K1800K2400K3000KSE +/- 14143.43, N = 32997062.311. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 5:1r7a.4xlarge500K1000K1500K2000K2500KSE +/- 4372.17, N = 32553261.191. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:1r7a.4xlarge500K1000K1500K2000K2500KSE +/- 14441.28, N = 32469143.211. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:5r7a.4xlarge600K1200K1800K2400K3000KSE +/- 12102.97, N = 32646865.741. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 5:1r7a.4xlarge500K1000K1500K2000K2500KSE +/- 5670.02, N = 32354124.761. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 10:1r7a.4xlarge500K1000K1500K2000K2500KSE +/- 6411.23, N = 32431915.601. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 50 - Set To Get Ratio: 1:10r7a.4xlarge600K1200K1800K2400K3000KSE +/- 3688.02, N = 32740324.851. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:1r7a.4xlarge500K1000K1500K2000K2500KSE +/- 28998.58, N = 32283141.901. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:5r7a.4xlarge500K1000K1500K2000K2500KSE +/- 28964.98, N = 32419810.581. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 500 - Set To Get Ratio: 5:1r7a.4xlarge400K800K1200K1600K2000KSE +/- 29876.91, N = 122055619.681. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 10:1r7a.4xlarge500K1000K1500K2000K2500KSE +/- 26111.97, N = 32119598.941. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 100 - Set To Get Ratio: 1:10r7a.4xlarge500K1000K1500K2000K2500KSE +/- 4417.00, N = 32434639.141. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 500 - Set To Get Ratio: 10:1r7a.4xlarge400K800K1200K1600K2000KSE +/- 24548.64, N = 32019821.661. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis 7.0.12 + memtier_benchmark

Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterRedis 7.0.12 + memtier_benchmark 2.0Protocol: Redis - Clients: 500 - Set To Get Ratio: 1:10r7a.4xlarge500K1000K1500K2000K2500KSE +/- 21031.13, N = 32291753.101. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre


Phoronix Test Suite v10.8.4