Amazon EC2 c7g.4xlarge Graviton3 Tests Graviton3 benchmarks by Michael Larabel. c7g.4xlarge: Processor: ARMv8 Neoverse-V1 (16 Cores), Motherboard: Amazon EC2 c7g.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 32GB, Disk: 193GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Ubuntu 22.04, Kernel: 5.15.0-1004-aws (aarch64), Compiler: GCC 11.2.0, File-System: ext4, System Layer: amazon a1.4xlarge: Processor: ARMv8 Cortex-A72 (16 Cores), Motherboard: Amazon EC2 a1.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 32GB, Disk: 193GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Ubuntu 22.04, Kernel: 5.15.0-1004-aws (aarch64), Compiler: GCC 11.2.0, File-System: ext4, System Layer: amazon LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms ns/day > Higher Is Better c7g.4xlarge . 11.425 |========================================================= a1.4xlarge .. 2.885 |============== Timed LLVM Compilation 13.0 Build System: Ninja Seconds < Lower Is Better c7g.4xlarge . 544.93 |================= a1.4xlarge .. 1784.60 |======================================================== Timed Node.js Compilation 17.3 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 497.58 |================ a1.4xlarge .. 1765.91 |======================================================== Timed Gem5 Compilation 21.2 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 391.17 |=================== a1.4xlarge .. 1155.62 |======================================================== NAS Parallel Benchmarks 3.4 Test / Class: SP.C Total Mop/s > Higher Is Better c7g.4xlarge . 4467.19 |======================================================== a1.4xlarge .. 1293.80 |================ NAS Parallel Benchmarks 3.4 Test / Class: BT.C Total Mop/s > Higher Is Better c7g.4xlarge . 10339.53 |======================================================= a1.4xlarge .. 3148.18 |================= NAS Parallel Benchmarks 3.4 Test / Class: LU.C Total Mop/s > Higher Is Better c7g.4xlarge . 7730.41 |======================================================== a1.4xlarge .. 2558.12 |=================== libavif avifenc 0.10 Encoder Speed: 0 Seconds < Lower Is Better c7g.4xlarge . 256.84 |=================== a1.4xlarge .. 768.30 |========================================================= GPAW 22.1 Input: Carbon Nanotube Seconds < Lower Is Better c7g.4xlarge . 155.18 |=========== a1.4xlarge .. 769.35 |========================================================= Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better c7g.4xlarge . 251.40 |====================== a1.4xlarge .. 644.79 |========================================================= LeelaChessZero 0.28 Backend: Eigen Nodes Per Second > Higher Is Better c7g.4xlarge . 1189 |=========================================================== a1.4xlarge .. 128 |====== LeelaChessZero 0.28 Backend: BLAS Nodes Per Second > Higher Is Better c7g.4xlarge . 1103 |=========================================================== a1.4xlarge .. 135 |======= GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better c7g.4xlarge . 1.128 |========================================================== a1.4xlarge .. 0.316 |================ Ngspice 34 Circuit: C7552 Seconds < Lower Is Better c7g.4xlarge . 191.29 |======================= a1.4xlarge .. 480.79 |========================================================= Ngspice 34 Circuit: C2670 Seconds < Lower Is Better c7g.4xlarge . 198.22 |======================== a1.4xlarge .. 473.90 |========================================================= libavif avifenc 0.10 Encoder Speed: 2 Seconds < Lower Is Better c7g.4xlarge . 141.70 |================== a1.4xlarge .. 449.02 |========================================================= C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better c7g.4xlarge . 38.52 |===================== a1.4xlarge .. 104.76 |========================================================= SecureMark 1.0.4 Benchmark: SecureMark-TLS marks > Higher Is Better c7g.4xlarge . 183708 |========================================================= a1.4xlarge .. 74356 |======================= NAS Parallel Benchmarks 3.4 Test / Class: EP.D Total Mop/s > Higher Is Better c7g.4xlarge . 934.72 |========================================================= a1.4xlarge .. 339.20 |===================== POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better c7g.4xlarge . 37.86 |======================= a1.4xlarge .. 93.80 |========================================================== Rodinia 3.1 Test: OpenMP LavaMD Seconds < Lower Is Better c7g.4xlarge . 143.33 |======================= a1.4xlarge .. 360.30 |========================================================= Build2 0.13 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 115.02 |=================== a1.4xlarge .. 353.91 |========================================================= asmFish 2018-07-23 1024 Hash Memory, 26 Depth Nodes/second > Higher Is Better c7g.4xlarge . 32134123 |======================================================= a1.4xlarge .. 15331550 |========================== ASTC Encoder 3.2 Preset: Exhaustive Seconds < Lower Is Better c7g.4xlarge . 139.38 |============================= a1.4xlarge .. 277.77 |========================================================= High Performance Conjugate Gradient 3.1 GFLOP/s > Higher Is Better c7g.4xlarge . 26.30580 |======================================================= a1.4xlarge .. 3.77834 |======== TensorFlow Lite 2022-05-18 Model: NASNet Mobile Microseconds < Lower Is Better c7g.4xlarge . 11591.9 |===================== a1.4xlarge .. 30986.7 |======================================================== OpenSSL 3.0 Algorithm: SHA256 byte/s > Higher Is Better c7g.4xlarge . 13722045973 |==================================================== a1.4xlarge .. 6785689517 |========================== ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better c7g.4xlarge . 5.853864 |======================================================= a1.4xlarge .. 0.891391 |======== Timed PHP Compilation 7.4.2 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 69.48 |==================== a1.4xlarge .. 196.03 |========================================================= NAS Parallel Benchmarks 3.4 Test / Class: CG.C Total Mop/s > Higher Is Better c7g.4xlarge . 6571.95 |======================================================== a1.4xlarge .. 1213.15 |========== ONNX Runtime 1.11 Model: fcn-resnet101-11 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 38 |============================================================= a1.4xlarge .. 10 |================ NAS Parallel Benchmarks 3.4 Test / Class: IS.D Total Mop/s > Higher Is Better c7g.4xlarge . 1041.90 |======================================================== a1.4xlarge .. 197.57 |=========== ONNX Runtime 1.11 Model: GPT-2 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 7990 |=========================================================== a1.4xlarge .. 2312 |================= ONNX Runtime 1.11 Model: bertsquad-12 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 407 |============================================================ a1.4xlarge .. 115 |================= ONNX Runtime 1.11 Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 609 |============================================================ a1.4xlarge .. 165 |================ ONNX Runtime 1.11 Model: super-resolution-10 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 2817 |=========================================================== a1.4xlarge .. 757 |================ Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Seconds < Lower Is Better c7g.4xlarge . 29.13 |========= a1.4xlarge .. 182.58 |========================================================= Apache HTTP Server 2.4.48 Concurrent Requests: 1000 Requests Per Second > Higher Is Better c7g.4xlarge . 72719.33 |======================================================= a1.4xlarge .. 19278.68 |=============== Apache HTTP Server 2.4.48 Concurrent Requests: 500 Requests Per Second > Higher Is Better c7g.4xlarge . 73546.32 |======================================================= a1.4xlarge .. 20133.49 |=============== Apache HTTP Server 2.4.48 Concurrent Requests: 200 Requests Per Second > Higher Is Better c7g.4xlarge . 73676.95 |======================================================= a1.4xlarge .. 20887.58 |================ nginx 1.21.1 Concurrent Requests: 1000 Requests Per Second > Higher Is Better c7g.4xlarge . 346814.75 |====================================================== a1.4xlarge .. 138205.11 |====================== Apache HTTP Server 2.4.48 Concurrent Requests: 100 Requests Per Second > Higher Is Better c7g.4xlarge . 67231.88 |======================================================= a1.4xlarge .. 18636.43 |=============== nginx 1.21.1 Concurrent Requests: 100 Requests Per Second > Higher Is Better c7g.4xlarge . 345710.87 |====================================================== a1.4xlarge .. 143155.48 |====================== nginx 1.21.1 Concurrent Requests: 500 Requests Per Second > Higher Is Better c7g.4xlarge . 346613.34 |====================================================== a1.4xlarge .. 139414.84 |====================== nginx 1.21.1 Concurrent Requests: 200 Requests Per Second > Higher Is Better c7g.4xlarge . 352380.98 |====================================================== a1.4xlarge .. 141436.20 |====================== NAS Parallel Benchmarks 3.4 Test / Class: FT.C Total Mop/s > Higher Is Better c7g.4xlarge . 11791.77 |======================================================= a1.4xlarge .. 2927.16 |============== m-queens 1.2 Time To Solve Seconds < Lower Is Better c7g.4xlarge . 66.82 |=================================== a1.4xlarge .. 110.37 |========================================================= WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Encode Time - Seconds < Lower Is Better c7g.4xlarge . 48.21 |====================== a1.4xlarge .. 124.71 |========================================================= Stress-NG 0.14 Test: CPU Cache Bogo Ops/s > Higher Is Better c7g.4xlarge . 64.31 |======== a1.4xlarge .. 464.85 |========================================================= simdjson 1.0 Throughput Test: Kostya GB/s > Higher Is Better c7g.4xlarge . 1.94 |=========================================================== a1.4xlarge .. 0.63 |=================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3240.6 |========================================================= a1.4xlarge .. 1213.9 |===================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better c7g.4xlarge . 39.5 |=========================================================== a1.4xlarge .. 16.0 |======================== Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3050.3 |========================================================= a1.4xlarge .. 1121.7 |===================== Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MB/s > Higher Is Better c7g.4xlarge . 41.2 |=========================================================== a1.4xlarge .. 16.9 |======================== simdjson 1.0 Throughput Test: PartialTweets GB/s > Higher Is Better c7g.4xlarge . 2.62 |=========================================================== a1.4xlarge .. 0.78 |================== simdjson 1.0 Throughput Test: DistinctUserID GB/s > Higher Is Better c7g.4xlarge . 2.69 |=========================================================== a1.4xlarge .. 0.80 |================== QuantLib 1.21 MFLOPS > Higher Is Better c7g.4xlarge . 2512.7 |========================================================= TensorFlow Lite 2022-05-18 Model: Inception V4 Microseconds < Lower Is Better c7g.4xlarge . 41855.1 |============ a1.4xlarge .. 188910.0 |======================================================= TensorFlow Lite 2022-05-18 Model: Inception ResNet V2 Microseconds < Lower Is Better c7g.4xlarge . 40051.3 |============= a1.4xlarge .. 171169.0 |======================================================= simdjson 1.0 Throughput Test: LargeRandom GB/s > Higher Is Better c7g.4xlarge . 0.7 |============================================================ a1.4xlarge .. 0.3 |========================== Timed ImageMagick Compilation 6.9.0 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 27.90 |================= a1.4xlarge .. 93.63 |========================================================== TensorFlow Lite 2022-05-18 Model: Mobilenet Float Microseconds < Lower Is Better c7g.4xlarge . 2156.60 |============ a1.4xlarge .. 9990.15 |======================================================== TensorFlow Lite 2022-05-18 Model: SqueezeNet Microseconds < Lower Is Better c7g.4xlarge . 3257.94 |=============== a1.4xlarge .. 12014.70 |======================================================= TensorFlow Lite 2022-05-18 Model: Mobilenet Quant Microseconds < Lower Is Better c7g.4xlarge . 1502.95 |=============== a1.4xlarge .. 5724.66 |======================================================== OpenSSL 3.0 Algorithm: RSA4096 verify/s > Higher Is Better c7g.4xlarge . 178460.4 |======================================================= a1.4xlarge .. 45328.6 |============== OpenSSL 3.0 Algorithm: RSA4096 sign/s > Higher Is Better c7g.4xlarge . 2546.4 |========================================================= a1.4xlarge .. 588.3 |============= PHPBench 0.8.1 PHP Benchmark Suite Score > Higher Is Better c7g.4xlarge . 666484 |========================================================= a1.4xlarge .. 241259 |===================== Stockfish 13 Total Time Nodes Per Second > Higher Is Better c7g.4xlarge . 27608891 |======================================================= a1.4xlarge .. 10980430 |====================== PyBench 2018-02-16 Total For Average Test Times Milliseconds < Lower Is Better c7g.4xlarge . 1185 |==================== a1.4xlarge .. 3452 |=========================================================== Timed Apache Compilation 2.4.41 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 26.94 |===================== a1.4xlarge .. 74.74 |========================================================== Rodinia 3.1 Test: OpenMP Streamcluster Seconds < Lower Is Better c7g.4xlarge . 13.30 |================ a1.4xlarge .. 47.43 |========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Encode Time - Seconds < Lower Is Better c7g.4xlarge . 22.77 |===================== a1.4xlarge .. 61.80 |========================================================== Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3508.5 |========================================================= a1.4xlarge .. 1364.6 |====================== Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed MB/s > Higher Is Better c7g.4xlarge . 4639.1 |========================================================= a1.4xlarge .. 633.9 |======== 7-Zip Compression 21.06 Test: Decompression Rating MIPS > Higher Is Better c7g.4xlarge . 73054 |========================================================== a1.4xlarge .. 40891 |================================ 7-Zip Compression 21.06 Test: Compression Rating MIPS > Higher Is Better c7g.4xlarge . 97824 |========================================================== a1.4xlarge .. 32498 |=================== NAS Parallel Benchmarks 3.4 Test / Class: MG.C Total Mop/s > Higher Is Better c7g.4xlarge . 13481.61 |======================================================= a1.4xlarge .. 3266.36 |============= ASTC Encoder 3.2 Preset: Thorough Seconds < Lower Is Better c7g.4xlarge . 13.92 |======================== a1.4xlarge .. 33.52 |========================================================== Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Seconds < Lower Is Better c7g.4xlarge . 8.01671425 |======== a1.4xlarge .. 53.77062740 |==================================================== Stress-NG 0.14 Test: CPU Stress Bogo Ops/s > Higher Is Better c7g.4xlarge . 5029.71 |======================================================== a1.4xlarge .. 2366.00 |========================== Google SynthMark 20201109 Test: VoiceMark_100 Voices > Higher Is Better c7g.4xlarge . 675.64 |========================================================= a1.4xlarge .. 331.07 |============================ Stress-NG 0.14 Test: IO_uring Bogo Ops/s > Higher Is Better c7g.4xlarge . 843015.78 |================================================== a1.4xlarge .. 918172.37 |====================================================== Stress-NG 0.14 Test: Memory Copying Bogo Ops/s > Higher Is Better c7g.4xlarge . 6693.32 |======================================================== a1.4xlarge .. 798.24 |======= Stress-NG 0.14 Test: Crypto Bogo Ops/s > Higher Is Better c7g.4xlarge . 23181.81 |======================================================= a1.4xlarge .. 11985.38 |============================ Stress-NG 0.14 Test: Vector Math Bogo Ops/s > Higher Is Better c7g.4xlarge . 55258.17 |======================================================= a1.4xlarge .. 27341.47 |=========================== Stress-NG 0.14 Test: Matrix Math Bogo Ops/s > Higher Is Better c7g.4xlarge . 80088.74 |======================================================= a1.4xlarge .. 7356.85 |===== Algebraic Multi-Grid Benchmark 1.2 Figure Of Merit > Higher Is Better c7g.4xlarge . 1258807333 |===================================================== a1.4xlarge .. 186716933 |======== DaCapo Benchmark 9.12-MR1 Java Test: Tradesoap msec < Lower Is Better c7g.4xlarge . 3524 |================== a1.4xlarge .. 11182 |========================================================== N-Queens 1.0 Elapsed Time Seconds < Lower Is Better c7g.4xlarge . 21.54 |======================================= a1.4xlarge .. 32.29 |========================================================== Rodinia 3.1 Test: OpenMP CFD Solver Seconds < Lower Is Better c7g.4xlarge . 10.48 |=============== a1.4xlarge .. 41.45 |========================================================== libavif avifenc 0.10 Encoder Speed: 6, Lossless Seconds < Lower Is Better c7g.4xlarge . 11.91 |==================== a1.4xlarge .. 33.99 |========================================================== Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better c7g.4xlarge . 405413.86 |====================================================== a1.4xlarge .. 203869.40 |=========================== DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans msec < Lower Is Better c7g.4xlarge . 3203 |===================== a1.4xlarge .. 9045 |=========================================================== Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better c7g.4xlarge . 383606667 |====================================================== a1.4xlarge .. 165513333 |======================= libavif avifenc 0.10 Encoder Speed: 6 Seconds < Lower Is Better c7g.4xlarge . 9.385 |=================== a1.4xlarge .. 28.778 |========================================================= DaCapo Benchmark 9.12-MR1 Java Test: H2 msec < Lower Is Better c7g.4xlarge . 2951 |========================== a1.4xlarge .. 6740 |=========================================================== DaCapo Benchmark 9.12-MR1 Java Test: Jython msec < Lower Is Better c7g.4xlarge . 3940 |================== a1.4xlarge .. 12997 |========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Encode Time - Seconds < Lower Is Better c7g.4xlarge . 9.346 |============================== a1.4xlarge .. 17.888 |========================================================= LULESH 2.0.3 z/s > Higher Is Better c7g.4xlarge . 10940.94 |======================================================= a1.4xlarge .. 2328.27 |============ libavif avifenc 0.10 Encoder Speed: 10, Lossless Seconds < Lower Is Better c7g.4xlarge . 5.765 |====================== a1.4xlarge .. 15.209 |========================================================= LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein ns/day > Higher Is Better c7g.4xlarge . 11.291 |========================================================= a1.4xlarge .. 3.245 |================ TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better c7g.4xlarge . 1370094 |======================================================== a1.4xlarge .. 538500 |======================