Amazon EC2 c7g.4xlarge Graviton3 Graviton3 benchmarks by Michael Larabel. c7g.4xlarge: Processor: ARMv8 Neoverse-V1 (16 Cores), Motherboard: Amazon EC2 c7g.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 32GB, Disk: 193GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Ubuntu 22.04, Kernel: 5.15.0-1004-aws (aarch64), Compiler: GCC 11.2.0, File-System: ext4, System Layer: amazon QuantLib 1.21 MFLOPS > Higher Is Better c7g.4xlarge . 2512.7 |========================================================= High Performance Conjugate Gradient 3.1 GFLOP/s > Higher Is Better c7g.4xlarge . 26.31 |========================================================== NAS Parallel Benchmarks 3.4 Test / Class: BT.C Total Mop/s > Higher Is Better c7g.4xlarge . 10339.53 |======================================================= NAS Parallel Benchmarks 3.4 Test / Class: CG.C Total Mop/s > Higher Is Better c7g.4xlarge . 6571.95 |======================================================== NAS Parallel Benchmarks 3.4 Test / Class: EP.D Total Mop/s > Higher Is Better c7g.4xlarge . 934.72 |========================================================= NAS Parallel Benchmarks 3.4 Test / Class: FT.C Total Mop/s > Higher Is Better c7g.4xlarge . 11791.77 |======================================================= NAS Parallel Benchmarks 3.4 Test / Class: IS.D Total Mop/s > Higher Is Better c7g.4xlarge . 1041.90 |======================================================== NAS Parallel Benchmarks 3.4 Test / Class: LU.C Total Mop/s > Higher Is Better c7g.4xlarge . 7730.41 |======================================================== NAS Parallel Benchmarks 3.4 Test / Class: MG.C Total Mop/s > Higher Is Better c7g.4xlarge . 13481.61 |======================================================= NAS Parallel Benchmarks 3.4 Test / Class: SP.C Total Mop/s > Higher Is Better c7g.4xlarge . 4467.19 |======================================================== LeelaChessZero 0.28 Backend: BLAS Nodes Per Second > Higher Is Better c7g.4xlarge . 1103 |=========================================================== LeelaChessZero 0.28 Backend: Eigen Nodes Per Second > Higher Is Better c7g.4xlarge . 1189 |=========================================================== Rodinia 3.1 Test: OpenMP LavaMD Seconds < Lower Is Better c7g.4xlarge . 143.33 |========================================================= Rodinia 3.1 Test: OpenMP CFD Solver Seconds < Lower Is Better c7g.4xlarge . 10.48 |========================================================== Rodinia 3.1 Test: OpenMP Streamcluster Seconds < Lower Is Better c7g.4xlarge . 13.30 |========================================================== Algebraic Multi-Grid Benchmark 1.2 Figure Of Merit > Higher Is Better c7g.4xlarge . 1258807333 |===================================================== Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better c7g.4xlarge . 251.40 |========================================================= Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Seconds < Lower Is Better c7g.4xlarge . 8.01671425 |===================================================== Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Seconds < Lower Is Better c7g.4xlarge . 29.13 |========================================================== LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: 20k Atoms ns/day > Higher Is Better c7g.4xlarge . 11.43 |========================================================== LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein ns/day > Higher Is Better c7g.4xlarge . 11.29 |========================================================== LULESH 2.0.3 z/s > Higher Is Better c7g.4xlarge . 10940.94 |======================================================= WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Encode Time - Seconds < Lower Is Better c7g.4xlarge . 22.77 |========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Encode Time - Seconds < Lower Is Better c7g.4xlarge . 9.346 |========================================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Encode Time - Seconds < Lower Is Better c7g.4xlarge . 48.21 |========================================================== simdjson 1.0 Throughput Test: Kostya GB/s > Higher Is Better c7g.4xlarge . 1.94 |=========================================================== simdjson 1.0 Throughput Test: LargeRandom GB/s > Higher Is Better c7g.4xlarge . 0.7 |============================================================ simdjson 1.0 Throughput Test: PartialTweets GB/s > Higher Is Better c7g.4xlarge . 2.62 |=========================================================== simdjson 1.0 Throughput Test: DistinctUserID GB/s > Higher Is Better c7g.4xlarge . 2.69 |=========================================================== DaCapo Benchmark 9.12-MR1 Java Test: H2 msec < Lower Is Better c7g.4xlarge . 2951 |=========================================================== DaCapo Benchmark 9.12-MR1 Java Test: Jython msec < Lower Is Better c7g.4xlarge . 3940 |=========================================================== DaCapo Benchmark 9.12-MR1 Java Test: Tradesoap msec < Lower Is Better c7g.4xlarge . 3524 |=========================================================== DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans msec < Lower Is Better c7g.4xlarge . 3203 |=========================================================== Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed MB/s > Higher Is Better c7g.4xlarge . 4639.1 |========================================================= Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3508.5 |========================================================= Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MB/s > Higher Is Better c7g.4xlarge . 41.2 |=========================================================== Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3050.3 |========================================================= Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better c7g.4xlarge . 39.5 |=========================================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3240.6 |========================================================= TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better c7g.4xlarge . 1370094 |======================================================== ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better c7g.4xlarge . 5.853864 |======================================================= Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better c7g.4xlarge . 405413.86 |====================================================== 7-Zip Compression 21.06 Test: Compression Rating MIPS > Higher Is Better c7g.4xlarge . 97824 |========================================================== 7-Zip Compression 21.06 Test: Decompression Rating MIPS > Higher Is Better c7g.4xlarge . 73054 |========================================================== Stockfish 13 Total Time Nodes Per Second > Higher Is Better c7g.4xlarge . 27608891 |======================================================= asmFish 2018-07-23 1024 Hash Memory, 26 Depth Nodes/second > Higher Is Better c7g.4xlarge . 32134123 |======================================================= libavif avifenc 0.10 Encoder Speed: 0 Seconds < Lower Is Better c7g.4xlarge . 256.84 |========================================================= libavif avifenc 0.10 Encoder Speed: 2 Seconds < Lower Is Better c7g.4xlarge . 141.70 |========================================================= libavif avifenc 0.10 Encoder Speed: 6 Seconds < Lower Is Better c7g.4xlarge . 9.385 |========================================================== libavif avifenc 0.10 Encoder Speed: 6, Lossless Seconds < Lower Is Better c7g.4xlarge . 11.91 |========================================================== libavif avifenc 0.10 Encoder Speed: 10, Lossless Seconds < Lower Is Better c7g.4xlarge . 5.765 |========================================================== Timed Apache Compilation 2.4.41 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 26.94 |========================================================== Timed Gem5 Compilation 21.2 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 391.17 |========================================================= Timed ImageMagick Compilation 6.9.0 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 27.90 |========================================================== Timed LLVM Compilation 13.0 Build System: Ninja Seconds < Lower Is Better c7g.4xlarge . 544.93 |========================================================= Timed Node.js Compilation 17.3 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 497.58 |========================================================= Timed PHP Compilation 7.4.2 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 69.48 |========================================================== Build2 0.13 Time To Compile Seconds < Lower Is Better c7g.4xlarge . 115.02 |========================================================= C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better c7g.4xlarge . 38.52 |========================================================== POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better c7g.4xlarge . 37.86 |========================================================== m-queens 1.2 Time To Solve Seconds < Lower Is Better c7g.4xlarge . 66.82 |========================================================== N-Queens 1.0 Elapsed Time Seconds < Lower Is Better c7g.4xlarge . 21.54 |========================================================== Ngspice 34 Circuit: C2670 Seconds < Lower Is Better c7g.4xlarge . 198.22 |========================================================= Ngspice 34 Circuit: C7552 Seconds < Lower Is Better c7g.4xlarge . 191.29 |========================================================= Google SynthMark 20201109 Test: VoiceMark_100 Voices > Higher Is Better c7g.4xlarge . 675.64 |========================================================= SecureMark 1.0.4 Benchmark: SecureMark-TLS marks > Higher Is Better c7g.4xlarge . 183708 |========================================================= OpenSSL 3.0 Algorithm: SHA256 byte/s > Higher Is Better c7g.4xlarge . 13722045973 |==================================================== OpenSSL 3.0 Algorithm: RSA4096 sign/s > Higher Is Better c7g.4xlarge . 2546.4 |========================================================= OpenSSL 3.0 Algorithm: RSA4096 verify/s > Higher Is Better c7g.4xlarge . 178460.4 |======================================================= Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better c7g.4xlarge . 383606667 |====================================================== GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better c7g.4xlarge . 1.128 |========================================================== TensorFlow Lite 2022-05-18 Model: SqueezeNet Microseconds < Lower Is Better c7g.4xlarge . 3257.94 |======================================================== TensorFlow Lite 2022-05-18 Model: Inception V4 Microseconds < Lower Is Better c7g.4xlarge . 41855.1 |======================================================== TensorFlow Lite 2022-05-18 Model: NASNet Mobile Microseconds < Lower Is Better c7g.4xlarge . 11591.9 |======================================================== TensorFlow Lite 2022-05-18 Model: Mobilenet Float Microseconds < Lower Is Better c7g.4xlarge . 2156.60 |======================================================== TensorFlow Lite 2022-05-18 Model: Mobilenet Quant Microseconds < Lower Is Better c7g.4xlarge . 1502.95 |======================================================== TensorFlow Lite 2022-05-18 Model: Inception ResNet V2 Microseconds < Lower Is Better c7g.4xlarge . 40051.3 |======================================================== ASTC Encoder 3.2 Preset: Thorough Seconds < Lower Is Better c7g.4xlarge . 13.92 |========================================================== ASTC Encoder 3.2 Preset: Exhaustive Seconds < Lower Is Better c7g.4xlarge . 139.38 |========================================================= Stress-NG 0.14 Test: Crypto Bogo Ops/s > Higher Is Better c7g.4xlarge . 23181.81 |======================================================= Stress-NG 0.14 Test: IO_uring Bogo Ops/s > Higher Is Better c7g.4xlarge . 843015.78 |====================================================== Stress-NG 0.14 Test: CPU Cache Bogo Ops/s > Higher Is Better c7g.4xlarge . 64.31 |========================================================== Stress-NG 0.14 Test: CPU Stress Bogo Ops/s > Higher Is Better c7g.4xlarge . 5029.71 |======================================================== Stress-NG 0.14 Test: Matrix Math Bogo Ops/s > Higher Is Better c7g.4xlarge . 80088.74 |======================================================= Stress-NG 0.14 Test: Vector Math Bogo Ops/s > Higher Is Better c7g.4xlarge . 55258.17 |======================================================= Stress-NG 0.14 Test: Memory Copying Bogo Ops/s > Higher Is Better c7g.4xlarge . 6693.32 |======================================================== GPAW 22.1 Input: Carbon Nanotube Seconds < Lower Is Better c7g.4xlarge . 155.18 |========================================================= PyBench 2018-02-16 Total For Average Test Times Milliseconds < Lower Is Better c7g.4xlarge . 1185 |=========================================================== nginx 1.21.1 Concurrent Requests: 100 Requests Per Second > Higher Is Better c7g.4xlarge . 345710.87 |====================================================== nginx 1.21.1 Concurrent Requests: 200 Requests Per Second > Higher Is Better c7g.4xlarge . 352380.98 |====================================================== nginx 1.21.1 Concurrent Requests: 500 Requests Per Second > Higher Is Better c7g.4xlarge . 346613.34 |====================================================== nginx 1.21.1 Concurrent Requests: 1000 Requests Per Second > Higher Is Better c7g.4xlarge . 346814.75 |====================================================== ONNX Runtime 1.11 Model: GPT-2 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 7990 |=========================================================== ONNX Runtime 1.11 Model: bertsquad-12 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 407 |============================================================ ONNX Runtime 1.11 Model: fcn-resnet101-11 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 38 |============================================================= ONNX Runtime 1.11 Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 609 |============================================================ ONNX Runtime 1.11 Model: super-resolution-10 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge . 2817 |=========================================================== Apache HTTP Server 2.4.48 Concurrent Requests: 100 Requests Per Second > Higher Is Better c7g.4xlarge . 67231.88 |======================================================= Apache HTTP Server 2.4.48 Concurrent Requests: 200 Requests Per Second > Higher Is Better c7g.4xlarge . 73676.95 |======================================================= Apache HTTP Server 2.4.48 Concurrent Requests: 500 Requests Per Second > Higher Is Better c7g.4xlarge . 73546.32 |======================================================= Apache HTTP Server 2.4.48 Concurrent Requests: 1000 Requests Per Second > Higher Is Better c7g.4xlarge . 72719.33 |======================================================= PHPBench 0.8.1 PHP Benchmark Suite Score > Higher Is Better c7g.4xlarge . 666484 |=========================================================