Amazon EC2 c7g.4xlarge AWS Graviton3 Graviton3 benchmarks by Michael Larabel. c7g.4xlarge: Processor: ARMv8 Neoverse-V1 (16 Cores), Motherboard: Amazon EC2 c7g.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 32GB, Disk: 193GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Ubuntu 22.04, Kernel: 5.15.0-1004-aws (aarch64), Compiler: GCC 11.2.0, File-System: ext4, System Layer: amazon c6g.4xlarge Graviton2: Processor: ARMv8 Neoverse-N1 (16 Cores), Motherboard: Amazon EC2 c6g.4xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 32GB, Disk: 193GB Amazon Elastic Block Store, Network: Amazon Elastic OS: Ubuntu 22.04, Kernel: 5.15.0-1004-aws (aarch64), Compiler: GCC 11.2.0, File-System: ext4, System Layer: amazon QuantLib 1.21 MFLOPS > Higher Is Better c7g.4xlarge ........... 2512.7 |=============================================== c6g.4xlarge Graviton2 . 1742.4 |================================= High Performance Conjugate Gradient 3.1 GFLOP/s > Higher Is Better c7g.4xlarge ........... 26.31 |================================================ c6g.4xlarge Graviton2 . 19.72 |==================================== NAS Parallel Benchmarks 3.4 Test / Class: BT.C Total Mop/s > Higher Is Better c7g.4xlarge ........... 10339.53 |============================================= c6g.4xlarge Graviton2 . 6449.11 |============================ NAS Parallel Benchmarks 3.4 Test / Class: CG.C Total Mop/s > Higher Is Better c7g.4xlarge ........... 6571.95 |============================================== c6g.4xlarge Graviton2 . 3520.86 |========================= NAS Parallel Benchmarks 3.4 Test / Class: EP.D Total Mop/s > Higher Is Better c7g.4xlarge ........... 934.72 |=============================================== c6g.4xlarge Graviton2 . 558.88 |============================ NAS Parallel Benchmarks 3.4 Test / Class: FT.C Total Mop/s > Higher Is Better c7g.4xlarge ........... 11791.77 |============================================= c6g.4xlarge Graviton2 . 6244.48 |======================== NAS Parallel Benchmarks 3.4 Test / Class: IS.D Total Mop/s > Higher Is Better c7g.4xlarge ........... 1041.90 |============================================== c6g.4xlarge Graviton2 . 372.76 |================ NAS Parallel Benchmarks 3.4 Test / Class: LU.C Total Mop/s > Higher Is Better c7g.4xlarge ........... 7730.41 |============================================== c6g.4xlarge Graviton2 . 5133.89 |=============================== NAS Parallel Benchmarks 3.4 Test / Class: MG.C Total Mop/s > Higher Is Better c7g.4xlarge ........... 13481.61 |============================================= c6g.4xlarge Graviton2 . 6720.68 |====================== NAS Parallel Benchmarks 3.4 Test / Class: SP.C Total Mop/s > Higher Is Better c7g.4xlarge ........... 4467.19 |============================================== c6g.4xlarge Graviton2 . 2356.16 |======================== LeelaChessZero 0.28 Backend: BLAS Nodes Per Second > Higher Is Better c7g.4xlarge ........... 1103 |================================================= c6g.4xlarge Graviton2 . 864 |====================================== LeelaChessZero 0.28 Backend: Eigen Nodes Per Second > Higher Is Better c7g.4xlarge ........... 1189 |================================================= c6g.4xlarge Graviton2 . 834 |================================== Rodinia 3.1 Test: OpenMP LavaMD Seconds < Lower Is Better c7g.4xlarge ........... 143.33 |=============================== c6g.4xlarge Graviton2 . 215.67 |=============================================== Rodinia 3.1 Test: OpenMP CFD Solver Seconds < Lower Is Better c7g.4xlarge ........... 10.48 |============================== c6g.4xlarge Graviton2 . 17.04 |================================================ Rodinia 3.1 Test: OpenMP Streamcluster Seconds < Lower Is Better c7g.4xlarge ........... 13.30 |========================================= c6g.4xlarge Graviton2 . 15.48 |================================================ Algebraic Multi-Grid Benchmark 1.2 Figure Of Merit > Higher Is Better c7g.4xlarge ........... 1258807333 |=========================================== c6g.4xlarge Graviton2 . 932652900 |================================ Timed MrBayes Analysis 3.2.7 Primate Phylogeny Analysis Seconds < Lower Is Better c7g.4xlarge ........... 251.40 |=============================== c6g.4xlarge Graviton2 . 384.75 |=============================================== Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction Seconds < Lower Is Better c7g.4xlarge ........... 8.01671425 |============================= c6g.4xlarge Graviton2 . 11.57335470 |========================================== Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction Seconds < Lower Is Better c7g.4xlarge ........... 29.13 |================================== c6g.4xlarge Graviton2 . 41.02 |================================================ LAMMPS Molecular Dynamics Simulator 29Oct2020 Model: Rhodopsin Protein ns/day > Higher Is Better c7g.4xlarge ........... 11.291 |=============================================== c6g.4xlarge Graviton2 . 7.935 |================================= LULESH 2.0.3 z/s > Higher Is Better c7g.4xlarge ........... 10940.94 |============================================= c6g.4xlarge Graviton2 . 6016.16 |========================= WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless Encode Time - Seconds < Lower Is Better c7g.4xlarge ........... 22.77 |=================================== c6g.4xlarge Graviton2 . 31.08 |================================================ WebP Image Encode 1.1 Encode Settings: Quality 100, Highest Compression Encode Time - Seconds < Lower Is Better c7g.4xlarge ........... 9.346 |==================================== c6g.4xlarge Graviton2 . 12.248 |=============================================== WebP Image Encode 1.1 Encode Settings: Quality 100, Lossless, Highest Compression Encode Time - Seconds < Lower Is Better c7g.4xlarge ........... 48.21 |=================================== c6g.4xlarge Graviton2 . 66.15 |================================================ simdjson 1.0 Throughput Test: Kostya GB/s > Higher Is Better c7g.4xlarge ........... 1.94 |================================================= c6g.4xlarge Graviton2 . 1.19 |============================== simdjson 1.0 Throughput Test: LargeRandom GB/s > Higher Is Better c7g.4xlarge ........... 0.70 |================================================= c6g.4xlarge Graviton2 . 0.49 |================================== simdjson 1.0 Throughput Test: PartialTweets GB/s > Higher Is Better c7g.4xlarge ........... 2.62 |================================================= c6g.4xlarge Graviton2 . 1.51 |============================ simdjson 1.0 Throughput Test: DistinctUserID GB/s > Higher Is Better c7g.4xlarge ........... 2.69 |================================================= c6g.4xlarge Graviton2 . 1.53 |============================ DaCapo Benchmark 9.12-MR1 Java Test: H2 msec < Lower Is Better c7g.4xlarge ........... 2951 |==================================== c6g.4xlarge Graviton2 . 3964 |================================================= DaCapo Benchmark 9.12-MR1 Java Test: Jython msec < Lower Is Better c7g.4xlarge ........... 3940 |================================== c6g.4xlarge Graviton2 . 5626 |================================================= DaCapo Benchmark 9.12-MR1 Java Test: Tradesoap msec < Lower Is Better c7g.4xlarge ........... 3524 |====================================== c6g.4xlarge Graviton2 . 4506 |================================================= DaCapo Benchmark 9.12-MR1 Java Test: Tradebeans msec < Lower Is Better c7g.4xlarge ........... 3203 |==================================== c6g.4xlarge Graviton2 . 4344 |================================================= Zstd Compression 1.5.0 Compression Level: 3 - Compression Speed MB/s > Higher Is Better c7g.4xlarge ........... 4639.1 |=============================================== c6g.4xlarge Graviton2 . 2888.3 |============================= c6g.4xlarge Graviton2 . 2878.8 |============================= Zstd Compression 1.5.0 Compression Level: 3 - Decompression Speed MB/s > Higher Is Better c7g.4xlarge . 3508.5 |========================================================= Zstd Compression 1.5.0 Compression Level: 19 - Compression Speed MB/s > Higher Is Better c7g.4xlarge ........... 41.2 |================================================= c6g.4xlarge Graviton2 . 34.6 |========================================= Zstd Compression 1.5.0 Compression Level: 19 - Decompression Speed MB/s > Higher Is Better c7g.4xlarge ........... 3050.3 |=============================================== c6g.4xlarge Graviton2 . 2051.6 |================================ Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Compression Speed MB/s > Higher Is Better c7g.4xlarge ........... 39.5 |================================================= c6g.4xlarge Graviton2 . 31.0 |====================================== Zstd Compression 1.5.0 Compression Level: 19, Long Mode - Decompression Speed MB/s > Higher Is Better c7g.4xlarge ........... 3240.6 |=============================================== c6g.4xlarge Graviton2 . 2196.3 |================================ TSCP 1.81 AI Chess Performance Nodes Per Second > Higher Is Better c7g.4xlarge ........... 1370094 |============================================== c6g.4xlarge Graviton2 . 872313 |============================= ACES DGEMM 1.0 Sustained Floating-Point Rate GFLOP/s > Higher Is Better c7g.4xlarge ........... 5.853864 |============================================= c6g.4xlarge Graviton2 . 4.785123 |===================================== Coremark 1.0 CoreMark Size 666 - Iterations Per Second Iterations/Sec > Higher Is Better c7g.4xlarge ........... 405413.86 |============================================ c6g.4xlarge Graviton2 . 315464.34 |================================== 7-Zip Compression 21.06 Test: Compression Rating MIPS > Higher Is Better c7g.4xlarge ........... 97824 |================================================ c6g.4xlarge Graviton2 . 71285 |=================================== 7-Zip Compression 21.06 Test: Decompression Rating MIPS > Higher Is Better c7g.4xlarge ........... 73054 |================================================ c6g.4xlarge Graviton2 . 59445 |======================================= Stockfish 13 Total Time Nodes Per Second > Higher Is Better c7g.4xlarge ........... 27608891 |============================================= c6g.4xlarge Graviton2 . 21679245 |=================================== asmFish 2018-07-23 1024 Hash Memory, 26 Depth Nodes/second > Higher Is Better c7g.4xlarge ........... 32134123 |============================================= c6g.4xlarge Graviton2 . 26540482 |===================================== libavif avifenc 0.10 Encoder Speed: 0 Seconds < Lower Is Better c7g.4xlarge ........... 256.84 |============================== c6g.4xlarge Graviton2 . 406.94 |=============================================== libavif avifenc 0.10 Encoder Speed: 2 Seconds < Lower Is Better c7g.4xlarge ........... 141.70 |============================ c6g.4xlarge Graviton2 . 238.21 |=============================================== libavif avifenc 0.10 Encoder Speed: 6 Seconds < Lower Is Better c7g.4xlarge ........... 9.385 |================================== c6g.4xlarge Graviton2 . 13.046 |=============================================== libavif avifenc 0.10 Encoder Speed: 6, Lossless Seconds < Lower Is Better c7g.4xlarge ........... 11.91 |=================================== c6g.4xlarge Graviton2 . 16.52 |================================================ libavif avifenc 0.10 Encoder Speed: 10, Lossless Seconds < Lower Is Better c7g.4xlarge ........... 5.765 |================================= c6g.4xlarge Graviton2 . 8.311 |================================================ Timed Apache Compilation 2.4.41 Time To Compile Seconds < Lower Is Better c7g.4xlarge ........... 26.94 |====================================== c6g.4xlarge Graviton2 . 34.20 |================================================ Timed Gem5 Compilation 21.2 Time To Compile Seconds < Lower Is Better c7g.4xlarge ........... 391.17 |====================================== c6g.4xlarge Graviton2 . 488.81 |=============================================== Timed ImageMagick Compilation 6.9.0 Time To Compile Seconds < Lower Is Better c7g.4xlarge ........... 27.90 |================================= c6g.4xlarge Graviton2 . 40.33 |================================================ Timed LLVM Compilation 13.0 Build System: Ninja Seconds < Lower Is Better c7g.4xlarge ........... 544.93 |===================================== c6g.4xlarge Graviton2 . 682.98 |=============================================== Timed Node.js Compilation 17.3 Time To Compile Seconds < Lower Is Better c7g.4xlarge ........... 497.58 |===================================== c6g.4xlarge Graviton2 . 628.40 |=============================================== Timed PHP Compilation 7.4.2 Time To Compile Seconds < Lower Is Better c7g.4xlarge ........... 69.48 |====================================== c6g.4xlarge Graviton2 . 88.90 |================================================ Build2 0.13 Time To Compile Seconds < Lower Is Better c7g.4xlarge ........... 115.02 |====================================== c6g.4xlarge Graviton2 . 142.28 |=============================================== C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Seconds < Lower Is Better c7g.4xlarge ........... 38.52 |============================== c6g.4xlarge Graviton2 . 62.32 |================================================ POV-Ray 3.7.0.7 Trace Time Seconds < Lower Is Better c7g.4xlarge ........... 37.86 |==================================== c6g.4xlarge Graviton2 . 51.05 |================================================ m-queens 1.2 Time To Solve Seconds < Lower Is Better c7g.4xlarge ........... 66.82 |=========================================== c6g.4xlarge Graviton2 . 75.22 |================================================ N-Queens 1.0 Elapsed Time Seconds < Lower Is Better c7g.4xlarge ........... 21.54 |============================================= c6g.4xlarge Graviton2 . 23.14 |================================================ Ngspice 34 Circuit: C2670 Seconds < Lower Is Better c7g.4xlarge ........... 198.22 |=================================== c6g.4xlarge Graviton2 . 263.72 |=============================================== Ngspice 34 Circuit: C7552 Seconds < Lower Is Better c7g.4xlarge ........... 191.29 |=================================== c6g.4xlarge Graviton2 . 255.21 |=============================================== Google SynthMark 20201109 Test: VoiceMark_100 Voices > Higher Is Better c7g.4xlarge ........... 675.64 |=============================================== c6g.4xlarge Graviton2 . 470.39 |================================= SecureMark 1.0.4 Benchmark: SecureMark-TLS marks > Higher Is Better c7g.4xlarge ........... 183708 |=============================================== c6g.4xlarge Graviton2 . 120301 |=============================== OpenSSL 3.0 Algorithm: SHA256 byte/s > Higher Is Better c7g.4xlarge ........... 13722045973 |========================================== c6g.4xlarge Graviton2 . 10723184083 |================================= OpenSSL 3.0 Algorithm: RSA4096 sign/s > Higher Is Better c7g.4xlarge ........... 2546.4 |=============================================== c6g.4xlarge Graviton2 . 660.6 |============ OpenSSL 3.0 Algorithm: RSA4096 verify/s > Higher Is Better c7g.4xlarge ........... 178460.4 |============================================= c6g.4xlarge Graviton2 . 53951.5 |============== Liquid-DSP 2021.01.31 Threads: 16 - Buffer Length: 256 - Filter Length: 57 samples/s > Higher Is Better c7g.4xlarge ........... 383606667 |============================================ c6g.4xlarge Graviton2 . 262890000 |============================== GROMACS 2022.1 Implementation: MPI CPU - Input: water_GMX50_bare Ns Per Day > Higher Is Better c7g.4xlarge ........... 1.128 |================================================ c6g.4xlarge Graviton2 . 0.781 |================================= TensorFlow Lite 2022-05-18 Model: SqueezeNet Microseconds < Lower Is Better c7g.4xlarge ........... 3257.94 |====================================== c6g.4xlarge Graviton2 . 3969.35 |============================================== TensorFlow Lite 2022-05-18 Model: Inception V4 Microseconds < Lower Is Better c7g.4xlarge ........... 41855.1 |========================================= c6g.4xlarge Graviton2 . 46793.9 |============================================== TensorFlow Lite 2022-05-18 Model: NASNet Mobile Microseconds < Lower Is Better c7g.4xlarge ........... 11591.9 |==================================== c6g.4xlarge Graviton2 . 14985.4 |============================================== TensorFlow Lite 2022-05-18 Model: Mobilenet Float Microseconds < Lower Is Better c7g.4xlarge ........... 2156.60 |======================================== c6g.4xlarge Graviton2 . 2500.87 |============================================== TensorFlow Lite 2022-05-18 Model: Mobilenet Quant Microseconds < Lower Is Better c7g.4xlarge ........... 1502.95 |=================================== c6g.4xlarge Graviton2 . 1980.24 |============================================== TensorFlow Lite 2022-05-18 Model: Inception ResNet V2 Microseconds < Lower Is Better c7g.4xlarge ........... 40051.3 |======================================== c6g.4xlarge Graviton2 . 45955.7 |============================================== ASTC Encoder 3.2 Preset: Thorough Seconds < Lower Is Better c7g.4xlarge ........... 13.92 |======================================== c6g.4xlarge Graviton2 . 16.52 |================================================ ASTC Encoder 3.2 Preset: Exhaustive Seconds < Lower Is Better c7g.4xlarge ........... 139.38 |========================================= c6g.4xlarge Graviton2 . 159.20 |=============================================== Stress-NG 0.14 Test: Crypto Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 23181.81 |============================================= c6g.4xlarge Graviton2 . 17924.18 |=================================== Stress-NG 0.14 Test: IO_uring Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 843015.78 |============================================ c6g.4xlarge Graviton2 . 770521.81 |======================================== Stress-NG 0.14 Test: CPU Cache Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 64.31 |================================================ c6g.4xlarge Graviton2 . 37.19 |============================ Stress-NG 0.14 Test: CPU Stress Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 5029.71 |============================================== c6g.4xlarge Graviton2 . 3404.94 |=============================== Stress-NG 0.14 Test: Matrix Math Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 80088.74 |============================================= c6g.4xlarge Graviton2 . 64084.08 |==================================== Stress-NG 0.14 Test: Vector Math Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 55258.17 |============================================= c6g.4xlarge Graviton2 . 37753.89 |=============================== Stress-NG 0.14 Test: Memory Copying Bogo Ops/s > Higher Is Better c7g.4xlarge ........... 6693.32 |============================================== c6g.4xlarge Graviton2 . 2903.00 |==================== GPAW 22.1 Input: Carbon Nanotube Seconds < Lower Is Better c7g.4xlarge ........... 155.18 |================================== c6g.4xlarge Graviton2 . 215.53 |=============================================== PyBench 2018-02-16 Total For Average Test Times Milliseconds < Lower Is Better c7g.4xlarge ........... 1185 |================================= c6g.4xlarge Graviton2 . 1741 |================================================= nginx 1.21.1 Concurrent Requests: 100 Requests Per Second > Higher Is Better c7g.4xlarge ........... 345710.87 |============================================ c6g.4xlarge Graviton2 . 307349.36 |======================================= nginx 1.21.1 Concurrent Requests: 200 Requests Per Second > Higher Is Better c7g.4xlarge ........... 352380.98 |============================================ c6g.4xlarge Graviton2 . 308938.67 |======================================= nginx 1.21.1 Concurrent Requests: 500 Requests Per Second > Higher Is Better c7g.4xlarge ........... 346613.34 |============================================ c6g.4xlarge Graviton2 . 310596.58 |======================================= nginx 1.21.1 Concurrent Requests: 1000 Requests Per Second > Higher Is Better c7g.4xlarge ........... 346814.75 |============================================ c6g.4xlarge Graviton2 . 308213.13 |======================================= ONNX Runtime 1.11 Model: GPT-2 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge ........... 7990 |================================================= c6g.4xlarge Graviton2 . 6948 |=========================================== ONNX Runtime 1.11 Model: bertsquad-12 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge ........... 407 |================================================== c6g.4xlarge Graviton2 . 322 |======================================== ONNX Runtime 1.11 Model: fcn-resnet101-11 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge ........... 38 |=================================================== c6g.4xlarge Graviton2 . 28 |====================================== ONNX Runtime 1.11 Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge ........... 609 |================================================== c6g.4xlarge Graviton2 . 334 |=========================== ONNX Runtime 1.11 Model: super-resolution-10 - Device: CPU - Executor: Standard Inferences Per Minute > Higher Is Better c7g.4xlarge ........... 2817 |================================================= c6g.4xlarge Graviton2 . 2072 |==================================== Apache HTTP Server 2.4.48 Concurrent Requests: 100 Requests Per Second > Higher Is Better c7g.4xlarge ........... 67231.88 |============================================= c6g.4xlarge Graviton2 . 46995.35 |=============================== Apache HTTP Server 2.4.48 Concurrent Requests: 200 Requests Per Second > Higher Is Better c7g.4xlarge ........... 73676.95 |============================================= c6g.4xlarge Graviton2 . 50059.97 |=============================== Apache HTTP Server 2.4.48 Concurrent Requests: 500 Requests Per Second > Higher Is Better c7g.4xlarge ........... 73546.32 |============================================= c6g.4xlarge Graviton2 . 50077.81 |=============================== Apache HTTP Server 2.4.48 Concurrent Requests: 1000 Requests Per Second > Higher Is Better c7g.4xlarge ........... 72719.33 |============================================= c6g.4xlarge Graviton2 . 46629.45 |============================= PHPBench 0.8.1 PHP Benchmark Suite Score > Higher Is Better c7g.4xlarge ........... 666484 |=============================================== c6g.4xlarge Graviton2 . 449855 |================================