AmpereOne A192-32X vs. AWS Graviton4 CPU Performance Benchmarks

AmpereOne versus AWS Graviton4 ARM64 CPU benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2409052-NE-GRAVITON412&gru&sor.

PyTorch

Device: CPU - Batch Size: 512 - Model: ResNet-50

miniFE

Problem Size: Small

Algebraic Multi-Grid Benchmark

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

Xmrig

Variant: GhostRider - Hash Count: 1M

GraphicsMagick

Operation: Noise-Gaussian

GraphicsMagick

Operation: Enhanced

GraphicsMagick

Operation: Sharpen

GraphicsMagick

Operation: Swirl

Coremark

CoreMark Size 666 - Iterations Per Second

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Thread

QuantLib

Configuration: Multi-Threaded

7-Zip Compression

Test: Compression Rating

7-Zip Compression

Test: Decompression Rating

ASKAP

Test: tConvolve MPI - Degridding

ASKAP

Test: tConvolve MPI - Gridding

ASTC Encoder

Preset: Thorough

ASTC Encoder

Preset: Very Thorough

ASTC Encoder

Preset: Exhaustive

Stockfish

Chess Benchmark

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

RocksDB

Test: Random Read

RocksDB

Test: Read While Writing

Speedb

Test: Random Read

Memcached

Set To Get Ratio: 1:100

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

ClickHouse

100M Rows Hits Dataset, Second Run

ClickHouse

100M Rows Hits Dataset, Third Run

John The Ripper

Test: Blowfish

John The Ripper

Test: bcrypt

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 32

Numpy Benchmark

NAS Parallel Benchmarks

Test / Class: EP.D

NAS Parallel Benchmarks

Test / Class: SP.C

NAS Parallel Benchmarks

Test / Class: IS.D

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

LULESH

Pennant

Test: leblancbig

Pennant

Test: sedovbig

PyBench

Total For Average Test Times

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

Timed Node.js Compilation

Time To Compile

Timed Gem5 Compilation

Time To Compile

Timed LLVM Compilation

Build System: Ninja

Timed Mesa Compilation

Time To Compile

WRF

Input: conus 2.5km

m-queens

Time To Solve

CloverLeaf

Input: clover_bm64_short

CloverLeaf

Input: clover_bm16

GPAW

Input: Carbon Nanotube

NWChem

Input: C240 Buckyball

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

Parallel BZIP2 Compression

FreeBSD-13.0-RELEASE-amd64-memstick.img Compression

Helsing

Digit Range: 14 digit

Primesieve

Length: 1e13

QMCPACK

Input: Li2_STO_ae

Phoronix Test Suite v10.8.5