AWS Graviton1 To Graviton4 Benchmarks

Benchmarks by Michael Larabel for a future article looking at AWS Graviton1, Graviton2, Graviton3, Graviton4 metal performance.

HTML result view exported from: https://openbenchmarking.org/result/2407243-NE-GRAVITON434&grr&sor.

AWS Graviton1 To Graviton4 BenchmarksProcessorMotherboardMemoryDiskNetworkOSKernelCompilerFile-SystemGraviton1 16 Cores a1.metalGraviton2 64 Cores r6g.metalGraviton3 64 Cores r7g.metalGraviton4 96 Cores r8g.metal-24xlARMv8 Cortex-A72 (16 Cores)Amazon EC2 a1.metal (1.0 BIOS)32GB429GB Amazon Elastic Block StoreAmazon ElasticUbuntu 24.046.8.0-1009-aws (aarch64)GCC 13.2.0ext4ARMv8 Neoverse-N1 (64 Cores)Amazon EC2 r6g.metal (1.0 BIOS)8 x 64GB DDR4-3200MT/sARMv8 Neoverse-V1 (64 Cores)Amazon EC2 r7g.metal (1.0 BIOS)8 x 64GB DDR5-4400MT/sARMv8 Neoverse-V2 (96 Cores)Amazon EC2 r8g.metal-24xl (1.0 BIOS)12 x 64GB DDR5-5600MT/sOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v Java Details- OpenJDK Runtime Environment (build 11.0.23+9-post-Ubuntu-1ubuntu1)Python Details- Python 3.12.3Security Details- Graviton1 16 Cores a1.metal: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Branch predictor hardening BHB + srbds: Not affected + tsx_async_abort: Not affected - Graviton2 64 Cores r6g.metal: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - Graviton3 64 Cores r7g.metal: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - Graviton4 96 Cores r8g.metal-24xl: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AWS Graviton1 To Graviton4 Benchmarksbuild-nodejs: Time To Compilestockfish: Chess Benchmarkbuild-godot: Time To Compileblender: Classroom - CPU-Onlynumpy: blender: Fishy Cat - CPU-Onlyc-ray: 5K - 16avifenc: 0pgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlyhpcg: 104 104 104 - 60build-php: Time To Compilespeedb: Read While Writingblender: BMW27 - CPU-Onlyopenssl: ChaCha20openssl: AES-128-GCMopenssl: SHA256openssl: ChaCha20-Poly1305openssl: AES-256-GCMopenssl: SHA512gromacs: MPI CPU - water_GMX50_bareavifenc: 2pyperformance: xml_etreec-ray: 4K - 16rocksdb: Update Randspeedb: Rand Readrocksdb: Read While Writingrocksdb: Rand Readstress-ng: CPU Cachepyperformance: python_startupsrsran: PUSCH Processor Benchmark, Throughput Threadmt-dgemm: Sustained Floating-Point Rategraphics-magick: Enhancedjohn-the-ripper: bcryptspeedb: Update Randjohn-the-ripper: Blowfishgraphics-magick: Noise-Gaussianjohn-the-ripper: HMAC-SHA512speedb: Read Rand Write Randgraphics-magick: Sharpenjohn-the-ripper: MD5rocksdb: Read Rand Write Randgraphics-magick: HWB Color Spacesrsran: PUSCH Processor Benchmark, Throughput Totalcompress-lz4: 12 - Decompression Speedcompress-lz4: 12 - Compression Speedcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedphpbench: PHP Benchmark Suitepyperformance: regex_compilec-ray: 1080p - 16pyperformance: pathlibcoremark: CoreMark Size 666 - Iterations Per Secondwebp: Quality 100, Losslesspyperformance: crypto_pyaescompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingjohn-the-ripper: WPA PSKpyperformance: json_loadsstress-ng: Power Mathstress-ng: Matrix 3D Mathstress-ng: Vector Floating Pointstress-ng: Logarithmic Mathstress-ng: Trigonometric Mathstress-ng: Floating Pointstress-ng: Memory Copyingstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Fused Multiply-Addminife: Smallsrsran: PDSCH Processor Benchmark, Throughput Totalsrsran: PDSCH Processor Benchmark, Throughput Threadavifenc: 6, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100webp: DefaultGraviton1 16 Cores a1.metalGraviton2 64 Cores r6g.metalGraviton3 64 Cores r7g.metalGraviton4 96 Cores r8g.metal-24xl4663.27624930571674.4691289.61104.59952.431008.693668.4999.8161018703.76867479.6422433641599.39828545033330455741337615174993365541467702577349645725058918770.313381.112230566.94616461623286630126865023252821631797.1325.918.70.879250207842128167800326103216675526212619042657445881169.31090.34.831102.013.72234211455140.822110192273.0337150.402274124432697789472.11609.16838.943466.253302.682998.75852.81997.897343.9928508.494928995.124159.121307.394.028.2291.433.254.63647.61943949837266.138166.70272.07146.68112.874251.3321.01598550920.5591178.110822978081.54672387510701573863098133897630800346636998930126846323187141228694632.731159.50810063.48972833518799668361288991652614992004843.7612.439.420.410800128453493635404530511764712667278103715313416673421023212727.92207.68.022200.325.5044420417816.18131.71176422.6858110.801102334932334914997536.79242.025845.8842695.4618828.2217338.174700.4811890.83284337.61165655.6333894303.5422864.28295.6146.68.9692.075.157.55555.66657635786190.349128.36400.62114.6068.577157.2750.867115367733.0799120.472875837662.63942415630732881311923334748241389369534325600250606646610307145774774.20695.40176.638.558102639328410577571643542469896983855459.6210.15334.1144581955078766456650774156920076673735110252145300044882712661054.23108.39.843088.633.236411061379.83623.71535079.5462131.0790.12857292993775273428.713073.079831.8976127.7727799.2326006.527047.6320452.57368563.70262952.3242419807.9336601.513395.4239.66.4832.646.729.87289.267118337918126.34669.40495.7864.0733.678130.2030.346289326461.619589.9991149091133.7214879168876044487114879785604846770110699450003400130558450532034182836.92680.24358.218.9271327423640938471115608195274585474735225.177.5569.460.63583929485647729398856672081582383335342911325236733370471013011961.73595.511.953573.139.808292211014.89117.62732642.2688911.3365.04946604798518628120.623418.4022886.62119727.5750252.6645810.3712455.0232525.12631796.65438910.4065585333.0470677.822062.7278.75.2373.227.8211.61OpenBenchmarking.org

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal10002000300040005000SE +/- 0.69, N = 3SE +/- 0.73, N = 3SE +/- 0.21, N = 3SE +/- 1.44, N = 3289.27555.67647.624663.28

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal30M60M90M120M150MSE +/- 2918876.82, N = 15SE +/- 1588787.71, N = 12SE +/- 1121445.57, N = 12SE +/- 39945.87, N = 9118337918576357864394983724930571. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal400800120016002000SE +/- 1.14, N = 3SE +/- 0.44, N = 3SE +/- 0.92, N = 3SE +/- 0.31, N = 3126.35190.35266.141674.47

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Classroom - Compute: CPU-OnlyGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal30060090012001500SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.35, N = 3SE +/- 0.28, N = 369.40128.36166.701289.61

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal110220330440550SE +/- 0.76, N = 3SE +/- 0.77, N = 3SE +/- 0.29, N = 3SE +/- 0.23, N = 3495.78400.62272.07104.59

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Fishy Cat - Compute: CPU-OnlyGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal2004006008001000SE +/- 0.26, N = 3SE +/- 0.29, N = 3SE +/- 0.35, N = 3SE +/- 0.38, N = 364.07114.60146.68952.43

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal2004006008001000SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 2.98, N = 333.6868.58112.871008.691. (CC) gcc options: -lpthread -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal140280420560700SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 3.02, N = 3130.20157.28251.33668.501. (CXX) g++ options: -O3 -fPIC -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal3691215SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.008, N = 10SE +/- 0.010, N = 30.3460.8671.0159.8161. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal600K1200K1800K2400K3000KSE +/- 15759.90, N = 3SE +/- 3990.43, N = 3SE +/- 7623.84, N = 10SE +/- 107.42, N = 3289326411536779855091018701. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

High Performance Conjugate Gradient

X Y Z: 104 104 104 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 104 104 104 - RT: 60Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1428425670SE +/- 0.01489, N = 3SE +/- 0.01313, N = 3SE +/- 0.00037, N = 3SE +/- 0.00496, N = 361.6195033.0799020.559103.768671. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.3.4Time To CompileGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal100200300400500SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.59, N = 3SE +/- 0.12, N = 390.00120.47178.11479.64

Speedb

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritingGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal2M4M6M8M10MSE +/- 87574.70, N = 15SE +/- 23497.61, N = 3SE +/- 74542.67, N = 15SE +/- 20302.77, N = 8114909118758376822978024336411. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: BMW27 - Compute: CPU-OnlyGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal130260390520650SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.21, N = 3SE +/- 0.63, N = 333.7262.6381.54599.39

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal30000M60000M90000M120000M150000MSE +/- 430328.22, N = 3SE +/- 22981194.52, N = 3SE +/- 157650.92, N = 3SE +/- 4780384.74, N = 3148791688760942415630736723875107082854503331. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal100000M200000M300000M400000M500000MSE +/- 2173107.96, N = 3SE +/- 2741312.53, N = 3SE +/- 27172495.86, N = 3SE +/- 35254075.37, N = 3444871148797288131192333157386309813304557413371. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA256Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal20000M40000M60000M80000M100000MSE +/- 7443314.27, N = 3SE +/- 21492664.43, N = 3SE +/- 308447238.10, N = 3SE +/- 62991235.00, N = 385604846770474824138933897630800361517499331. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal20000M40000M60000M80000M100000MSE +/- 512363.59, N = 3SE +/- 7386579.63, N = 3SE +/- 281207.00, N = 3SE +/- 4948671.03, N = 3110699450003695343256004663699893065541467701. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal90000M180000M270000M360000M450000MSE +/- 15611507.33, N = 3SE +/- 48618733.94, N = 3SE +/- 5879487.16, N = 3SE +/- 17186337.18, N = 3400130558450250606646610126846323187257734964571. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA512Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal11000M22000M33000M44000M55000MSE +/- 4634061.90, N = 3SE +/- 2215754.50, N = 3SE +/- 20829920.25, N = 3SE +/- 4619831.10, N = 353203418283307145774771412286946325058918771. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal246810SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 36.9264.2062.7310.3131. (CXX) g++ options: -O3 -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal80160240320400SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.85, N = 380.2495.40159.51381.111. (CXX) g++ options: -O3 -fPIC -lm

PyPerformance

Benchmark: xml_etree

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: xml_etreeGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal50100150200250SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.58, N = 358.276.6100.0230.0

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal120240360480600SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 1.65, N = 318.9338.5663.49566.951. (CC) gcc options: -lpthread -lm

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal300K600K900K1200K1500KSE +/- 5313.65, N = 3SE +/- 13144.46, N = 3SE +/- 4819.65, N = 15SE +/- 443.99, N = 3132742310263937283351646161. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Speedb

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal140M280M420M560M700MSE +/- 229391.69, N = 3SE +/- 1029086.00, N = 3SE +/- 445171.45, N = 3SE +/- 174965.25, N = 15640938471284105775187996683232866301. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal2M4M6M8M10MSE +/- 18723.10, N = 3SE +/- 18610.37, N = 3SE +/- 57133.93, N = 15SE +/- 16876.08, N = 3115608197164354612889912686501. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal110M220M330M440M550MSE +/- 53499.62, N = 3SE +/- 86390.65, N = 3SE +/- 1103933.53, N = 15SE +/- 12274.71, N = 3527458547246989698165261499232528211. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: CPU CacheGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1000K2000K3000K4000K5000KSE +/- 57877.80, N = 15SE +/- 45276.44, N = 4SE +/- 23602.71, N = 15SE +/- 3895.03, N = 34735225.173855459.622004843.76631797.131. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

PyPerformance

Benchmark: python_startup

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: python_startupGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal612182430SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 37.5510.1012.4025.90

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput ThreadGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1530456075SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.23, N = 469.453.039.418.7MIN: 45.4MIN: 35.3MIN: 24.4MIN: 11.3 / MAX: 18.91. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1428425670SE +/- 0.164173, N = 3SE +/- 0.005519, N = 3SE +/- 0.156541, N = 3SE +/- 0.003641, N = 360.63583934.11445820.4108000.8792501. (CC) gcc options: -O3 -march=native -fopenmp

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal60120180240300SE +/- 0.67, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3294195128201. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal20K40K60K80K100KSE +/- 22.45, N = 3SE +/- 1.33, N = 3SE +/- 0.67, N = 3SE +/- 94.53, N = 1585647507874534978421. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Speedb

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal160K320K480K640K800KSE +/- 3844.86, N = 3SE +/- 3804.56, N = 3SE +/- 3440.64, N = 3SE +/- 302.33, N = 37293986645663635401281671. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal20K40K60K80K100KSE +/- 3.00, N = 3SE +/- 13.72, N = 3SE +/- 36.69, N = 3SE +/- 106.90, N = 1585667507744530580031. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal50100150200250SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.00, N = 3208156117261. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal30M60M90M120M150MSE +/- 162829.50, N = 3SE +/- 68676.37, N = 3SE +/- 77846.72, N = 3SE +/- 9769.57, N = 31582383339200766764712667103216671. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Speedb

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read Random Write RandomGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1.1M2.2M3.3M4.4M5.5MSE +/- 4241.50, N = 3SE +/- 3662.12, N = 3SE +/- 25580.42, N = 3SE +/- 214.78, N = 35342911373511027810375526211. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal70140210280350SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3325252153261. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal500K1000K1500K2000K2500KSE +/- 1201.85, N = 3SE +/- 1154.70, N = 3SE +/- 333.33, N = 3SE +/- 287.69, N = 32367333145300013416671904261. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1.5M3M4.5M6M7.5MSE +/- 15765.85, N = 3SE +/- 18849.94, N = 3SE +/- 5811.82, N = 3SE +/- 4162.42, N = 37047101448827134210235744581. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal70140210280350SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.00, N = 3301266212811. (CC) gcc options: -fopenmp -O2 -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lz -lm -lpthread -lgomp

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal400800120016002000SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 31961.71054.2727.9169.3MIN: 1132.6 / MAX: 1961.8MIN: 599.6MIN: 4161. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

LZ4 Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 12 - Decompression SpeedGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal8001600240032004000SE +/- 1.89, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 3SE +/- 2.61, N = 33595.53108.32207.61090.31. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 12 - Compression SpeedGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 311.959.848.024.831. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 9 - Decompression SpeedGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal8001600240032004000SE +/- 0.40, N = 3SE +/- 0.20, N = 3SE +/- 0.31, N = 3SE +/- 0.06, N = 33573.13088.62200.31102.01. (CC) gcc options: -O3 -pthread

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.10Compression Level: 9 - Compression SpeedGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 339.8033.2325.5013.721. (CC) gcc options: -O3 -pthread

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal200K400K600K800K1000KSE +/- 273.39, N = 3SE +/- 964.81, N = 3SE +/- 2661.11, N = 3SE +/- 481.21, N = 3829221641106444204234211

PyPerformance

Benchmark: regex_compile

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: regex_compileGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal100200300400500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3101137178455

C-Ray

Resolution: 1080p - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 1080p - Rays Per Pixel: 16Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal306090120150SE +/- 0.007, N = 3SE +/- 0.010, N = 3SE +/- 0.010, N = 3SE +/- 0.248, N = 34.8919.83616.181140.8221. (CC) gcc options: -lpthread -lm

PyPerformance

Benchmark: pathlib

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: pathlibGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal20406080100SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 317.623.731.7110.0

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal600K1200K1800K2400K3000KSE +/- 64.82, N = 3SE +/- 10334.27, N = 13SE +/- 300.99, N = 3SE +/- 24.05, N = 32732642.271535079.551176422.69192273.031. (CC) gcc options: -O2 -lrt" -lrt

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, LosslessGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal0.29930.59860.89791.19721.4965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.331.070.800.401. (CC) gcc options: -fvisibility=hidden -O2 -lm

PyPerformance

Benchmark: crypto_pyaes

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: crypto_pyaesGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal50100150200250SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 365.090.1110.0227.0

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal110K220K330K440K550KSE +/- 158.70, N = 3SE +/- 38.31, N = 3SE +/- 32.36, N = 3SE +/- 50.81, N = 3494660285729233493412441. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal100K200K300K400K500KSE +/- 2385.17, N = 3SE +/- 865.25, N = 3SE +/- 159.81, N = 3SE +/- 28.98, N = 3479851299377233491326971. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal20K40K60K80K100KSE +/- 0.00, N = 3SE +/- 6.00, N = 3SE +/- 11.59, N = 3SE +/- 1.33, N = 386281527344997578941. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

PyPerformance

Benchmark: json_loads

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: json_loadsGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal1632486480SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.10, N = 320.628.736.772.1

Stress-NG

Test: Power Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Power MathGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal5K10K15K20K25KSE +/- 0.98, N = 3SE +/- 0.10, N = 3SE +/- 13.10, N = 3SE +/- 0.08, N = 323418.4013073.079242.021609.161. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Matrix 3D MathGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal5K10K15K20K25KSE +/- 50.98, N = 3SE +/- 6.42, N = 3SE +/- 4.86, N = 3SE +/- 3.67, N = 322886.629831.895845.88838.941. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Vector Floating PointGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal30K60K90K120K150KSE +/- 17.26, N = 3SE +/- 29.81, N = 3SE +/- 35.00, N = 3SE +/- 0.38, N = 3119727.5776127.7742695.463466.251. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Logarithmic Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Logarithmic MathGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal11K22K33K44K55KSE +/- 1.22, N = 3SE +/- 0.23, N = 3SE +/- 0.91, N = 3SE +/- 0.28, N = 350252.6627799.2318828.223302.681. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Trigonometric Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Trigonometric MathGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal10K20K30K40K50KSE +/- 11.48, N = 3SE +/- 1.81, N = 3SE +/- 2.00, N = 3SE +/- 0.04, N = 345810.3726006.5217338.172998.751. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Floating PointGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal3K6K9K12K15KSE +/- 2.60, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 3SE +/- 1.60, N = 312455.027047.634700.48852.801. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Memory CopyingGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal7K14K21K28K35KSE +/- 13.60, N = 3SE +/- 1.92, N = 3SE +/- 1.30, N = 3SE +/- 0.09, N = 332525.1220452.5711890.831997.891. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Matrix MathGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal140K280K420K560K700KSE +/- 9.55, N = 3SE +/- 12.06, N = 3SE +/- 7.14, N = 3SE +/- 74.42, N = 3631796.65368563.70284337.617343.991. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Vector MathGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal90K180K270K360K450KSE +/- 85.60, N = 3SE +/- 51.34, N = 3SE +/- 6.90, N = 3SE +/- 0.54, N = 3438910.40262952.32165655.6328508.491. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.17.08Test: Fused Multiply-AddGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal14M28M42M56M70MSE +/- 1063.93, N = 3SE +/- 3002.12, N = 3SE +/- 1364.97, N = 3SE +/- 2472.21, N = 365585333.0442419807.9333894303.544928995.121. (CXX) g++ options: -O2 -std=gnu99 -lc -lm

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal15K30K45K60K75KSE +/- 233.77, N = 3SE +/- 30.17, N = 3SE +/- 11.77, N = 3SE +/- 1.93, N = 370677.8036601.5022864.204159.121. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal5K10K15K20K25KSE +/- 15.31, N = 3SE +/- 189.99, N = 3SE +/- 88.68, N = 5SE +/- 10.32, N = 322062.713395.48295.61307.31. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput ThreadGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal60120180240300SE +/- 3.00, N = 3SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.21, N = 3278.7239.6146.694.01. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, LosslessGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal714212835SE +/- 0.046, N = 3SE +/- 0.086, N = 3SE +/- 0.034, N = 3SE +/- 0.098, N = 35.2376.4838.96928.2291. (CXX) g++ options: -O3 -fPIC -lm

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100, Highest CompressionGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal0.72451.4492.17352.8983.6225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.222.642.071.431. (CC) gcc options: -fvisibility=hidden -O2 -lm

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: Quality 100Graviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.826.725.153.251. (CC) gcc options: -fvisibility=hidden -O2 -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgMP/s, More Is BetterWebP Image Encode 1.4Encode Settings: DefaultGraviton4 96 Cores r8g.metal-24xlGraviton3 64 Cores r7g.metalGraviton2 64 Cores r6g.metalGraviton1 16 Cores a1.metal3691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 311.619.877.554.631. (CC) gcc options: -fvisibility=hidden -O2 -lm


Phoronix Test Suite v10.8.5