Graviton4 r8g.16xlarge vs. AMD EPYC 4th Gen

Initial benchmarks by Michael Larabel

HTML result view exported from: https://openbenchmarking.org/result/2407108-NE-2407106NE99&rdt&grr.

Graviton4 r8g.16xlarge vs. AMD EPYC 4th GenProcessorMotherboardChipsetMemoryDiskNetworkGraphicsOSKernelCompilerFile-SystemSystem LayerScreen ResolutionGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlargeARMv8 Neoverse-V2 (64 Cores)Amazon EC2 r8g.16xlarge (1.0 BIOS)Amazon Device 0200496GB429GB Amazon Elastic Block StoreAmazon ElasticUbuntu 24.046.8.0-1009-aws (aarch64)GCC 13.2.0ext4amazonAMD EPYC 9R14 (64 Cores)Amazon EC2 r7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC1 x 512GB DDR5-4800MT/ssimpledrmdrmfb6.8.0-1009-aws (x86_64)800x600OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- Graviton4 r8g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-dIwDw0/gcc-13-13.2.0/debian/tmp-nvptx/usr --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto --without-cuda-driver -v - EPYC 9R14 r7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Python Details- Python 3.12.3Security Details- Graviton4 r8g.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - EPYC 9R14 r7a.16xlarge: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Vulnerable: Safe RET no microcode + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected Processor Details- EPYC 9R14 r7a.16xlarge: CPU Microcode: 0xa101148

Graviton4 r8g.16xlarge vs. AMD EPYC 4th Genclickhouse: 100M Rows Hits Dataset, Third Runclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, First Run / Cold Cachebuild-gem5: Time To Compilexmrig: GhostRider - 1Mblender: Barbershop - CPU-Onlystockfish: Chess Benchmarkbuild-nodejs: Time To Compilebuild-llvm: Ninjaopenssl: ChaCha20-Poly1305openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20rocksdb: Read While Writingopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timepgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writeblender: Pabellon Barcelona - CPU-Onlybuild-godot: Time To Compileopenssl: SHA512openssl: SHA256blender: Classroom - CPU-Onlyc-ray: 5K - 16blender: Fishy Cat - CPU-Onlyrocksdb: Update Randjohn-the-ripper: MD5john-the-ripper: HMAC-SHA512rocksdb: Read Rand Write Randrocksdb: Rand Readblender: BMW27 - CPU-Onlyc-ray: 4K - 16gromacs: MPI CPU - water_GMX50_barejohn-the-ripper: WPA PSKliquid-dsp: 64 - 256 - 512john-the-ripper: bcryptjohn-the-ripper: Blowfishliquid-dsp: 64 - 256 - 57liquid-dsp: 64 - 256 - 32srsran: PUSCH Processor Benchmark, Throughput Totalxmrig: CryptoNight-Femto UPX2 - 1Mxmrig: CryptoNight-Heavy - 1Mxmrig: Monero - 1Mxmrig: KawPow - 1Mopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timecompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingxmrig: Wownero - 1Mincompact3d: input.i3d 193 Cells Per Directionminife: Smallsrsran: PDSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge495.03479.94449.44186.7685958.7499.6381440801365.024182.063751905487072997971021372661251505271018987203238522597344.167691.9745270.5141947525226.2444420202.26147.8703541280936356971174537105.3950.24795.0112746161579667108066333558401434653258650.6428.3234.831574441850000005703857032192966666732587333331332.521851.921867.221872.121906.041.41307518.38476833150538346028304.07.8750206665410.414402.5514.29508.87488.88213.2886899.8284.6385180777250.429198.5002174910967372561017954072364829482133086113903936304192374.69664123.997170.4782092710224.323445884.52127.44857.1188.89139.167428988647000256384000375658532470366930.1949.6918.5193717477597000009140991221252953333322615333332400.243.16970924.78823628840832093310.915766141274.228256.8OpenBenchmarking.org

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 6.71, N = 9SE +/- 9.86, N = 3495.03514.29MIN: 43.07 / MAX: 6666.67MIN: 40.51 / MAX: 6000

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 5.05, N = 9SE +/- 2.24, N = 3479.94508.87MIN: 43.1 / MAX: 6000MIN: 40.57 / MAX: 6000

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 4.96, N = 9SE +/- 6.79, N = 3449.44488.88MIN: 42.7 / MAX: 6666.67MIN: 40.21 / MAX: 5000

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50100150200250SE +/- 1.43, N = 12SE +/- 2.25, N = 3186.77213.29

Xmrig

Variant: GhostRider - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge15003000450060007500SE +/- 47.30, N = 12SE +/- 2.15, N = 35958.76899.8-maes1. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Barbershop - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge110220330440550SE +/- 0.76, N = 3SE +/- 0.03, N = 3499.63284.63

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 16.1Chess BenchmarkGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20M40M60M80M100MSE +/- 2235406.92, N = 15SE +/- 927253.56, N = 128144080185180777-m64 -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -flto -flto-partition=one -flto=jobserver

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80160240320400SE +/- 0.19, N = 3SE +/- 0.19, N = 3365.02250.43

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge4080120160200SE +/- 0.20, N = 3SE +/- 0.91, N = 3182.06198.50

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50000M100000M150000M200000M250000MSE +/- 558939.35, N = 3SE +/- 197460025.45, N = 3751905487072174910967371. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge60000M120000M180000M240000M300000MSE +/- 7802917.99, N = 3SE +/- 358257182.84, N = 32997971021372561017954071. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge60000M120000M180000M240000M300000MSE +/- 6417811.59, N = 3SE +/- 338102973.44, N = 32661251505272364829482131. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70000M140000M210000M280000M350000MSE +/- 66408.09, N = 3SE +/- 350599170.76, N = 31018987203233086113903931. Graviton4 r8g.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)2. EPYC 9R14 r7a.16xlarge: OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge2M4M6M8M10MSE +/- 26974.32, N = 3SE +/- 43017.34, N = 13852259763041921. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80160240320400344.17374.70-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge30609012015091.97124.00-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge0.11570.23140.34710.46280.5785SE +/- 0.002, N = 3SE +/- 0.002, N = 30.5140.4781. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge400K800K1200K1600K2000KSE +/- 6233.51, N = 3SE +/- 7113.73, N = 3194752520927101. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50100150200250SE +/- 0.74, N = 3SE +/- 1.74, N = 3226.24224.321. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge10002000300040005000SE +/- 14.53, N = 3SE +/- 34.82, N = 3442044581. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Pabellon Barcelona - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge4080120160200SE +/- 0.34, N = 3SE +/- 0.11, N = 3202.2684.52

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To CompileGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge306090120150SE +/- 0.35, N = 3SE +/- 0.19, N = 3147.87127.45

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA512Graviton4 r8g.16xlarge8000M16000M24000M32000M40000MSE +/- 8695334.33, N = 3354128093631. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: SHA256Graviton4 r8g.16xlarge12000M24000M36000M48000M60000MSE +/- 10002237.60, N = 3569711745371. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024)

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Classroom - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.11, N = 3SE +/- 0.14, N = 3105.3957.11

C-Ray

Resolution: 5K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.07, N = 3SE +/- 0.10, N = 350.2588.891. (CC) gcc options: -lpthread -lm

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: Fishy Cat - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 395.0139.16

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge300K600K900K1200K1500KSE +/- 11457.12, N = 3SE +/- 4861.01, N = 312746167428981. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge2M4M6M8M10MSE +/- 1201.85, N = 3SE +/- 7571.88, N = 315796678647000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge50M100M150M200M250MSE +/- 640028.21, N = 3SE +/- 730794.32, N = 3108066333256384000-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1.2M2.4M3.6M4.8M6MSE +/- 11170.68, N = 3SE +/- 17358.30, N = 3558401437565851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70M140M210M280M350MSE +/- 1371022.17, N = 3SE +/- 827628.35, N = 33465325863247036691. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0.2Blend File: BMW27 - Compute: CPU-OnlyGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1122334455SE +/- 0.10, N = 3SE +/- 0.08, N = 350.6430.19

C-Ray

Resolution: 4K - Rays Per Pixel: 16

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge1122334455SE +/- 0.03, N = 3SE +/- 0.03, N = 328.3249.691. (CC) gcc options: -lpthread -lm

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2024Implementation: MPI CPU - Input: water_GMX50_bareGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge246810SE +/- 0.004, N = 3SE +/- 0.041, N = 34.8318.5191. (CXX) g++ options: -O3 -lm

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80K160K240K320K400KSE +/- 10.97, N = 3SE +/- 164.01, N = 357444371747-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge160M320M480M640M800MSE +/- 0.00, N = 3SE +/- 1140935.29, N = 31850000007597000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20K40K60K80K100KSE +/- 2.33, N = 3SE +/- 75.52, N = 35703891409-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge20K40K60K80K100KSE +/- 9.29, N = 3SE +/- 117.95, N = 35703291221-m641. (CC) gcc options: -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge500M1000M1500M2000M2500MSE +/- 33333.33, N = 3SE +/- 6691619.97, N = 3192966666725295333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32Graviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge700M1400M2100M2800M3500MSE +/- 533333.33, N = 3SE +/- 3199131.83, N = 3325873333322615333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PUSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge5001000150020002500SE +/- 0.03, N = 3SE +/- 0.07, N = 31332.52400.2MIN: 784 / MAX: 1332.6-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq - MIN: 1696.6 / MAX: 2400.31. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl

Xmrig

Variant: CryptoNight-Femto UPX2 - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 9.16, N = 321851.91. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: CryptoNight-Heavy - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 16.81, N = 321867.21. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 40.15, N = 321872.11. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xmrig

Variant: KawPow - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MGraviton4 r8g.16xlarge5K10K15K20K25KSE +/- 59.25, N = 321906.01. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge102030405041.4143.17-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge61218243018.3824.79-mcpu=native-m641. (CXX) g++ options: -std=c++14 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge70K140K210K280K350KSE +/- 107.45, N = 3SE +/- 148.35, N = 33315052884081. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge80K160K240K320K400KSE +/- 397.84, N = 3SE +/- 1508.30, N = 33834603209331. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MGraviton4 r8g.16xlarge6K12K18K24K30KSE +/- 16.31, N = 328304.01. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge3691215SE +/- 0.01884341, N = 3SE +/- 0.01638008, N = 37.8750206610.915766101. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge14K28K42K56K70KSE +/- 28.12, N = 3SE +/- 136.71, N = 365410.441274.21. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

srsRAN Project

Test: PDSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.10.1-20240325Test: PDSCH Processor Benchmark, Throughput TotalGraviton4 r8g.16xlargeEPYC 9R14 r7a.16xlarge6K12K18K24K30KSE +/- 34.15, N = 3SE +/- 148.22, N = 314402.528256.8-march=native -mavx2 -mavx -msse4.1 -mfma -mavx512f -mavx512cd -mavx512bw -mavx512dq1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -ldl


Phoronix Test Suite v10.8.5