Amazon AWS

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2306227-NE-AMAZONAWS79&grt.

Amazon AWSProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layerm7g.16xlarge Graviton3ARMv8 Neoverse-V1 (64 Cores)Amazon EC2 m7g.16xlarge (1.0 BIOS)Amazon Device 0200256GB215GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.19.0-1025-aws (aarch64)GCC 11.3.0ext4amazonOpenBenchmarking.org- Transparent Huge Pages: madvise- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - Python 3.10.6- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

Amazon AWScompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingmt-dgemm: Sustained Floating-Point Rateamg: apache: 500apache: 1000brl-cad: VGR Performance Metriccoremark: CoreMark Size 666 - Iterations Per Secondgpaw: Carbon Nanotubegraph500: 26graph500: 26graph500: 26graph500: 26gromacs: MPI CPU - water_GMX50_bareheffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float - 256heffte: c2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float - 512heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double - 512heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - double - 512hpcg: 144 144 144 - 60hpcg: 160 160 160 - 60kripke: laghos: Triple Point Problemlaghos: Sedov Blast Wave, ube_922_hex.meshlammps: 20k Atomslammps: Rhodopsin Proteinlczero: BLASlczero: Eigenliquid-dsp: 32 - 256 - 32liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 32liquid-dsp: 64 - 256 - 57liquid-dsp: 32 - 256 - 512liquid-dsp: 64 - 256 - 512lulesh: mocassin: Gas HII40mocassin: Dust 2D tau100.0npb: CG.Cnpb: EP.Dnpb: LU.Cnpb: MG.Cnpb: SP.Cnekrs: Kershawnekrs: TurboPipe Periodicnginx: 500nginx: 1000nwchem: C240 Buckyballopenssl: SHA256openssl: SHA512openssl: RSA4096openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305pennant: sedovbigpennant: leblancbigqmcpack: Li2_STO_aeqmcpack: simple-H2Oqmcpack: FeCO6_b3lyp_gmsqmcpack: FeCO6_b3lyp_gmsremhos: Sample Remap Examplerodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclustersrsran: Downlink Processor Benchmarksrsran: PUSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Threadstockfish: Total Timestress-ng: NUMAstress-ng: CPU Cachestress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Matrix 3D Mathstress-ng: Memory Copyingstress-ng: Vector Shufflestress-ng: Wide Vector Mathstress-ng: Fused Multiply-Addstress-ng: Vector Floating Pointbuild-gem5: Time To Compilebuild-godot: Time To Compilebuild-nodejs: Time To Compileincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionm7g.16xlarge Graviton331682528554024.362353164676166771754.8960965.707837771601880.34226461.831119432000012277900002994970004197540004.223186.35681.444288.0482306.540164.873162.95657.150340.892346.2504138.01478.504984.473933.790133.8195339000400232.01410.5536.92737.558130113981136066667721493333227050000014424000008139666716275333328296.37813.57582.66921988.993738.9828341.6850126.2917244.8531506800003976300000255768.44255616.041940.2542125155803212544887010181.9713859.5103226784517332033171900283333113630742874609909.2064906.720537112.6128.041211.60205.7214.04043.7884.37511.663318.55413.895.81121197113759.103892396.34368750.67217235.5910403.9320484.2454143.401542834.9463762252.7676102.55180.247154.378237.7833.0987103813.9454180OpenBenchmarking.org

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingm7g.16xlarge Graviton370K140K210K280K350KSE +/- 154.72, N = 33168251. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingm7g.16xlarge Graviton360K120K180K240K300KSE +/- 93.51, N = 32855401. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratem7g.16xlarge Graviton3612182430SE +/- 0.17, N = 1324.361. (CC) gcc options: -O3 -march=native -fopenmp

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2m7g.16xlarge Graviton3400M800M1200M1600M2000MSE +/- 103191.30, N = 316467616671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500m7g.16xlarge Graviton315K30K45K60K75KSE +/- 116.32, N = 371754.891. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 1000m7g.16xlarge Graviton313K26K39K52K65KSE +/- 72.21, N = 360965.701. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.34VGR Performance Metricm7g.16xlarge Graviton3200K400K600K800K1000K7837771. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm7g.16xlarge Graviton3300K600K900K1200K1500KSE +/- 11449.37, N = 151601880.341. (CC) gcc options: -O2 -lrt" -lrt

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotubem7g.16xlarge Graviton31428425670SE +/- 0.03, N = 361.831. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton3300M600M900M1200M1500M11943200001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton3300M600M900M1200M1500M12277900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton360M120M180M240M300M2994970001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton390M180M270M360M450M4197540001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barem7g.16xlarge Graviton30.95021.90042.85063.80084.751SE +/- 0.003, N = 34.2231. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128m7g.16xlarge Graviton34080120160200SE +/- 0.27, N = 3186.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256m7g.16xlarge Graviton320406080100SE +/- 0.01, N = 381.441. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512m7g.16xlarge Graviton320406080100SE +/- 0.02, N = 388.051. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128m7g.16xlarge Graviton370140210280350SE +/- 0.83, N = 3306.541. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256m7g.16xlarge Graviton34080120160200SE +/- 0.27, N = 3164.871. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512m7g.16xlarge Graviton34080120160200SE +/- 0.13, N = 3162.961. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128m7g.16xlarge Graviton31326395265SE +/- 0.28, N = 357.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256m7g.16xlarge Graviton3918273645SE +/- 0.01, N = 340.891. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512m7g.16xlarge Graviton31020304050SE +/- 0.01, N = 346.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128m7g.16xlarge Graviton3306090120150SE +/- 0.12, N = 3138.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256m7g.16xlarge Graviton320406080100SE +/- 0.02, N = 378.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512m7g.16xlarge Graviton320406080100SE +/- 0.02, N = 384.471. (CXX) g++ options: -O3

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60m7g.16xlarge Graviton3816243240SE +/- 0.00, N = 333.791. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60m7g.16xlarge Graviton3816243240SE +/- 0.00, N = 333.821. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.6m7g.16xlarge Graviton370M140M210M280M350MSE +/- 619419.33, N = 33390004001. (CXX) g++ options: -O3 -fopenmp -ldl

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemm7g.16xlarge Graviton350100150200250SE +/- 0.28, N = 3232.011. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshm7g.16xlarge Graviton390180270360450SE +/- 0.42, N = 3410.551. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomsm7g.16xlarge Graviton3816243240SE +/- 0.03, N = 336.931. (CXX) g++ options: -O3 -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteinm7g.16xlarge Graviton3918273645SE +/- 0.06, N = 337.561. (CXX) g++ options: -O3 -ldl

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASm7g.16xlarge Graviton330060090012001500SE +/- 4.67, N = 313011. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenm7g.16xlarge Graviton330060090012001500SE +/- 8.74, N = 313981. (CXX) g++ options: -flto -pthread

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32m7g.16xlarge Graviton3200M400M600M800M1000MSE +/- 233333.33, N = 311360666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57m7g.16xlarge Graviton3150M300M450M600M750MSE +/- 3333.33, N = 37214933331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32m7g.16xlarge Graviton3500M1000M1500M2000M2500MSE +/- 435889.89, N = 322705000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57m7g.16xlarge Graviton3300M600M900M1200M1500MSE +/- 152752.52, N = 314424000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512m7g.16xlarge Graviton320M40M60M80M100MSE +/- 1855.92, N = 3813966671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512m7g.16xlarge Graviton330M60M90M120M150MSE +/- 6666.67, N = 31627533331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3m7g.16xlarge Graviton36K12K18K24K30KSE +/- 27.09, N = 328296.381. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Monte Carlo Simulations of Ionised Nebulae

Input: Gas HII40

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Gas HII40m7g.16xlarge Graviton33691215SE +/- 0.05, N = 313.581. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Dust 2D tau100.0m7g.16xlarge Graviton320406080100SE +/- 0.01, N = 382.671. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm7g.16xlarge Graviton35K10K15K20K25KSE +/- 130.18, N = 321988.991. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm7g.16xlarge Graviton38001600240032004000SE +/- 1.69, N = 33738.981. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cm7g.16xlarge Graviton36K12K18K24K30KSE +/- 48.62, N = 328341.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm7g.16xlarge Graviton311K22K33K44K55KSE +/- 24.30, N = 350126.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cm7g.16xlarge Graviton34K8K12K16K20KSE +/- 10.19, N = 317244.851. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawm7g.16xlarge Graviton3700M1400M2100M2800M3500MSE +/- 1575066.14, N = 331506800001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicm7g.16xlarge Graviton3900M1800M2700M3600M4500MSE +/- 1199180.28, N = 339763000001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500m7g.16xlarge Graviton350K100K150K200K250KSE +/- 323.56, N = 3255768.441. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000m7g.16xlarge Graviton350K100K150K200K250KSE +/- 137.20, N = 3255616.041. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyballm7g.16xlarge Graviton34008001200160020001940.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256m7g.16xlarge Graviton312000M24000M36000M48000M60000MSE +/- 18610524.10, N = 3542125155801. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512m7g.16xlarge Graviton37000M14000M21000M28000M35000MSE +/- 17714077.14, N = 3321254488701. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096m7g.16xlarge Graviton32K4K6K8K10KSE +/- 1.27, N = 310181.91. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096m7g.16xlarge Graviton3150K300K450K600K750KSE +/- 21.82, N = 3713859.51. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20m7g.16xlarge Graviton320000M40000M60000M80000M100000MSE +/- 1293723.80, N = 31032267845171. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMm7g.16xlarge Graviton370000M140000M210000M280000M350000MSE +/- 81289574.27, N = 33320331719001. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMm7g.16xlarge Graviton360000M120000M180000M240000M300000MSE +/- 6411836.47, N = 32833331136301. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305m7g.16xlarge Graviton316000M32000M48000M64000M80000MSE +/- 1340503.89, N = 3742874609901. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm7g.16xlarge Graviton33691215SE +/- 0.011347, N = 39.2064901. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm7g.16xlarge Graviton3246810SE +/- 0.000869, N = 36.7205371. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: Li2_STO_aem7g.16xlarge Graviton3306090120150SE +/- 0.08, N = 3112.611. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: simple-H2Om7g.16xlarge Graviton3714212835SE +/- 0.03, N = 328.041. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsm7g.16xlarge Graviton350100150200250SE +/- 0.22, N = 3211.601. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsm7g.16xlarge Graviton350100150200250SE +/- 0.45, N = 3205.721. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplem7g.16xlarge Graviton348121620SE +/- 0.04, N = 314.041. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDm7g.16xlarge Graviton31020304050SE +/- 0.15, N = 343.791. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverm7g.16xlarge Graviton30.98441.96882.95323.93764.922SE +/- 0.011, N = 34.3751. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterm7g.16xlarge Graviton33691215SE +/- 0.14, N = 311.661. (CXX) g++ options: -O2 -lOpenCL

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmarkm7g.16xlarge Graviton370140210280350SE +/- 0.91, N = 3318.51. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Totalm7g.16xlarge Graviton312002400360048006000SE +/- 4.08, N = 35413.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Threadm7g.16xlarge Graviton320406080100SE +/- 0.03, N = 395.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timem7g.16xlarge Graviton320M40M60M80M100MSE +/- 2854071.93, N = 151121197111. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: NUMAm7g.16xlarge Graviton38001600240032004000SE +/- 5.17, N = 33759.101. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: CPU Cachem7g.16xlarge Graviton3800K1600K2400K3200K4000KSE +/- 57217.78, N = 153892396.341. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix Mathm7g.16xlarge Graviton380K160K240K320K400KSE +/- 53.44, N = 3368750.671. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Mathm7g.16xlarge Graviton350K100K150K200K250KSE +/- 47.94, N = 3217235.591. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix 3D Mathm7g.16xlarge Graviton32K4K6K8K10KSE +/- 6.38, N = 310403.931. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Memory Copyingm7g.16xlarge Graviton34K8K12K16K20KSE +/- 3.80, N = 320484.241. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Shufflem7g.16xlarge Graviton312K24K36K48K60KSE +/- 21.44, N = 354143.401. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Wide Vector Mathm7g.16xlarge Graviton3300K600K900K1200K1500KSE +/- 16116.93, N = 151542834.941. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Fused Multiply-Addm7g.16xlarge Graviton314M28M42M56M70MSE +/- 4870.19, N = 363762252.761. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Floating Pointm7g.16xlarge Graviton316K32K48K64K80KSE +/- 190.19, N = 376102.551. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilem7g.16xlarge Graviton34080120160200SE +/- 0.13, N = 3180.25

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compilem7g.16xlarge Graviton3306090120150SE +/- 0.32, N = 3154.38

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilem7g.16xlarge Graviton350100150200250SE +/- 0.33, N = 3237.78

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm7g.16xlarge Graviton30.69721.39442.09162.78883.486SE +/- 0.02702838, N = 33.098710381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm7g.16xlarge Graviton348121620SE +/- 0.02, N = 313.951. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz


Phoronix Test Suite v10.8.5