Amazon AWS

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2306232-NE-2306227NE39&gru&sro.

Amazon AWSProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layerm7g.16xlarge Graviton3c6g.16xlarge Graviton2ARMv8 Neoverse-V1 (64 Cores)Amazon EC2 m7g.16xlarge (1.0 BIOS)Amazon Device 0200256GB215GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.19.0-1025-aws (aarch64)GCC 11.3.0ext4amazonARMv8 Neoverse-N1 (64 Cores)Amazon EC2 c6g.16xlarge (1.0 BIOS)128GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

Amazon AWSgraph500: 26graph500: 26stress-ng: NUMAstress-ng: CPU Cachestress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Matrix 3D Mathstress-ng: Memory Copyingstress-ng: Vector Shufflestress-ng: Wide Vector Mathstress-ng: Fused Multiply-Addstress-ng: Vector Floating Pointopenssl: SHA256openssl: SHA512openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305amg: nekrs: Kershawnekrs: TurboPipe Periodichpcg: 144 144 144 - 60hpcg: 160 160 160 - 60heffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float - 256heffte: c2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float - 512heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double - 512heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - double - 512mt-dgemm: Sustained Floating-Point Ratecoremark: CoreMark Size 666 - Iterations Per Secondlaghos: Triple Point Problemlaghos: Sedov Blast Wave, ube_922_hex.meshsrsran: Downlink Processor Benchmarksrsran: PUSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Threadcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratinglczero: BLASlczero: Eigenstockfish: Total Timegromacs: MPI CPU - water_GMX50_barelammps: 20k Atomslammps: Rhodopsin Proteinnginx: 500nginx: 1000apache: 500apache: 1000liquid-dsp: 32 - 256 - 32liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 32liquid-dsp: 64 - 256 - 57liquid-dsp: 32 - 256 - 512liquid-dsp: 64 - 256 - 512openssl: RSA4096graph500: 26graph500: 26kripke: npb: CG.Cnpb: EP.Dnpb: LU.Cnpb: MG.Cnpb: SP.Copenssl: RSA4096brl-cad: VGR Performance Metriclulesh: pennant: sedovbigpennant: leblancbigrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusternwchem: C240 Buckyballincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionmocassin: Gas HII40mocassin: Dust 2D tau100.0remhos: Sample Remap Examplebuild-gem5: Time To Compilebuild-godot: Time To Compilebuild-nodejs: Time To Compilegpaw: Carbon Nanotubeqmcpack: Li2_STO_aeqmcpack: simple-H2Oqmcpack: FeCO6_b3lyp_gmsqmcpack: FeCO6_b3lyp_gmsm7g.16xlarge Graviton3c6g.16xlarge Graviton2122779000011943200003759.103892396.34368750.67217235.5910403.9320484.2454143.401542834.9463762252.7676102.5554212515580321254488701032267845173320331719002833331136307428746099016467616673150680000397630000033.790133.8195186.35681.444288.0482306.540164.873162.95657.150340.892346.2504138.01478.504984.473924.3623531601880.342264232.01410.55318.55413.895.8316825285540130113981121197114.22336.92737.558255768.44255616.0471754.8960965.701136066667721493333227050000014424000008139666716275333310181.941975400029949700033900040021988.993738.9828341.6850126.2917244.85713859.578377728296.3789.2064906.72053743.7884.37511.6631940.23.0987103813.945418013.57582.66914.040180.247154.378237.78361.831112.6128.041211.60205.728743890008604320002112.661921785.20284713.63147886.145752.1711324.7935614.51997272.6537732190.5442850.8242472798847143939254906729254120315843616385712919959315746717636807103558633317603366672220190000135.35841.981642.8284209.49692.399681.941232.746820.627924.265881.449840.110444.929720.4179521260642.177024180.80322.37197.23938.763.8240702234202947891866092842.76725.17125.950148964.69158676.4066640.9367276.837654666674892700001531400000978200000674863331349266672624.328468900020935000022012023313103.622216.2618741.9025671.299711.70214040.953302017557.48516.4805012.1768362.2246.05113.7352976.95.6372073525.882565820.758145.37420.740225.305218.276287.81492.760165.1245.225302.19297.94OpenBenchmarking.org

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26c6g.16xlarge Graviton2m7g.16xlarge Graviton3300M600M900M1200M1500M87438900012277900001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26c6g.16xlarge Graviton2m7g.16xlarge Graviton3300M600M900M1200M1500M86043200011943200001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: NUMAc6g.16xlarge Graviton2m7g.16xlarge Graviton38001600240032004000SE +/- 1.53, N = 3SE +/- 5.17, N = 32112.663759.101. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: CPU Cachec6g.16xlarge Graviton2m7g.16xlarge Graviton3800K1600K2400K3200K4000KSE +/- 21905.72, N = 15SE +/- 57217.78, N = 151921785.203892396.341. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix Mathc6g.16xlarge Graviton2m7g.16xlarge Graviton380K160K240K320K400KSE +/- 8.13, N = 3SE +/- 53.44, N = 3284713.63368750.671. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Mathc6g.16xlarge Graviton2m7g.16xlarge Graviton350K100K150K200K250KSE +/- 37.96, N = 3SE +/- 47.94, N = 3147886.14217235.591. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix 3D Mathc6g.16xlarge Graviton2m7g.16xlarge Graviton32K4K6K8K10KSE +/- 1.40, N = 3SE +/- 6.38, N = 35752.1710403.931. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Memory Copyingc6g.16xlarge Graviton2m7g.16xlarge Graviton34K8K12K16K20KSE +/- 1.12, N = 3SE +/- 3.80, N = 311324.7920484.241. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Shufflec6g.16xlarge Graviton2m7g.16xlarge Graviton312K24K36K48K60KSE +/- 74.80, N = 3SE +/- 21.44, N = 335614.5154143.401. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Wide Vector Mathc6g.16xlarge Graviton2m7g.16xlarge Graviton3300K600K900K1200K1500KSE +/- 505.84, N = 3SE +/- 16116.93, N = 15997272.651542834.941. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Fused Multiply-Addc6g.16xlarge Graviton2m7g.16xlarge Graviton314M28M42M56M70MSE +/- 3687.67, N = 3SE +/- 4870.19, N = 337732190.5463762252.761. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Floating Pointc6g.16xlarge Graviton2m7g.16xlarge Graviton316K32K48K64K80KSE +/- 31.31, N = 3SE +/- 190.19, N = 342850.8276102.551. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256c6g.16xlarge Graviton2m7g.16xlarge Graviton312000M24000M36000M48000M60000MSE +/- 245440310.03, N = 3SE +/- 18610524.10, N = 342472798847542125155801. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512c6g.16xlarge Graviton2m7g.16xlarge Graviton37000M14000M21000M28000M35000MSE +/- 9173912.49, N = 3SE +/- 17714077.14, N = 314393925490321254488701. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20c6g.16xlarge Graviton2m7g.16xlarge Graviton320000M40000M60000M80000M100000MSE +/- 35952887.59, N = 3SE +/- 1293723.80, N = 3672925412031032267845171. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMc6g.16xlarge Graviton2m7g.16xlarge Graviton370000M140000M210000M280000M350000MSE +/- 9833681.11, N = 3SE +/- 81289574.27, N = 31584361638573320331719001. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMc6g.16xlarge Graviton2m7g.16xlarge Graviton360000M120000M180000M240000M300000MSE +/- 2312792.64, N = 3SE +/- 6411836.47, N = 31291995931572833331136301. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305c6g.16xlarge Graviton2m7g.16xlarge Graviton316000M32000M48000M64000M80000MSE +/- 1132293.08, N = 3SE +/- 1340503.89, N = 346717636807742874609901. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c6g.16xlarge Graviton2m7g.16xlarge Graviton3400M800M1200M1600M2000MSE +/- 140169.34, N = 3SE +/- 103191.30, N = 3103558633316467616671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawc6g.16xlarge Graviton2m7g.16xlarge Graviton3700M1400M2100M2800M3500MSE +/- 737119.02, N = 3SE +/- 1575066.14, N = 3176033666731506800001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicc6g.16xlarge Graviton2m7g.16xlarge Graviton3900M1800M2700M3600M4500MSE +/- 144222.05, N = 3SE +/- 1199180.28, N = 3222019000039763000001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60m7g.16xlarge Graviton3816243240SE +/- 0.00, N = 333.791. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60m7g.16xlarge Graviton3816243240SE +/- 0.00, N = 333.821. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128c6g.16xlarge Graviton2m7g.16xlarge Graviton34080120160200SE +/- 0.35, N = 3SE +/- 0.27, N = 3135.36186.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256c6g.16xlarge Graviton2m7g.16xlarge Graviton320406080100SE +/- 0.05, N = 3SE +/- 0.01, N = 341.9881.441. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512c6g.16xlarge Graviton2m7g.16xlarge Graviton320406080100SE +/- 0.01, N = 3SE +/- 0.02, N = 342.8388.051. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128c6g.16xlarge Graviton2m7g.16xlarge Graviton370140210280350SE +/- 0.64, N = 3SE +/- 0.83, N = 3209.50306.541. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256c6g.16xlarge Graviton2m7g.16xlarge Graviton34080120160200SE +/- 0.19, N = 3SE +/- 0.27, N = 392.40164.871. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512c6g.16xlarge Graviton2m7g.16xlarge Graviton34080120160200SE +/- 0.03, N = 3SE +/- 0.13, N = 381.94162.961. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128c6g.16xlarge Graviton2m7g.16xlarge Graviton31326395265SE +/- 0.08, N = 3SE +/- 0.28, N = 332.7557.151. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256c6g.16xlarge Graviton2m7g.16xlarge Graviton3918273645SE +/- 0.01, N = 3SE +/- 0.01, N = 320.6340.891. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512c6g.16xlarge Graviton2m7g.16xlarge Graviton31020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 324.2746.251. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128c6g.16xlarge Graviton2m7g.16xlarge Graviton3306090120150SE +/- 0.61, N = 3SE +/- 0.12, N = 381.45138.011. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256c6g.16xlarge Graviton2m7g.16xlarge Graviton320406080100SE +/- 0.01, N = 3SE +/- 0.02, N = 340.1178.501. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512c6g.16xlarge Graviton2m7g.16xlarge Graviton320406080100SE +/- 0.03, N = 3SE +/- 0.02, N = 344.9384.471. (CXX) g++ options: -O3

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec6g.16xlarge Graviton2m7g.16xlarge Graviton3612182430SE +/- 0.15, N = 3SE +/- 0.17, N = 1320.4224.361. (CC) gcc options: -O3 -march=native -fopenmp

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc6g.16xlarge Graviton2m7g.16xlarge Graviton3300K600K900K1200K1500KSE +/- 153.60, N = 3SE +/- 11449.37, N = 151260642.181601880.341. (CC) gcc options: -O2 -lrt" -lrt

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemc6g.16xlarge Graviton2m7g.16xlarge Graviton350100150200250SE +/- 0.48, N = 3SE +/- 0.28, N = 3180.80232.011. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshc6g.16xlarge Graviton2m7g.16xlarge Graviton390180270360450SE +/- 0.89, N = 3SE +/- 0.42, N = 3322.37410.551. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmarkc6g.16xlarge Graviton2m7g.16xlarge Graviton370140210280350SE +/- 0.25, N = 3SE +/- 0.91, N = 3197.2318.51. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Totalc6g.16xlarge Graviton2m7g.16xlarge Graviton312002400360048006000SE +/- 2.53, N = 3SE +/- 4.08, N = 33938.75413.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Threadc6g.16xlarge Graviton2m7g.16xlarge Graviton320406080100SE +/- 0.03, N = 3SE +/- 0.03, N = 363.895.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingc6g.16xlarge Graviton2m7g.16xlarge Graviton370K140K210K280K350KSE +/- 209.44, N = 3SE +/- 154.72, N = 32407023168251. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingc6g.16xlarge Graviton2m7g.16xlarge Graviton360K120K180K240K300KSE +/- 15.43, N = 3SE +/- 93.51, N = 32342022855401. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc6g.16xlarge Graviton2m7g.16xlarge Graviton330060090012001500SE +/- 11.79, N = 3SE +/- 4.67, N = 394713011. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc6g.16xlarge Graviton2m7g.16xlarge Graviton330060090012001500SE +/- 4.73, N = 3SE +/- 8.74, N = 389113981. (CXX) g++ options: -flto -pthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timec6g.16xlarge Graviton2m7g.16xlarge Graviton320M40M60M80M100MSE +/- 2597495.37, N = 15SE +/- 2854071.93, N = 15866092841121197111. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barec6g.16xlarge Graviton2m7g.16xlarge Graviton30.95021.90042.85063.80084.751SE +/- 0.002, N = 3SE +/- 0.003, N = 32.7674.2231. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomsc6g.16xlarge Graviton2m7g.16xlarge Graviton3816243240SE +/- 0.01, N = 3SE +/- 0.03, N = 325.1736.931. (CXX) g++ options: -O3 -ldl

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteinc6g.16xlarge Graviton2m7g.16xlarge Graviton3918273645SE +/- 0.08, N = 3SE +/- 0.06, N = 325.9537.561. (CXX) g++ options: -O3 -ldl

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500c6g.16xlarge Graviton2m7g.16xlarge Graviton350K100K150K200K250KSE +/- 90.87, N = 3SE +/- 323.56, N = 3148964.69255768.441. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000c6g.16xlarge Graviton2m7g.16xlarge Graviton350K100K150K200K250KSE +/- 185.79, N = 3SE +/- 137.20, N = 3158676.40255616.041. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500c6g.16xlarge Graviton2m7g.16xlarge Graviton315K30K45K60K75KSE +/- 181.58, N = 3SE +/- 116.32, N = 366640.9371754.891. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 1000c6g.16xlarge Graviton2m7g.16xlarge Graviton314K28K42K56K70KSE +/- 107.55, N = 3SE +/- 72.21, N = 367276.8360965.701. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32c6g.16xlarge Graviton2m7g.16xlarge Graviton3200M400M600M800M1000MSE +/- 456520.66, N = 3SE +/- 233333.33, N = 376546666711360666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57c6g.16xlarge Graviton2m7g.16xlarge Graviton3150M300M450M600M750MSE +/- 23094.01, N = 3SE +/- 3333.33, N = 34892700007214933331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32c6g.16xlarge Graviton2m7g.16xlarge Graviton3500M1000M1500M2000M2500MSE +/- 251661.15, N = 3SE +/- 435889.89, N = 3153140000022705000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57c6g.16xlarge Graviton2m7g.16xlarge Graviton3300M600M900M1200M1500MSE +/- 11547.01, N = 3SE +/- 152752.52, N = 397820000014424000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512c6g.16xlarge Graviton2m7g.16xlarge Graviton320M40M60M80M100MSE +/- 333.33, N = 3SE +/- 1855.92, N = 367486333813966671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512c6g.16xlarge Graviton2m7g.16xlarge Graviton330M60M90M120M150MSE +/- 3333.33, N = 3SE +/- 6666.67, N = 31349266671627533331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c6g.16xlarge Graviton2m7g.16xlarge Graviton32K4K6K8K10KSE +/- 1.71, N = 3SE +/- 1.27, N = 32624.310181.91. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26c6g.16xlarge Graviton2m7g.16xlarge Graviton390M180M270M360M450M2846890004197540001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26c6g.16xlarge Graviton2m7g.16xlarge Graviton360M120M180M240M300M2093500002994970001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.6c6g.16xlarge Graviton2m7g.16xlarge Graviton370M140M210M280M350MSE +/- 102787.75, N = 3SE +/- 619419.33, N = 32201202333390004001. (CXX) g++ options: -O3 -fopenmp -ldl

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc6g.16xlarge Graviton2m7g.16xlarge Graviton35K10K15K20K25KSE +/- 31.56, N = 3SE +/- 130.18, N = 313103.6221988.991. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc6g.16xlarge Graviton2m7g.16xlarge Graviton38001600240032004000SE +/- 2.22, N = 3SE +/- 1.69, N = 32216.263738.981. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc6g.16xlarge Graviton2m7g.16xlarge Graviton36K12K18K24K30KSE +/- 26.12, N = 3SE +/- 48.62, N = 318741.9028341.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc6g.16xlarge Graviton2m7g.16xlarge Graviton311K22K33K44K55KSE +/- 7.02, N = 3SE +/- 24.30, N = 325671.2950126.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc6g.16xlarge Graviton2m7g.16xlarge Graviton34K8K12K16K20KSE +/- 1.54, N = 3SE +/- 10.19, N = 39711.7017244.851. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c6g.16xlarge Graviton2m7g.16xlarge Graviton3150K300K450K600K750KSE +/- 88.30, N = 3SE +/- 21.82, N = 3214040.9713859.51. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.34VGR Performance Metricc6g.16xlarge Graviton2m7g.16xlarge Graviton3200K400K600K800K1000K5330207837771. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c6g.16xlarge Graviton2m7g.16xlarge Graviton36K12K18K24K30KSE +/- 38.55, N = 3SE +/- 27.09, N = 317557.4928296.381. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigc6g.16xlarge Graviton2m7g.16xlarge Graviton348121620SE +/- 0.018218, N = 3SE +/- 0.011347, N = 316.4805009.2064901. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigc6g.16xlarge Graviton2m7g.16xlarge Graviton33691215SE +/- 0.018924, N = 3SE +/- 0.000869, N = 312.1768306.7205371. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc6g.16xlarge Graviton2m7g.16xlarge Graviton31428425670SE +/- 0.04, N = 3SE +/- 0.15, N = 362.2243.791. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc6g.16xlarge Graviton2m7g.16xlarge Graviton3246810SE +/- 0.016, N = 3SE +/- 0.011, N = 36.0514.3751. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc6g.16xlarge Graviton2m7g.16xlarge Graviton348121620SE +/- 0.21, N = 15SE +/- 0.14, N = 313.7411.661. (CXX) g++ options: -O2 -lOpenCL

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyballc6g.16xlarge Graviton2m7g.16xlarge Graviton360012001800240030002976.91940.21. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc6g.16xlarge Graviton2m7g.16xlarge Graviton31.26842.53683.80525.07366.342SE +/- 0.02560507, N = 3SE +/- 0.02702838, N = 35.637207353.098710381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc6g.16xlarge Graviton2m7g.16xlarge Graviton3612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 325.8813.951. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Monte Carlo Simulations of Ionised Nebulae

Input: Gas HII40

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Gas HII40c6g.16xlarge Graviton2m7g.16xlarge Graviton3510152025SE +/- 0.17, N = 3SE +/- 0.05, N = 320.7613.581. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Dust 2D tau100.0c6g.16xlarge Graviton2m7g.16xlarge Graviton3306090120150SE +/- 0.86, N = 3SE +/- 0.01, N = 3145.3782.671. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplec6g.16xlarge Graviton2m7g.16xlarge Graviton3510152025SE +/- 0.08, N = 3SE +/- 0.04, N = 320.7414.041. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec6g.16xlarge Graviton2m7g.16xlarge Graviton350100150200250SE +/- 0.35, N = 3SE +/- 0.13, N = 3225.31180.25

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compilec6g.16xlarge Graviton2m7g.16xlarge Graviton350100150200250SE +/- 0.30, N = 3SE +/- 0.32, N = 3218.28154.38

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilec6g.16xlarge Graviton2m7g.16xlarge Graviton360120180240300SE +/- 0.16, N = 3SE +/- 0.33, N = 3287.81237.78

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotubec6g.16xlarge Graviton2m7g.16xlarge Graviton320406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 392.7661.831. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: Li2_STO_aec6g.16xlarge Graviton2m7g.16xlarge Graviton34080120160200SE +/- 1.13, N = 3SE +/- 0.08, N = 3165.12112.611. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: simple-H2Oc6g.16xlarge Graviton2m7g.16xlarge Graviton31020304050SE +/- 0.24, N = 3SE +/- 0.03, N = 345.2328.041. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsc6g.16xlarge Graviton2m7g.16xlarge Graviton370140210280350SE +/- 0.37, N = 3SE +/- 0.22, N = 3302.19211.601. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsc6g.16xlarge Graviton2m7g.16xlarge Graviton360120180240300SE +/- 1.75, N = 3SE +/- 0.45, N = 3297.94205.721. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl


Phoronix Test Suite v10.8.4