Amazon AWS

amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2306239-NE-2306232NE27&sor&grr.

Amazon AWSProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layerm7g.16xlarge Graviton3c6g.16xlarge Graviton2c7g.16xlarge Graviton3ARMv8 Neoverse-V1 (64 Cores)Amazon EC2 m7g.16xlarge (1.0 BIOS)Amazon Device 0200256GB215GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.19.0-1025-aws (aarch64)GCC 11.3.0ext4amazonARMv8 Neoverse-N1 (64 Cores)Amazon EC2 c6g.16xlarge (1.0 BIOS)128GBARMv8 Neoverse-V1 (64 Cores)Amazon EC2 c7g.16xlarge (1.0 BIOS)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

Amazon AWSnwchem: C240 Buckyballbrl-cad: VGR Performance Metriclczero: Eigenlczero: BLASstockfish: Total Timegraph500: 26graph500: 26graph500: 26graph500: 26lammps: 20k Atomsbuild-nodejs: Time To Compileqmcpack: FeCO6_b3lyp_gmsqmcpack: FeCO6_b3lyp_gmsbuild-gem5: Time To Compileopenssl: SHA256openssl: AES-128-GCMopenssl: ChaCha20openssl: ChaCha20-Poly1305openssl: AES-256-GCMopenssl: SHA512build-godot: Time To Compilehpcg: 160 160 160 - 60stress-ng: CPU Cachenekrs: TurboPipe Periodicqmcpack: Li2_STO_aehpcg: 144 144 144 - 60nekrs: Kershawstress-ng: Wide Vector Mathnpb: SP.Cmocassin: Dust 2D tau100.0npb: EP.Dnginx: 1000nginx: 500apache: 1000apache: 500npb: LU.Claghos: Sedov Blast Wave, ube_922_hex.meshcoremark: CoreMark Size 666 - Iterations Per Secondgpaw: Carbon Nanotubeopenssl: RSA4096openssl: RSA4096gromacs: MPI CPU - water_GMX50_baresrsran: Downlink Processor Benchmarkheffte: c2c - FFTW - double - 512laghos: Triple Point Problemrodinia: OpenMP LavaMDsrsran: PUSCH Processor Benchmark, Throughput Totalsrsran: PUSCH Processor Benchmark, Throughput Threadrodinia: OpenMP Streamclusterqmcpack: simple-H2Oliquid-dsp: 64 - 256 - 512liquid-dsp: 32 - 256 - 512stress-ng: Fused Multiply-Addstress-ng: Vector Shufflestress-ng: Vector Floating Pointstress-ng: Matrix Mathliquid-dsp: 64 - 256 - 57stress-ng: NUMAstress-ng: Matrix 3D Mathstress-ng: Memory Copyingliquid-dsp: 64 - 256 - 32stress-ng: Vector Mathliquid-dsp: 32 - 256 - 57liquid-dsp: 32 - 256 - 32heffte: r2c - FFTW - double - 512heffte: c2c - FFTW - float - 512kripke: compress-7zip: Decompression Ratingcompress-7zip: Compression Ratingamg: mt-dgemm: Sustained Floating-Point Ratelulesh: incompact3d: input.i3d 193 Cells Per Directionremhos: Sample Remap Examplemocassin: Gas HII40heffte: r2c - FFTW - float - 512pennant: sedovbigpennant: leblancbignpb: CG.Cheffte: c2c - FFTW - double - 256rodinia: OpenMP CFD Solvernpb: MG.Cincompact3d: input.i3d 129 Cells Per Directionheffte: r2c - FFTW - double - 256heffte: c2c - FFTW - float - 256heffte: r2c - FFTW - float - 256lammps: Rhodopsin Proteinheffte: c2c - FFTW - double - 128heffte: r2c - FFTW - double - 128heffte: c2c - FFTW - float - 128heffte: r2c - FFTW - float - 128m7g.16xlarge Graviton3c6g.16xlarge Graviton2c7g.16xlarge Graviton31940.2783777139813011121197114197540002994970001227790000119432000036.927237.783211.60205.72180.247542125155803320331719001032267845177428746099028333311363032125448870154.37833.81953892396.343976300000112.6133.790131506800001542834.9417244.8582.6693738.98255616.04255768.4460965.7071754.8928341.68410.551601880.34226461.831713859.510181.94.223318.546.2504232.0143.7885413.895.811.66328.0411627533338139666763762252.7654143.4076102.55368750.6714424000003759.1010403.9320484.242270500000217235.59721493333113606666784.473988.0482339000400285540316825164676166724.36235328296.37813.945418014.04013.575162.9569.2064906.72053721988.9940.89234.37550126.293.0987103878.504981.4442164.87337.55857.1503138.014186.356306.5402976.95330208919478660928428468900020935000087438900086043200025.171287.814302.19297.94225.30542472798847158436163857672925412034671763680712919959315714393925490218.2761921785.202220190000165.121760336667997272.659711.70145.3742216.26158676.40148964.6967276.8366640.9318741.90322.371260642.17702492.760214040.92624.32.767197.224.2658180.8062.2243938.763.813.73545.2251349266676748633337732190.5435614.5142850.82284713.639782000002112.665752.1711324.791531400000147886.1448927000076546666744.929742.8284220120233234202240702103558633320.41795217557.48525.882565820.74020.75881.941216.4805012.1768313103.6220.62796.05125671.295.6372073540.110441.981692.399625.95032.746881.4498135.358209.4961962.7789066138213331173164764157580002938260001206990000117771000036.862238.543211.32204.77181.779542165612633320643498431032755169977431884221328337379573732145914147156.6873844101.983978983333112.6432618533331535336.5717219.9582.8223664.54255552.05255145.5268429.8369265.4228375.71408.011605948.67464562.083713945.910181.44.200319.746.3706230.6843.9635356.895.711.62527.9901627666678141200063818458.6154472.0776178.46368671.3914423666673523.5810813.5920478.672271966667217446.12721386667113613333384.745188.1842354442733285633311056176527766724.14060528708.65613.832669314.12013.659163.2769.4222706.96134521911.0240.82834.44249742.303.1444799977.768581.0096162.01037.41255.1055133.514184.026301.418OpenBenchmarking.org

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyballm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton260012001800240030001940.21962.72976.91. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.34VGR Performance Metricc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2200K400K600K800K1000K7890667837775330201. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton230060090012001500SE +/- 8.74, N = 3SE +/- 15.65, N = 3SE +/- 4.73, N = 3139813828911. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton230060090012001500SE +/- 3.53, N = 3SE +/- 4.67, N = 3SE +/- 11.79, N = 3133313019471. (CXX) g++ options: -flto -pthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timec7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton230M60M90M120M150MSE +/- 2998209.87, N = 12SE +/- 2854071.93, N = 15SE +/- 2597495.37, N = 15117316476112119711866092841. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Graph500

Scale: 26

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton290M180M270M360M450M4197540004157580002846890001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton260M120M180M240M300M2994970002938260002093500001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2300M600M900M1200M1500M122779000012069900008743890001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Graph500

Scale: 26

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 26m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2300M600M900M1200M1500M119432000011777100008604320001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomsm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 336.9336.8625.171. (CXX) g++ options: -O3 -ldl

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton260120180240300SE +/- 0.33, N = 3SE +/- 0.20, N = 3SE +/- 0.16, N = 3237.78238.54287.81

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton270140210280350SE +/- 0.19, N = 3SE +/- 0.22, N = 3SE +/- 0.37, N = 3211.32211.60302.191. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

QMCPACK

Input: FeCO6_b3lyp_gms

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: FeCO6_b3lyp_gmsc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton260120180240300SE +/- 0.82, N = 3SE +/- 0.45, N = 3SE +/- 1.75, N = 3204.77205.72297.941. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton250100150200250SE +/- 0.13, N = 3SE +/- 0.26, N = 3SE +/- 0.35, N = 3180.25181.78225.31

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton212000M24000M36000M48000M60000MSE +/- 16491036.11, N = 3SE +/- 18610524.10, N = 3SE +/- 245440310.03, N = 35421656126354212515580424727988471. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton270000M140000M210000M280000M350000MSE +/- 12264074.61, N = 3SE +/- 81289574.27, N = 3SE +/- 9833681.11, N = 33320643498433320331719001584361638571. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton220000M40000M60000M80000M100000MSE +/- 1725060.95, N = 3SE +/- 1293723.80, N = 3SE +/- 35952887.59, N = 3103275516997103226784517672925412031. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton216000M32000M48000M64000M80000MSE +/- 1218886.42, N = 3SE +/- 1340503.89, N = 3SE +/- 1132293.08, N = 37431884221374287460990467176368071. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton260000M120000M180000M240000M300000MSE +/- 33807617.40, N = 3SE +/- 6411836.47, N = 3SE +/- 2312792.64, N = 32833737957372833331136301291995931571. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton27000M14000M21000M28000M35000MSE +/- 4573992.60, N = 3SE +/- 17714077.14, N = 3SE +/- 9173912.49, N = 33214591414732125448870143939254901. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 4.0Time To Compilem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton250100150200250SE +/- 0.32, N = 3SE +/- 0.63, N = 3SE +/- 0.30, N = 3154.38156.69218.28

High Performance Conjugate Gradient

X Y Z: 160 160 160 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 160 160 160 - RT: 60m7g.16xlarge Graviton3816243240SE +/- 0.00, N = 333.821. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: CPU Cachem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2800K1600K2400K3200K4000KSE +/- 57217.78, N = 15SE +/- 59376.56, N = 15SE +/- 21905.72, N = 153892396.343844101.981921785.201. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2900M1800M2700M3600M4500MSE +/- 169148.19, N = 3SE +/- 1199180.28, N = 3SE +/- 144222.05, N = 33978983333397630000022201900001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

QMCPACK

Input: Li2_STO_ae

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: Li2_STO_aem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton24080120160200SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 1.13, N = 3112.61112.64165.121. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

High Performance Conjugate Gradient

X Y Z: 144 144 144 - RT: 60

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1X Y Z: 144 144 144 - RT: 60m7g.16xlarge Graviton3816243240SE +/- 0.00, N = 333.791. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2700M1400M2100M2800M3500MSE +/- 2490845.46, N = 3SE +/- 1575066.14, N = 3SE +/- 737119.02, N = 33261853333315068000017603366671. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Wide Vector Mathm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2300K600K900K1200K1500KSE +/- 16116.93, N = 15SE +/- 16521.46, N = 15SE +/- 505.84, N = 31542834.941535336.57997272.651. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton24K8K12K16K20KSE +/- 10.19, N = 3SE +/- 7.21, N = 3SE +/- 1.54, N = 317244.8517219.959711.701. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Monte Carlo Simulations of Ionised Nebulae

Input: Dust 2D tau100.0

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Dust 2D tau100.0m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2306090120150SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.86, N = 382.6782.82145.371. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton28001600240032004000SE +/- 1.69, N = 3SE +/- 34.07, N = 15SE +/- 2.22, N = 33738.983664.542216.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton250K100K150K200K250KSE +/- 137.20, N = 3SE +/- 55.97, N = 3SE +/- 185.79, N = 3255616.04255552.05158676.401. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton250K100K150K200K250KSE +/- 323.56, N = 3SE +/- 243.69, N = 3SE +/- 90.87, N = 3255768.44255145.52148964.691. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 1000c7g.16xlarge Graviton3c6g.16xlarge Graviton2m7g.16xlarge Graviton315K30K45K60K75KSE +/- 66.85, N = 3SE +/- 107.55, N = 3SE +/- 72.21, N = 368429.8367276.8360965.701. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton215K30K45K60K75KSE +/- 116.32, N = 3SE +/- 184.82, N = 3SE +/- 181.58, N = 371754.8969265.4266640.931. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton26K12K18K24K30KSE +/- 36.09, N = 3SE +/- 48.62, N = 3SE +/- 26.12, N = 328375.7128341.6818741.901. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton290180270360450SE +/- 0.42, N = 3SE +/- 0.89, N = 3SE +/- 0.89, N = 3410.55408.01322.371. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2300K600K900K1200K1500KSE +/- 13274.76, N = 15SE +/- 11449.37, N = 15SE +/- 153.60, N = 31605948.671601880.341260642.181. (CC) gcc options: -O2 -lrt" -lrt

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 23.6Input: Carbon Nanotubem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton220406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 361.8362.0892.761. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2150K300K450K600K750KSE +/- 12.03, N = 3SE +/- 21.82, N = 3SE +/- 88.30, N = 3713945.9713859.5214040.91. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton22K4K6K8K10KSE +/- 1.27, N = 3SE +/- 1.54, N = 3SE +/- 1.71, N = 310181.910181.42624.31. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton20.95021.90042.85063.80084.751SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 34.2234.2002.7671. (CXX) g++ options: -O3

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: Downlink Processor Benchmarkc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton270140210280350SE +/- 0.95, N = 3SE +/- 0.91, N = 3SE +/- 0.25, N = 3319.7318.5197.21. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton21122334455SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 346.3746.2524.271. (CXX) g++ options: -O3

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton250100150200250SE +/- 0.28, N = 3SE +/- 0.16, N = 3SE +/- 0.48, N = 3232.01230.68180.801. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton21428425670SE +/- 0.15, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 343.7943.9662.221. (CXX) g++ options: -O2 -lOpenCL

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Totalm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton212002400360048006000SE +/- 4.08, N = 3SE +/- 1.80, N = 3SE +/- 2.53, N = 35413.85356.83938.71. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.5Test: PUSCH Processor Benchmark, Throughput Threadm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton220406080100SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 395.895.763.81. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton248121620SE +/- 0.10, N = 8SE +/- 0.14, N = 3SE +/- 0.21, N = 1511.6311.6613.741. (CXX) g++ options: -O2 -lOpenCL

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.16Input: simple-H2Oc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton21020304050SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.24, N = 327.9928.0445.231. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton230M60M90M120M150MSE +/- 3333.33, N = 3SE +/- 6666.67, N = 3SE +/- 3333.33, N = 31627666671627533331349266671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton220M40M60M80M100MSE +/- 1000.00, N = 3SE +/- 1855.92, N = 3SE +/- 333.33, N = 38141200081396667674863331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Fused Multiply-Addc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton214M28M42M56M70MSE +/- 4431.60, N = 3SE +/- 4870.19, N = 3SE +/- 3687.67, N = 363818458.6163762252.7637732190.541. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Shufflec7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton212K24K36K48K60KSE +/- 139.03, N = 3SE +/- 21.44, N = 3SE +/- 74.80, N = 354472.0754143.4035614.511. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Floating Pointc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton216K32K48K64K80KSE +/- 71.97, N = 3SE +/- 190.19, N = 3SE +/- 31.31, N = 376178.4676102.5542850.821. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix Mathm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton280K160K240K320K400KSE +/- 53.44, N = 3SE +/- 38.76, N = 3SE +/- 8.13, N = 3368750.67368671.39284713.631. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2300M600M900M1200M1500MSE +/- 152752.52, N = 3SE +/- 284800.12, N = 3SE +/- 11547.01, N = 3144240000014423666679782000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: NUMAm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton28001600240032004000SE +/- 5.17, N = 3SE +/- 3.39, N = 3SE +/- 1.53, N = 33759.103523.582112.661. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Matrix 3D Mathc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton22K4K6K8K10KSE +/- 9.35, N = 3SE +/- 6.38, N = 3SE +/- 1.40, N = 310813.5910403.935752.171. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Memory Copyingm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton24K8K12K16K20KSE +/- 3.80, N = 3SE +/- 4.65, N = 3SE +/- 1.12, N = 320484.2420478.6711324.791. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2500M1000M1500M2000M2500MSE +/- 284800.12, N = 3SE +/- 435889.89, N = 3SE +/- 251661.15, N = 32271966667227050000015314000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.15.10Test: Vector Mathc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton250K100K150K200K250KSE +/- 20.95, N = 3SE +/- 47.94, N = 3SE +/- 37.96, N = 3217446.12217235.59147886.141. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2150M300M450M600M750MSE +/- 3333.33, N = 3SE +/- 168358.08, N = 3SE +/- 23094.01, N = 37214933337213866674892700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2200M400M600M800M1000MSE +/- 33333.33, N = 3SE +/- 233333.33, N = 3SE +/- 456520.66, N = 3113613333311360666677654666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton220406080100SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 384.7584.4744.931. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton220406080100SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 388.1888.0542.831. (CXX) g++ options: -O3

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.6c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton280M160M240M320M400MSE +/- 525406.56, N = 3SE +/- 619419.33, N = 3SE +/- 102787.75, N = 33544427333390004002201202331. (CXX) g++ options: -O3 -fopenmp -ldl

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton260K120K180K240K300KSE +/- 146.43, N = 3SE +/- 93.51, N = 3SE +/- 15.43, N = 32856332855402342021. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton270K140K210K280K350KSE +/- 154.72, N = 3SE +/- 72.90, N = 3SE +/- 209.44, N = 33168253110562407021. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2400M800M1200M1600M2000MSE +/- 192645.90, N = 3SE +/- 103191.30, N = 3SE +/- 140169.34, N = 31765277667164676166710355863331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2612182430SE +/- 0.17, N = 13SE +/- 0.29, N = 4SE +/- 0.15, N = 324.3624.1420.421. (CC) gcc options: -O3 -march=native -fopenmp

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton26K12K18K24K30KSE +/- 11.81, N = 3SE +/- 27.09, N = 3SE +/- 38.55, N = 328708.6628296.3817557.491. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton2612182430SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 313.8313.9525.881. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplem7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2510152025SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 314.0414.1220.741. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Monte Carlo Simulations of Ionised Nebulae

Input: Gas HII40

OpenBenchmarking.orgSeconds, Fewer Is BetterMonte Carlo Simulations of Ionised Nebulae 2.02.73.3Input: Gas HII40m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2510152025SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.17, N = 313.5813.6620.761. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512c7g.16xlarge Graviton3m7g.16xlarge Graviton3c6g.16xlarge Graviton24080120160200SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 3163.28162.9681.941. (CXX) g++ options: -O3

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton248121620SE +/- 0.011347, N = 3SE +/- 0.011497, N = 3SE +/- 0.018218, N = 39.2064909.42227016.4805001. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton23691215SE +/- 0.000869, N = 3SE +/- 0.005468, N = 3SE +/- 0.018924, N = 36.7205376.96134512.1768301. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton25K10K15K20K25KSE +/- 130.18, N = 3SE +/- 283.23, N = 3SE +/- 31.56, N = 321988.9921911.0213103.621. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2918273645SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 340.8940.8320.631. (CXX) g++ options: -O3

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2246810SE +/- 0.011, N = 3SE +/- 0.021, N = 3SE +/- 0.016, N = 34.3754.4426.0511. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton211K22K33K44K55KSE +/- 24.30, N = 3SE +/- 32.94, N = 3SE +/- 7.02, N = 350126.2949742.3025671.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton21.26842.53683.80525.07366.342SE +/- 0.02702838, N = 3SE +/- 0.03233273, N = 3SE +/- 0.02560507, N = 33.098710383.144479995.637207351. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton220406080100SE +/- 0.02, N = 3SE +/- 0.31, N = 3SE +/- 0.01, N = 378.5077.7740.111. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton220406080100SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 381.4481.0141.981. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton24080120160200SE +/- 0.27, N = 3SE +/- 0.11, N = 3SE +/- 0.19, N = 3164.87162.0192.401. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteinm7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2918273645SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 337.5637.4125.951. (CXX) g++ options: -O3 -ldl

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton21326395265SE +/- 0.28, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 357.1555.1132.751. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton2306090120150SE +/- 0.12, N = 3SE +/- 0.47, N = 3SE +/- 0.61, N = 3138.01133.5181.451. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton24080120160200SE +/- 0.27, N = 3SE +/- 0.47, N = 3SE +/- 0.35, N = 3186.36184.03135.361. (CXX) g++ options: -O3

HeFFTe - Highly Efficient FFT for Exascale

Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128m7g.16xlarge Graviton3c7g.16xlarge Graviton3c6g.16xlarge Graviton270140210280350SE +/- 0.83, N = 3SE +/- 0.56, N = 3SE +/- 0.64, N = 3306.54301.42209.501. (CXX) g++ options: -O3


Phoronix Test Suite v10.8.5