amazon testing on Ubuntu 22.04 via the Phoronix Test Suite.
m7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 m7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
c6g.16xlarge Graviton2 Changed Processor to ARMv8 Neoverse-N1 (64 Cores) .
Changed Motherboard to Amazon EC2 c6g.16xlarge (1.0 BIOS) .
Changed Memory to 128GB .
Amazon AWS Processor Motherboard Chipset Memory Disk Network OS Kernel Compiler File-System System Layer m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 ARMv8 Neoverse-V1 (64 Cores) Amazon EC2 m7g.16xlarge (1.0 BIOS) Amazon Device 0200 256GB 215GB Amazon Elastic Block Store Amazon Elastic Ubuntu 22.04 5.19.0-1025-aws (aarch64) GCC 11.3.0 ext4 amazon ARMv8 Neoverse-N1 (64 Cores) Amazon EC2 c6g.16xlarge (1.0 BIOS) 128GB OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Python Details - Python 3.10.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
m7g.16xlarge Graviton3 vs. c6g.16xlarge Graviton2 Comparison Phoronix Test Suite Baseline +72% +72% +144% +144% +216% +216% 10.4% RSA4096 288% RSA4096 233.5% SHA512 123.2% AES-256-GCM 119.3% AES-128-GCM 109.6% c2c - FFTW - float - 512 105.6% CPU Cache 102.5% r2c - FFTW - float - 512 98.9% c2c - FFTW - double - 256 98.2% r2c - FFTW - double - 256 95.7% MG.C 95.3% c2c - FFTW - float - 256 94% c2c - FFTW - double - 512 90.6% r2c - FFTW - double - 512 88% i.i.1.C.P.D 85.6% i.i.1.C.P.D 81.9% leblancbig 81.2% Memory Copying 80.9% Matrix 3D Math 80.9% TurboPipe Periodic 79.1% sedovbig 79% Kershaw 79% r2c - FFTW - float - 256 78.4% NUMA 77.9% V.F.P 77.6% SP.C 77.6% Dust 2D tau100.0 75.9% c2c - FFTW - double - 128 74.5% 500 71.7% r2c - FFTW - double - 128 69.4% Fused Multiply-Add 69% EP.D 68.7% CG.C 67.8% D.P.B 61.5% simple-H2O 61.3% 61.2% 1000 61.1% 59% ChaCha20-Poly1305 59% Eigen 56.9% Wide Vector Math 54.7% 54% C240 Buckyball 53.4% ChaCha20 53.4% Gas HII40 52.9% MPI CPU - water_GMX50_bare 52.6% Vector Shuffle 52% LU.C 51.2% P.P.B.T.T 50.2% Carbon Nanotube 50% 32 - 256 - 32 48.4% 64 - 256 - 32 48.3% S.R.E 47.7% 32 - 256 - 57 47.5% 64 - 256 - 57 47.5% 26 47.4% V.P.M 47% Vector Math 46.9% 20k Atoms 46.7% Li2_STO_ae 46.6% r2c - FFTW - float - 128 46.3% FeCO6_b3lyp_gms 44.8% Rhodopsin Protein 44.7% 26 43.1% FeCO6_b3lyp_gms 42.8% OpenMP LavaMD 42.1% Time To Compile 41.4% 26 40.4% 26 38.8% OpenMP CFD Solver 38.3% c2c - FFTW - float - 128 37.7% P.P.B.T.T 37.5% BLAS 37.4% Compression Rating 31.6% Matrix Math 29.5% Total Time 29.5% T.P.P 28.3% SHA256 27.6% S.B.W.u.m 27.4% CoreMark Size 666 - I.P.S 27.1% Time To Compile 25% D.R 21.9% Time To Compile 21% 64 - 256 - 512 20.6% 32 - 256 - 512 20.6% S.F.P.R 19.3% O.S 17.8% 1000 500 7.7% OpenSSL OpenSSL OpenSSL OpenSSL OpenSSL HeFFTe - Highly Efficient FFT for Exascale Stress-NG HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale NAS Parallel Benchmarks HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale HeFFTe - Highly Efficient FFT for Exascale Xcompact3d Incompact3d Xcompact3d Incompact3d Pennant Stress-NG Stress-NG nekRS Pennant nekRS HeFFTe - Highly Efficient FFT for Exascale Stress-NG Stress-NG NAS Parallel Benchmarks Monte Carlo Simulations of Ionised Nebulae HeFFTe - Highly Efficient FFT for Exascale nginx HeFFTe - Highly Efficient FFT for Exascale Stress-NG NAS Parallel Benchmarks NAS Parallel Benchmarks srsRAN Project QMCPACK LULESH nginx Algebraic Multi-Grid Benchmark OpenSSL LeelaChessZero Stress-NG Kripke NWChem OpenSSL Monte Carlo Simulations of Ionised Nebulae GROMACS Stress-NG NAS Parallel Benchmarks srsRAN Project GPAW Liquid-DSP Liquid-DSP Remhos Liquid-DSP Liquid-DSP Graph500 BRL-CAD Stress-NG LAMMPS Molecular Dynamics Simulator QMCPACK HeFFTe - Highly Efficient FFT for Exascale QMCPACK LAMMPS Molecular Dynamics Simulator Graph500 QMCPACK Rodinia Timed Godot Game Engine Compilation Graph500 Graph500 Rodinia HeFFTe - Highly Efficient FFT for Exascale srsRAN Project LeelaChessZero 7-Zip Compression Stress-NG Stockfish Laghos OpenSSL Laghos Coremark Timed Gem5 Compilation 7-Zip Compression Timed Node.js Compilation Liquid-DSP Liquid-DSP ACES DGEMM Rodinia Apache HTTP Server Apache HTTP Server m7g.16xlarge Graviton3 c6g.16xlarge Graviton2
Amazon AWS compress-7zip: Compression Rating compress-7zip: Decompression Rating mt-dgemm: Sustained Floating-Point Rate amg: apache: 500 apache: 1000 brl-cad: VGR Performance Metric coremark: CoreMark Size 666 - Iterations Per Second gpaw: Carbon Nanotube graph500: 26 graph500: 26 graph500: 26 graph500: 26 gromacs: MPI CPU - water_GMX50_bare heffte: c2c - FFTW - float - 128 heffte: c2c - FFTW - float - 256 heffte: c2c - FFTW - float - 512 heffte: r2c - FFTW - float - 128 heffte: r2c - FFTW - float - 256 heffte: r2c - FFTW - float - 512 heffte: c2c - FFTW - double - 128 heffte: c2c - FFTW - double - 256 heffte: c2c - FFTW - double - 512 heffte: r2c - FFTW - double - 128 heffte: r2c - FFTW - double - 256 heffte: r2c - FFTW - double - 512 hpcg: 144 144 144 - 60 hpcg: 160 160 160 - 60 kripke: laghos: Triple Point Problem laghos: Sedov Blast Wave, ube_922_hex.mesh lammps: 20k Atoms lammps: Rhodopsin Protein lczero: BLAS lczero: Eigen liquid-dsp: 32 - 256 - 32 liquid-dsp: 32 - 256 - 57 liquid-dsp: 64 - 256 - 32 liquid-dsp: 64 - 256 - 57 liquid-dsp: 32 - 256 - 512 liquid-dsp: 64 - 256 - 512 lulesh: mocassin: Gas HII40 mocassin: Dust 2D tau100.0 npb: CG.C npb: EP.D npb: LU.C npb: MG.C npb: SP.C nekrs: Kershaw nekrs: TurboPipe Periodic nginx: 500 nginx: 1000 nwchem: C240 Buckyball openssl: SHA256 openssl: SHA512 openssl: RSA4096 openssl: RSA4096 openssl: ChaCha20 openssl: AES-128-GCM openssl: AES-256-GCM openssl: ChaCha20-Poly1305 pennant: sedovbig pennant: leblancbig qmcpack: Li2_STO_ae qmcpack: simple-H2O qmcpack: FeCO6_b3lyp_gms qmcpack: FeCO6_b3lyp_gms remhos: Sample Remap Example rodinia: OpenMP LavaMD rodinia: OpenMP CFD Solver rodinia: OpenMP Streamcluster srsran: Downlink Processor Benchmark srsran: PUSCH Processor Benchmark, Throughput Total srsran: PUSCH Processor Benchmark, Throughput Thread stockfish: Total Time stress-ng: NUMA stress-ng: CPU Cache stress-ng: Matrix Math stress-ng: Vector Math stress-ng: Matrix 3D Math stress-ng: Memory Copying stress-ng: Vector Shuffle stress-ng: Wide Vector Math stress-ng: Fused Multiply-Add stress-ng: Vector Floating Point build-gem5: Time To Compile build-godot: Time To Compile build-nodejs: Time To Compile incompact3d: input.i3d 129 Cells Per Direction incompact3d: input.i3d 193 Cells Per Direction m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 316825 285540 24.362353 1646761667 71754.89 60965.70 783777 1601880.342264 61.831 1194320000 1227790000 299497000 419754000 4.223 186.356 81.4442 88.0482 306.540 164.873 162.956 57.1503 40.8923 46.2504 138.014 78.5049 84.4739 33.7901 33.8195 339000400 232.01 410.55 36.927 37.558 1301 1398 1136066667 721493333 2270500000 1442400000 81396667 162753333 28296.378 13.575 82.669 21988.99 3738.98 28341.68 50126.29 17244.85 3150680000 3976300000 255768.44 255616.04 1940.2 54212515580 32125448870 10181.9 713859.5 103226784517 332033171900 283333113630 74287460990 9.206490 6.720537 112.61 28.041 211.60 205.72 14.040 43.788 4.375 11.663 318.5 5413.8 95.8 112119711 3759.10 3892396.34 368750.67 217235.59 10403.93 20484.24 54143.40 1542834.94 63762252.76 76102.55 180.247 154.378 237.783 3.09871038 13.9454180 240702 234202 20.417952 1035586333 66640.93 67276.83 533020 1260642.177024 92.760 860432000 874389000 209350000 284689000 2.767 135.358 41.9816 42.8284 209.496 92.3996 81.9412 32.7468 20.6279 24.2658 81.4498 40.1104 44.9297 220120233 180.80 322.37 25.171 25.950 947 891 765466667 489270000 1531400000 978200000 67486333 134926667 17557.485 20.758 145.374 13103.62 2216.26 18741.90 25671.29 9711.70 1760336667 2220190000 148964.69 158676.40 2976.9 42472798847 14393925490 2624.3 214040.9 67292541203 158436163857 129199593157 46717636807 16.48050 12.17683 165.12 45.225 302.19 297.94 20.740 62.224 6.051 13.735 197.2 3938.7 63.8 86609284 2112.66 1921785.20 284713.63 147886.14 5752.17 11324.79 35614.51 997272.65 37732190.54 42850.82 225.305 218.276 287.814 5.63720735 25.8825658 OpenBenchmarking.org
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60K 120K 180K 240K 300K SE +/- 15.43, N = 3 SE +/- 93.51, N = 3 234202 285540 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 400M 800M 1200M 1600M 2000M SE +/- 140169.34, N = 3 SE +/- 103191.30, N = 3 1035586333 1646761667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
Apache HTTP Server This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 500 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 15K 30K 45K 60K 75K SE +/- 181.58, N = 3 SE +/- 116.32, N = 3 66640.93 71754.89 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better Apache HTTP Server 2.4.56 Concurrent Requests: 1000 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 14K 28K 42K 56K 70K SE +/- 107.55, N = 3 SE +/- 72.21, N = 3 67276.83 60965.70 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.34 VGR Performance Metric c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 200K 400K 600K 800K 1000K 533020 783777 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 92.76 61.83 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M 860432000 1194320000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M 874389000 1227790000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60M 120M 180M 240M 300M 209350000 299497000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 90M 180M 270M 360M 450M 284689000 419754000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 0.9502 1.9004 2.8506 3.8008 4.751 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 2.767 4.223 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 0.35, N = 3 SE +/- 0.27, N = 3 135.36 186.36 1. (CXX) g++ options: -O3
High Performance Conjugate Gradient HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 144 144 144 - RT: 60 m7g.16xlarge Graviton3 8 16 24 32 40 SE +/- 0.00, N = 3 33.79 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
X Y Z: 144 144 144 - RT: 60
c6g.16xlarge Graviton2: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory
OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 160 160 160 - RT: 60 m7g.16xlarge Graviton3 8 16 24 32 40 SE +/- 0.00, N = 3 33.82 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
X Y Z: 160 160 160 - RT: 60
c6g.16xlarge Graviton2: The test quit with a non-zero exit status. E: cat: 'HPCG-Benchmark*.txt': No such file or directory
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.6 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70M 140M 210M 280M 350M SE +/- 102787.75, N = 3 SE +/- 619419.33, N = 3 220120233 339000400 1. (CXX) g++ options: -O3 -fopenmp -ldl
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50 100 150 200 250 SE +/- 0.48, N = 3 SE +/- 0.28, N = 3 180.80 232.01 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 90 180 270 360 450 SE +/- 0.89, N = 3 SE +/- 0.42, N = 3 322.37 410.55 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300 600 900 1200 1500 SE +/- 11.79, N = 3 SE +/- 4.67, N = 3 947 1301 1. (CXX) g++ options: -flto -pthread
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: Eigen c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300 600 900 1200 1500 SE +/- 4.73, N = 3 SE +/- 8.74, N = 3 891 1398 1. (CXX) g++ options: -flto -pthread
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 200M 400M 600M 800M 1000M SE +/- 456520.66, N = 3 SE +/- 233333.33, N = 3 765466667 1136066667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 150M 300M 450M 600M 750M SE +/- 23094.01, N = 3 SE +/- 3333.33, N = 3 489270000 721493333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 500M 1000M 1500M 2000M 2500M SE +/- 251661.15, N = 3 SE +/- 435889.89, N = 3 1531400000 2270500000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M SE +/- 11547.01, N = 3 SE +/- 152752.52, N = 3 978200000 1442400000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20M 40M 60M 80M 100M SE +/- 333.33, N = 3 SE +/- 1855.92, N = 3 67486333 81396667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 30M 60M 90M 120M 150M SE +/- 3333.33, N = 3 SE +/- 6666.67, N = 3 134926667 162753333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Gas HII40 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 5 10 15 20 25 SE +/- 0.17, N = 3 SE +/- 0.05, N = 3 20.76 13.58 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Dust 2D tau100.0 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 30 60 90 120 150 SE +/- 0.86, N = 3 SE +/- 0.01, N = 3 145.37 82.67 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 5K 10K 15K 20K 25K SE +/- 31.56, N = 3 SE +/- 130.18, N = 3 13103.62 21988.99 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 800 1600 2400 3200 4000 SE +/- 2.22, N = 3 SE +/- 1.69, N = 3 2216.26 3738.98 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 6K 12K 18K 24K 30K SE +/- 26.12, N = 3 SE +/- 48.62, N = 3 18741.90 28341.68 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 11K 22K 33K 44K 55K SE +/- 7.02, N = 3 SE +/- 24.30, N = 3 25671.29 50126.29 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 4K 8K 12K 16K 20K SE +/- 1.54, N = 3 SE +/- 10.19, N = 3 9711.70 17244.85 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 700M 1400M 2100M 2800M 3500M SE +/- 737119.02, N = 3 SE +/- 1575066.14, N = 3 1760336667 3150680000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 900M 1800M 2700M 3600M 4500M SE +/- 144222.05, N = 3 SE +/- 1199180.28, N = 3 2220190000 3976300000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50K 100K 150K 200K 250K SE +/- 90.87, N = 3 SE +/- 323.56, N = 3 148964.69 255768.44 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50K 100K 150K 200K 250K SE +/- 185.79, N = 3 SE +/- 137.20, N = 3 158676.40 255616.04 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 600 1200 1800 2400 3000 2976.9 1940.2 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 12000M 24000M 36000M 48000M 60000M SE +/- 245440310.03, N = 3 SE +/- 18610524.10, N = 3 42472798847 54212515580 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 7000M 14000M 21000M 28000M 35000M SE +/- 9173912.49, N = 3 SE +/- 17714077.14, N = 3 14393925490 32125448870 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 2K 4K 6K 8K 10K SE +/- 1.71, N = 3 SE +/- 1.27, N = 3 2624.3 10181.9 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 150K 300K 450K 600K 750K SE +/- 88.30, N = 3 SE +/- 21.82, N = 3 214040.9 713859.5 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20000M 40000M 60000M 80000M 100000M SE +/- 35952887.59, N = 3 SE +/- 1293723.80, N = 3 67292541203 103226784517 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70000M 140000M 210000M 280000M 350000M SE +/- 9833681.11, N = 3 SE +/- 81289574.27, N = 3 158436163857 332033171900 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60000M 120000M 180000M 240000M 300000M SE +/- 2312792.64, N = 3 SE +/- 6411836.47, N = 3 129199593157 283333113630 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 16000M 32000M 48000M 64000M 80000M SE +/- 1132293.08, N = 3 SE +/- 1340503.89, N = 3 46717636807 74287460990 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 3 6 9 12 15 SE +/- 0.018924, N = 3 SE +/- 0.000869, N = 3 12.176830 6.720537 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: Li2_STO_ae c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 1.13, N = 3 SE +/- 0.08, N = 3 165.12 112.61 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: simple-H2O c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 10 20 30 40 50 SE +/- 0.24, N = 3 SE +/- 0.03, N = 3 45.23 28.04 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70 140 210 280 350 SE +/- 0.37, N = 3 SE +/- 0.22, N = 3 302.19 211.60 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60 120 180 240 300 SE +/- 1.75, N = 3 SE +/- 0.45, N = 3 297.94 205.72 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -mcpu=native -O3 -lm -ldl
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 20.74 14.04 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 14 28 42 56 70 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 62.22 43.79 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 2 4 6 8 10 SE +/- 0.016, N = 3 SE +/- 0.011, N = 3 6.051 4.375 1. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 4 8 12 16 20 SE +/- 0.21, N = 15 SE +/- 0.14, N = 3 13.74 11.66 1. (CXX) g++ options: -O2 -lOpenCL
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70 140 210 280 350 SE +/- 0.25, N = 3 SE +/- 0.91, N = 3 197.2 318.5 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 1200 2400 3600 4800 6000 SE +/- 2.53, N = 3 SE +/- 4.08, N = 3 3938.7 5413.8 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 63.8 95.8 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20M 40M 60M 80M 100M SE +/- 2597495.37, N = 15 SE +/- 2854071.93, N = 15 86609284 112119711 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: CPU Cache c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 800K 1600K 2400K 3200K 4000K SE +/- 21905.72, N = 15 SE +/- 57217.78, N = 15 1921785.20 3892396.34 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix Math c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 80K 160K 240K 320K 400K SE +/- 8.13, N = 3 SE +/- 53.44, N = 3 284713.63 368750.67 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Math c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50K 100K 150K 200K 250K SE +/- 37.96, N = 3 SE +/- 47.94, N = 3 147886.14 217235.59 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix 3D Math c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 2K 4K 6K 8K 10K SE +/- 1.40, N = 3 SE +/- 6.38, N = 3 5752.17 10403.93 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Memory Copying c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 4K 8K 12K 16K 20K SE +/- 1.12, N = 3 SE +/- 3.80, N = 3 11324.79 20484.24 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Shuffle c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 12K 24K 36K 48K 60K SE +/- 74.80, N = 3 SE +/- 21.44, N = 3 35614.51 54143.40 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Wide Vector Math c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300K 600K 900K 1200K 1500K SE +/- 505.84, N = 3 SE +/- 16116.93, N = 15 997272.65 1542834.94 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Fused Multiply-Add c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 14M 28M 42M 56M 70M SE +/- 3687.67, N = 3 SE +/- 4870.19, N = 3 37732190.54 63762252.76 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Floating Point c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 16K 32K 48K 64K 80K SE +/- 31.31, N = 3 SE +/- 190.19, N = 3 42850.82 76102.55 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 1.2684 2.5368 3.8052 5.0736 6.342 SE +/- 0.02560507, N = 3 SE +/- 0.02702838, N = 3 5.63720735 3.09871038 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 6 12 18 24 30 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 25.88 13.95 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
m7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 m7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 June 2023 16:24 by user ubuntu.
c6g.16xlarge Graviton2 Processor: ARMv8 Neoverse-N1 (64 Cores), Motherboard: Amazon EC2 c6g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 23 June 2023 01:32 by user ubuntu.