Benchmarks by Michael Larabel for a future article on Phoronix.com.
m7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 m7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
c6g.16xlarge Graviton2 Changed Processor to ARMv8 Neoverse-N1 (64 Cores) .
Changed Motherboard to Amazon EC2 c6g.16xlarge (1.0 BIOS) .
Changed Memory to 128GB .
c7g.16xlarge Graviton3 Changed Processor to ARMv8 Neoverse-V1 (64 Cores) .
Changed Motherboard to Amazon EC2 c7g.16xlarge (1.0 BIOS) .
c7gn.16xlarge Graviton3E Changed Motherboard to Amazon EC2 c7gn.16xlarge (1.0 BIOS) .
c6a.16xlarge AMD Zen 3 Processor: AMD EPYC 7R13 (32 Cores / 64 Threads) , Motherboard: Amazon EC2 c6a.16xlarge (1.0 BIOS) , Chipset: Intel 440FX 82441FX PMC , Memory: 128GB, Disk: 322GB Amazon Elastic Block Store , Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xa0011cfPython Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
egeo-07 Processor: 2 x Intel Xeon Silver 4208 @ 3.20GHz (16 Cores / 32 Threads) , Motherboard: Dell Precision 7920 Rack 0DY2X0 (2.21.2 BIOS) , Chipset: Intel Sky Lake-E DMI3 Registers , Memory: 64GB , Disk: 2000GB TOSHIBA DT01ACA2 , Graphics: Matrox G200eW3 15GB , Audio: NVIDIA TU104 HD Audio, Monitor: DELL 17FP, Network: 4 x Intel I350
OS: Debian 11, Kernel: 5.10.0-28-amd64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.2.138, Vulkan: 1.3.242, Compiler: GCC 10.2.1 20210110 + Clang 11.0.1-2 + CUDA 11.2, File-System: ext4, Screen Resolution: 1280x1024
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Python Notes: Python 2.7.18 + Python 3.9.2Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks Processor Motherboard Chipset Memory Disk Network Graphics Audio Monitor OS Kernel Compiler File-System System Layer Vulkan Display Server Display Driver OpenCL Screen Resolution m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 egeo-07 ARMv8 Neoverse-V1 (64 Cores) Amazon EC2 m7g.16xlarge (1.0 BIOS) Amazon Device 0200 256GB 215GB Amazon Elastic Block Store Amazon Elastic Ubuntu 22.04 5.19.0-1025-aws (aarch64) GCC 11.3.0 ext4 amazon ARMv8 Neoverse-N1 (64 Cores) Amazon EC2 c6g.16xlarge (1.0 BIOS) 128GB ARMv8 Neoverse-V1 (64 Cores) Amazon EC2 c7g.16xlarge (1.0 BIOS) Amazon EC2 c7gn.16xlarge (1.0 BIOS) AMD EPYC 7R13 (32 Cores / 64 Threads) Amazon EC2 c6a.16xlarge (1.0 BIOS) Intel 440FX 82441FX PMC 322GB Amazon Elastic Block Store 5.19.0-1025-aws (x86_64) 1.3.238 GCC 11.4.0 2 x Intel Xeon Silver 4208 @ 3.20GHz (16 Cores / 32 Threads) Dell Precision 7920 Rack 0DY2X0 (2.21.2 BIOS) Intel Sky Lake-E DMI3 Registers 64GB 2000GB TOSHIBA DT01ACA2 Matrox G200eW3 15GB NVIDIA TU104 HD Audio DELL 17FP 4 x Intel I350 Debian 11 5.10.0-28-amd64 (x86_64) X Server NVIDIA OpenCL 3.0 CUDA 12.2.138 1.3.242 GCC 10.2.1 20210110 + Clang 11.0.1-2 + CUDA 11.2 1280x1024 OpenBenchmarking.org Kernel Details - m7g.16xlarge Graviton3: Transparent Huge Pages: madvise - c6g.16xlarge Graviton2: Transparent Huge Pages: madvise - c7g.16xlarge Graviton3: Transparent Huge Pages: madvise - c7gn.16xlarge Graviton3E: Transparent Huge Pages: madvise - c6a.16xlarge AMD Zen 3: Transparent Huge Pages: madvise - egeo-07: Transparent Huge Pages: always Compiler Details - m7g.16xlarge Graviton3: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6g.16xlarge Graviton2: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c7g.16xlarge Graviton3: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c7gn.16xlarge Graviton3E: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6a.16xlarge AMD Zen 3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - egeo-07: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Python Details - m7g.16xlarge Graviton3: Python 3.10.6 - c6g.16xlarge Graviton2: Python 3.10.6 - c7g.16xlarge Graviton3: Python 3.10.6 - c7gn.16xlarge Graviton3E: Python 3.10.6 - c6a.16xlarge AMD Zen 3: Python 3.10.12 - egeo-07: Python 2.7.18 + Python 3.9.2 Security Details - m7g.16xlarge Graviton3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge Graviton2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c7g.16xlarge Graviton3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c7gn.16xlarge Graviton3E: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6a.16xlarge AMD Zen 3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - egeo-07: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled Processor Details - c6a.16xlarge AMD Zen 3: CPU Microcode: 0xa0011cf - egeo-07: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605
m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 egeo-07 Logarithmic Result Overview Phoronix Test Suite Stress-NG Pennant BRL-CAD Xcompact3d Incompact3d NWChem Graph500 OpenSSL LAMMPS Molecular Dynamics Simulator LULESH HeFFTe - Highly Efficient FFT for Exascale Remhos Laghos GPAW Stockfish 7-Zip Compression Coremark GROMACS Algebraic Multi-Grid Benchmark nginx Rodinia Kripke QMCPACK srsRAN Project Liquid-DSP Timed Node.js Compilation Timed Godot Game Engine Compilation Monte Carlo Simulations of Ionised Nebulae Timed Gem5 Compilation NAS Parallel Benchmarks ACES DGEMM
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks nwchem: C240 Buckyball graph500: 26 graph500: 26 graph500: 26 graph500: 26 lammps: 20k Atoms build-gem5: Time To Compile brl-cad: VGR Performance Metric stockfish: Total Time lczero: BLAS build-nodejs: Time To Compile lczero: Eigen qmcpack: FeCO6_b3lyp_gms qmcpack: FeCO6_b3lyp_gms build-godot: Time To Compile mocassin: Dust 2D tau100.0 openssl: SHA256 openssl: AES-128-GCM openssl: ChaCha20 openssl: ChaCha20-Poly1305 openssl: AES-256-GCM openssl: SHA512 qmcpack: Li2_STO_ae stress-ng: CPU Cache laghos: Sedov Blast Wave, ube_922_hex.mesh nekrs: TurboPipe Periodic mt-dgemm: Sustained Floating-Point Rate npb: EP.D gpaw: Carbon Nanotube nekrs: Kershaw npb: SP.C nginx: 1000 nginx: 500 stress-ng: Wide Vector Math laghos: Triple Point Problem rodinia: OpenMP LavaMD heffte: c2c - FFTW - double - 512 gromacs: MPI CPU - water_GMX50_bare npb: LU.C openssl: RSA4096 openssl: RSA4096 coremark: CoreMark Size 666 - Iterations Per Second rodinia: OpenMP Streamcluster stress-ng: NUMA qmcpack: simple-H2O srsran: PUSCH Processor Benchmark, Throughput Total stress-ng: Vector Floating Point srsran: Downlink Processor Benchmark srsran: PUSCH Processor Benchmark, Throughput Thread heffte: r2c - FFTW - double - 512 heffte: c2c - FFTW - float - 512 kripke: compress-7zip: Decompression Rating compress-7zip: Compression Rating incompact3d: input.i3d 193 Cells Per Direction liquid-dsp: 64 - 256 - 512 liquid-dsp: 32 - 256 - 512 stress-ng: Fused Multiply-Add stress-ng: Vector Shuffle liquid-dsp: 64 - 256 - 32 liquid-dsp: 64 - 256 - 57 stress-ng: Matrix 3D Math stress-ng: Matrix Math stress-ng: Memory Copying liquid-dsp: 32 - 256 - 32 liquid-dsp: 32 - 256 - 57 stress-ng: Vector Math remhos: Sample Remap Example pennant: sedovbig amg: heffte: r2c - FFTW - float - 512 mocassin: Gas HII40 lulesh: pennant: leblancbig incompact3d: input.i3d 129 Cells Per Direction heffte: c2c - FFTW - double - 256 npb: CG.C rodinia: OpenMP CFD Solver heffte: c2c - FFTW - float - 256 heffte: r2c - FFTW - double - 256 npb: MG.C heffte: r2c - FFTW - float - 256 lammps: Rhodopsin Protein heffte: c2c - FFTW - double - 128 heffte: r2c - FFTW - float - 128 heffte: c2c - FFTW - float - 128 heffte: r2c - FFTW - double - 128 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 egeo-07 1940.2 419754000 299497000 1227790000 1194320000 36.927 180.247 783777 112119711 1301 237.783 1398 211.60 205.72 154.378 82.669 54212515580 332033171900 103226784517 74287460990 283333113630 32125448870 112.61 3892396.34 410.55 3976300000 24.362353 3738.98 61.831 3150680000 17244.85 255616.04 255768.44 1542834.94 232.01 43.788 46.2504 4.223 28341.68 713859.5 10181.9 1601880.342264 11.663 3759.10 28.041 5413.8 76102.55 318.5 95.8 84.4739 88.0482 339000400 285540 316825 13.9454180 162753333 81396667 63762252.76 54143.40 2270500000 1442400000 10403.93 368750.67 20484.24 1136066667 721493333 217235.59 14.040 9.206490 1646761667 162.956 13.575 28296.378 6.720537 3.09871038 40.8923 21988.99 4.375 81.4442 78.5049 50126.29 164.873 37.558 57.1503 306.540 186.356 138.014 2976.9 284689000 209350000 874389000 860432000 25.171 225.305 533020 86609284 947 287.814 891 302.19 297.94 218.276 145.374 42472798847 158436163857 67292541203 46717636807 129199593157 14393925490 165.12 1921785.20 322.37 2220190000 20.417952 2216.26 92.760 1760336667 9711.70 158676.40 148964.69 997272.65 180.80 62.224 24.2658 2.767 18741.90 214040.9 2624.3 1260642.177024 13.735 2112.66 45.225 3938.7 42850.82 197.2 63.8 44.9297 42.8284 220120233 234202 240702 25.8825658 134926667 67486333 37732190.54 35614.51 1531400000 978200000 5752.17 284713.63 11324.79 765466667 489270000 147886.14 20.740 16.48050 1035586333 81.9412 20.758 17557.485 12.17683 5.63720735 20.6279 13103.62 6.051 41.9816 40.1104 25671.29 92.3996 25.950 32.7468 209.496 135.358 81.4498 1962.7 415758000 293826000 1206990000 1177710000 36.862 181.779 789066 117316476 1333 238.543 1382 211.32 204.77 156.687 82.822 54216561263 332064349843 103275516997 74318842213 283373795737 32145914147 112.64 3844101.98 408.01 3978983333 24.140605 3664.54 62.083 3261853333 17219.95 255552.05 255145.52 1535336.57 230.68 43.963 46.3706 4.200 28375.71 713945.9 10181.4 1605948.674645 11.625 3523.58 27.990 5356.8 76178.46 319.7 95.7 84.7451 88.1842 354442733 285633 311056 13.8326693 162766667 81412000 63818458.61 54472.07 2271966667 1442366667 10813.59 368671.39 20478.67 1136133333 721386667 217446.12 14.120 9.422270 1765277667 163.276 13.659 28708.656 6.961345 3.14447999 40.8283 21911.02 4.442 81.0096 77.7685 49742.30 162.010 37.412 55.1055 301.418 184.026 133.514 1914 411762000 296164000 1207760000 1175640000 36.838 182.471 744743 117027121 1392 238.636 1444 188.28 204.25 155.951 82.974 54154218593 411130469943 114118119423 79969465487 351152465420 32126059040 113.20 3860335.38 423.11 4141440000 24.078529 3657.67 56.440 3302823333 17163.11 256585.83 253518.51 1530043.52 236.22 44.044 46.5300 4.820 28369.11 713754.8 10183.3 1611801.559265 10.690 3525.17 27.999 5431.2 76911.74 323.2 97.4 85.0060 88.4551 354234067 285677 312009 13.7606726 162756667 81394000 63723431.55 54695.04 2266833333 1442666667 10882.02 369258.89 20475.96 1136000000 721380000 217567.10 14.082 9.340953 1765966333 163.559 13.525 28736.226 6.839998 3.11489828 40.9708 22155.36 4.429 81.1671 78.1658 49860.68 162.361 37.482 55.1038 300.396 184.110 133.422 3440.4 204550000 157688000 417777000 410571000 20.342 192.118 485038 96905609 1316 230.423 1152 184.10 187.32 147.737 194.435 45857534777 151449269317 138389378753 92522999373 138457889450 15291283297 123.95 1447265.35 275.92 4337536667 9.388050 3061.42 89.818 4308810000 34025.35 163178.67 165847.75 1380146.63 227.40 64.179 23.5212 3.965 95221.40 548396.5 8392.4 1466587.036580 8.396 552.68 26.867 6479.1 96529.51 691.3 215.9 42.4394 44.3176 237087650 235787 230970 30.3145288 460076667 274803333 30920910.92 22255.84 2184866667 1710800000 4571.96 147576.41 8080.43 1193966667 1444266667 221776.15 22.104 16.53050 836999300 82.7584 12.669 16708.258 9.917565 7.01975288 20.8719 20210.00 9.342 43.5907 41.5868 45946.81 102.652 19.563 48.9432 158.858 98.7026 86.3730 10984 85755800 69070600 210572000 208288000 7.539 462.599 102684 26092344 641.867 608.86 552.72 391.737 281.249 3466925203 61131320253 56213494627 26507041907 44619128560 3878566000 408.45 1525522.46 70.77 2.094206 1329.20 257.464 11726.55 70348.77 73654.19 399851.85 61.58 212.545 10.6653 1.116 36556.34 200342.7 3041.8 362563.288370 23.247 0.89 77.523 1129.5 21347.32 342.3 88.5 18.9902 19.6715 109107233 59055 75840 84.6642202 129560000 128250000 10084119.21 7162.56 641243333 477896667 1841.58 55962.00 3209.83 633483333 429596667 38515.89 68.956 89.00490 444456133 34.3674 26.920 5676.7762 43.45880 21.2392737 9.32379 9661.91 18.630 16.9971 17.1170 19477.23 32.4287 7.176 9.62296 58.2012 26.7367 23.0160 OpenBenchmarking.org
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 2K 4K 6K 8K 10K 1914.0 1940.2 1962.7 2976.9 3440.4 10984.0 -m64 -ldl -lutil -m64 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 90M 180M 270M 360M 450M 419754000 415758000 411762000 284689000 204550000 85755800 -pthread 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 60M 120M 180M 240M 300M 299497000 296164000 293826000 209350000 157688000 69070600 -pthread 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 300M 600M 900M 1200M 1500M 1227790000 1207760000 1206990000 874389000 417777000 210572000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 300M 600M 900M 1200M 1500M 1194320000 1177710000 1175640000 860432000 410571000 208288000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 8 16 24 32 40 SE +/- 0.034, N = 3 SE +/- 0.025, N = 3 SE +/- 0.018, N = 3 SE +/- 0.009, N = 3 SE +/- 0.066, N = 3 SE +/- 0.006, N = 3 36.927 36.862 36.838 25.171 20.342 7.539 -lm -pthread -lm 1. (CXX) g++ options: -O3 -ldl
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 100 200 300 400 500 SE +/- 0.13, N = 3 SE +/- 0.26, N = 3 SE +/- 0.38, N = 3 SE +/- 0.26, N = 3 SE +/- 0.35, N = 3 SE +/- 9.98, N = 9 180.25 181.78 182.47 192.12 225.31 462.60
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.34 VGR Performance Metric c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 200K 400K 600K 800K 1000K 789066 783777 744743 533020 485038 102684 -m64 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 30M 60M 90M 120M 150M SE +/- 2998209.87, N = 12 SE +/- 1531345.46, N = 15 SE +/- 2854071.93, N = 15 SE +/- 1430593.84, N = 15 SE +/- 2597495.37, N = 15 SE +/- 349749.32, N = 12 117316476 117027121 112119711 96905609 86609284 26092344 -m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 300 600 900 1200 1500 SE +/- 7.22, N = 3 SE +/- 3.53, N = 3 SE +/- 13.29, N = 5 SE +/- 4.67, N = 3 SE +/- 11.79, N = 3 1392 1333 1316 1301 947 1. (CXX) g++ options: -flto -pthread
Backend: BLAS
egeo-07: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile c6a.16xlarge AMD Zen 3 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 egeo-07 140 280 420 560 700 SE +/- 0.40, N = 3 SE +/- 0.33, N = 3 SE +/- 0.20, N = 3 SE +/- 0.32, N = 3 SE +/- 0.16, N = 3 SE +/- 1.11, N = 3 230.42 237.78 238.54 238.64 287.81 641.87
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: Eigen c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 300 600 900 1200 1500 SE +/- 14.88, N = 3 SE +/- 8.74, N = 3 SE +/- 15.65, N = 3 SE +/- 7.37, N = 3 SE +/- 4.73, N = 3 1444 1398 1382 1152 891 1. (CXX) g++ options: -flto -pthread
Backend: Eigen
egeo-07: The test quit with a non-zero exit status. E: ./lczero: line 4: ./lc0: No such file or directory
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 130 260 390 520 650 SE +/- 1.03, N = 3 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 SE +/- 0.22, N = 3 SE +/- 0.37, N = 3 SE +/- 0.11, N = 3 184.10 188.28 211.32 211.60 302.19 608.86 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native -march=native -pthread 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 120 240 360 480 600 SE +/- 2.30, N = 3 SE +/- 0.21, N = 3 SE +/- 0.82, N = 3 SE +/- 0.45, N = 3 SE +/- 1.75, N = 3 SE +/- 7.86, N = 3 187.32 204.25 204.77 205.72 297.94 552.72 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native -march=native -pthread 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
Timed Godot Game Engine Compilation This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile c6a.16xlarge AMD Zen 3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 80 160 240 320 400 SE +/- 0.12, N = 3 SE +/- 0.32, N = 3 SE +/- 0.45, N = 3 SE +/- 0.63, N = 3 SE +/- 0.30, N = 3 SE +/- 0.36, N = 3 147.74 154.38 155.95 156.69 218.28 391.74
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Dust 2D tau100.0 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 60 120 180 240 300 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.07, N = 3 SE +/- 0.86, N = 3 SE +/- 1.84, N = 7 SE +/- 0.37, N = 3 82.67 82.82 82.97 145.37 194.44 281.25 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 12000M 24000M 36000M 48000M 60000M SE +/- 16491036.11, N = 3 SE +/- 18610524.10, N = 3 SE +/- 19542665.92, N = 3 SE +/- 26770675.21, N = 3 SE +/- 245440310.03, N = 3 SE +/- 404619.57, N = 3 54216561263 54212515580 54154218593 45857534777 42472798847 3466925203 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 90000M 180000M 270000M 360000M 450000M SE +/- 11273100.69, N = 3 SE +/- 12264074.61, N = 3 SE +/- 81289574.27, N = 3 SE +/- 9833681.11, N = 3 SE +/- 4227452.23, N = 3 SE +/- 11737066.92, N = 3 411130469943 332064349843 332033171900 158436163857 151449269317 61131320253 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 30000M 60000M 90000M 120000M 150000M SE +/- 36376378.52, N = 3 SE +/- 771581.87, N = 3 SE +/- 1725060.95, N = 3 SE +/- 1293723.80, N = 3 SE +/- 35952887.59, N = 3 SE +/- 13595278.49, N = 3 138389378753 114118119423 103275516997 103226784517 67292541203 56213494627 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 20000M 40000M 60000M 80000M 100000M SE +/- 232372675.93, N = 3 SE +/- 1769561.47, N = 3 SE +/- 1218886.42, N = 3 SE +/- 1340503.89, N = 3 SE +/- 1132293.08, N = 3 SE +/- 1523000.86, N = 3 92522999373 79969465487 74318842213 74287460990 46717636807 26507041907 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 80000M 160000M 240000M 320000M 400000M SE +/- 24279491.44, N = 3 SE +/- 33807617.40, N = 3 SE +/- 6411836.47, N = 3 SE +/- 41584947.90, N = 3 SE +/- 2312792.64, N = 3 SE +/- 2585526.42, N = 3 351152465420 283373795737 283333113630 138457889450 129199593157 44619128560 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 7000M 14000M 21000M 28000M 35000M SE +/- 4573992.60, N = 3 SE +/- 16155877.53, N = 3 SE +/- 17714077.14, N = 3 SE +/- 207279.55, N = 3 SE +/- 9173912.49, N = 3 SE +/- 1513929.31, N = 3 32145914147 32126059040 32125448870 15291283297 14393925490 3878566000 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: Li2_STO_ae m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 90 180 270 360 450 SE +/- 0.08, N = 3 SE +/- 0.12, N = 3 SE +/- 0.31, N = 3 SE +/- 0.13, N = 3 SE +/- 1.13, N = 3 SE +/- 4.45, N = 3 112.61 112.64 113.20 123.95 165.12 408.45 -mcpu=native -mcpu=native -mcpu=native -march=native -mcpu=native -march=native -pthread 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: CPU Cache m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 c6a.16xlarge AMD Zen 3 800K 1600K 2400K 3200K 4000K SE +/- 57217.78, N = 15 SE +/- 40698.46, N = 15 SE +/- 59376.56, N = 15 SE +/- 21905.72, N = 15 SE +/- 22640.51, N = 15 SE +/- 30785.49, N = 12 3892396.34 3860335.38 3844101.98 1921785.20 1525522.46 1447265.35 -laio -lbsd -lEGL -lGLESv2 -lmd 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 90 180 270 360 450 SE +/- 0.79, N = 3 SE +/- 0.42, N = 3 SE +/- 0.89, N = 3 SE +/- 0.89, N = 3 SE +/- 0.48, N = 3 SE +/- 0.27, N = 3 423.11 410.55 408.01 322.37 275.92 70.77 -pthread 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 900M 1800M 2700M 3600M 4500M SE +/- 12801180.07, N = 3 SE +/- 1394740.12, N = 3 SE +/- 169148.19, N = 3 SE +/- 1199180.28, N = 3 SE +/- 144222.05, N = 3 4337536667 4141440000 3978983333 3976300000 2220190000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
Input: TurboPipe Periodic
egeo-07: The test quit with a non-zero exit status. E: [egeo-07.qteorica.unal.edu.co:290233] PMIX ERROR: UNREACHABLE in file ../../../src/server/pmix_server.c at line 2795
ACES DGEMM This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 6 12 18 24 30 SE +/- 0.171001, N = 13 SE +/- 0.285590, N = 4 SE +/- 0.297525, N = 4 SE +/- 0.154503, N = 3 SE +/- 0.038051, N = 3 SE +/- 0.035680, N = 15 24.362353 24.140605 24.078529 20.417952 9.388050 2.094206 1. (CC) gcc options: -O3 -march=native -fopenmp
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 800 1600 2400 3200 4000 SE +/- 1.69, N = 3 SE +/- 34.07, N = 15 SE +/- 32.06, N = 15 SE +/- 4.77, N = 3 SE +/- 2.22, N = 3 SE +/- 0.38, N = 3 3738.98 3664.54 3657.67 3061.42 2216.26 1329.20 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. c6a.16xlarge AMD Zen 3: Open MPI 4.1.2 3. egeo-07: Open MPI 4.1.0
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 60 120 180 240 300 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.04, N = 3 SE +/- 0.13, N = 3 SE +/- 0.02, N = 3 SE +/- 0.19, N = 3 56.44 61.83 62.08 89.82 92.76 257.46 -pthread 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 900M 1800M 2700M 3600M 4500M SE +/- 22342148.51, N = 3 SE +/- 5414395.42, N = 3 SE +/- 2490845.46, N = 3 SE +/- 1575066.14, N = 3 SE +/- 737119.02, N = 3 4308810000 3302823333 3261853333 3150680000 1760336667 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
Input: Kershaw
egeo-07: The test quit with a non-zero exit status. E: [egeo-07.qteorica.unal.edu.co:290025] PMIX ERROR: UNREACHABLE in file ../../../src/server/pmix_server.c at line 2795
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C c6a.16xlarge AMD Zen 3 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E egeo-07 c6g.16xlarge Graviton2 7K 14K 21K 28K 35K SE +/- 20.85, N = 3 SE +/- 10.19, N = 3 SE +/- 7.21, N = 3 SE +/- 31.31, N = 3 SE +/- 15.52, N = 3 SE +/- 1.54, N = 3 34025.35 17244.85 17219.95 17163.11 11726.55 9711.70 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. c6a.16xlarge AMD Zen 3: Open MPI 4.1.2 3. egeo-07: Open MPI 4.1.0
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 50K 100K 150K 200K 250K SE +/- 402.16, N = 3 SE +/- 137.20, N = 3 SE +/- 55.97, N = 3 SE +/- 136.82, N = 3 SE +/- 185.79, N = 3 SE +/- 141.39, N = 3 256585.83 255616.04 255552.05 163178.67 158676.40 70348.77 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 50K 100K 150K 200K 250K SE +/- 323.56, N = 3 SE +/- 243.69, N = 3 SE +/- 317.05, N = 3 SE +/- 60.38, N = 3 SE +/- 90.87, N = 3 SE +/- 47.73, N = 3 255768.44 255145.52 253518.51 165847.75 148964.69 73654.19 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Wide Vector Math m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 300K 600K 900K 1200K 1500K SE +/- 16116.93, N = 15 SE +/- 16521.46, N = 15 SE +/- 16444.95, N = 15 SE +/- 2507.18, N = 3 SE +/- 505.84, N = 3 SE +/- 641.34, N = 3 1542834.94 1535336.57 1530043.52 1380146.63 997272.65 399851.85 -laio -lbsd -lEGL -lGLESv2 -lmd 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 50 100 150 200 250 SE +/- 0.27, N = 3 SE +/- 0.28, N = 3 SE +/- 0.16, N = 3 SE +/- 1.06, N = 3 SE +/- 0.48, N = 3 SE +/- 0.71, N = 4 236.22 232.01 230.68 227.40 180.80 61.58 -pthread 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 50 100 150 200 250 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.15, N = 3 SE +/- 0.04, N = 3 SE +/- 0.53, N = 3 SE +/- 0.02, N = 3 43.79 43.96 44.04 62.22 64.18 212.55 -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl 1. (CXX) g++ options:
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 11 22 33 44 55 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 46.53 46.37 46.25 24.27 23.52 10.67 -pthread 1. (CXX) g++ options: -O3
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 1.0845 2.169 3.2535 4.338 5.4225 SE +/- 0.003, N = 3 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.013, N = 3 SE +/- 0.002, N = 3 SE +/- 0.002, N = 3 4.820 4.223 4.200 3.965 2.767 1.116 -lm 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C c6a.16xlarge AMD Zen 3 egeo-07 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 20K 40K 60K 80K 100K SE +/- 90.22, N = 3 SE +/- 21.23, N = 3 SE +/- 36.09, N = 3 SE +/- 43.73, N = 3 SE +/- 48.62, N = 3 SE +/- 26.12, N = 3 95221.40 36556.34 28375.71 28369.11 28341.68 18741.90 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. c6a.16xlarge AMD Zen 3: Open MPI 4.1.2 3. egeo-07: Open MPI 4.1.0
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 150K 300K 450K 600K 750K SE +/- 12.03, N = 3 SE +/- 21.82, N = 3 SE +/- 198.10, N = 3 SE +/- 34.73, N = 3 SE +/- 88.30, N = 3 SE +/- 170.27, N = 3 713945.9 713859.5 713754.8 548396.5 214040.9 200342.7 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 egeo-07 c6g.16xlarge Graviton2 2K 4K 6K 8K 10K SE +/- 0.84, N = 3 SE +/- 1.27, N = 3 SE +/- 1.54, N = 3 SE +/- 3.06, N = 3 SE +/- 5.58, N = 3 SE +/- 1.71, N = 3 10183.3 10181.9 10181.4 8392.4 3041.8 2624.3 -m64 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 300K 600K 900K 1200K 1500K SE +/- 14869.41, N = 7 SE +/- 13274.76, N = 15 SE +/- 11449.37, N = 15 SE +/- 6710.50, N = 3 SE +/- 153.60, N = 3 SE +/- 4039.35, N = 3 1611801.56 1605948.67 1601880.34 1466587.04 1260642.18 362563.29 1. (CC) gcc options: -O2 -lrt" -lrt
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 6 12 18 24 30 SE +/- 0.101, N = 15 SE +/- 0.233, N = 12 SE +/- 0.099, N = 8 SE +/- 0.138, N = 3 SE +/- 0.211, N = 15 SE +/- 0.397, N = 15 8.396 10.690 11.625 11.663 13.735 23.247 -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl 1. (CXX) g++ options:
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: NUMA m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 800 1600 2400 3200 4000 SE +/- 5.17, N = 3 SE +/- 7.31, N = 3 SE +/- 3.39, N = 3 SE +/- 1.53, N = 3 SE +/- 9.75, N = 15 SE +/- 0.00, N = 3 3759.10 3525.17 3523.58 2112.66 552.68 0.89 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: simple-H2O c6a.16xlarge AMD Zen 3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 20 40 60 80 100 SE +/- 0.08, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.24, N = 3 SE +/- 0.86, N = 5 26.87 27.99 28.00 28.04 45.23 77.52 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native -march=native -pthread 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 1400 2800 4200 5600 7000 SE +/- 21.76, N = 3 SE +/- 3.32, N = 3 SE +/- 4.08, N = 3 SE +/- 1.80, N = 3 SE +/- 2.53, N = 3 SE +/- 6.96, N = 3 6479.1 5431.2 5413.8 5356.8 3938.7 1129.5 -march=native -mfma -march=native -mfma -lpthread 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Floating Point c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 20K 40K 60K 80K 100K SE +/- 864.23, N = 13 SE +/- 1.74, N = 3 SE +/- 71.97, N = 3 SE +/- 190.19, N = 3 SE +/- 31.31, N = 3 SE +/- 160.53, N = 3 96529.51 76911.74 76178.46 76102.55 42850.82 21347.32 -laio -lbsd -lEGL -lGLESv2 -lmd 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark c6a.16xlarge AMD Zen 3 egeo-07 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 150 300 450 600 750 SE +/- 1.26, N = 3 SE +/- 4.00, N = 4 SE +/- 0.06, N = 3 SE +/- 0.95, N = 3 SE +/- 0.91, N = 3 SE +/- 0.25, N = 3 691.3 342.3 323.2 319.7 318.5 197.2 -march=native -mfma -march=native -mfma -lpthread 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 egeo-07 c6g.16xlarge Graviton2 50 100 150 200 250 SE +/- 0.55, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.68, N = 10 SE +/- 0.03, N = 3 215.9 97.4 95.8 95.7 88.5 63.8 -march=native -mfma -march=native -mfma -lpthread 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 85.01 84.75 84.47 44.93 42.44 18.99 -pthread 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.14, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 88.46 88.18 88.05 44.32 42.83 19.67 -pthread 1. (CXX) g++ options: -O3
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.6 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 80M 160M 240M 320M 400M SE +/- 525406.56, N = 3 SE +/- 445212.18, N = 3 SE +/- 619419.33, N = 3 SE +/- 2932840.19, N = 4 SE +/- 102787.75, N = 3 SE +/- 523405.33, N = 3 354442733 354234067 339000400 237087650 220120233 109107233 -pthread 1. (CXX) g++ options: -O3 -fopenmp -ldl
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 60K 120K 180K 240K 300K SE +/- 54.90, N = 3 SE +/- 146.43, N = 3 SE +/- 93.51, N = 3 SE +/- 1190.65, N = 3 SE +/- 15.43, N = 3 SE +/- 265.06, N = 3 285677 285633 285540 235787 234202 59055 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 70K 140K 210K 280K 350K SE +/- 154.72, N = 3 SE +/- 308.14, N = 3 SE +/- 72.90, N = 3 SE +/- 209.44, N = 3 SE +/- 670.46, N = 3 SE +/- 414.62, N = 3 316825 312009 311056 240702 230970 75840 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.28, N = 3 SE +/- 0.03, N = 3 13.76 13.83 13.95 25.88 30.31 84.66 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 c6a.16xlarge AMD Zen 3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 100M 200M 300M 400M 500M SE +/- 392527.42, N = 3 SE +/- 3333.33, N = 3 SE +/- 8819.17, N = 3 SE +/- 6666.67, N = 3 SE +/- 3333.33, N = 3 SE +/- 92915.73, N = 3 460076667 162766667 162756667 162753333 134926667 129560000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 c6a.16xlarge AMD Zen 3 egeo-07 c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 60M 120M 180M 240M 300M SE +/- 193419.52, N = 3 SE +/- 120554.28, N = 3 SE +/- 1000.00, N = 3 SE +/- 1855.92, N = 3 SE +/- 577.35, N = 3 SE +/- 333.33, N = 3 274803333 128250000 81412000 81396667 81394000 67486333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Fused Multiply-Add c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 14M 28M 42M 56M 70M SE +/- 4431.60, N = 3 SE +/- 4870.19, N = 3 SE +/- 10061.51, N = 3 SE +/- 3687.67, N = 3 SE +/- 32747.05, N = 3 SE +/- 16948.60, N = 3 63818458.61 63762252.76 63723431.55 37732190.54 30920910.92 10084119.21 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Shuffle c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 12K 24K 36K 48K 60K SE +/- 294.96, N = 3 SE +/- 139.03, N = 3 SE +/- 21.44, N = 3 SE +/- 74.80, N = 3 SE +/- 0.50, N = 3 SE +/- 0.40, N = 3 54695.04 54472.07 54143.40 35614.51 22255.84 7162.56 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 500M 1000M 1500M 2000M 2500M SE +/- 284800.12, N = 3 SE +/- 435889.89, N = 3 SE +/- 2915666.50, N = 3 SE +/- 218581.28, N = 3 SE +/- 251661.15, N = 3 SE +/- 707515.21, N = 3 2271966667 2270500000 2266833333 2184866667 1531400000 641243333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 400M 800M 1200M 1600M 2000M SE +/- 1014889.16, N = 3 SE +/- 88191.71, N = 3 SE +/- 152752.52, N = 3 SE +/- 284800.12, N = 3 SE +/- 11547.01, N = 3 SE +/- 851162.60, N = 3 1710800000 1442666667 1442400000 1442366667 978200000 477896667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix 3D Math c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 2K 4K 6K 8K 10K SE +/- 19.16, N = 3 SE +/- 9.35, N = 3 SE +/- 6.38, N = 3 SE +/- 1.40, N = 3 SE +/- 1.96, N = 3 SE +/- 9.17, N = 3 10882.02 10813.59 10403.93 5752.17 4571.96 1841.58 -laio -lbsd -lEGL -lGLESv2 -lmd 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix Math c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 80K 160K 240K 320K 400K SE +/- 28.60, N = 3 SE +/- 53.44, N = 3 SE +/- 38.76, N = 3 SE +/- 8.13, N = 3 SE +/- 167.77, N = 3 SE +/- 6.88, N = 3 369258.89 368750.67 368671.39 284713.63 147576.41 55962.00 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Memory Copying m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 4K 8K 12K 16K 20K SE +/- 3.80, N = 3 SE +/- 4.65, N = 3 SE +/- 1.36, N = 3 SE +/- 1.12, N = 3 SE +/- 0.46, N = 3 SE +/- 1.93, N = 3 20484.24 20478.67 20475.96 11324.79 8080.43 3209.83 -laio -lbsd -lEGL -lGLESv2 -lmd 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 c6a.16xlarge AMD Zen 3 c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 egeo-07 300M 600M 900M 1200M 1500M SE +/- 578311.72, N = 3 SE +/- 33333.33, N = 3 SE +/- 233333.33, N = 3 SE +/- 57735.03, N = 3 SE +/- 456520.66, N = 3 SE +/- 571382.34, N = 3 1193966667 1136133333 1136066667 1136000000 765466667 633483333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 c6a.16xlarge AMD Zen 3 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 egeo-07 300M 600M 900M 1200M 1500M SE +/- 9533333.33, N = 3 SE +/- 3333.33, N = 3 SE +/- 168358.08, N = 3 SE +/- 150111.07, N = 3 SE +/- 23094.01, N = 3 SE +/- 846666.67, N = 3 1444266667 721493333 721386667 721380000 489270000 429596667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Math c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 50K 100K 150K 200K 250K SE +/- 100.78, N = 3 SE +/- 27.00, N = 3 SE +/- 20.95, N = 3 SE +/- 47.94, N = 3 SE +/- 37.96, N = 3 SE +/- 9.74, N = 3 221776.15 217567.10 217446.12 217235.59 147886.14 38515.89 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 15 30 45 60 75 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.11, N = 3 SE +/- 0.44, N = 3 14.04 14.08 14.12 20.74 22.10 68.96 -pthread 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 20 40 60 80 100 SE +/- 0.011347, N = 3 SE +/- 0.003721, N = 3 SE +/- 0.011497, N = 3 SE +/- 0.018218, N = 3 SE +/- 0.036687, N = 3 SE +/- 0.050055, N = 3 9.206490 9.340953 9.422270 16.480500 16.530500 89.004900 -pthread 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 400M 800M 1200M 1600M 2000M SE +/- 488508.39, N = 3 SE +/- 192645.90, N = 3 SE +/- 103191.30, N = 3 SE +/- 140169.34, N = 3 SE +/- 1055539.30, N = 3 SE +/- 394420.25, N = 3 1765966333 1765277667 1646761667 1035586333 836999300 444456133 -pthread 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 40 80 120 160 200 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.13, N = 3 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 163.56 163.28 162.96 82.76 81.94 34.37 -pthread 1. (CXX) g++ options: -O3
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Gas HII40 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 egeo-07 6 12 18 24 30 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.04, N = 3 12.67 13.53 13.58 13.66 20.76 26.92 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
LULESH LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 6K 12K 18K 24K 30K SE +/- 12.73, N = 3 SE +/- 11.81, N = 3 SE +/- 27.09, N = 3 SE +/- 38.55, N = 3 SE +/- 90.11, N = 3 SE +/- 5.42, N = 3 28736.23 28708.66 28296.38 17557.49 16708.26 5676.78 -pthread 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 10 20 30 40 50 SE +/- 0.000869, N = 3 SE +/- 0.000467, N = 3 SE +/- 0.005468, N = 3 SE +/- 0.013289, N = 3 SE +/- 0.018924, N = 3 SE +/- 0.025073, N = 3 6.720537 6.839998 6.961345 9.917565 12.176830 43.458800 -pthread 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 5 10 15 20 25 SE +/- 0.02702838, N = 3 SE +/- 0.01738352, N = 3 SE +/- 0.03233273, N = 3 SE +/- 0.02560507, N = 3 SE +/- 0.08686597, N = 15 SE +/- 0.12655798, N = 3 3.09871038 3.11489828 3.14447999 5.63720735 7.01975288 21.23927370 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 9 18 27 36 45 SE +/- 0.02971, N = 3 SE +/- 0.01031, N = 3 SE +/- 0.02659, N = 3 SE +/- 0.17467, N = 3 SE +/- 0.01033, N = 3 SE +/- 0.01615, N = 3 40.97080 40.89230 40.82830 20.87190 20.62790 9.32379 -pthread 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C c7gn.16xlarge Graviton3E m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 5K 10K 15K 20K 25K SE +/- 125.21, N = 3 SE +/- 130.18, N = 3 SE +/- 283.23, N = 3 SE +/- 14.83, N = 3 SE +/- 31.56, N = 3 SE +/- 12.05, N = 3 22155.36 21988.99 21911.02 20210.00 13103.62 9661.91 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. c6a.16xlarge AMD Zen 3: Open MPI 4.1.2 3. egeo-07: Open MPI 4.1.0
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 5 10 15 20 25 SE +/- 0.011, N = 3 SE +/- 0.027, N = 3 SE +/- 0.021, N = 3 SE +/- 0.016, N = 3 SE +/- 0.002, N = 3 SE +/- 0.208, N = 4 4.375 4.429 4.442 6.051 9.342 18.630 -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -O2 -lOpenCL -m64 -lm -lcuda -lcudart -lcudadevrt -lcudart_static -lrt -lpthread -ldl 1. (CXX) g++ options:
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.42, N = 6 SE +/- 0.05, N = 3 SE +/- 0.10, N = 3 81.44 81.17 81.01 43.59 41.98 17.00 -pthread 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 20 40 60 80 100 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 SE +/- 0.17, N = 3 SE +/- 0.01, N = 3 SE +/- 0.10, N = 3 78.50 78.17 77.77 41.59 40.11 17.12 -pthread 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 11K 22K 33K 44K 55K SE +/- 24.30, N = 3 SE +/- 14.65, N = 3 SE +/- 32.94, N = 3 SE +/- 167.32, N = 3 SE +/- 7.02, N = 3 SE +/- 35.78, N = 3 50126.29 49860.68 49742.30 45946.81 25671.29 19477.23 -pthread -ldl -lutil -lrt 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. c6a.16xlarge AMD Zen 3: Open MPI 4.1.2 3. egeo-07: Open MPI 4.1.0
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 SE +/- 1.28, N = 3 SE +/- 0.19, N = 3 SE +/- 0.04, N = 3 164.87 162.36 162.01 102.65 92.40 32.43 -pthread 1. (CXX) g++ options: -O3
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: Rhodopsin Protein m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 9 18 27 36 45 SE +/- 0.057, N = 3 SE +/- 0.026, N = 3 SE +/- 0.033, N = 3 SE +/- 0.083, N = 3 SE +/- 0.257, N = 12 SE +/- 0.024, N = 3 37.558 37.482 37.412 25.950 19.563 7.176 -lm -pthread -lm 1. (CXX) g++ options: -O3 -ldl
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 13 26 39 52 65 SE +/- 0.28294, N = 3 SE +/- 0.14885, N = 3 SE +/- 0.32202, N = 3 SE +/- 0.84547, N = 15 SE +/- 0.08221, N = 3 SE +/- 0.03519, N = 3 57.15030 55.10550 55.10380 48.94320 32.74680 9.62296 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 70 140 210 280 350 SE +/- 0.83, N = 3 SE +/- 0.56, N = 3 SE +/- 1.62, N = 3 SE +/- 0.64, N = 3 SE +/- 1.94, N = 3 SE +/- 0.85, N = 15 306.54 301.42 300.40 209.50 158.86 58.20 -pthread 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 m7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c6a.16xlarge AMD Zen 3 egeo-07 40 80 120 160 200 SE +/- 0.27, N = 3 SE +/- 0.20, N = 3 SE +/- 0.47, N = 3 SE +/- 0.35, N = 3 SE +/- 1.25, N = 14 SE +/- 0.04, N = 3 186.36 184.11 184.03 135.36 98.70 26.74 -pthread 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 m7g.16xlarge Graviton3 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 c6g.16xlarge Graviton2 egeo-07 30 60 90 120 150 SE +/- 0.12, N = 3 SE +/- 0.47, N = 3 SE +/- 0.04, N = 3 SE +/- 1.46, N = 12 SE +/- 0.61, N = 3 SE +/- 0.06, N = 3 138.01 133.51 133.42 86.37 81.45 23.02 -pthread 1. (CXX) g++ options: -O3
m7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 m7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 June 2023 16:24 by user ubuntu.
c6g.16xlarge Graviton2 Processor: ARMv8 Neoverse-N1 (64 Cores), Motherboard: Amazon EC2 c6g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 23 June 2023 01:32 by user ubuntu.
c7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 c7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 23 June 2023 10:31 by user ubuntu.
c7gn.16xlarge Graviton3E Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 c7gn.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 July 2023 15:05 by user ubuntu.
c6a.16xlarge AMD Zen 3 Processor: AMD EPYC 7R13 (32 Cores / 64 Threads), Motherboard: Amazon EC2 c6a.16xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 128GB, Disk: 322GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xa0011cfPython Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 11 August 2023 14:59 by user ubuntu.
egeo-07 Processor: 2 x Intel Xeon Silver 4208 @ 3.20GHz (16 Cores / 32 Threads), Motherboard: Dell Precision 7920 Rack 0DY2X0 (2.21.2 BIOS), Chipset: Intel Sky Lake-E DMI3 Registers, Memory: 64GB, Disk: 2000GB TOSHIBA DT01ACA2, Graphics: Matrox G200eW3 15GB, Audio: NVIDIA TU104 HD Audio, Monitor: DELL 17FP, Network: 4 x Intel I350
OS: Debian 11, Kernel: 5.10.0-28-amd64 (x86_64), Display Server: X Server, Display Driver: NVIDIA, OpenCL: OpenCL 3.0 CUDA 12.2.138, Vulkan: 1.3.242, Compiler: GCC 10.2.1 20210110 + Clang 11.0.1-2 + CUDA 11.2, File-System: ext4, Screen Resolution: 1280x1024
Kernel Notes: Transparent Huge Pages: alwaysCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-mutex --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-Km9U7s/gcc-10-10.2.1/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x5003605Python Notes: Python 2.7.18 + Python 3.9.2Security Notes: gather_data_sampling: Mitigation of Microcode + itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Mitigation of Clear buffers; SMT vulnerable + retbleed: Mitigation of Enhanced IBRS + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of TSX disabled
Testing initiated at 28 May 2024 01:22 by user root.