Benchmarks by Michael Larabel for a future article on Phoronix.com.
m7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 m7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
c6g.16xlarge Graviton2 Changed Processor to ARMv8 Neoverse-N1 (64 Cores) .
Changed Motherboard to Amazon EC2 c6g.16xlarge (1.0 BIOS) .
Changed Memory to 128GB .
c7g.16xlarge Graviton3 Changed Processor to ARMv8 Neoverse-V1 (64 Cores) .
Changed Motherboard to Amazon EC2 c7g.16xlarge (1.0 BIOS) .
c7gn.16xlarge Graviton3E Changed Motherboard to Amazon EC2 c7gn.16xlarge (1.0 BIOS) .
c6a.16xlarge AMD Zen 3 Processor: AMD EPYC 7R13 (32 Cores / 64 Threads) , Motherboard: Amazon EC2 c6a.16xlarge (1.0 BIOS) , Chipset: Intel 440FX 82441FX PMC , Memory: 128GB, Disk: 322GB Amazon Elastic Block Store , Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xa0011cfPython Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks Processor Motherboard Chipset Memory Disk Network OS Kernel Compiler File-System System Layer Vulkan m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 ARMv8 Neoverse-V1 (64 Cores) Amazon EC2 m7g.16xlarge (1.0 BIOS) Amazon Device 0200 256GB 215GB Amazon Elastic Block Store Amazon Elastic Ubuntu 22.04 5.19.0-1025-aws (aarch64) GCC 11.3.0 ext4 amazon ARMv8 Neoverse-N1 (64 Cores) Amazon EC2 c6g.16xlarge (1.0 BIOS) 128GB ARMv8 Neoverse-V1 (64 Cores) Amazon EC2 c7g.16xlarge (1.0 BIOS) Amazon EC2 c7gn.16xlarge (1.0 BIOS) AMD EPYC 7R13 (32 Cores / 64 Threads) Amazon EC2 c6a.16xlarge (1.0 BIOS) Intel 440FX 82441FX PMC 322GB Amazon Elastic Block Store 5.19.0-1025-aws (x86_64) 1.3.238 GCC 11.4.0 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - m7g.16xlarge Graviton3: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6g.16xlarge Graviton2: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c7g.16xlarge Graviton3: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c7gn.16xlarge Graviton3E: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6a.16xlarge AMD Zen 3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Python Details - m7g.16xlarge Graviton3: Python 3.10.6 - c6g.16xlarge Graviton2: Python 3.10.6 - c7g.16xlarge Graviton3: Python 3.10.6 - c7gn.16xlarge Graviton3E: Python 3.10.6 - c6a.16xlarge AMD Zen 3: Python 3.10.12 Security Details - m7g.16xlarge Graviton3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge Graviton2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c7g.16xlarge Graviton3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c7gn.16xlarge Graviton3E: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6a.16xlarge AMD Zen 3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected Processor Details - c6a.16xlarge AMD Zen 3: CPU Microcode: 0xa0011cf
m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 Result Overview Phoronix Test Suite 100% 142% 185% 227% 269% srsRAN Project ACES DGEMM Graph500 Liquid-DSP NAS Parallel Benchmarks OpenSSL Xcompact3d Incompact3d nekRS Algebraic Multi-Grid Benchmark Stress-NG LAMMPS Molecular Dynamics Simulator HeFFTe - Highly Efficient FFT for Exascale Pennant NWChem GROMACS LULESH nginx GPAW Monte Carlo Simulations of Ionised Nebulae BRL-CAD Kripke Remhos QMCPACK LeelaChessZero Timed Godot Game Engine Compilation Stockfish Rodinia Laghos 7-Zip Compression Coremark Timed Node.js Compilation Timed Gem5 Compilation
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks npb: LU.C liquid-dsp: 32 - 256 - 512 openssl: RSA4096 srsran: Downlink Processor Benchmark npb: SP.C liquid-dsp: 64 - 256 - 512 srsran: PUSCH Processor Benchmark, Throughput Thread openssl: RSA4096 liquid-dsp: 32 - 256 - 57 graph500: 26 graph500: 26 openssl: AES-256-GCM openssl: AES-128-GCM mt-dgemm: Sustained Floating-Point Rate stress-ng: Memory Copying stress-ng: Matrix Math stress-ng: Vector Shuffle nekrs: Kershaw stress-ng: Matrix 3D Math mocassin: Dust 2D tau100.0 incompact3d: input.i3d 129 Cells Per Direction stress-ng: Vector Floating Point openssl: SHA512 incompact3d: input.i3d 193 Cells Per Direction rodinia: OpenMP CFD Solver amg: heffte: c2c - FFTW - float - 512 stress-ng: Fused Multiply-Add openssl: ChaCha20 graph500: 26 heffte: r2c - FFTW - double - 512 heffte: r2c - FFTW - float - 512 heffte: c2c - FFTW - double - 256 openssl: ChaCha20-Poly1305 heffte: c2c - FFTW - double - 512 heffte: r2c - FFTW - double - 256 nekrs: TurboPipe Periodic npb: MG.C heffte: c2c - FFTW - float - 256 heffte: r2c - FFTW - float - 128 lammps: Rhodopsin Protein graph500: 26 heffte: c2c - FFTW - float - 128 lammps: 20k Atoms pennant: leblancbig nwchem: C240 Buckyball pennant: sedovbig heffte: r2c - FFTW - float - 256 liquid-dsp: 64 - 256 - 57 gromacs: MPI CPU - water_GMX50_bare lulesh: nginx: 500 heffte: r2c - FFTW - double - 128 npb: CG.C npb: EP.D qmcpack: simple-H2O srsran: PUSCH Processor Benchmark, Throughput Total gpaw: Carbon Nanotube qmcpack: FeCO6_b3lyp_gms mocassin: Gas HII40 brl-cad: VGR Performance Metric lczero: Eigen nginx: 1000 kripke: qmcpack: FeCO6_b3lyp_gms remhos: Sample Remap Example liquid-dsp: 32 - 256 - 32 stress-ng: Wide Vector Math laghos: Sedov Blast Wave, ube_922_hex.mesh stress-ng: Vector Math liquid-dsp: 64 - 256 - 32 build-godot: Time To Compile lczero: BLAS qmcpack: Li2_STO_ae rodinia: OpenMP LavaMD compress-7zip: Compression Rating laghos: Triple Point Problem coremark: CoreMark Size 666 - Iterations Per Second openssl: SHA256 build-gem5: Time To Compile build-nodejs: Time To Compile compress-7zip: Decompression Rating stress-ng: CPU Cache stress-ng: NUMA stockfish: Total Time heffte: c2c - FFTW - double - 128 rodinia: OpenMP Streamcluster m7g.16xlarge Graviton3 c6g.16xlarge Graviton2 c7g.16xlarge Graviton3 c7gn.16xlarge Graviton3E c6a.16xlarge AMD Zen 3 28341.68 81396667 10181.9 318.5 17244.85 162753333 95.8 713859.5 721493333 1227790000 1194320000 283333113630 332033171900 24.362353 20484.24 368750.67 54143.40 3150680000 10403.93 82.669 3.09871038 76102.55 32125448870 13.9454180 4.375 1646761667 88.0482 63762252.76 103226784517 419754000 84.4739 162.956 40.8923 74287460990 46.2504 78.5049 3976300000 50126.29 81.4442 306.540 37.558 299497000 186.356 36.927 6.720537 1940.2 9.206490 164.873 1442400000 4.223 28296.378 255768.44 138.014 21988.99 3738.98 28.041 5413.8 61.831 211.60 13.575 783777 1398 255616.04 339000400 205.72 14.040 1136066667 1542834.94 410.55 217235.59 2270500000 154.378 1301 112.61 43.788 316825 232.01 1601880.342264 54212515580 180.247 237.783 285540 3892396.34 3759.10 112119711 57.1503 11.663 18741.90 67486333 2624.3 197.2 9711.70 134926667 63.8 214040.9 489270000 874389000 860432000 129199593157 158436163857 20.417952 11324.79 284713.63 35614.51 1760336667 5752.17 145.374 5.63720735 42850.82 14393925490 25.8825658 6.051 1035586333 42.8284 37732190.54 67292541203 284689000 44.9297 81.9412 20.6279 46717636807 24.2658 40.1104 2220190000 25671.29 41.9816 209.496 25.950 209350000 135.358 25.171 12.17683 2976.9 16.48050 92.3996 978200000 2.767 17557.485 148964.69 81.4498 13103.62 2216.26 45.225 3938.7 92.760 302.19 20.758 533020 891 158676.40 220120233 297.94 20.740 765466667 997272.65 322.37 147886.14 1531400000 218.276 947 165.12 62.224 240702 180.80 1260642.177024 42472798847 225.305 287.814 234202 1921785.20 2112.66 86609284 32.7468 13.735 28375.71 81412000 10181.4 319.7 17219.95 162766667 95.7 713945.9 721386667 1206990000 1177710000 283373795737 332064349843 24.140605 20478.67 368671.39 54472.07 3261853333 10813.59 82.822 3.14447999 76178.46 32145914147 13.8326693 4.442 1765277667 88.1842 63818458.61 103275516997 415758000 84.7451 163.276 40.8283 74318842213 46.3706 77.7685 3978983333 49742.30 81.0096 301.418 37.412 293826000 184.026 36.862 6.961345 1962.7 9.422270 162.010 1442366667 4.200 28708.656 255145.52 133.514 21911.02 3664.54 27.990 5356.8 62.083 211.32 13.659 789066 1382 255552.05 354442733 204.77 14.120 1136133333 1535336.57 408.01 217446.12 2271966667 156.687 1333 112.64 43.963 311056 230.68 1605948.674645 54216561263 181.779 238.543 285633 3844101.98 3523.58 117316476 55.1055 11.625 28369.11 81394000 10183.3 323.2 17163.11 162756667 97.4 713754.8 721380000 1207760000 1175640000 351152465420 411130469943 24.078529 20475.96 369258.89 54695.04 3302823333 10882.02 82.974 3.11489828 76911.74 32126059040 13.7606726 4.429 1765966333 88.4551 63723431.55 114118119423 411762000 85.0060 163.559 40.9708 79969465487 46.5300 78.1658 4141440000 49860.68 81.1671 300.396 37.482 296164000 184.110 36.838 6.839998 1914 9.340953 162.361 1442666667 4.820 28736.226 253518.51 133.422 22155.36 3657.67 27.999 5431.2 56.440 188.28 13.525 744743 1444 256585.83 354234067 204.25 14.082 1136000000 1530043.52 423.11 217567.10 2266833333 155.951 1392 113.20 44.044 312009 236.22 1611801.559265 54154218593 182.471 238.636 285677 3860335.38 3525.17 117027121 55.1038 10.690 95221.40 274803333 8392.4 691.3 34025.35 460076667 215.9 548396.5 1444266667 417777000 410571000 138457889450 151449269317 9.388050 8080.43 147576.41 22255.84 4308810000 4571.96 194.435 7.01975288 96529.51 15291283297 30.3145288 9.342 836999300 44.3176 30920910.92 138389378753 204550000 42.4394 82.7584 20.8719 92522999373 23.5212 41.5868 4337536667 45946.81 43.5907 158.858 19.563 157688000 98.7026 20.342 9.917565 3440.4 16.53050 102.652 1710800000 3.965 16708.258 165847.75 86.3730 20210.00 3061.42 26.867 6479.1 89.818 184.10 12.669 485038 1152 163178.67 237087650 187.32 22.104 1193966667 1380146.63 275.92 221776.15 2184866667 147.737 1316 123.95 64.179 230970 227.40 1466587.036580 45857534777 192.118 230.423 235787 1447265.35 552.68 96905609 48.9432 8.396 OpenBenchmarking.org
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20K 40K 60K 80K 100K SE +/- 90.22, N = 3 SE +/- 43.73, N = 3 SE +/- 36.09, N = 3 SE +/- 26.12, N = 3 SE +/- 48.62, N = 3 95221.40 28369.11 28375.71 18741.90 28341.68 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60M 120M 180M 240M 300M SE +/- 193419.52, N = 3 SE +/- 577.35, N = 3 SE +/- 1000.00, N = 3 SE +/- 333.33, N = 3 SE +/- 1855.92, N = 3 274803333 81394000 81412000 67486333 81396667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 2K 4K 6K 8K 10K SE +/- 3.06, N = 3 SE +/- 0.84, N = 3 SE +/- 1.54, N = 3 SE +/- 1.71, N = 3 SE +/- 1.27, N = 3 8392.4 10183.3 10181.4 2624.3 10181.9 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: Downlink Processor Benchmark c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 150 300 450 600 750 SE +/- 1.26, N = 3 SE +/- 0.06, N = 3 SE +/- 0.95, N = 3 SE +/- 0.25, N = 3 SE +/- 0.91, N = 3 691.3 323.2 319.7 197.2 318.5 -march=native -mfma 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 7K 14K 21K 28K 35K SE +/- 20.85, N = 3 SE +/- 31.31, N = 3 SE +/- 7.21, N = 3 SE +/- 1.54, N = 3 SE +/- 10.19, N = 3 34025.35 17163.11 17219.95 9711.70 17244.85 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 100M 200M 300M 400M 500M SE +/- 392527.42, N = 3 SE +/- 8819.17, N = 3 SE +/- 3333.33, N = 3 SE +/- 3333.33, N = 3 SE +/- 6666.67, N = 3 460076667 162756667 162766667 134926667 162753333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Thread c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50 100 150 200 250 SE +/- 0.55, N = 3 SE +/- 0.06, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 215.9 97.4 95.7 63.8 95.8 -march=native -mfma 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 150K 300K 450K 600K 750K SE +/- 34.73, N = 3 SE +/- 198.10, N = 3 SE +/- 12.03, N = 3 SE +/- 88.30, N = 3 SE +/- 21.82, N = 3 548396.5 713754.8 713945.9 214040.9 713859.5 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 57 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M SE +/- 9533333.33, N = 3 SE +/- 150111.07, N = 3 SE +/- 168358.08, N = 3 SE +/- 23094.01, N = 3 SE +/- 3333.33, N = 3 1444266667 721380000 721386667 489270000 721493333 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M 417777000 1207760000 1206990000 874389000 1227790000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M 410571000 1175640000 1177710000 860432000 1194320000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 80000M 160000M 240000M 320000M 400000M SE +/- 41584947.90, N = 3 SE +/- 24279491.44, N = 3 SE +/- 33807617.40, N = 3 SE +/- 2312792.64, N = 3 SE +/- 6411836.47, N = 3 138457889450 351152465420 283373795737 129199593157 283333113630 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 90000M 180000M 270000M 360000M 450000M SE +/- 4227452.23, N = 3 SE +/- 11273100.69, N = 3 SE +/- 12264074.61, N = 3 SE +/- 9833681.11, N = 3 SE +/- 81289574.27, N = 3 151449269317 411130469943 332064349843 158436163857 332033171900 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
ACES DGEMM This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 6 12 18 24 30 SE +/- 0.038051, N = 3 SE +/- 0.297525, N = 4 SE +/- 0.285590, N = 4 SE +/- 0.154503, N = 3 SE +/- 0.171001, N = 13 9.388050 24.078529 24.140605 20.417952 24.362353 1. (CC) gcc options: -O3 -march=native -fopenmp
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Memory Copying c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 4K 8K 12K 16K 20K SE +/- 0.46, N = 3 SE +/- 1.36, N = 3 SE +/- 4.65, N = 3 SE +/- 1.12, N = 3 SE +/- 3.80, N = 3 8080.43 20475.96 20478.67 11324.79 20484.24 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix Math c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 80K 160K 240K 320K 400K SE +/- 167.77, N = 3 SE +/- 28.60, N = 3 SE +/- 38.76, N = 3 SE +/- 8.13, N = 3 SE +/- 53.44, N = 3 147576.41 369258.89 368671.39 284713.63 368750.67 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Shuffle c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 12K 24K 36K 48K 60K SE +/- 0.50, N = 3 SE +/- 294.96, N = 3 SE +/- 139.03, N = 3 SE +/- 74.80, N = 3 SE +/- 21.44, N = 3 22255.84 54695.04 54472.07 35614.51 54143.40 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: Kershaw c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 900M 1800M 2700M 3600M 4500M SE +/- 22342148.51, N = 3 SE +/- 5414395.42, N = 3 SE +/- 2490845.46, N = 3 SE +/- 737119.02, N = 3 SE +/- 1575066.14, N = 3 4308810000 3302823333 3261853333 1760336667 3150680000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Matrix 3D Math c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 2K 4K 6K 8K 10K SE +/- 1.96, N = 3 SE +/- 19.16, N = 3 SE +/- 9.35, N = 3 SE +/- 1.40, N = 3 SE +/- 6.38, N = 3 4571.96 10882.02 10813.59 5752.17 10403.93 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Dust 2D tau100.0 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 1.84, N = 7 SE +/- 0.07, N = 3 SE +/- 0.00, N = 3 SE +/- 0.86, N = 3 SE +/- 0.01, N = 3 194.44 82.97 82.82 145.37 82.67 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 2 4 6 8 10 SE +/- 0.08686597, N = 15 SE +/- 0.01738352, N = 3 SE +/- 0.03233273, N = 3 SE +/- 0.02560507, N = 3 SE +/- 0.02702838, N = 3 7.01975288 3.11489828 3.14447999 5.63720735 3.09871038 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Floating Point c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20K 40K 60K 80K 100K SE +/- 864.23, N = 13 SE +/- 1.74, N = 3 SE +/- 71.97, N = 3 SE +/- 31.31, N = 3 SE +/- 190.19, N = 3 96529.51 76911.74 76178.46 42850.82 76102.55 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 7000M 14000M 21000M 28000M 35000M SE +/- 207279.55, N = 3 SE +/- 16155877.53, N = 3 SE +/- 4573992.60, N = 3 SE +/- 9173912.49, N = 3 SE +/- 17714077.14, N = 3 15291283297 32126059040 32145914147 14393925490 32125448870 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Xcompact3d Incompact3d Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 7 14 21 28 35 SE +/- 0.28, N = 3 SE +/- 0.05, N = 3 SE +/- 0.09, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 30.31 13.76 13.83 25.88 13.95 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 3 6 9 12 15 SE +/- 0.002, N = 3 SE +/- 0.027, N = 3 SE +/- 0.021, N = 3 SE +/- 0.016, N = 3 SE +/- 0.011, N = 3 9.342 4.429 4.442 6.051 4.375 1. (CXX) g++ options: -O2 -lOpenCL
Algebraic Multi-Grid Benchmark AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Figure Of Merit, More Is Better Algebraic Multi-Grid Benchmark 1.2 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 400M 800M 1200M 1600M 2000M SE +/- 1055539.30, N = 3 SE +/- 488508.39, N = 3 SE +/- 192645.90, N = 3 SE +/- 140169.34, N = 3 SE +/- 103191.30, N = 3 836999300 1765966333 1765277667 1035586333 1646761667 1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.14, N = 3 SE +/- 0.02, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 44.32 88.46 88.18 42.83 88.05 1. (CXX) g++ options: -O3
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Fused Multiply-Add c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 14M 28M 42M 56M 70M SE +/- 32747.05, N = 3 SE +/- 10061.51, N = 3 SE +/- 4431.60, N = 3 SE +/- 3687.67, N = 3 SE +/- 4870.19, N = 3 30920910.92 63723431.55 63818458.61 37732190.54 63762252.76 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 30000M 60000M 90000M 120000M 150000M SE +/- 36376378.52, N = 3 SE +/- 771581.87, N = 3 SE +/- 1725060.95, N = 3 SE +/- 35952887.59, N = 3 SE +/- 1293723.80, N = 3 138389378753 114118119423 103275516997 67292541203 103226784517 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 90M 180M 270M 360M 450M 204550000 411762000 415758000 284689000 419754000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.05, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 42.44 85.01 84.75 44.93 84.47 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 0.08, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.03, N = 3 SE +/- 0.13, N = 3 82.76 163.56 163.28 81.94 162.96 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 9 18 27 36 45 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 20.87 40.97 40.83 20.63 40.89 1. (CXX) g++ options: -O3
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20000M 40000M 60000M 80000M 100000M SE +/- 232372675.93, N = 3 SE +/- 1769561.47, N = 3 SE +/- 1218886.42, N = 3 SE +/- 1132293.08, N = 3 SE +/- 1340503.89, N = 3 92522999373 79969465487 74318842213 46717636807 74287460990 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 11 22 33 44 55 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 23.52 46.53 46.37 24.27 46.25 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.17, N = 3 SE +/- 0.03, N = 3 SE +/- 0.31, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 41.59 78.17 77.77 40.11 78.50 1. (CXX) g++ options: -O3
nekRS nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org flops/rank, More Is Better nekRS 23.0 Input: TurboPipe Periodic c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 900M 1800M 2700M 3600M 4500M SE +/- 12801180.07, N = 3 SE +/- 1394740.12, N = 3 SE +/- 169148.19, N = 3 SE +/- 144222.05, N = 3 SE +/- 1199180.28, N = 3 4337536667 4141440000 3978983333 2220190000 3976300000 1. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 11K 22K 33K 44K 55K SE +/- 167.32, N = 3 SE +/- 14.65, N = 3 SE +/- 32.94, N = 3 SE +/- 7.02, N = 3 SE +/- 24.30, N = 3 45946.81 49860.68 49742.30 25671.29 50126.29 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.42, N = 6 SE +/- 0.09, N = 3 SE +/- 0.07, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 43.59 81.17 81.01 41.98 81.44 1. (CXX) g++ options: -O3
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70 140 210 280 350 SE +/- 1.94, N = 3 SE +/- 1.62, N = 3 SE +/- 0.56, N = 3 SE +/- 0.64, N = 3 SE +/- 0.83, N = 3 158.86 300.40 301.42 209.50 306.54 1. (CXX) g++ options: -O3
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: Rhodopsin Protein c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 9 18 27 36 45 SE +/- 0.26, N = 12 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.08, N = 3 SE +/- 0.06, N = 3 19.56 37.48 37.41 25.95 37.56 -lm 1. (CXX) g++ options: -O3 -ldl
Graph500 This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60M 120M 180M 240M 300M 157688000 296164000 293826000 209350000 299497000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 1.25, N = 14 SE +/- 0.20, N = 3 SE +/- 0.47, N = 3 SE +/- 0.35, N = 3 SE +/- 0.27, N = 3 98.70 184.11 184.03 135.36 186.36 1. (CXX) g++ options: -O3
LAMMPS Molecular Dynamics Simulator LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 8 16 24 32 40 SE +/- 0.07, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 20.34 36.84 36.86 25.17 36.93 -lm 1. (CXX) g++ options: -O3 -ldl
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: leblancbig c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 3 6 9 12 15 SE +/- 0.013289, N = 3 SE +/- 0.000467, N = 3 SE +/- 0.005468, N = 3 SE +/- 0.018924, N = 3 SE +/- 0.000869, N = 3 9.917565 6.839998 6.961345 12.176830 6.720537 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
NWChem NWChem is an open-source high performance computational chemistry package. Per NWChem's documentation, "NWChem aims to provide its users with computational chemistry tools that are scalable both in their ability to treat large scientific computational chemistry problems efficiently, and in their use of available parallel computing resources from high-performance parallel supercomputers to conventional workstation clusters." Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 700 1400 2100 2800 3500 3440.4 1914.0 1962.7 2976.9 1940.2 -m64 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
Pennant Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Hydro Cycle Time - Seconds, Fewer Is Better Pennant 1.0.1 Test: sedovbig c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 4 8 12 16 20 SE +/- 0.036687, N = 3 SE +/- 0.003721, N = 3 SE +/- 0.011497, N = 3 SE +/- 0.018218, N = 3 SE +/- 0.011347, N = 3 16.530500 9.340953 9.422270 16.480500 9.206490 1. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 1.28, N = 3 SE +/- 0.04, N = 3 SE +/- 0.11, N = 3 SE +/- 0.19, N = 3 SE +/- 0.27, N = 3 102.65 162.36 162.01 92.40 164.87 1. (CXX) g++ options: -O3
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 57 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 400M 800M 1200M 1600M 2000M SE +/- 1014889.16, N = 3 SE +/- 88191.71, N = 3 SE +/- 284800.12, N = 3 SE +/- 11547.01, N = 3 SE +/- 152752.52, N = 3 1710800000 1442666667 1442366667 978200000 1442400000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
GROMACS The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 1.0845 2.169 3.2535 4.338 5.4225 SE +/- 0.013, N = 3 SE +/- 0.003, N = 3 SE +/- 0.004, N = 3 SE +/- 0.002, N = 3 SE +/- 0.003, N = 3 3.965 4.820 4.200 2.767 4.223 1. (CXX) g++ options: -O3
LULESH LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org z/s, More Is Better LULESH 2.0.3 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 6K 12K 18K 24K 30K SE +/- 90.11, N = 3 SE +/- 12.73, N = 3 SE +/- 11.81, N = 3 SE +/- 38.55, N = 3 SE +/- 27.09, N = 3 16708.26 28736.23 28708.66 17557.49 28296.38 1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50K 100K 150K 200K 250K SE +/- 60.38, N = 3 SE +/- 317.05, N = 3 SE +/- 243.69, N = 3 SE +/- 90.87, N = 3 SE +/- 323.56, N = 3 165847.75 253518.51 255145.52 148964.69 255768.44 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 30 60 90 120 150 SE +/- 1.46, N = 12 SE +/- 0.04, N = 3 SE +/- 0.47, N = 3 SE +/- 0.61, N = 3 SE +/- 0.12, N = 3 86.37 133.42 133.51 81.45 138.01 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 5K 10K 15K 20K 25K SE +/- 14.83, N = 3 SE +/- 125.21, N = 3 SE +/- 283.23, N = 3 SE +/- 31.56, N = 3 SE +/- 130.18, N = 3 20210.00 22155.36 21911.02 13103.62 21988.99 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.D c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 800 1600 2400 3200 4000 SE +/- 4.77, N = 3 SE +/- 32.06, N = 15 SE +/- 34.07, N = 15 SE +/- 2.22, N = 3 SE +/- 1.69, N = 3 3061.42 3657.67 3664.54 2216.26 3738.98 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: simple-H2O c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 10 20 30 40 50 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.24, N = 3 SE +/- 0.03, N = 3 26.87 28.00 27.99 45.23 28.04 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
srsRAN Project srsRAN Project is a complete ORAN-native 5G RAN solution created by Software Radio Systems (SRS). The srsRAN Project radio suite was formerly known as srsLTE and can be used for building your own software-defined radio (SDR) 4G/5G mobile network. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mbps, More Is Better srsRAN Project 23.5 Test: PUSCH Processor Benchmark, Throughput Total c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 1400 2800 4200 5600 7000 SE +/- 21.76, N = 3 SE +/- 3.32, N = 3 SE +/- 1.80, N = 3 SE +/- 2.53, N = 3 SE +/- 4.08, N = 3 6479.1 5431.2 5356.8 3938.7 5413.8 -march=native -mfma 1. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -lgtest
GPAW GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 20 40 60 80 100 SE +/- 0.13, N = 3 SE +/- 0.04, N = 3 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 89.82 56.44 62.08 92.76 61.83 1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70 140 210 280 350 SE +/- 1.03, N = 3 SE +/- 0.29, N = 3 SE +/- 0.19, N = 3 SE +/- 0.37, N = 3 SE +/- 0.22, N = 3 184.10 188.28 211.32 302.19 211.60 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
Monte Carlo Simulations of Ionised Nebulae Mocassin is the Monte Carlo Simulations of Ionised Nebulae. MOCASSIN is a fully 3D or 2D photoionisation and dust radiative transfer code which employs a Monte Carlo approach to the transfer of radiation through media of arbitrary geometry and density distribution. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Monte Carlo Simulations of Ionised Nebulae 2.02.73.3 Input: Gas HII40 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 5 10 15 20 25 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 3 SE +/- 0.17, N = 3 SE +/- 0.05, N = 3 12.67 13.53 13.66 20.76 13.58 1. (F9X) gfortran options: -cpp -Jsource/ -ffree-line-length-0 -lm -std=legacy -O2 -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lz
BRL-CAD BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org VGR Performance Metric, More Is Better BRL-CAD 7.34 VGR Performance Metric c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 200K 400K 600K 800K 1000K 485038 744743 789066 533020 783777 -m64 1. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: Eigen c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300 600 900 1200 1500 SE +/- 7.37, N = 3 SE +/- 14.88, N = 3 SE +/- 15.65, N = 3 SE +/- 4.73, N = 3 SE +/- 8.74, N = 3 1152 1444 1382 891 1398 1. (CXX) g++ options: -flto -pthread
nginx This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50K 100K 150K 200K 250K SE +/- 136.82, N = 3 SE +/- 402.16, N = 3 SE +/- 55.97, N = 3 SE +/- 185.79, N = 3 SE +/- 137.20, N = 3 163178.67 256585.83 255552.05 158676.40 255616.04 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Kripke Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Throughput FoM, More Is Better Kripke 1.2.6 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 80M 160M 240M 320M 400M SE +/- 2932840.19, N = 4 SE +/- 445212.18, N = 3 SE +/- 525406.56, N = 3 SE +/- 102787.75, N = 3 SE +/- 619419.33, N = 3 237087650 354234067 354442733 220120233 339000400 1. (CXX) g++ options: -O3 -fopenmp -ldl
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: FeCO6_b3lyp_gms c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60 120 180 240 300 SE +/- 2.30, N = 3 SE +/- 0.21, N = 3 SE +/- 0.82, N = 3 SE +/- 1.75, N = 3 SE +/- 0.45, N = 3 187.32 204.25 204.77 297.94 205.72 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
Remhos Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 5 10 15 20 25 SE +/- 0.11, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.08, N = 3 SE +/- 0.04, N = 3 22.10 14.08 14.12 20.74 14.04 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 32 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300M 600M 900M 1200M 1500M SE +/- 578311.72, N = 3 SE +/- 57735.03, N = 3 SE +/- 33333.33, N = 3 SE +/- 456520.66, N = 3 SE +/- 233333.33, N = 3 1193966667 1136000000 1136133333 765466667 1136066667 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Wide Vector Math c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300K 600K 900K 1200K 1500K SE +/- 2507.18, N = 3 SE +/- 16444.95, N = 15 SE +/- 16521.46, N = 15 SE +/- 505.84, N = 3 SE +/- 16116.93, N = 15 1380146.63 1530043.52 1535336.57 997272.65 1542834.94 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 90 180 270 360 450 SE +/- 0.48, N = 3 SE +/- 0.79, N = 3 SE +/- 0.89, N = 3 SE +/- 0.89, N = 3 SE +/- 0.42, N = 3 275.92 423.11 408.01 322.37 410.55 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: Vector Math c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50K 100K 150K 200K 250K SE +/- 100.78, N = 3 SE +/- 27.00, N = 3 SE +/- 20.95, N = 3 SE +/- 37.96, N = 3 SE +/- 47.94, N = 3 221776.15 217567.10 217446.12 147886.14 217235.59 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Liquid-DSP LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 64 - Buffer Length: 256 - Filter Length: 32 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 500M 1000M 1500M 2000M 2500M SE +/- 218581.28, N = 3 SE +/- 2915666.50, N = 3 SE +/- 284800.12, N = 3 SE +/- 251661.15, N = 3 SE +/- 435889.89, N = 3 2184866667 2266833333 2271966667 1531400000 2270500000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Timed Godot Game Engine Compilation This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50 100 150 200 250 SE +/- 0.12, N = 3 SE +/- 0.45, N = 3 SE +/- 0.63, N = 3 SE +/- 0.30, N = 3 SE +/- 0.32, N = 3 147.74 155.95 156.69 218.28 154.38
LeelaChessZero LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.28 Backend: BLAS c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300 600 900 1200 1500 SE +/- 13.29, N = 5 SE +/- 7.22, N = 3 SE +/- 3.53, N = 3 SE +/- 11.79, N = 3 SE +/- 4.67, N = 3 1316 1392 1333 947 1301 1. (CXX) g++ options: -flto -pthread
QMCPACK QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.16 Input: Li2_STO_ae c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 40 80 120 160 200 SE +/- 0.13, N = 3 SE +/- 0.31, N = 3 SE +/- 0.12, N = 3 SE +/- 1.13, N = 3 SE +/- 0.08, N = 3 123.95 113.20 112.64 165.12 112.61 -march=native -mcpu=native -mcpu=native -mcpu=native -mcpu=native 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -O3 -lm -ldl
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 14 28 42 56 70 SE +/- 0.53, N = 3 SE +/- 0.15, N = 3 SE +/- 0.11, N = 3 SE +/- 0.04, N = 3 SE +/- 0.15, N = 3 64.18 44.04 43.96 62.22 43.79 1. (CXX) g++ options: -O2 -lOpenCL
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 70K 140K 210K 280K 350K SE +/- 670.46, N = 3 SE +/- 308.14, N = 3 SE +/- 72.90, N = 3 SE +/- 209.44, N = 3 SE +/- 154.72, N = 3 230970 312009 311056 240702 316825 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Laghos Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50 100 150 200 250 SE +/- 1.06, N = 3 SE +/- 0.27, N = 3 SE +/- 0.16, N = 3 SE +/- 0.48, N = 3 SE +/- 0.28, N = 3 227.40 236.22 230.68 180.80 232.01 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Coremark This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations/Sec, More Is Better Coremark 1.0 CoreMark Size 666 - Iterations Per Second c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 300K 600K 900K 1200K 1500K SE +/- 6710.50, N = 3 SE +/- 14869.41, N = 7 SE +/- 13274.76, N = 15 SE +/- 153.60, N = 3 SE +/- 11449.37, N = 15 1466587.04 1611801.56 1605948.67 1260642.18 1601880.34 1. (CC) gcc options: -O2 -lrt" -lrt
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 12000M 24000M 36000M 48000M 60000M SE +/- 26770675.21, N = 3 SE +/- 19542665.92, N = 3 SE +/- 16491036.11, N = 3 SE +/- 245440310.03, N = 3 SE +/- 18610524.10, N = 3 45857534777 54154218593 54216561263 42472798847 54212515580 -m64 1. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
Timed Gem5 Compilation This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 21.2 Time To Compile c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 50 100 150 200 250 SE +/- 0.26, N = 3 SE +/- 0.38, N = 3 SE +/- 0.26, N = 3 SE +/- 0.35, N = 3 SE +/- 0.13, N = 3 192.12 182.47 181.78 225.31 180.25
Timed Node.js Compilation This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60 120 180 240 300 SE +/- 0.40, N = 3 SE +/- 0.32, N = 3 SE +/- 0.20, N = 3 SE +/- 0.16, N = 3 SE +/- 0.33, N = 3 230.42 238.64 238.54 287.81 237.78
7-Zip Compression This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 60K 120K 180K 240K 300K SE +/- 1190.65, N = 3 SE +/- 54.90, N = 3 SE +/- 146.43, N = 3 SE +/- 15.43, N = 3 SE +/- 93.51, N = 3 235787 285677 285633 234202 285540 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Stress-NG Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: CPU Cache c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 800K 1600K 2400K 3200K 4000K SE +/- 30785.49, N = 12 SE +/- 40698.46, N = 15 SE +/- 59376.56, N = 15 SE +/- 21905.72, N = 15 SE +/- 57217.78, N = 15 1447265.35 3860335.38 3844101.98 1921785.20 3892396.34 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
OpenBenchmarking.org Bogo Ops/s, More Is Better Stress-NG 0.15.10 Test: NUMA c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 800 1600 2400 3200 4000 SE +/- 9.75, N = 15 SE +/- 7.31, N = 3 SE +/- 3.39, N = 3 SE +/- 1.53, N = 3 SE +/- 5.17, N = 3 552.68 3525.17 3523.58 2112.66 3759.10 1. (CXX) g++ options: -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lpthread -lrt -lsctp -lz
Stockfish This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 30M 60M 90M 120M 150M SE +/- 1430593.84, N = 15 SE +/- 1531345.46, N = 15 SE +/- 2998209.87, N = 12 SE +/- 2597495.37, N = 15 SE +/- 2854071.93, N = 15 96905609 117027121 117316476 86609284 112119711 -m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 1. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
HeFFTe - Highly Efficient FFT for Exascale HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.3 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 13 26 39 52 65 SE +/- 0.85, N = 15 SE +/- 0.32, N = 3 SE +/- 0.15, N = 3 SE +/- 0.08, N = 3 SE +/- 0.28, N = 3 48.94 55.10 55.11 32.75 57.15 1. (CXX) g++ options: -O3
Rodinia Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster c6a.16xlarge AMD Zen 3 c7gn.16xlarge Graviton3E c7g.16xlarge Graviton3 c6g.16xlarge Graviton2 m7g.16xlarge Graviton3 4 8 12 16 20 SE +/- 0.101, N = 15 SE +/- 0.233, N = 12 SE +/- 0.099, N = 8 SE +/- 0.211, N = 15 SE +/- 0.138, N = 3 8.396 10.690 11.625 13.735 11.663 1. (CXX) g++ options: -O2 -lOpenCL
m7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 m7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 256GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 22 June 2023 16:24 by user ubuntu.
c6g.16xlarge Graviton2 Processor: ARMv8 Neoverse-N1 (64 Cores), Motherboard: Amazon EC2 c6g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 23 June 2023 01:32 by user ubuntu.
c7g.16xlarge Graviton3 Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 c7g.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 23 June 2023 10:31 by user ubuntu.
c7gn.16xlarge Graviton3E Processor: ARMv8 Neoverse-V1 (64 Cores), Motherboard: Amazon EC2 c7gn.16xlarge (1.0 BIOS), Chipset: Amazon Device 0200, Memory: 128GB, Disk: 215GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (aarch64), Compiler: GCC 11.3.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -vPython Notes: Python 3.10.6Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 10 July 2023 15:05 by user ubuntu.
c6a.16xlarge AMD Zen 3 Processor: AMD EPYC 7R13 (32 Cores / 64 Threads), Motherboard: Amazon EC2 c6a.16xlarge (1.0 BIOS), Chipset: Intel 440FX 82441FX PMC, Memory: 128GB, Disk: 322GB Amazon Elastic Block Store, Network: Amazon Elastic
OS: Ubuntu 22.04, Kernel: 5.19.0-1025-aws (x86_64), Vulkan: 1.3.238, Compiler: GCC 11.4.0, File-System: ext4, System Layer: amazon
Kernel Notes: Transparent Huge Pages: madviseCompiler Notes: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -vProcessor Notes: CPU Microcode: 0xa0011cfPython Notes: Python 3.10.12Security Notes: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected
Testing initiated at 11 August 2023 14:59 by user ubuntu.