Benchmarks by Michael Larabel for a future article on Phoronix.com.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2308110-NE-2307106NE96
Amazon AWS Graviton3E vs. Graviton 2/3 benchmarks
Benchmarks by Michael Larabel for a future article on Phoronix.com.
,,"m7g.16xlarge Graviton3","c6g.16xlarge Graviton2","c7g.16xlarge Graviton3","c7gn.16xlarge Graviton3E","c6a.16xlarge AMD Zen 3"
Processor,,ARMv8 Neoverse-V1 (64 Cores),ARMv8 Neoverse-N1 (64 Cores),ARMv8 Neoverse-V1 (64 Cores),ARMv8 Neoverse-V1 (64 Cores),AMD EPYC 7R13 (32 Cores / 64 Threads)
Motherboard,,Amazon EC2 m7g.16xlarge (1.0 BIOS),Amazon EC2 c6g.16xlarge (1.0 BIOS),Amazon EC2 c7g.16xlarge (1.0 BIOS),Amazon EC2 c7gn.16xlarge (1.0 BIOS),Amazon EC2 c6a.16xlarge (1.0 BIOS)
Chipset,,Amazon Device 0200,Amazon Device 0200,Amazon Device 0200,Amazon Device 0200,Intel 440FX 82441FX PMC
Memory,,256GB,128GB,128GB,128GB,128GB
Disk,,215GB Amazon Elastic Block Store,215GB Amazon Elastic Block Store,215GB Amazon Elastic Block Store,215GB Amazon Elastic Block Store,322GB Amazon Elastic Block Store
Network,,Amazon Elastic,Amazon Elastic,Amazon Elastic,Amazon Elastic,Amazon Elastic
OS,,Ubuntu 22.04,Ubuntu 22.04,Ubuntu 22.04,Ubuntu 22.04,Ubuntu 22.04
Kernel,,5.19.0-1025-aws (aarch64),5.19.0-1025-aws (aarch64),5.19.0-1025-aws (aarch64),5.19.0-1025-aws (aarch64),5.19.0-1025-aws (x86_64)
Compiler,,GCC 11.3.0,GCC 11.3.0,GCC 11.3.0,GCC 11.3.0,GCC 11.4.0
File-System,,ext4,ext4,ext4,ext4,ext4
System Layer,,amazon,amazon,amazon,amazon,amazon
Vulkan,,,,,,1.3.238
,,"m7g.16xlarge Graviton3","c6g.16xlarge Graviton2","c7g.16xlarge Graviton3","c7gn.16xlarge Graviton3E","c6a.16xlarge AMD Zen 3"
"7-Zip Compression - Test: Compression Rating (MIPS)",HIB,316825,240702,311056,312009,230970
"7-Zip Compression - Test: Decompression Rating (MIPS)",HIB,285540,234202,285633,285677,235787
"ACES DGEMM - Sustained Floating-Point Rate (GFLOP/s)",HIB,24.362353,20.417952,24.140605,24.078529,9.388050
"Algebraic Multi-Grid Benchmark - (Figure Of Merit)",HIB,1646761667,1035586333,1765277667,1765966333,836999300
"BRL-CAD - VGR Performance Metric (VGR Performance Metric)",HIB,783777,533020,789066,744743,485038
"Coremark - CoreMark Size 666 - Iterations Per Second (Iterations/Sec)",HIB,1601880.342264,1260642.177024,1605948.674645,1611801.559265,1466587.036580
"GPAW - Input: Carbon Nanotube (sec)",LIB,61.831,92.760,62.083,56.440,89.818
"Graph500 - Scale: 26 (bfs median_TEPS)",HIB,1194320000,860432000,1177710000,1175640000,410571000
"Graph500 - Scale: 26 (bfs max_TEPS)",HIB,1227790000,874389000,1206990000,1207760000,417777000
"Graph500 - Scale: 26 (sssp median_TEPS)",HIB,299497000,209350000,293826000,296164000,157688000
"Graph500 - Scale: 26 (sssp max_TEPS)",HIB,419754000,284689000,415758000,411762000,204550000
"GROMACS - Implementation: MPI CPU - Input: water_GMX50_bare (Ns/Day)",HIB,4.223,2.767,4.200,4.820,3.965
"HeFFTe - Highly Efficient FFT for Exascale - Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 (GFLOP/s)",HIB,186.356,135.358,184.026,184.110,98.7026
"HeFFTe - Highly Efficient FFT for Exascale - Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 (GFLOP/s)",HIB,81.4442,41.9816,81.0096,81.1671,43.5907
"HeFFTe - Highly Efficient FFT for Exascale - Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 (GFLOP/s)",HIB,88.0482,42.8284,88.1842,88.4551,44.3176
"HeFFTe - Highly Efficient FFT for Exascale - Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 (GFLOP/s)",HIB,306.540,209.496,301.418,300.396,158.858
"HeFFTe - Highly Efficient FFT for Exascale - Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 (GFLOP/s)",HIB,164.873,92.3996,162.010,162.361,102.652
"HeFFTe - Highly Efficient FFT for Exascale - Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 (GFLOP/s)",HIB,162.956,81.9412,163.276,163.559,82.7584
"HeFFTe - Highly Efficient FFT for Exascale - Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 (GFLOP/s)",HIB,57.1503,32.7468,55.1055,55.1038,48.9432
"HeFFTe - Highly Efficient FFT for Exascale - Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 (GFLOP/s)",HIB,40.8923,20.6279,40.8283,40.9708,20.8719
"HeFFTe - Highly Efficient FFT for Exascale - Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 (GFLOP/s)",HIB,46.2504,24.2658,46.3706,46.5300,23.5212
"HeFFTe - Highly Efficient FFT for Exascale - Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 (GFLOP/s)",HIB,138.014,81.4498,133.514,133.422,86.3730
"HeFFTe - Highly Efficient FFT for Exascale - Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 (GFLOP/s)",HIB,78.5049,40.1104,77.7685,78.1658,41.5868
"HeFFTe - Highly Efficient FFT for Exascale - Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 (GFLOP/s)",HIB,84.4739,44.9297,84.7451,85.0060,42.4394
"Kripke - (Throughput FoM)",HIB,339000400,220120233,354442733,354234067,237087650
"Laghos - Test: Triple Point Problem (Major Kernels Rate)",HIB,232.01,180.80,230.68,236.22,227.40
"Laghos - Test: Sedov Blast Wave, ube_922_hex.mesh (Major Kernels Rate)",HIB,410.55,322.37,408.01,423.11,275.92
"LAMMPS Molecular Dynamics Simulator - Model: 20k Atoms (ns/day)",HIB,36.927,25.171,36.862,36.838,20.342
"LAMMPS Molecular Dynamics Simulator - Model: Rhodopsin Protein (ns/day)",HIB,37.558,25.950,37.412,37.482,19.563
"LeelaChessZero - Backend: BLAS (Nodes/s)",HIB,1301,947,1333,1392,1316
"LeelaChessZero - Backend: Eigen (Nodes/s)",HIB,1398,891,1382,1444,1152
"Liquid-DSP - Threads: 32 - Buffer Length: 256 - Filter Length: 32 (samples/s)",HIB,1136066667,765466667,1136133333,1136000000,1193966667
"Liquid-DSP - Threads: 32 - Buffer Length: 256 - Filter Length: 57 (samples/s)",HIB,721493333,489270000,721386667,721380000,1444266667
"Liquid-DSP - Threads: 64 - Buffer Length: 256 - Filter Length: 32 (samples/s)",HIB,2270500000,1531400000,2271966667,2266833333,2184866667
"Liquid-DSP - Threads: 64 - Buffer Length: 256 - Filter Length: 57 (samples/s)",HIB,1442400000,978200000,1442366667,1442666667,1710800000
"Liquid-DSP - Threads: 32 - Buffer Length: 256 - Filter Length: 512 (samples/s)",HIB,81396667,67486333,81412000,81394000,274803333
"Liquid-DSP - Threads: 64 - Buffer Length: 256 - Filter Length: 512 (samples/s)",HIB,162753333,134926667,162766667,162756667,460076667
"LULESH - (z/s)",HIB,28296.378,17557.485,28708.656,28736.226,16708.258
"Monte Carlo Simulations of Ionised Nebulae - Input: Gas HII40 (sec)",LIB,13.575,20.758,13.659,13.525,12.669
"Monte Carlo Simulations of Ionised Nebulae - Input: Dust 2D tau100.0 (sec)",LIB,82.669,145.374,82.822,82.974,194.435
"NAS Parallel Benchmarks - Test / Class: CG.C (Mop/s)",HIB,21988.99,13103.62,21911.02,22155.36,20210.00
"NAS Parallel Benchmarks - Test / Class: EP.D (Mop/s)",HIB,3738.98,2216.26,3664.54,3657.67,3061.42
"NAS Parallel Benchmarks - Test / Class: LU.C (Mop/s)",HIB,28341.68,18741.90,28375.71,28369.11,95221.40
"NAS Parallel Benchmarks - Test / Class: MG.C (Mop/s)",HIB,50126.29,25671.29,49742.30,49860.68,45946.81
"NAS Parallel Benchmarks - Test / Class: SP.C (Mop/s)",HIB,17244.85,9711.70,17219.95,17163.11,34025.35
"nekRS - Input: Kershaw (flops/rank)",HIB,3150680000,1760336667,3261853333,3302823333,4308810000
"nekRS - Input: TurboPipe Periodic (flops/rank)",HIB,3976300000,2220190000,3978983333,4141440000,4337536667
"nginx - Connections: 500 (Reqs/sec)",HIB,255768.44,148964.69,255145.52,253518.51,165847.75
"nginx - Connections: 1000 (Reqs/sec)",HIB,255616.04,158676.40,255552.05,256585.83,163178.67
"NWChem - Input: C240 Buckyball (sec)",LIB,1940.2,2976.9,1962.7,1914,3440.4
"OpenSSL - Algorithm: SHA256 (byte/s)",HIB,54212515580,42472798847,54216561263,54154218593,45857534777
"OpenSSL - Algorithm: SHA512 (byte/s)",HIB,32125448870,14393925490,32145914147,32126059040,15291283297
"OpenSSL - Algorithm: RSA4096 (sign/s)",HIB,10181.9,2624.3,10181.4,10183.3,8392.4
"OpenSSL - Algorithm: RSA4096 (verify/s)",HIB,713859.5,214040.9,713945.9,713754.8,548396.5
"OpenSSL - Algorithm: ChaCha20 (byte/s)",HIB,103226784517,67292541203,103275516997,114118119423,138389378753
"OpenSSL - Algorithm: AES-128-GCM (byte/s)",HIB,332033171900,158436163857,332064349843,411130469943,151449269317
"OpenSSL - Algorithm: AES-256-GCM (byte/s)",HIB,283333113630,129199593157,283373795737,351152465420,138457889450
"OpenSSL - Algorithm: ChaCha20-Poly1305 (byte/s)",HIB,74287460990,46717636807,74318842213,79969465487,92522999373
"Pennant - Test: sedovbig (Hydro Cycle Time - sec)",LIB,9.206490,16.48050,9.422270,9.340953,16.53050
"Pennant - Test: leblancbig (Hydro Cycle Time - sec)",LIB,6.720537,12.17683,6.961345,6.839998,9.917565
"QMCPACK - Input: Li2_STO_ae (Execution Time - sec)",LIB,112.61,165.12,112.64,113.20,123.95
"QMCPACK - Input: simple-H2O (Execution Time - sec)",LIB,28.041,45.225,27.990,27.999,26.867
"QMCPACK - Input: FeCO6_b3lyp_gms (Execution Time - sec)",LIB,211.60,302.19,211.32,188.28,184.10
"QMCPACK - Input: FeCO6_b3lyp_gms (Execution Time - sec)",LIB,205.72,297.94,204.77,204.25,187.32
"Remhos - Test: Sample Remap Example (sec)",LIB,14.040,20.740,14.120,14.082,22.104
"Rodinia - Test: OpenMP LavaMD (sec)",LIB,43.788,62.224,43.963,44.044,64.179
"Rodinia - Test: OpenMP CFD Solver (sec)",LIB,4.375,6.051,4.442,4.429,9.342
"Rodinia - Test: OpenMP Streamcluster (sec)",LIB,11.663,13.735,11.625,10.690,8.396
"srsRAN Project - Test: Downlink Processor Benchmark (Mbps)",HIB,318.5,197.2,319.7,323.2,691.3
"srsRAN Project - Test: PUSCH Processor Benchmark, Throughput Total (Mbps)",HIB,5413.8,3938.7,5356.8,5431.2,6479.1
"srsRAN Project - Test: PUSCH Processor Benchmark, Throughput Thread (Mbps)",HIB,95.8,63.8,95.7,97.4,215.9
"Stockfish - Total Time (Nodes/s)",HIB,112119711,86609284,117316476,117027121,96905609
"Stress-NG - Test: NUMA (Bogo Ops/s)",HIB,3759.10,2112.66,3523.58,3525.17,552.68
"Stress-NG - Test: CPU Cache (Bogo Ops/s)",HIB,3892396.34,1921785.20,3844101.98,3860335.38,1447265.35
"Stress-NG - Test: Matrix Math (Bogo Ops/s)",HIB,368750.67,284713.63,368671.39,369258.89,147576.41
"Stress-NG - Test: Vector Math (Bogo Ops/s)",HIB,217235.59,147886.14,217446.12,217567.10,221776.15
"Stress-NG - Test: Matrix 3D Math (Bogo Ops/s)",HIB,10403.93,5752.17,10813.59,10882.02,4571.96
"Stress-NG - Test: Memory Copying (Bogo Ops/s)",HIB,20484.24,11324.79,20478.67,20475.96,8080.43
"Stress-NG - Test: Vector Shuffle (Bogo Ops/s)",HIB,54143.40,35614.51,54472.07,54695.04,22255.84
"Stress-NG - Test: Wide Vector Math (Bogo Ops/s)",HIB,1542834.94,997272.65,1535336.57,1530043.52,1380146.63
"Stress-NG - Test: Fused Multiply-Add (Bogo Ops/s)",HIB,63762252.76,37732190.54,63818458.61,63723431.55,30920910.92
"Stress-NG - Test: Vector Floating Point (Bogo Ops/s)",HIB,76102.55,42850.82,76178.46,76911.74,96529.51
"Timed Gem5 Compilation - Time To Compile (sec)",LIB,180.247,225.305,181.779,182.471,192.118
"Timed Godot Game Engine Compilation - Time To Compile (sec)",LIB,154.378,218.276,156.687,155.951,147.737
"Timed Node.js Compilation - Time To Compile (sec)",LIB,237.783,287.814,238.543,238.636,230.423
"Xcompact3d Incompact3d - Input: input.i3d 129 Cells Per Direction (sec)",LIB,3.09871038,5.63720735,3.14447999,3.11489828,7.01975288
"Xcompact3d Incompact3d - Input: input.i3d 193 Cells Per Direction (sec)",LIB,13.9454180,25.8825658,13.8326693,13.7606726,30.3145288