Intel vs. Graviton2 Amazon EC2 Benchmarks

KVM testing on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2108196-TJ-2108197TJ41
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 4 Tests
CPU Massive 6 Tests
Fortran Tests 3 Tests
HPC - High Performance Computing 7 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 5 Tests
Multi-Core 9 Tests
OpenMPI Tests 6 Tests
Scientific Computing 4 Tests
Server CPU Tests 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
m6g.metal
August 19 2021
  1 Hour, 9 Minutes
m5.24xlarge
August 19 2021
  1 Hour, 40 Minutes
m6i.24xlarge
August 19 2021
  51 Minutes
Invert Hiding All Results Option
  1 Hour, 13 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel vs. Graviton2 Amazon EC2 BenchmarksProcessorMotherboardMemoryDiskNetworkChipsetOSKernelVulkanCompilerFile-SystemSystem Layerm6g.metalm5.24xlargem6i.24xlargeARMv8 Neoverse-N1 (64 Cores)Amazon EC2 m6g.metal v1.0252GB107GB Amazon Elastic Block StoreAmazon ElasticUbuntu 20.045.4.0-1045-aws (aarch64)1.0.2GCC 9.3.0ext42 x Intel Xeon Platinum 8259CL (48 Cores / 96 Threads)Amazon EC2 m5.24xlarge (1.0 BIOS)Intel 440FX 82441FX PMC374GB5.4.0-1045-aws (x86_64)KVM2 x Intel Xeon Platinum 8375C (48 Cores / 96 Threads)Amazon EC2 m6i.24xlarge (1.0 BIOS)372GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- m6g.metal: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m5.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Java Details- m6g.metal, m5.24xlarge: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04)Security Details- m6g.metal: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - m5.24xlarge: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Processor Details- m5.24xlarge: CPU Microcode: 0x5003005- m6i.24xlarge: CPU Microcode: 0xd0002b1

m6g.metalm5.24xlargem6i.24xlargeLogarithmic Result OverviewPhoronix Test SuitePOV-RayNAS Parallel BenchmarksHigh Performance Conjugate GradientminiFEXcompact3d Incompact3dPennantStockfishm-queensFacebook RocksDBLULESHasmFishCoremarkN-QueensTNN

Intel vs. Graviton2 Amazon EC2 Benchmarkshpcg: npb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: MG.Cminife: Smallpennant: sedovbigpennant: leblancbigincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionlulesh: coremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthpovray: Trace Timem-queens: Time To Solven-queens: Elapsed Timetnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1rocksdb: Rand Readm6g.metalm5.24xlargem6i.24xlarge21.457024464.8213438.712218.082233.1421850.2825872.7723848.215.4130111.297265.1838054723.248034816867.3701236555.8037529665744910486848257.43919.4303.7613288.829365.839105.072341.31527033261426.8884104533.1530206.034777.134875.4750800.7465732.2214007.125.2223710.030264.7651316321.468287816272.5871451630.51904910565856111516018542.96422.3413.8923797.589426.09493.266394.50819457607437.2245136431.1133146.766426.326752.3870031.7188248.7319946.417.245136.4139283.4929897014.905858022519.1151607068.54333413679081613665690010.63116.0683.1443522.581350.37870.932357.721231109408OpenBenchmarking.org

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1m6g.metalm5.24xlargem6i.24xlarge918273645SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 321.4626.8937.221. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1m6g.metalm5.24xlargem6i.24xlarge816243240Min: 21.45 / Avg: 21.46 / Max: 21.46Min: 26.69 / Avg: 26.89 / Max: 27.22Min: 37.22 / Avg: 37.22 / Max: 37.241. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cm6g.metalm5.24xlargem6i.24xlarge30K60K90K120K150KSE +/- 12.96, N = 3SE +/- 119.93, N = 3SE +/- 147.64, N = 324464.82104533.15136431.111. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cm6g.metalm5.24xlargem6i.24xlarge20K40K60K80K100KMin: 24440.31 / Avg: 24464.82 / Max: 24484.38Min: 104293.52 / Avg: 104533.15 / Max: 104662.12Min: 136147.74 / Avg: 136431.11 / Max: 136644.711. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm6g.metalm5.24xlargem6i.24xlarge7K14K21K28K35KSE +/- 27.23, N = 3SE +/- 40.34, N = 3SE +/- 54.87, N = 313438.7130206.0333146.761. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm6g.metalm5.24xlargem6i.24xlarge6K12K18K24K30KMin: 13398.53 / Avg: 13438.71 / Max: 13490.64Min: 30135.39 / Avg: 30206.03 / Max: 30275.11Min: 33043.37 / Avg: 33146.76 / Max: 33230.31. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cm6g.metalm5.24xlargem6i.24xlarge14002800420056007000SE +/- 9.83, N = 3SE +/- 188.76, N = 15SE +/- 82.22, N = 32218.084777.136426.321. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cm6g.metalm5.24xlargem6i.24xlarge11002200330044005500Min: 2199.81 / Avg: 2218.08 / Max: 2233.49Min: 2987.67 / Avg: 4777.13 / Max: 5214.38Min: 6318.99 / Avg: 6426.32 / Max: 6587.891. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm6g.metalm5.24xlargem6i.24xlarge14002800420056007000SE +/- 1.71, N = 3SE +/- 257.35, N = 12SE +/- 78.27, N = 152233.144875.476752.381. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm6g.metalm5.24xlargem6i.24xlarge12002400360048006000Min: 2229.78 / Avg: 2233.14 / Max: 2235.36Min: 2988 / Avg: 4875.47 / Max: 5437.97Min: 6446.65 / Avg: 6752.38 / Max: 7141.171. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cm6g.metalm5.24xlargem6i.24xlarge15K30K45K60K75KSE +/- 2.47, N = 3SE +/- 441.43, N = 15SE +/- 54.84, N = 321850.2850800.7470031.711. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cm6g.metalm5.24xlargem6i.24xlarge12K24K36K48K60KMin: 21846.38 / Avg: 21850.28 / Max: 21854.86Min: 47922.53 / Avg: 50800.74 / Max: 52912.65Min: 69923.35 / Avg: 70031.71 / Max: 70100.591. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm6g.metalm5.24xlargem6i.24xlarge20K40K60K80K100KSE +/- 37.90, N = 3SE +/- 462.23, N = 3SE +/- 6.50, N = 325872.7765732.2288248.731. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm6g.metalm5.24xlargem6i.24xlarge15K30K45K60K75KMin: 25797.71 / Avg: 25872.77 / Max: 25919.42Min: 64807.79 / Avg: 65732.22 / Max: 66201.01Min: 88235.75 / Avg: 88248.73 / Max: 88255.61. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

miniFE

MiniFE Finite Element is an application for unstructured implicit finite element codes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Smallm5.24xlargem6i.24xlargem6g.metal5K10K15K20K25KSE +/- 284.82, N = 15SE +/- 817.97, N = 15SE +/- 5.77, N = 314007.119946.423848.21. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Smallm5.24xlargem6i.24xlargem6g.metal4K8K12K16K20KMin: 12751 / Avg: 14007.1 / Max: 16376.2Min: 15625.4 / Avg: 19946.41 / Max: 26002.3Min: 23841.7 / Avg: 23848.2 / Max: 23859.71. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm5.24xlargem6i.24xlargem6g.metal612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 325.2217.2515.411. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm5.24xlargem6i.24xlargem6g.metal612182430Min: 25.18 / Avg: 25.22 / Max: 25.28Min: 17.23 / Avg: 17.25 / Max: 17.26Min: 15.4 / Avg: 15.41 / Max: 15.421. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm6g.metalm5.24xlargem6i.24xlarge3691215SE +/- 0.003475, N = 3SE +/- 0.019405, N = 3SE +/- 0.016794, N = 311.29726010.0302606.4139281. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm6g.metalm5.24xlargem6i.24xlarge3691215Min: 11.29 / Avg: 11.3 / Max: 11.3Min: 10 / Avg: 10.03 / Max: 10.07Min: 6.38 / Avg: 6.41 / Max: 6.441. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm6g.metalm5.24xlargem6i.24xlarge1.16642.33283.49924.66565.832SE +/- 0.00359720, N = 3SE +/- 0.01271715, N = 3SE +/- 0.02606307, N = 35.183805474.765131633.492989701. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm6g.metalm5.24xlargem6i.24xlarge246810Min: 5.18 / Avg: 5.18 / Max: 5.19Min: 4.74 / Avg: 4.77 / Max: 4.78Min: 3.46 / Avg: 3.49 / Max: 3.541. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm6g.metalm5.24xlargem6i.24xlarge612182430SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 323.2521.4714.911. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm6g.metalm5.24xlargem6i.24xlarge510152025Min: 23.21 / Avg: 23.25 / Max: 23.28Min: 21.31 / Avg: 21.47 / Max: 21.69Min: 14.79 / Avg: 14.91 / Max: 15.041. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3m5.24xlargem6g.metalm6i.24xlarge5K10K15K20K25KSE +/- 6.88, N = 3SE +/- 6.43, N = 3SE +/- 64.35, N = 316272.5916867.3722519.121. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3m5.24xlargem6g.metalm6i.24xlarge4K8K12K16K20KMin: 16262.09 / Avg: 16272.59 / Max: 16285.53Min: 16856.51 / Avg: 16867.37 / Max: 16878.76Min: 22399.85 / Avg: 22519.12 / Max: 22620.631. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm6g.metalm5.24xlargem6i.24xlarge300K600K900K1200K1500KSE +/- 279.74, N = 3SE +/- 5103.90, N = 3SE +/- 8144.03, N = 31236555.801451630.521607068.541. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm6g.metalm5.24xlargem6i.24xlarge300K600K900K1200K1500KMin: 1236237.2 / Avg: 1236555.8 / Max: 1237113.4Min: 1442578.61 / Avg: 1451630.52 / Max: 1460242.61Min: 1592501.97 / Avg: 1607068.54 / Max: 1620663.461. (CC) gcc options: -O2 -lrt" -lrt

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timem6g.metalm5.24xlargem6i.24xlarge30M60M90M120M150MSE +/- 692846.14, N = 3SE +/- 1176206.14, N = 15SE +/- 185784.28, N = 396657449105658561136790816-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timem6g.metalm5.24xlargem6i.24xlarge20M40M60M80M100MMin: 95675673 / Avg: 96657448.67 / Max: 97995210Min: 100080401 / Avg: 105658560.6 / Max: 117406519Min: 136485231 / Avg: 136790816.33 / Max: 1371266681. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthm6g.metalm5.24xlargem6i.24xlarge30M60M90M120M150MSE +/- 1056350.97, N = 3SE +/- 806502.14, N = 12SE +/- 1425879.59, N = 3104868482115160185136656900
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthm6g.metalm5.24xlargem6i.24xlarge20M40M60M80M100MMin: 102806769 / Avg: 104868481.67 / Max: 106298885Min: 111523183 / Avg: 115160184.58 / Max: 120419819Min: 133912717 / Avg: 136656899.67 / Max: 138700924

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timem6g.metalm5.24xlargem6i.24xlarge1326395265SE +/- 0.92, N = 15SE +/- 4.19, N = 15SE +/- 0.06, N = 357.4442.9610.63-march=native-march=native1. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timem6g.metalm5.24xlargem6i.24xlarge1122334455Min: 52.22 / Avg: 57.44 / Max: 63.6Min: 27.54 / Avg: 42.96 / Max: 86.67Min: 10.53 / Avg: 10.63 / Max: 10.741. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvem5.24xlargem6g.metalm6i.24xlarge510152025SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.19, N = 322.3419.4316.071. (CXX) g++ options: -fopenmp -O2 -march=native
OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvem5.24xlargem6g.metalm6i.24xlarge510152025Min: 22.12 / Avg: 22.34 / Max: 22.57Min: 19.41 / Avg: 19.43 / Max: 19.46Min: 15.87 / Avg: 16.07 / Max: 16.451. (CXX) g++ options: -fopenmp -O2 -march=native

N-Queens

This is a test of the OpenMP version of a test that solves the N-queens problem. The board problem size is 18. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timem5.24xlargem6g.metalm6i.24xlarge0.87571.75142.62713.50284.3785SE +/- 0.028, N = 3SE +/- 0.001, N = 3SE +/- 0.041, N = 33.8923.7613.1441. (CC) gcc options: -static -fopenmp -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timem5.24xlargem6g.metalm6i.24xlarge246810Min: 3.85 / Avg: 3.89 / Max: 3.95Min: 3.76 / Avg: 3.76 / Max: 3.76Min: 3.06 / Avg: 3.14 / Max: 3.191. (CC) gcc options: -static -fopenmp -O3 -march=native

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetm5.24xlargem6i.24xlargem6g.metal8001600240032004000SE +/- 2.50, N = 3SE +/- 3.59, N = 3SE +/- 4.12, N = 33797.593522.583288.83MIN: 3754.49 / MAX: 4081.73MIN: 3492.16 / MAX: 3621.39MIN: 3237.1 / MAX: 3327.591. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetm5.24xlargem6i.24xlargem6g.metal7001400210028003500Min: 3794.3 / Avg: 3797.59 / Max: 3802.5Min: 3517.19 / Avg: 3522.58 / Max: 3529.39Min: 3282.02 / Avg: 3288.83 / Max: 3296.261. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2m5.24xlargem6g.metalm6i.24xlarge90180270360450SE +/- 0.52, N = 3SE +/- 0.26, N = 3SE +/- 0.24, N = 3426.09365.84350.38MIN: 422.84 / MAX: 477.54MIN: 364.34 / MAX: 367.32MIN: 348.46 / MAX: 395.911. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2m5.24xlargem6g.metalm6i.24xlarge80160240320400Min: 425.15 / Avg: 426.09 / Max: 426.95Min: 365.34 / Avg: 365.84 / Max: 366.19Min: 349.94 / Avg: 350.38 / Max: 350.761. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2m6g.metalm5.24xlargem6i.24xlarge20406080100SE +/- 0.14, N = 3SE +/- 0.01, N = 3SE +/- 0.53, N = 3105.0793.2770.93MIN: 104.55 / MAX: 105.78MIN: 93.03 / MAX: 93.7MIN: 70.08 / MAX: 72.981. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2m6g.metalm5.24xlargem6i.24xlarge20406080100Min: 104.92 / Avg: 105.07 / Max: 105.34Min: 93.24 / Avg: 93.27 / Max: 93.28Min: 70.37 / Avg: 70.93 / Max: 71.991. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1m5.24xlargem6i.24xlargem6g.metal90180270360450SE +/- 0.14, N = 3SE +/- 0.23, N = 3SE +/- 0.83, N = 3394.51357.72341.32MIN: 393.65 / MAX: 397.65MIN: 356.98 / MAX: 361.71MIN: 338.67 / MAX: 344.111. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl
OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1m5.24xlargem6i.24xlargem6g.metal70140210280350Min: 394.27 / Avg: 394.51 / Max: 394.75Min: 357.38 / Avg: 357.72 / Max: 358.17Min: 340.42 / Avg: 341.32 / Max: 342.981. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random Readm5.24xlargem6i.24xlargem6g.metal60M120M180M240M300MSE +/- 128495.22, N = 3SE +/- 1027109.62, N = 3SE +/- 1242136.86, N = 31945760742311094082703326141. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random Readm5.24xlargem6i.24xlargem6g.metal50M100M150M200M250MMin: 194354482 / Avg: 194576073.67 / Max: 194799590Min: 229171479 / Avg: 231109408.33 / Max: 232668445Min: 268540515 / Avg: 270332613.67 / Max: 2727186291. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread