Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarks

Amazon EC2 benchmarking for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2112046-TJ-2108199TJ92&grr&sro.

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 BenchmarksProcessorMotherboardMemoryDiskNetworkChipsetOSKernelVulkanCompilerFile-SystemSystem Layerm6g.metalm5.24xlargem6i.24xlargem6i.32xlargem6a.24xlargeARMv8 Neoverse-N1 (64 Cores)Amazon EC2 m6g.metal v1.0252GB107GB Amazon Elastic Block StoreAmazon ElasticUbuntu 20.045.4.0-1045-aws (aarch64)1.0.2GCC 9.3.0ext42 x Intel Xeon Platinum 8259CL (48 Cores / 96 Threads)Amazon EC2 m5.24xlarge (1.0 BIOS)Intel 440FX 82441FX PMC374GB5.4.0-1045-aws (x86_64)KVM2 x Intel Xeon Platinum 8375C (48 Cores / 96 Threads)Amazon EC2 m6i.24xlarge (1.0 BIOS)372GB2 x Intel Xeon Platinum 8375C (64 Cores / 128 Threads)Amazon EC2 m6i.32xlarge (1.0 BIOS)496GBAMD EPYC 7R13 (48 Cores / 96 Threads)Amazon EC2 m6a.24xlarge (1.0 BIOS)370GB5.11.0-1020-aws (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- m6g.metal: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m5.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.32xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6a.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Java Details- m6g.metal, m5.24xlarge: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04)Security Details- m6g.metal: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - m5.24xlarge: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.32xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6a.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Processor Details- m5.24xlarge: CPU Microcode: 0x5003005- m6i.24xlarge: CPU Microcode: 0xd0002b1- m6i.32xlarge: CPU Microcode: 0xd0002b1- m6a.24xlarge: CPU Microcode: 0xa001143

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarkstnn: CPU - DenseNetasmfish: 1024 Hash Memory, 26 Depthhpcg: povray: Trace Timeminife: Smallnpb: EP.Drocksdb: Rand Readstockfish: Total Timenpb: BT.Ccoremark: CoreMark Size 666 - Iterations Per Secondtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1incompact3d: input.i3d 193 Cells Per Directionpennant: sedovbigm-queens: Time To Solvenpb: FT.Clulesh: pennant: leblancbignpb: CG.Ctnn: CPU - SqueezeNet v2npb: EP.Cn-queens: Elapsed Timeincompact3d: input.i3d 129 Cells Per Directionnpb: MG.Cm6g.metalm5.24xlargem6i.24xlargem6i.32xlargem6a.24xlarge3288.82910486848221.457057.43923848.22233.142703326149665744924464.821236555.803752365.839341.31523.248034815.4130119.43021850.2816867.37011.2972613438.71105.0722218.083.7615.1838054725872.773797.58911516018526.888442.96414007.14875.47194576074105658561104533.151451630.519049426.094394.50821.468287825.2223722.34150800.7416272.58710.0302630206.0393.2664777.133.8924.7651316365732.223522.58113665690037.224510.63119946.46752.38231109408136790816136431.111607068.543334350.378357.72114.905858017.2451316.06870031.7122519.1156.41392833146.7670.9326426.323.1443.4929897088248.733524.69016932904339.13289.01518797.68765.66298073130169762583202455.312128843.094210349.167357.68912.594302815.1422612.334102661.1835739.8215.10554138736.5070.5188107.022.3122.95411030117771.922866.45513081493917.028110.72611197.82780.0426958971712723230592962.401850380.040001312.780282.80924.959275612.2795112.33454008.7416745.1207.50899123714.8278.8192759.762.3825.4224848849638.45OpenBenchmarking.org

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge8001600240032004000SE +/- 2.50, N = 3SE +/- 1.59, N = 3SE +/- 4.12, N = 3SE +/- 3.59, N = 3SE +/- 0.73, N = 33797.592866.463288.833522.583524.69MIN: 3754.49 / MAX: 4081.73MIN: 2825.11 / MAX: 2989.74MIN: 3237.1 / MAX: 3327.59MIN: 3492.16 / MAX: 3621.39MIN: 3482.31 / MAX: 3882.561. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge40M80M120M160M200MSE +/- 806502.14, N = 12SE +/- 280698.15, N = 3SE +/- 1056350.97, N = 3SE +/- 1425879.59, N = 3SE +/- 870392.07, N = 3115160185130814939104868482136656900169329043

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge918273645SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 326.8917.0321.4637.2239.131. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge1326395265SE +/- 4.195, N = 15SE +/- 0.016, N = 3SE +/- 0.915, N = 15SE +/- 0.059, N = 3SE +/- 0.105, N = 342.96410.72657.43910.6319.015-march=native -lSM -lICE -lX11-march=native-lSM -lICE -lX11-march=native -lSM -lICE -lX11-march=native1. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Smallm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge5K10K15K20K25KSE +/- 284.82, N = 15SE +/- 149.74, N = 15SE +/- 5.77, N = 3SE +/- 817.97, N = 15SE +/- 590.35, N = 1514007.111197.823848.219946.418797.61. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge2K4K6K8K10KSE +/- 257.35, N = 12SE +/- 4.54, N = 3SE +/- 1.71, N = 3SE +/- 78.27, N = 15SE +/- 77.50, N = 154875.472780.042233.146752.388765.661. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random Readm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge60M120M180M240M300MSE +/- 128495.22, N = 3SE +/- 749179.54, N = 3SE +/- 1242136.86, N = 3SE +/- 1027109.62, N = 3SE +/- 2848830.44, N = 31945760742695897172703326142311094082980731301. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge40M80M120M160M200MSE +/- 1176206.14, N = 15SE +/- 1261758.38, N = 6SE +/- 692846.14, N = 3SE +/- 185784.28, N = 3SE +/- 554216.71, N = 310565856112723230596657449136790816169762583-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge40K80K120K160K200KSE +/- 119.93, N = 3SE +/- 140.47, N = 3SE +/- 12.96, N = 3SE +/- 147.64, N = 3SE +/- 156.09, N = 3104533.1592962.4024464.82136431.11202455.311. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge500K1000K1500K2000K2500KSE +/- 5103.90, N = 3SE +/- 16150.98, N = 3SE +/- 279.74, N = 3SE +/- 8144.03, N = 3SE +/- 2056.46, N = 31451630.521850380.041236555.801607068.542128843.091. (CC) gcc options: -O2 -lrt" -lrt

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge90180270360450SE +/- 0.52, N = 3SE +/- 1.81, N = 3SE +/- 0.26, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3426.09312.78365.84350.38349.17MIN: 422.84 / MAX: 477.54MIN: 309.33 / MAX: 382.03MIN: 364.34 / MAX: 367.32MIN: 348.46 / MAX: 395.91MIN: 346.78 / MAX: 378.541. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge90180270360450SE +/- 0.14, N = 3SE +/- 0.32, N = 3SE +/- 0.83, N = 3SE +/- 0.23, N = 3SE +/- 0.00, N = 3394.51282.81341.32357.72357.69MIN: 393.65 / MAX: 397.65MIN: 281.46 / MAX: 315.41MIN: 338.67 / MAX: 344.11MIN: 356.98 / MAX: 361.71MIN: 357.05 / MAX: 360.091. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge612182430SE +/- 0.11, N = 3SE +/- 0.25, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 321.4724.9623.2514.9112.591. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge612182430SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 325.2212.2815.4117.2515.141. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge510152025SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 422.3412.3319.4316.0712.331. (CXX) g++ options: -fopenmp -O2 -march=native

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge20K40K60K80K100KSE +/- 441.43, N = 15SE +/- 84.37, N = 3SE +/- 2.47, N = 3SE +/- 54.84, N = 3SE +/- 1220.06, N = 450800.7454008.7421850.2870031.71102661.181. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge8K16K24K32K40KSE +/- 6.88, N = 3SE +/- 110.64, N = 3SE +/- 6.43, N = 3SE +/- 64.35, N = 3SE +/- 50.85, N = 316272.5916745.1216867.3722519.1235739.821. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge3691215SE +/- 0.019405, N = 3SE +/- 0.005942, N = 3SE +/- 0.003475, N = 3SE +/- 0.016794, N = 3SE +/- 0.014585, N = 310.0302607.50899111.2972606.4139285.1055411. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge8K16K24K32K40KSE +/- 40.34, N = 3SE +/- 167.20, N = 3SE +/- 27.23, N = 3SE +/- 54.87, N = 3SE +/- 259.41, N = 330206.0323714.8213438.7133146.7638736.501. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge20406080100SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.53, N = 3SE +/- 0.10, N = 393.2778.82105.0770.9370.52MIN: 93.03 / MAX: 93.7MIN: 78.48 / MAX: 84.22MIN: 104.55 / MAX: 105.78MIN: 70.08 / MAX: 72.98MIN: 70.09 / MAX: 71.461. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge2K4K6K8K10KSE +/- 188.76, N = 15SE +/- 6.49, N = 3SE +/- 9.83, N = 3SE +/- 82.22, N = 3SE +/- 60.61, N = 114777.132759.762218.086426.328107.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge0.87571.75142.62713.50284.3785SE +/- 0.028, N = 3SE +/- 0.007, N = 3SE +/- 0.001, N = 3SE +/- 0.041, N = 3SE +/- 0.052, N = 153.8922.3823.7613.1442.3121. (CC) gcc options: -static -fopenmp -O3 -march=native

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge1.22012.44023.66034.88046.1005SE +/- 0.01271715, N = 3SE +/- 0.01191253, N = 3SE +/- 0.00359720, N = 3SE +/- 0.02606307, N = 3SE +/- 0.01051693, N = 34.765131635.422484885.183805473.492989702.954110301. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge30K60K90K120K150KSE +/- 462.23, N = 3SE +/- 36.70, N = 3SE +/- 37.90, N = 3SE +/- 6.50, N = 3SE +/- 270.09, N = 365732.2249638.4525872.7788248.73117771.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3


Phoronix Test Suite v10.8.4