Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarks

Amazon EC2 benchmarking for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2112046-TJ-2108199TJ92
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 4 Tests
CPU Massive 6 Tests
Fortran Tests 3 Tests
HPC - High Performance Computing 7 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 5 Tests
Multi-Core 9 Tests
OpenMPI Tests 6 Tests
Scientific Computing 4 Tests
Server CPU Tests 5 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
m6g.metal
August 19 2021
  1 Hour, 9 Minutes
m5.24xlarge
August 19 2021
  1 Hour, 40 Minutes
m6i.24xlarge
August 19 2021
  51 Minutes
m6i.32xlarge
August 19 2021
  50 Minutes
m6a.24xlarge
December 04 2021
  56 Minutes
Invert Hiding All Results Option
  1 Hour, 5 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarks - Phoronix Test Suite

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarks

Amazon EC2 benchmarking for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2112046-TJ-2108199TJ92&sro&grt.

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 BenchmarksProcessorMotherboardMemoryDiskNetworkChipsetOSKernelVulkanCompilerFile-SystemSystem Layerm6g.metalm5.24xlargem6i.24xlargem6i.32xlargem6a.24xlargeARMv8 Neoverse-N1 (64 Cores)Amazon EC2 m6g.metal v1.0252GB107GB Amazon Elastic Block StoreAmazon ElasticUbuntu 20.045.4.0-1045-aws (aarch64)1.0.2GCC 9.3.0ext42 x Intel Xeon Platinum 8259CL (48 Cores / 96 Threads)Amazon EC2 m5.24xlarge (1.0 BIOS)Intel 440FX 82441FX PMC374GB5.4.0-1045-aws (x86_64)KVM2 x Intel Xeon Platinum 8375C (48 Cores / 96 Threads)Amazon EC2 m6i.24xlarge (1.0 BIOS)372GB2 x Intel Xeon Platinum 8375C (64 Cores / 128 Threads)Amazon EC2 m6i.32xlarge (1.0 BIOS)496GBAMD EPYC 7R13 (48 Cores / 96 Threads)Amazon EC2 m6a.24xlarge (1.0 BIOS)370GB5.11.0-1020-aws (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- m6g.metal: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m5.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.32xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6a.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Java Details- m6g.metal, m5.24xlarge: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04)Security Details- m6g.metal: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - m5.24xlarge: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.32xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6a.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Processor Details- m5.24xlarge: CPU Microcode: 0x5003005- m6i.24xlarge: CPU Microcode: 0xd0002b1- m6i.32xlarge: CPU Microcode: 0xd0002b1- m6a.24xlarge: CPU Microcode: 0xa001143

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarksasmfish: 1024 Hash Memory, 26 Depthcoremark: CoreMark Size 666 - Iterations Per Secondrocksdb: Rand Readhpcg: lulesh: m-queens: Time To Solveminife: Smalln-queens: Elapsed Timenpb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: MG.Cpennant: sedovbigpennant: leblancbigpovray: Trace Timestockfish: Total Timetnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1incompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionm6g.metalm5.24xlargem6i.24xlargem6i.32xlargem6a.24xlarge1048684821236555.80375227033261421.457016867.37019.43023848.23.76124464.8213438.712218.082233.1421850.2825872.7715.4130111.2972657.439966574493288.829365.839105.072341.3155.1838054723.24803481151601851451630.51904919457607426.888416272.58722.34114007.13.892104533.1530206.034777.134875.4750800.7465732.2225.2223710.0302642.9641056585613797.589426.09493.266394.5084.7651316321.46828781366569001607068.54333423110940837.224522519.11516.06819946.43.144136431.1133146.766426.326752.3870031.7188248.7317.245136.41392810.6311367908163522.581350.37870.932357.7213.4929897014.90585801693290432128843.09421029807313039.132835739.82112.33418797.62.312202455.3138736.508107.028765.66102661.18117771.9215.142265.1055419.0151697625833524.690349.16770.518357.6892.9541103012.59430281308149391850380.04000126958971717.028116745.12012.33411197.82.38292962.4023714.822759.762780.0454008.7449638.4512.279517.50899110.7261272323052866.455312.78078.819282.8095.4224848824.9592756OpenBenchmarking.org

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge40M80M120M160M200MSE +/- 806502.14, N = 12SE +/- 280698.15, N = 3SE +/- 1056350.97, N = 3SE +/- 1425879.59, N = 3SE +/- 870392.07, N = 3115160185130814939104868482136656900169329043

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge500K1000K1500K2000K2500KSE +/- 5103.90, N = 3SE +/- 16150.98, N = 3SE +/- 279.74, N = 3SE +/- 8144.03, N = 3SE +/- 2056.46, N = 31451630.521850380.041236555.801607068.542128843.091. (CC) gcc options: -O2 -lrt" -lrt

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random Readm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge60M120M180M240M300MSE +/- 128495.22, N = 3SE +/- 749179.54, N = 3SE +/- 1242136.86, N = 3SE +/- 1027109.62, N = 3SE +/- 2848830.44, N = 31945760742695897172703326142311094082980731301. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge918273645SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 326.8917.0321.4637.2239.131. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge8K16K24K32K40KSE +/- 6.88, N = 3SE +/- 110.64, N = 3SE +/- 6.43, N = 3SE +/- 64.35, N = 3SE +/- 50.85, N = 316272.5916745.1216867.3722519.1235739.821. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge510152025SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 422.3412.3319.4316.0712.331. (CXX) g++ options: -fopenmp -O2 -march=native

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Smallm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge5K10K15K20K25KSE +/- 284.82, N = 15SE +/- 149.74, N = 15SE +/- 5.77, N = 3SE +/- 817.97, N = 15SE +/- 590.35, N = 1514007.111197.823848.219946.418797.61. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge0.87571.75142.62713.50284.3785SE +/- 0.028, N = 3SE +/- 0.007, N = 3SE +/- 0.001, N = 3SE +/- 0.041, N = 3SE +/- 0.052, N = 153.8922.3823.7613.1442.3121. (CC) gcc options: -static -fopenmp -O3 -march=native

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge40K80K120K160K200KSE +/- 119.93, N = 3SE +/- 140.47, N = 3SE +/- 12.96, N = 3SE +/- 147.64, N = 3SE +/- 156.09, N = 3104533.1592962.4024464.82136431.11202455.311. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge8K16K24K32K40KSE +/- 40.34, N = 3SE +/- 167.20, N = 3SE +/- 27.23, N = 3SE +/- 54.87, N = 3SE +/- 259.41, N = 330206.0323714.8213438.7133146.7638736.501. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge2K4K6K8K10KSE +/- 188.76, N = 15SE +/- 6.49, N = 3SE +/- 9.83, N = 3SE +/- 82.22, N = 3SE +/- 60.61, N = 114777.132759.762218.086426.328107.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge2K4K6K8K10KSE +/- 257.35, N = 12SE +/- 4.54, N = 3SE +/- 1.71, N = 3SE +/- 78.27, N = 15SE +/- 77.50, N = 154875.472780.042233.146752.388765.661. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge20K40K60K80K100KSE +/- 441.43, N = 15SE +/- 84.37, N = 3SE +/- 2.47, N = 3SE +/- 54.84, N = 3SE +/- 1220.06, N = 450800.7454008.7421850.2870031.71102661.181. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge30K60K90K120K150KSE +/- 462.23, N = 3SE +/- 36.70, N = 3SE +/- 37.90, N = 3SE +/- 6.50, N = 3SE +/- 270.09, N = 365732.2249638.4525872.7788248.73117771.921. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge612182430SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 325.2212.2815.4117.2515.141. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge3691215SE +/- 0.019405, N = 3SE +/- 0.005942, N = 3SE +/- 0.003475, N = 3SE +/- 0.016794, N = 3SE +/- 0.014585, N = 310.0302607.50899111.2972606.4139285.1055411. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge1326395265SE +/- 4.195, N = 15SE +/- 0.016, N = 3SE +/- 0.915, N = 15SE +/- 0.059, N = 3SE +/- 0.105, N = 342.96410.72657.43910.6319.015-march=native -lSM -lICE -lX11-march=native-lSM -lICE -lX11-march=native -lSM -lICE -lX11-march=native1. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timem5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge40M80M120M160M200MSE +/- 1176206.14, N = 15SE +/- 1261758.38, N = 6SE +/- 692846.14, N = 3SE +/- 185784.28, N = 3SE +/- 554216.71, N = 310565856112723230596657449136790816169762583-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge8001600240032004000SE +/- 2.50, N = 3SE +/- 1.59, N = 3SE +/- 4.12, N = 3SE +/- 3.59, N = 3SE +/- 0.73, N = 33797.592866.463288.833522.583524.69MIN: 3754.49 / MAX: 4081.73MIN: 2825.11 / MAX: 2989.74MIN: 3237.1 / MAX: 3327.59MIN: 3492.16 / MAX: 3621.39MIN: 3482.31 / MAX: 3882.561. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge90180270360450SE +/- 0.52, N = 3SE +/- 1.81, N = 3SE +/- 0.26, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3426.09312.78365.84350.38349.17MIN: 422.84 / MAX: 477.54MIN: 309.33 / MAX: 382.03MIN: 364.34 / MAX: 367.32MIN: 348.46 / MAX: 395.91MIN: 346.78 / MAX: 378.541. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge20406080100SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 3SE +/- 0.53, N = 3SE +/- 0.10, N = 393.2778.82105.0770.9370.52MIN: 93.03 / MAX: 93.7MIN: 78.48 / MAX: 84.22MIN: 104.55 / MAX: 105.78MIN: 70.08 / MAX: 72.98MIN: 70.09 / MAX: 71.461. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1m5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge90180270360450SE +/- 0.14, N = 3SE +/- 0.32, N = 3SE +/- 0.83, N = 3SE +/- 0.23, N = 3SE +/- 0.00, N = 3394.51282.81341.32357.72357.69MIN: 393.65 / MAX: 397.65MIN: 281.46 / MAX: 315.41MIN: 338.67 / MAX: 344.11MIN: 356.98 / MAX: 361.71MIN: 357.05 / MAX: 360.091. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge1.22012.44023.66034.88046.1005SE +/- 0.01271715, N = 3SE +/- 0.01191253, N = 3SE +/- 0.00359720, N = 3SE +/- 0.02606307, N = 3SE +/- 0.01051693, N = 34.765131635.422484885.183805473.492989702.954110301. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm5.24xlargem6a.24xlargem6g.metalm6i.24xlargem6i.32xlarge612182430SE +/- 0.11, N = 3SE +/- 0.25, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 321.4724.9623.2514.9112.591. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi


Phoronix Test Suite v10.8.4