Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarks

Amazon EC2 benchmarking for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2112046-TJ-2108199TJ92&sor&gru.

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 BenchmarksProcessorMotherboardMemoryDiskNetworkChipsetOSKernelVulkanCompilerFile-SystemSystem Layerm6g.metalm5.24xlargem6i.24xlargem6i.32xlargem6a.24xlargeARMv8 Neoverse-N1 (64 Cores)Amazon EC2 m6g.metal v1.0252GB107GB Amazon Elastic Block StoreAmazon ElasticUbuntu 20.045.4.0-1045-aws (aarch64)1.0.2GCC 9.3.0ext42 x Intel Xeon Platinum 8259CL (48 Cores / 96 Threads)Amazon EC2 m5.24xlarge (1.0 BIOS)Intel 440FX 82441FX PMC374GB5.4.0-1045-aws (x86_64)KVM2 x Intel Xeon Platinum 8375C (48 Cores / 96 Threads)Amazon EC2 m6i.24xlarge (1.0 BIOS)372GB2 x Intel Xeon Platinum 8375C (64 Cores / 128 Threads)Amazon EC2 m6i.32xlarge (1.0 BIOS)496GBAMD EPYC 7R13 (48 Cores / 96 Threads)Amazon EC2 m6a.24xlarge (1.0 BIOS)370GB5.11.0-1020-aws (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- m6g.metal: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m5.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6i.32xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - m6a.24xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Java Details- m6g.metal, m5.24xlarge: OpenJDK Runtime Environment (build 11.0.11+9-Ubuntu-0ubuntu2.20.04)Security Details- m6g.metal: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - m5.24xlarge: itlb_multihit: KVM: Vulnerable + l1tf: Mitigation of PTE Inversion + mds: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6i.32xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - m6a.24xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Processor Details- m5.24xlarge: CPU Microcode: 0x5003005- m6i.24xlarge: CPU Microcode: 0xd0002b1- m6i.32xlarge: CPU Microcode: 0xd0002b1- m6a.24xlarge: CPU Microcode: 0xa001143

Intel M6i Ice Lake vs. Graviton2 Amazon EC2 Benchmarksminife: Smallhpcg: coremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthrocksdb: Rand Readnpb: BT.Cnpb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: MG.Clulesh: pennant: sedovbigpennant: leblancbigtnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v2tnn: CPU - SqueezeNet v1.1incompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionpovray: Trace Timem-queens: Time To Solven-queens: Elapsed Timem6g.metalm5.24xlargem6i.24xlargem6i.32xlargem6a.24xlarge23848.221.45701236555.8037529665744910486848227033261424464.8213438.712218.082233.1421850.2825872.7716867.37015.4130111.297263288.829365.839105.072341.3155.1838054723.248034857.43919.4303.76114007.126.88841451630.519049105658561115160185194576074104533.1530206.034777.134875.4750800.7465732.2216272.58725.2223710.030263797.589426.09493.266394.5084.7651316321.468287842.96422.3413.89219946.437.22451607068.543334136790816136656900231109408136431.1133146.766426.326752.3870031.7188248.7322519.11517.245136.4139283522.581350.37870.932357.7213.4929897014.905858010.63116.0683.14418797.639.13282128843.094210169762583169329043298073130202455.3138736.508107.028765.66102661.18117771.9235739.82115.142265.1055413524.690349.16770.518357.6892.9541103012.59430289.01512.3342.31211197.817.02811850380.04000112723230513081493926958971792962.4023714.822759.762780.0454008.7449638.4516745.12012.279517.5089912866.455312.78078.819282.8095.4224848824.959275610.72612.3342.382OpenBenchmarking.org

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: Smallm6g.metalm6i.24xlargem6i.32xlargem5.24xlargem6a.24xlarge5K10K15K20K25KSE +/- 5.77, N = 3SE +/- 817.97, N = 15SE +/- 590.35, N = 15SE +/- 284.82, N = 15SE +/- 149.74, N = 1523848.219946.418797.614007.111197.81. (CXX) g++ options: -O3 -fopenmp -pthread -lmpi_cxx -lmpi

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1m6i.32xlargem6i.24xlargem5.24xlargem6g.metalm6a.24xlarge918273645SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 339.1337.2226.8921.4617.031. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondm6i.32xlargem6a.24xlargem6i.24xlargem5.24xlargem6g.metal500K1000K1500K2000K2500KSE +/- 2056.46, N = 3SE +/- 16150.98, N = 3SE +/- 8144.03, N = 3SE +/- 5103.90, N = 3SE +/- 279.74, N = 32128843.091850380.041607068.541451630.521236555.801. (CC) gcc options: -O2 -lrt" -lrt

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timem6i.32xlargem6i.24xlargem6a.24xlargem5.24xlargem6g.metal40M80M120M160M200MSE +/- 554216.71, N = 3SE +/- 185784.28, N = 3SE +/- 1261758.38, N = 6SE +/- 1176206.14, N = 15SE +/- 692846.14, N = 316976258313679081612723230510565856196657449-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthm6i.32xlargem6i.24xlargem6a.24xlargem5.24xlargem6g.metal40M80M120M160M200MSE +/- 870392.07, N = 3SE +/- 1425879.59, N = 3SE +/- 280698.15, N = 3SE +/- 806502.14, N = 12SE +/- 1056350.97, N = 3169329043136656900130814939115160185104868482

Facebook RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.22.1Test: Random Readm6i.32xlargem6g.metalm6a.24xlargem6i.24xlargem5.24xlarge60M120M180M240M300MSE +/- 2848830.44, N = 3SE +/- 1242136.86, N = 3SE +/- 749179.54, N = 3SE +/- 1027109.62, N = 3SE +/- 128495.22, N = 32980731302703326142695897172311094081945760741. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cm6i.32xlargem6i.24xlargem5.24xlargem6a.24xlargem6g.metal40K80K120K160K200KSE +/- 156.09, N = 3SE +/- 147.64, N = 3SE +/- 119.93, N = 3SE +/- 140.47, N = 3SE +/- 12.96, N = 3202455.31136431.11104533.1592962.4024464.821. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cm6i.32xlargem6i.24xlargem5.24xlargem6a.24xlargem6g.metal8K16K24K32K40KSE +/- 259.41, N = 3SE +/- 54.87, N = 3SE +/- 40.34, N = 3SE +/- 167.20, N = 3SE +/- 27.23, N = 338736.5033146.7630206.0323714.8213438.711. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Cm6i.32xlargem6i.24xlargem5.24xlargem6a.24xlargem6g.metal2K4K6K8K10KSE +/- 60.61, N = 11SE +/- 82.22, N = 3SE +/- 188.76, N = 15SE +/- 6.49, N = 3SE +/- 9.83, N = 38107.026426.324777.132759.762218.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dm6i.32xlargem6i.24xlargem5.24xlargem6a.24xlargem6g.metal2K4K6K8K10KSE +/- 77.50, N = 15SE +/- 78.27, N = 15SE +/- 257.35, N = 12SE +/- 4.54, N = 3SE +/- 1.71, N = 38765.666752.384875.472780.042233.141. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cm6i.32xlargem6i.24xlargem6a.24xlargem5.24xlargem6g.metal20K40K60K80K100KSE +/- 1220.06, N = 4SE +/- 54.84, N = 3SE +/- 84.37, N = 3SE +/- 441.43, N = 15SE +/- 2.47, N = 3102661.1870031.7154008.7450800.7421850.281. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cm6i.32xlargem6i.24xlargem5.24xlargem6a.24xlargem6g.metal30K60K90K120K150KSE +/- 270.09, N = 3SE +/- 6.50, N = 3SE +/- 462.23, N = 3SE +/- 36.70, N = 3SE +/- 37.90, N = 3117771.9288248.7365732.2249638.4525872.771. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 4.0.3

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3m6i.32xlargem6i.24xlargem6g.metalm6a.24xlargem5.24xlarge8K16K24K32K40KSE +/- 50.85, N = 3SE +/- 64.35, N = 3SE +/- 6.43, N = 3SE +/- 110.64, N = 3SE +/- 6.88, N = 335739.8222519.1216867.3716745.1216272.591. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigm6a.24xlargem6i.32xlargem6g.metalm6i.24xlargem5.24xlarge612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 312.2815.1415.4117.2525.221. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigm6i.32xlargem6i.24xlargem6a.24xlargem5.24xlargem6g.metal3691215SE +/- 0.014585, N = 3SE +/- 0.016794, N = 3SE +/- 0.005942, N = 3SE +/- 0.019405, N = 3SE +/- 0.003475, N = 35.1055416.4139287.50899110.03026011.2972601. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

TNN

Target: CPU - Model: DenseNet

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetm6a.24xlargem6g.metalm6i.24xlargem6i.32xlargem5.24xlarge8001600240032004000SE +/- 1.59, N = 3SE +/- 4.12, N = 3SE +/- 3.59, N = 3SE +/- 0.73, N = 3SE +/- 2.50, N = 32866.463288.833522.583524.693797.59MIN: 2825.11 / MAX: 2989.74MIN: 3237.1 / MAX: 3327.59MIN: 3492.16 / MAX: 3621.39MIN: 3482.31 / MAX: 3882.56MIN: 3754.49 / MAX: 4081.731. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2m6a.24xlargem6i.32xlargem6i.24xlargem6g.metalm5.24xlarge90180270360450SE +/- 1.81, N = 3SE +/- 0.19, N = 3SE +/- 0.24, N = 3SE +/- 0.26, N = 3SE +/- 0.52, N = 3312.78349.17350.38365.84426.09MIN: 309.33 / MAX: 382.03MIN: 346.78 / MAX: 378.54MIN: 348.46 / MAX: 395.91MIN: 364.34 / MAX: 367.32MIN: 422.84 / MAX: 477.541. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2m6i.32xlargem6i.24xlargem6a.24xlargem5.24xlargem6g.metal20406080100SE +/- 0.10, N = 3SE +/- 0.53, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 370.5270.9378.8293.27105.07MIN: 70.09 / MAX: 71.46MIN: 70.08 / MAX: 72.98MIN: 78.48 / MAX: 84.22MIN: 93.03 / MAX: 93.7MIN: 104.55 / MAX: 105.781. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1m6a.24xlargem6g.metalm6i.32xlargem6i.24xlargem5.24xlarge90180270360450SE +/- 0.32, N = 3SE +/- 0.83, N = 3SE +/- 0.00, N = 3SE +/- 0.23, N = 3SE +/- 0.14, N = 3282.81341.32357.69357.72394.51MIN: 281.46 / MAX: 315.41MIN: 338.67 / MAX: 344.11MIN: 357.05 / MAX: 360.09MIN: 356.98 / MAX: 361.71MIN: 393.65 / MAX: 397.651. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionm6i.32xlargem6i.24xlargem5.24xlargem6g.metalm6a.24xlarge1.22012.44023.66034.88046.1005SE +/- 0.01051693, N = 3SE +/- 0.02606307, N = 3SE +/- 0.01271715, N = 3SE +/- 0.00359720, N = 3SE +/- 0.01191253, N = 32.954110303.492989704.765131635.183805475.422484881. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionm6i.32xlargem6i.24xlargem5.24xlargem6g.metalm6a.24xlarge612182430SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.25, N = 312.5914.9121.4723.2524.961. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timem6i.32xlargem6i.24xlargem6a.24xlargem5.24xlargem6g.metal1326395265SE +/- 0.105, N = 3SE +/- 0.059, N = 3SE +/- 0.016, N = 3SE +/- 4.195, N = 15SE +/- 0.915, N = 159.01510.63110.72642.96457.439-march=native-march=native -lSM -lICE -lX11-march=native-march=native -lSM -lICE -lX11-lSM -lICE -lX111. (CXX) g++ options: -pipe -O3 -ffast-math -pthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvem6i.32xlargem6a.24xlargem6i.24xlargem6g.metalm5.24xlarge510152025SE +/- 0.14, N = 4SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 312.3312.3316.0719.4322.341. (CXX) g++ options: -fopenmp -O2 -march=native

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timem6i.32xlargem6a.24xlargem6i.24xlargem6g.metalm5.24xlarge0.87571.75142.62713.50284.3785SE +/- 0.052, N = 15SE +/- 0.007, N = 3SE +/- 0.041, N = 3SE +/- 0.001, N = 3SE +/- 0.028, N = 32.3122.3823.1443.7613.8921. (CC) gcc options: -static -fopenmp -O3 -march=native


Phoronix Test Suite v10.8.4