Amazon EC2 c7g.4xlarge AWS Graviton3

Graviton3 benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2205269-NE-2205259NE55&sor&grr.

Amazon EC2 c7g.4xlarge AWS Graviton3ProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem LayerVulkanc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge XeonARMv8 Neoverse-V1 (16 Cores)Amazon EC2 c7g.4xlarge (1.0 BIOS)Amazon Device 020032GB193GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.15.0-1004-aws (aarch64)GCC 11.2.0ext4amazonARMv8 Neoverse-N1 (16 Cores)Amazon EC2 c6g.4xlarge (1.0 BIOS)Intel Xeon Platinum 8375C (8 Cores / 16 Threads)Amazon EC2 c6i.4xlarge (1.0 BIOS)Intel 440FX 82441FX PMC5.15.0-1004-aws (x86_64)1.2.204OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c7g.4xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6g.4xlarge Graviton2: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6i.4xlarge Xeon: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Java Details- OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)Python Details- Python 3.10.4Security Details- c7g.4xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6g.4xlarge Graviton2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6i.4xlarge Xeon: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Processor Details- c6i.4xlarge Xeon: CPU Microcode: 0xd000331

Amazon EC2 c7g.4xlarge AWS Graviton3lczero: BLASbuild-llvm: Ninjabuild-nodejs: Time To Compilebuild-gem5: Time To Compilenpb: SP.Clczero: Eigenngspice: C7552npb: BT.Ctensorflow-lite: NASNet Mobileavifenc: 0securemark: SecureMark-TLSmrbayes: Primate Phylogeny Analysisnpb: EP.Donnx: GPT-2 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardnpb: LU.Cngspice: C2670rodinia: OpenMP LavaMDgpaw: Carbon Nanotubeopenssl: SHA256gromacs: MPI CPU - water_GMX50_bareasmfish: 1024 Hash Memory, 26 Depthavifenc: 2stress-ng: CPU Cachebuild2: Time To Compileapache: 500astcenc: Exhaustiveonnx: fcn-resnet101-11 - CPU - Standardtensorflow-lite: Mobilenet Quantonnx: super-resolution-10 - CPU - Standardhpcg: apache: 1000nginx: 100nginx: 500nginx: 200nginx: 1000apache: 200apache: 100m-queens: Time To Solvebuild-php: Time To Compilenpb: IS.Dc-ray: Total Time - 4K, 16 Rays Per Pixelsimdjson: PartialTweetsmt-dgemm: Sustained Floating-Point Ratesimdjson: DistinctUserIDtensorflow-lite: Inception V4tensorflow-lite: Inception ResNet V2tensorflow-lite: Mobilenet Floattensorflow-lite: SqueezeNetopenssl: RSA4096openssl: RSA4096simdjson: Kostyasimdjson: LargeRandwebp: Quality 100, Lossless, Highest Compressioncompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedrodinia: OpenMP Streamclusterpovray: Trace Timecompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedincompact3d: input.i3d 193 Cells Per Directionnpb: FT.Cstockfish: Total Timecompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedquantlib: phpbench: PHP Benchmark Suitebuild-imagemagick: Time To Compilestress-ng: CPU Stresssynthmark: VoiceMark_100stress-ng: IO_uringstress-ng: Memory Copyingstress-ng: Cryptostress-ng: Vector Mathstress-ng: Matrix Mathpybench: Total For Average Test Timescompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingdacapobench: Tradebeansbuild-apache: Time To Compilenpb: CG.Cwebp: Quality 100, Losslesscoremark: CoreMark Size 666 - Iterations Per Secondn-queens: Elapsed Timeliquid-dsp: 16 - 256 - 57astcenc: Thoroughrodinia: OpenMP CFD Solveravifenc: 6, Losslessnpb: MG.Cdacapobench: Tradesoapincompact3d: input.i3d 129 Cells Per Directionavifenc: 6dacapobench: H2webp: Quality 100, Highest Compressionamg: dacapobench: Jythonavifenc: 10, Losslesslulesh: lammps: Rhodopsin Proteintscp: AI Chess Performancec7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon1103544.929497.579391.1714467.191189191.28610339.5311591.9256.841183708251.397934.7279904076097730.41198.224143.334155.180137220459731.12832134123141.69864.31115.02073546.32139.3797381502.95281726.305872719.33345710.87346613.34352380.98346814.7573676.9567231.8866.82269.4831041.9038.5172.625.8538642.6941855.140051.32156.603257.94178460.42546.41.940.748.2083240.639.513.29637.8633050.341.229.125857011791.77276088914639.13508.52512.766648427.9045029.71675.635843015.786693.3223181.8155258.1780088.7411857305497824320326.9406571.9522.769405413.86055421.53638360666713.924810.47811.90813481.6135248.016714259.38529519.346125880733339405.76510940.93911.2911370094864682.981628.401488.8052356.16834255.2056449.1114985.4406.937120301384.753558.8869483223345133.89263.724215.666215.528107231840830.78126540482238.20537.19142.27750077.81159.2039281980.24207219.721846629.45307349.36310596.58308938.67308213.1350059.9746995.3575.22488.897372.7662.3231.514.7851231.5346793.945955.72500.873969.3553951.5660.61.190.4966.1472196.331.015.48451.0472051.634.641.02408356244.48216792452878.81742.444985540.3333404.94470.389770521.812903.0017924.1837753.8964084.0817415944571285434434.2013520.8631.082315464.33980023.13626289000016.522217.03516.5186720.68450611.573354713.046396412.24893265290056268.3116016.16277.9358723131397685.704604.620469.9409563.221466161.08113888.4010900.6204.994230549134.9241103.227944773137438136.77147.893281.389202.10670969939371.4522374620097.73517.40136.80191746.5769.63871393967.3934508.6603179830.96356302.84351672.92356829.93347345.4994458.2286545.5791.23164.337861.5792.5453.712.2305454.3041185.741179.71965.072983.93140964.42161.32.460.8641.8052666.133.823.51252.7842582.038.169.216997820423.57220819613440.62996.82533.082818629.73712527.16565.6901037943.373150.4910210.3440140.3039878.869974565366631292822.5279522.8221.122285378.84166118.8393731000007.262520.44617.52926298.81381517.868277212.98429218.27066136476740138.2658112.37156.2201272596OpenBenchmarking.org

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton230060090012001500SE +/- 12.41, N = 9SE +/- 6.44, N = 3SE +/- 10.22, N = 4139711038641. (CXX) g++ options: -flto -pthread

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon150300450600750SE +/- 5.19, N = 3SE +/- 0.49, N = 3SE +/- 0.12, N = 3544.93682.98685.70

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2140280420560700SE +/- 2.06, N = 3SE +/- 0.42, N = 3SE +/- 0.37, N = 3497.58604.62628.40

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2110220330440550SE +/- 1.33, N = 3SE +/- 0.59, N = 3SE +/- 0.53, N = 3391.17469.94488.81

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton22K4K6K8K10KSE +/- 73.65, N = 3SE +/- 9.61, N = 3SE +/- 0.57, N = 39563.224467.192356.161. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton230060090012001500SE +/- 13.37, N = 3SE +/- 9.70, N = 3SE +/- 12.00, N = 3146611898341. (CXX) g++ options: -flto -pthread

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton260120180240300SE +/- 0.33, N = 3SE +/- 1.94, N = 3SE +/- 2.40, N = 7161.08191.29255.211. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton23K6K9K12K15KSE +/- 22.04, N = 3SE +/- 7.36, N = 3SE +/- 3.20, N = 313888.4010339.536449.111. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton23K6K9K12K15KSE +/- 166.62, N = 14SE +/- 121.56, N = 15SE +/- 203.15, N = 1510900.611591.914985.4

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton290180270360450SE +/- 0.33, N = 3SE +/- 0.18, N = 3SE +/- 0.13, N = 3204.99256.84406.941. (CXX) g++ options: -O3 -fPIC -lm

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton250K100K150K200K250KSE +/- 864.34, N = 3SE +/- 773.26, N = 3SE +/- 23.07, N = 32305491837081203011. (CC) gcc options: -pedantic -O3

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton280160240320400SE +/- 1.43, N = 3SE +/- 0.24, N = 3SE +/- 0.11, N = 3134.92251.40384.75-mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm1. (CC) gcc options: -O3 -std=c99 -pedantic -lm

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton22004006008001000SE +/- 19.93, N = 9SE +/- 0.39, N = 3SE +/- 0.23, N = 31103.22934.72558.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton22K4K6K8K10KSE +/- 2.40, N = 3SE +/- 322.41, N = 12SE +/- 3.50, N = 37990794469481. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2170340510680850SE +/- 50.92, N = 12SE +/- 0.17, N = 3SE +/- 0.17, N = 37734073221. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton230060090012001500SE +/- 91.51, N = 12SE +/- 0.00, N = 3SE +/- 0.17, N = 313746093341. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton28K16K24K32K40KSE +/- 160.86, N = 3SE +/- 1.96, N = 3SE +/- 0.90, N = 338136.777730.415133.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton260120180240300SE +/- 1.80, N = 4SE +/- 0.86, N = 3SE +/- 0.91, N = 3147.89198.22263.721. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon60120180240300SE +/- 0.15, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 3143.33215.67281.391. (CXX) g++ options: -O2 -lOpenCL

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton250100150200250SE +/- 0.08, N = 3SE +/- 0.24, N = 3SE +/- 0.13, N = 3155.18202.11215.531. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon3000M6000M9000M12000M15000MSE +/- 7739237.92, N = 3SE +/- 47755430.47, N = 3SE +/- 606684.16, N = 313722045973107231840837096993937-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton20.32670.65340.98011.30681.6335SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.4521.1280.7811. (CXX) g++ options: -O3

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon7M14M21M28M35MSE +/- 104795.40, N = 3SE +/- 359309.26, N = 3SE +/- 325631.00, N = 3321341232654048223746200

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton250100150200250SE +/- 0.26, N = 3SE +/- 0.11, N = 3SE +/- 0.12, N = 397.74141.70238.211. (CXX) g++ options: -O3 -fPIC -lm

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cachec7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon1428425670SE +/- 3.64, N = 12SE +/- 0.97, N = 15SE +/- 0.30, N = 1564.3137.1917.401. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2306090120150SE +/- 0.64, N = 3SE +/- 0.69, N = 3SE +/- 0.70, N = 3115.02136.80142.28

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton220K40K60K80K100KSE +/- 833.50, N = 7SE +/- 89.82, N = 3SE +/- 578.32, N = 391746.5773546.3250077.811. (CC) gcc options: -shared -fPIC -O2

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton24080120160200SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 369.64139.38159.201. (CXX) g++ options: -O3 -flto -pthread

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2306090120150SE +/- 0.60, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 313938281. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon9001800270036004500SE +/- 17.76, N = 3SE +/- 14.44, N = 3SE +/- 80.05, N = 121502.951980.243967.39

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton27001400210028003500SE +/- 1.61, N = 3SE +/- 1.86, N = 3SE +/- 1.74, N = 33450281720721. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon612182430SE +/- 0.03738, N = 3SE +/- 0.01639, N = 3SE +/- 0.04033, N = 326.3058019.721808.660311. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton220K40K60K80K100KSE +/- 335.63, N = 3SE +/- 83.83, N = 3SE +/- 276.10, N = 379830.9672719.3346629.451. (CC) gcc options: -shared -fPIC -O2

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton280K160K240K320K400KSE +/- 1727.81, N = 3SE +/- 2009.97, N = 3SE +/- 3992.58, N = 3356302.84345710.87307349.361. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton280K160K240K320K400KSE +/- 1620.39, N = 3SE +/- 1017.52, N = 3SE +/- 3783.68, N = 3351672.92346613.34310596.581. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton280K160K240K320K400KSE +/- 1582.66, N = 3SE +/- 3986.77, N = 3SE +/- 1347.28, N = 3356829.93352380.98308938.671. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton270K140K210K280K350KSE +/- 2637.25, N = 3SE +/- 1410.11, N = 3SE +/- 1677.89, N = 3347345.49346814.75308213.131. (CC) gcc options: -lcrypt -lz -O3 -march=native

Apache HTTP Server

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton220K40K60K80K100KSE +/- 615.05, N = 3SE +/- 649.31, N = 3SE +/- 112.65, N = 394458.2273676.9550059.971. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton220K40K60K80K100KSE +/- 389.13, N = 3SE +/- 38.09, N = 3SE +/- 93.03, N = 386545.5767231.8846995.351. (CC) gcc options: -shared -fPIC -O2

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 366.8275.2291.231. (CXX) g++ options: -fopenmp -O2 -march=native

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton220406080100SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.31, N = 364.3469.4888.90

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton22004006008001000SE +/- 2.29, N = 3SE +/- 2.14, N = 3SE +/- 0.20, N = 31041.90861.57372.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon20406080100SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 338.5262.3292.551. (CC) gcc options: -lm -lpthread -O3

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton20.83481.66962.50443.33924.174SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.712.621.511. (CXX) g++ options: -O3

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon1.31712.63423.95135.26846.5855SE +/- 0.016350, N = 3SE +/- 0.007139, N = 3SE +/- 0.003819, N = 35.8538644.7851232.2305451. (CC) gcc options: -O3 -march=native -fopenmp

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton20.96751.9352.90253.874.8375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.302.691.531. (CXX) g++ options: -O3

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton210K20K30K40K50KSE +/- 75.14, N = 3SE +/- 210.27, N = 3SE +/- 197.89, N = 341185.741855.146793.9

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton210K20K30K40K50KSE +/- 305.31, N = 3SE +/- 110.01, N = 3SE +/- 336.95, N = 340051.341179.745955.7

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton25001000150020002500SE +/- 1.81, N = 3SE +/- 19.61, N = 3SE +/- 28.63, N = 31965.072156.602500.87

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton29001800270036004500SE +/- 3.54, N = 3SE +/- 22.07, N = 3SE +/- 37.23, N = 32983.933257.943969.35

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton240K80K120K160K200KSE +/- 82.61, N = 3SE +/- 47.94, N = 3SE +/- 3.30, N = 3178460.4140964.453951.5-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton25001000150020002500SE +/- 0.23, N = 3SE +/- 4.47, N = 3SE +/- 0.03, N = 32546.42161.3660.6-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton20.55351.1071.66052.2142.7675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.461.941.191. (CXX) g++ options: -O3

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton20.19350.3870.58050.7740.9675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.860.700.491. (CXX) g++ options: -O3

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton21530456075SE +/- 0.34, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 341.8148.2166.15-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton27001400210028003500SE +/- 6.93, N = 3SE +/- 7.82, N = 3SE +/- 2.93, N = 33240.62666.12196.3-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2918273645SE +/- 0.23, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 339.533.831.0-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon612182430SE +/- 0.33, N = 12SE +/- 0.26, N = 15SE +/- 0.07, N = 313.3015.4823.511. (CXX) g++ options: -O2 -lOpenCL

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon1224364860SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.12, N = 337.8651.0552.78-march=native1. (CXX) g++ options: -pipe -O3 -ffast-math -R/usr/lib -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton27001400210028003500SE +/- 7.75, N = 3SE +/- 24.18, N = 3SE +/- 12.10, N = 33050.32582.02051.6-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2918273645SE +/- 0.00, N = 3SE +/- 0.40, N = 3SE +/- 0.06, N = 341.238.134.6-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon1530456075SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.14, N = 329.1341.0269.221. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton24K8K12K16K20KSE +/- 40.24, N = 3SE +/- 1.17, N = 3SE +/- 1.10, N = 320423.5711791.776244.481. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton26M12M18M24M30MSE +/- 153578.64, N = 3SE +/- 242448.39, N = 3SE +/- 292329.99, N = 3276088912208196121679245-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton210002000300040005000SE +/- 9.57, N = 3SE +/- 29.53, N = 3SE +/- 6.37, N = 34639.13440.62888.3-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedc7g.4xlargec6i.4xlarge Xeon8001600240032004000SE +/- 2.07, N = 3SE +/- 2.87, N = 33508.52996.8-llzma1. (CC) gcc options: -O3 -pthread -lz

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton25001000150020002500SE +/- 6.22, N = 3SE +/- 0.15, N = 3SE +/- 5.40, N = 32533.02512.71742.41. (CXX) g++ options: -O3 -march=native -rdynamic

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2200K400K600K800K1000KSE +/- 983.65, N = 3SE +/- 525.83, N = 3SE +/- 743.13, N = 3828186666484449855

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2918273645SE +/- 0.13, N = 3SE +/- 0.07, N = 3SE +/- 0.22, N = 327.9029.7440.33

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton23K6K9K12K15KSE +/- 155.66, N = 3SE +/- 0.41, N = 3SE +/- 0.54, N = 312527.165029.713404.941. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2150300450600750SE +/- 0.32, N = 3SE +/- 2.00, N = 3SE +/- 0.33, N = 3675.64565.69470.391. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Stress-NG

Test: IO_uring

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2200K400K600K800K1000KSE +/- 405.56, N = 3SE +/- 614.16, N = 3SE +/- 2395.13, N = 31037943.37843015.78770521.811. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton214002800420056007000SE +/- 3.52, N = 3SE +/- 0.94, N = 3SE +/- 3.75, N = 36693.323150.492903.001. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon5K10K15K20K25KSE +/- 32.01, N = 3SE +/- 92.83, N = 3SE +/- 5.89, N = 323181.8117924.1810210.341. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton212K24K36K48K60KSE +/- 17.05, N = 3SE +/- 28.50, N = 3SE +/- 15.72, N = 355258.1740140.3037753.891. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Mathc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon20K40K60K80K100KSE +/- 3.18, N = 3SE +/- 2.76, N = 3SE +/- 35.16, N = 380088.7464084.0839878.861. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2400800120016002000SE +/- 3.84, N = 3SE +/- 0.33, N = 3SE +/- 1.67, N = 399711851741

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon16K32K48K64K80KSE +/- 12.88, N = 3SE +/- 239.68, N = 3SE +/- 35.00, N = 37305459445456531. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon20K40K60K80K100KSE +/- 159.36, N = 3SE +/- 44.77, N = 3SE +/- 174.34, N = 39782471285666311. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

DaCapo Benchmark

Java Test: Tradebeans

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton29001800270036004500SE +/- 19.24, N = 20SE +/- 26.73, N = 4SE +/- 40.13, N = 4292832034344

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2816243240SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 322.5326.9434.20

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton22K4K6K8K10KSE +/- 66.44, N = 3SE +/- 17.12, N = 3SE +/- 9.95, N = 39522.826571.953520.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2714212835SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 321.1222.7731.08-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon90K180K270K360K450KSE +/- 3211.91, N = 3SE +/- 49.84, N = 3SE +/- 80.93, N = 3405413.86315464.34285378.841. (CC) gcc options: -O2 -lrt" -lrt

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton2612182430SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 318.8421.5423.141. (CC) gcc options: -static -fopenmp -O3 -march=native

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton280M160M240M320M400MSE +/- 400097.21, N = 3SE +/- 41633.32, N = 3SE +/- 35118.85, N = 33836066673731000002628900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton248121620SE +/- 0.0001, N = 3SE +/- 0.0011, N = 3SE +/- 0.0064, N = 37.262513.924816.52221. (CXX) g++ options: -O3 -flto -pthread

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon510152025SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 310.4817.0420.451. (CXX) g++ options: -O2 -lOpenCL

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon48121620SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.03, N = 311.9116.5217.531. (CXX) g++ options: -O3 -fPIC -lm

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton26K12K18K24K30KSE +/- 184.24, N = 3SE +/- 4.69, N = 3SE +/- 1.39, N = 326298.8113481.616720.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

DaCapo Benchmark

Java Test: Tradesoap

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton210002000300040005000SE +/- 14.95, N = 4SE +/- 24.39, N = 4SE +/- 27.95, N = 4352438154506

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon48121620SE +/- 0.01401446, N = 3SE +/- 0.01351889, N = 3SE +/- 0.09619197, N = 38.0167142511.5733547017.868277201. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton23691215SE +/- 0.025, N = 3SE +/- 0.025, N = 3SE +/- 0.006, N = 39.38512.98413.0461. (CXX) g++ options: -O3 -fPIC -lm

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton29001800270036004500SE +/- 32.93, N = 4SE +/- 32.57, N = 5SE +/- 45.89, N = 4292129513964

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compressionc6i.4xlarge Xeonc7g.4xlargec6g.4xlarge Graviton23691215SE +/- 0.021, N = 3SE +/- 0.007, N = 3SE +/- 0.043, N = 38.2709.34612.248-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon300M600M900M1200M1500MSE +/- 952437.28, N = 3SE +/- 3420043.89, N = 3SE +/- 5114517.12, N = 312588073339326529006613647671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton212002400360048006000SE +/- 6.99, N = 4SE +/- 24.07, N = 4SE +/- 23.29, N = 4394040135626

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Losslessc7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2246810SE +/- 0.021, N = 3SE +/- 0.010, N = 3SE +/- 0.026, N = 35.7658.2658.3111. (CXX) g++ options: -O3 -fPIC -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton22K4K6K8K10KSE +/- 76.73, N = 3SE +/- 14.20, N = 3SE +/- 4.88, N = 310940.948112.376016.161. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlargec6g.4xlarge Graviton2c6i.4xlarge Xeon3691215SE +/- 0.060, N = 3SE +/- 0.014, N = 3SE +/- 0.009, N = 311.2917.9356.2201. (CXX) g++ options: -O3 -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlargec6i.4xlarge Xeonc6g.4xlarge Graviton2300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 1099.67, N = 5SE +/- 338.27, N = 5137009412725968723131. (CC) gcc options: -O3 -march=native


Phoronix Test Suite v10.8.4