Amazon EC2 c7g.4xlarge Graviton3

Graviton3 benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2206031-NE-2206014NE30.

Amazon EC2 c7g.4xlarge Graviton3ProcessorMotherboardChipsetMemoryDiskNetworkGraphicsMonitorOSKernelCompilerFile-SystemSystem LayerDisplay ServerVulkanScreen Resolutionc7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 2Apple M1ARMv8 Neoverse-V1 (16 Cores)Amazon EC2 c7g.4xlarge (1.0 BIOS)Amazon Device 020032GB193GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.15.0-1004-aws (aarch64)GCC 11.2.0ext4amazonARMv8 Neoverse-N1 (16 Cores)QEMU KVM Virtual Machine (0.0.0 BIOS)Red Hat QEMU PCIe1 x 7000 MB RAM QEMU47GBRed Hat Virtio deviceUbuntu 20.045.4.0-100-generic (aarch64)X Server 1.20.131.1.182GCC 9.4.0KVM16384 MB + 15031 MB RAMApple M1 (8 Cores)Apple MacBook Air16GB229GBApple M1Color LCDmacOS 12.421.5.0 (arm64)GCC 13.0.0 + Clang 13.0.0APFS2560x1600OpenBenchmarking.orgKernel Details- c7g.4xlarge, 16 vcpu ampere Vm, 16 vcpu ampere Vm run 2: Transparent Huge Pages: madviseCompiler Details- c7g.4xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - 16 vcpu ampere Vm: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - 16 vcpu ampere Vm run 2: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Java Details- c7g.4xlarge: OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)- 16 vcpu ampere Vm: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.20.04)- 16 vcpu ampere Vm run 2: OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.20.04)- Apple M1: Please visit java for information on installing Java.Python Details- c7g.4xlarge: Python 3.10.4- 16 vcpu ampere Vm: Python 3.8.10- 16 vcpu ampere Vm run 2: Python 3.8.10- Apple M1: Python 3.9.13Security Details- c7g.4xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - 16 vcpu ampere Vm: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 16 vcpu ampere Vm run 2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected Environment Details- Apple M1: XPC_FLAGS=0x0

Amazon EC2 c7g.4xlarge Graviton3quantlib: hpcg: npb: BT.Cnpb: CG.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Clczero: BLASlczero: Eigenrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusteramg: mrbayes: Primate Phylogeny Analysisincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionlammps: 20k Atomslammps: Rhodopsin Proteinlulesh: webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDdacapobench: H2dacapobench: Jythondacapobench: Tradesoapdacapobench: Tradebeanscompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedtscp: AI Chess Performancemt-dgemm: Sustained Floating-Point Ratecoremark: CoreMark Size 666 - Iterations Per Secondcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessbuild-apache: Time To Compilebuild-gem5: Time To Compilebuild-imagemagick: Time To Compilebuild-llvm: Ninjabuild-nodejs: Time To Compilebuild-php: Time To Compilebuild2: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timem-queens: Time To Solven-queens: Elapsed Timengspice: C2670ngspice: C7552synthmark: VoiceMark_100securemark: SecureMark-TLSopenssl: SHA256openssl: RSA4096openssl: RSA4096liquid-dsp: 16 - 256 - 57gromacs: MPI CPU - water_GMX50_baretensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2astcenc: Thoroughastcenc: Exhaustivestress-ng: Cryptostress-ng: IO_uringstress-ng: CPU Cachestress-ng: CPU Stressstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyinggpaw: Carbon Nanotubepybench: Total For Average Test Timesnginx: 100nginx: 200nginx: 500nginx: 1000onnx: GPT-2 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: super-resolution-10 - CPU - Standardapache: 100apache: 200apache: 500apache: 1000phpbench: PHP Benchmark Suitec7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 2Apple M12512.726.305810339.536571.95934.7211791.771041.907730.4113481.614467.1911031189143.33410.47813.2961258807333251.3978.0167142529.125857011.42511.29110940.93922.7699.34648.2081.940.72.622.6929513940352432034639.13508.541.23050.339.53240.613700945.853864405413.86055497824730542760889132134123256.841141.6989.38511.9085.76526.940391.17127.904544.929497.57969.483115.02038.51737.86366.82221.536198.224191.286675.635183708137220459732546.4178460.43836066671.1283257.9441855.111591.92156.601502.9540051.313.9248139.379723181.81843015.7864.315029.7180088.7455258.176693.32155.1801185345710.87352380.98346613.34346814.75799040738609281767231.8873676.9573546.3272719.336664841878.46924.502651.14661.87650.951910.99.260036799.062436.26660.386338.87227.195892.126924.282009.15555485190.40027.85252.388566477900339.72317.903897458.25613025.5865.4953014.474558.69110.299125.4701.300.551.651.669907675316126244361063.32095.814.01449.18.891904.510403881.147944371146.66303340606581661597097716217696419.324223.60112.55515.0487.87755.985960.52176.1331429.6721386.105162.608235.49654.10269.75864.40012.551654.186796.446559.89314187812753706930791.064694.33191800000.6107508.8387437.750384.95496.115538.3498426.519.0718183.503020420.09236.503974.4876853.0141006.213189.31305.717142165315.0666077.4865751.8564815.2417823.7420474.1321329.8820049.524916493.040.994.194.323807.84199.023.73864.020.44063.4452.0799557.9651003.3351182.0116739.0027528.2730521.814010.4691770016.6970103.781048975.671170852.2969723.0870710.061103760.651152906.15736732OpenBenchmarking.org

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21c7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 25001000150020002500SE +/- 0.15, N = 3SE +/- 17.05, N = 15SE +/- 15.63, N = 152512.71878.41910.91. (CXX) g++ options: -O3 -march=native -rdynamic

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlarge16 vcpu ampere Vm run 2612182430SE +/- 0.03738, N = 3SE +/- 0.16954, N = 1126.305809.26003-pthread1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 22K4K6K8K10KSE +/- 7.36, N = 3SE +/- 9.84, N = 3SE +/- 30.27, N = 310339.536924.506799.06-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 214002800420056007000SE +/- 17.12, N = 3SE +/- 85.05, N = 12SE +/- 101.16, N = 156571.952651.142436.26-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 22004006008001000SE +/- 0.39, N = 3SE +/- 0.34, N = 3SE +/- 1.49, N = 3934.72661.87660.38-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc7g.4xlarge16 vcpu ampere Vm16 vcpu ampere Vm run 23K6K9K12K15KSE +/- 1.17, N = 3SE +/- 24.16, N = 9SE +/- 47.76, N = 311791.77650.956338.87-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlarge16 vcpu ampere Vm run 22004006008001000SE +/- 2.29, N = 3SE +/- 2.75, N = 41041.90227.19-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.4xlarge16 vcpu ampere Vm run 217003400510068008500SE +/- 1.96, N = 3SE +/- 16.78, N = 37730.415892.12-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc7g.4xlarge16 vcpu ampere Vm run 23K6K9K12K15KSE +/- 4.69, N = 3SE +/- 12.27, N = 313481.616924.28-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc7g.4xlarge16 vcpu ampere Vm run 210002000300040005000SE +/- 9.61, N = 3SE +/- 8.85, N = 34467.192009.15-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.4xlarge16 vcpu ampere Vm run 22004006008001000SE +/- 6.44, N = 3SE +/- 6.05, N = 411035551. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc7g.4xlarge16 vcpu ampere Vm run 230060090012001500SE +/- 9.70, N = 3SE +/- 6.46, N = 911894851. (CXX) g++ options: -flto -pthread

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlarge16 vcpu ampere Vm run 24080120160200SE +/- 0.15, N = 3SE +/- 0.03, N = 3143.33190.401. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlarge16 vcpu ampere Vm run 2714212835SE +/- 0.02, N = 3SE +/- 0.28, N = 1510.4827.851. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlarge16 vcpu ampere Vm run 21224364860SE +/- 0.33, N = 12SE +/- 1.06, N = 1513.3052.391. (CXX) g++ options: -O2 -lOpenCL

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlarge16 vcpu ampere Vm run 2300M600M900M1200M1500MSE +/- 952437.28, N = 3SE +/- 1444853.58, N = 31258807333566477900-pthread1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc7g.4xlarge16 vcpu ampere Vm run 270140210280350SE +/- 0.24, N = 3SE +/- 1.14, N = 3251.40339.721. (CC) gcc options: -O3 -std=c99 -pedantic -lm

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlarge16 vcpu ampere Vm run 248121620SE +/- 0.01401446, N = 3SE +/- 0.21523497, N = 158.0167142517.90389740-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlarge16 vcpu ampere Vm run 21326395265SE +/- 0.03, N = 3SE +/- 0.42, N = 329.1358.26-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atomsc7g.4xlarge16 vcpu ampere Vm run 23691215SE +/- 0.067, N = 311.4255.586-pthread1. (CXX) g++ options: -O3 -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlarge16 vcpu ampere Vm run 23691215SE +/- 0.060, N = 3SE +/- 0.039, N = 1511.2915.495-pthread1. (CXX) g++ options: -O3 -lm

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlarge16 vcpu ampere Vm run 22K4K6K8K10KSE +/- 76.73, N = 3SE +/- 45.42, N = 1510940.943014.47-pthread1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc7g.4xlarge16 vcpu ampere Vm run 21326395265SE +/- 0.09, N = 3SE +/- 0.05, N = 322.7758.69-pthread -ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compressionc7g.4xlarge16 vcpu ampere Vm run 23691215SE +/- 0.007, N = 3SE +/- 0.056, N = 39.34610.299-pthread -ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc7g.4xlarge16 vcpu ampere Vm run 2306090120150SE +/- 0.01, N = 3SE +/- 0.91, N = 1148.21125.47-pthread -ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac7g.4xlarge16 vcpu ampere Vm run 2Apple M10.6841.3682.0522.7363.42SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 31.941.303.04-pthread-arch -isysroot1. (CXX) g++ options: -O3

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc7g.4xlarge16 vcpu ampere Vm run 2Apple M10.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.700.550.99-pthread-arch -isysroot1. (CXX) g++ options: -O3

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc7g.4xlarge16 vcpu ampere Vm run 2Apple M10.94281.88562.82843.77124.714SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 32.621.654.19-pthread-arch -isysroot1. (CXX) g++ options: -O3

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc7g.4xlarge16 vcpu ampere Vm run 2Apple M10.9721.9442.9163.8884.86SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 32.691.664.32-pthread-arch -isysroot1. (CXX) g++ options: -O3

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c7g.4xlarge16 vcpu ampere Vm run 22K4K6K8K10KSE +/- 32.57, N = 5SE +/- 58.80, N = 2029519907

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlarge16 vcpu ampere Vm run 214002800420056007000SE +/- 6.99, N = 4SE +/- 28.16, N = 439406753

DaCapo Benchmark

Java Test: Tradesoap

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlarge16 vcpu ampere Vm run 23K6K9K12K15KSE +/- 14.95, N = 4SE +/- 114.92, N = 4352416126

DaCapo Benchmark

Java Test: Tradebeans

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc7g.4xlarge16 vcpu ampere Vm run 25K10K15K20K25KSE +/- 26.73, N = 4SE +/- 440.17, N = 16320324436

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc7g.4xlarge16 vcpu ampere Vm run 2Apple M110002000300040005000SE +/- 9.57, N = 3SE +/- 19.16, N = 15SE +/- 8.27, N = 34639.11063.33807.8-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedc7g.4xlarge16 vcpu ampere Vm run 2Apple M19001800270036004500SE +/- 2.07, N = 3SE +/- 9.51, N = 9SE +/- 23.45, N = 33508.52095.84199.0-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlarge16 vcpu ampere Vm run 2Apple M1918273645SE +/- 0.00, N = 3SE +/- 0.27, N = 15SE +/- 0.06, N = 341.214.023.7-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlarge16 vcpu ampere Vm run 2Apple M18001600240032004000SE +/- 7.75, N = 3SE +/- 105.10, N = 15SE +/- 0.32, N = 33050.31449.13864.0-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlarge16 vcpu ampere Vm run 2Apple M1918273645SE +/- 0.23, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 339.508.8920.40-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlarge16 vcpu ampere Vm run 2Apple M19001800270036004500SE +/- 6.93, N = 3SE +/- 17.95, N = 3SE +/- 0.72, N = 33240.61904.54063.4-llzma1. (CC) gcc options: -O3 -pthread -lz

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlarge16 vcpu ampere Vm run 2300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 1332.29, N = 5137009410403881. (CC) gcc options: -O3 -march=native

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlarge16 vcpu ampere Vm run 21.31712.63423.95135.26846.5855SE +/- 0.016350, N = 3SE +/- 0.004261, N = 35.8538641.1479441. (CC) gcc options: -O3 -march=native -fopenmp

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlarge16 vcpu ampere Vm run 290K180K270K360K450KSE +/- 3211.91, N = 3SE +/- 755.08, N = 3405413.86371146.661. (CC) gcc options: -O2 -lrt" -lrt

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlarge16 vcpu ampere Vm run 220K40K60K80K100KSE +/- 159.36, N = 3SE +/- 927.35, N = 1297824406061. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlarge16 vcpu ampere Vm run 216K32K48K64K80KSE +/- 12.88, N = 373054581661. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlarge16 vcpu ampere Vm run 26M12M18M24M30MSE +/- 153578.64, N = 3SE +/- 159724.07, N = 327608891159709771. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlarge16 vcpu ampere Vm run 27M14M21M28M35MSE +/- 104795.40, N = 3SE +/- 76504.07, N = 33213412316217696

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c7g.4xlarge16 vcpu ampere Vm run 290180270360450SE +/- 0.18, N = 3SE +/- 9.46, N = 9256.84419.321. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c7g.4xlarge16 vcpu ampere Vm run 250100150200250SE +/- 0.11, N = 3SE +/- 1.54, N = 3141.70223.601. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6c7g.4xlarge16 vcpu ampere Vm run 23691215SE +/- 0.025, N = 3SE +/- 0.026, N = 39.38512.5551. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlarge16 vcpu ampere Vm run 248121620SE +/- 0.01, N = 3SE +/- 0.17, N = 311.9115.051. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Losslessc7g.4xlarge16 vcpu ampere Vm run 2246810SE +/- 0.021, N = 3SE +/- 0.021, N = 35.7657.8771. (CXX) g++ options: -O3 -fPIC -lm

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec7g.4xlarge16 vcpu ampere Vm run 21326395265SE +/- 0.05, N = 3SE +/- 0.71, N = 326.9455.99

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlarge16 vcpu ampere Vm run 22004006008001000SE +/- 1.33, N = 3SE +/- 11.13, N = 4391.17960.52

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlarge16 vcpu ampere Vm run 2Apple M1100200300400500SE +/- 0.13, N = 3SE +/- 3.17, N = 15SE +/- 169.86, N = 927.9076.13452.08

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlarge16 vcpu ampere Vm run 230060090012001500SE +/- 5.19, N = 3SE +/- 14.14, N = 3544.931429.67

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlarge16 vcpu ampere Vm run 2Apple M12K4K6K8K10KSE +/- 2.06, N = 3SE +/- 7.93, N = 3SE +/- 1489.09, N = 3497.581386.119557.97

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec7g.4xlarge16 vcpu ampere Vm run 24080120160200SE +/- 0.11, N = 3SE +/- 4.68, N = 1269.48162.61

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlarge16 vcpu ampere Vm run 250100150200250SE +/- 0.64, N = 3SE +/- 3.00, N = 12115.02235.50

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlarge16 vcpu ampere Vm run 2Apple M12004006008001000SE +/- 0.02, N = 3SE +/- 0.42, N = 3SE +/- 328.10, N = 938.5254.101003.341. (CC) gcc options: -lm -lpthread -O3

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlarge16 vcpu ampere Vm run 21632486480SE +/- 0.01, N = 3SE +/- 0.73, N = 1537.8669.76-R/usr/lib-pthread1. (CXX) g++ options: -pipe -O3 -ffast-math -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlarge16 vcpu ampere Vm run 21530456075SE +/- 0.00, N = 3SE +/- 0.00, N = 366.8264.401. (CXX) g++ options: -fopenmp -O2 -march=native

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec7g.4xlarge16 vcpu ampere Vm run 2510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 321.5412.551. (CC) gcc options: -static -fopenmp -O3 -march=native

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c7g.4xlarge16 vcpu ampere Vm run 2140280420560700SE +/- 0.86, N = 3SE +/- 27.85, N = 6198.22654.19-lXft -lfontconfig -lXrender -lfreetype1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c7g.4xlarge16 vcpu ampere Vm run 22004006008001000SE +/- 1.94, N = 3SE +/- 49.48, N = 9191.29796.45-lXft -lfontconfig -lXrender -lfreetype1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlarge16 vcpu ampere Vm run 2150300450600750SE +/- 0.32, N = 3SE +/- 0.12, N = 3675.64559.891. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc7g.4xlarge16 vcpu ampere Vm run 240K80K120K160K200KSE +/- 773.26, N = 3SE +/- 99.89, N = 31837081418781. (CC) gcc options: -pedantic -O3

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlarge16 vcpu ampere Vm run 23000M6000M9000M12000M15000MSE +/- 7739237.92, N = 3SE +/- 12960363.63, N = 313722045973127537069301. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlarge16 vcpu ampere Vm run 25001000150020002500SE +/- 0.23, N = 3SE +/- 0.30, N = 32546.4791.01. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlarge16 vcpu ampere Vm run 240K80K120K160K200KSE +/- 82.61, N = 3SE +/- 11.12, N = 3178460.464694.31. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlarge16 vcpu ampere Vm run 280M160M240M320M400MSE +/- 400097.21, N = 3SE +/- 107857.93, N = 33836066673191800001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec7g.4xlarge16 vcpu ampere Vm run 20.25380.50760.76141.01521.269SE +/- 0.002, N = 3SE +/- 0.002, N = 31.1280.610-pthread1. (CXX) g++ options: -O3

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc7g.4xlarge16 vcpu ampere Vm run 216003200480064008000SE +/- 22.07, N = 3SE +/- 54.01, N = 33257.947508.83

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c7g.4xlarge16 vcpu ampere Vm run 220K40K60K80K100KSE +/- 210.27, N = 3SE +/- 1013.94, N = 441855.187437.7

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec7g.4xlarge16 vcpu ampere Vm run 211K22K33K44K55KSE +/- 121.56, N = 15SE +/- 645.70, N = 1511591.950384.9

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc7g.4xlarge16 vcpu ampere Vm run 212002400360048006000SE +/- 19.61, N = 3SE +/- 51.89, N = 62156.605496.11

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlarge16 vcpu ampere Vm run 212002400360048006000SE +/- 17.76, N = 3SE +/- 44.20, N = 151502.955538.34

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlarge16 vcpu ampere Vm run 220K40K60K80K100KSE +/- 305.31, N = 3SE +/- 824.24, N = 1540051.398426.5

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc7g.4xlarge16 vcpu ampere Vm run 2510152025SE +/- 0.00, N = 3SE +/- 0.16, N = 913.9219.071. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec7g.4xlarge16 vcpu ampere Vm run 24080120160200SE +/- 0.01, N = 3SE +/- 0.22, N = 3139.38183.501. (CXX) g++ options: -O3 -flto -pthread

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlarge16 vcpu ampere Vm run 25K10K15K20K25KSE +/- 32.01, N = 3SE +/- 43.41, N = 323181.8120420.091. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: IO_uring

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc7g.4xlarge200K400K600K800K1000KSE +/- 614.16, N = 3843015.781. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cachec7g.4xlarge16 vcpu ampere Vm run 2Apple M130060090012001500SE +/- 3.64, N = 12SE +/- 2.46, N = 4SE +/- 106.97, N = 1264.31236.501182.01-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt1. (CC) gcc options: -O2 -std=gnu99 -lm -lc -lz -pthread

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc7g.4xlarge16 vcpu ampere Vm run 2Apple M14K8K12K16K20KSE +/- 0.41, N = 3SE +/- 22.51, N = 3SE +/- 98.75, N = 35029.713974.4816739.00-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt1. (CC) gcc options: -O2 -std=gnu99 -lm -lc -lz -pthread

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Mathc7g.4xlarge16 vcpu ampere Vm run 2Apple M120K40K60K80K100KSE +/- 3.18, N = 3SE +/- 8.70, N = 3SE +/- 181.15, N = 380088.7476853.0127528.27-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt1. (CC) gcc options: -O2 -std=gnu99 -lm -lc -lz -pthread

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlarge16 vcpu ampere Vm run 2Apple M112K24K36K48K60KSE +/- 17.05, N = 3SE +/- 7.88, N = 3SE +/- 201.08, N = 355258.1741006.2130521.81-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt1. (CC) gcc options: -O2 -std=gnu99 -lm -lc -lz -pthread

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlarge16 vcpu ampere Vm run 2Apple M114002800420056007000SE +/- 3.52, N = 3SE +/- 7.47, N = 3SE +/- 45.80, N = 36693.323189.314010.46-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt-lapparmor -latomic -lcrypt -ldl -ljpeg -lrt1. (CC) gcc options: -O2 -std=gnu99 -lm -lc -lz -pthread

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlarge16 vcpu ampere Vm run 270140210280350SE +/- 0.08, N = 3SE +/- 3.47, N = 3155.18305.72-pthread1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc7g.4xlarge16 vcpu ampere Vm run 2Apple M130060090012001500SE +/- 0.33, N = 3SE +/- 0.58, N = 311851421917

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c7g.4xlarge16 vcpu ampere Vm run 2Apple M170K140K210K280K350KSE +/- 2009.97, N = 3SE +/- 90.74, N = 3SE +/- 703.26, N = 15345710.8765315.0670016.69-ldl -lpthread1. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c7g.4xlarge16 vcpu ampere Vm run 2Apple M180K160K240K320K400KSE +/- 3986.77, N = 3SE +/- 315.71, N = 3SE +/- 897.80, N = 12352380.9866077.4870103.781. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c7g.4xlarge16 vcpu ampere Vm run 2Apple M1200K400K600K800K1000KSE +/- 1017.52, N = 3SE +/- 465.70, N = 3SE +/- 16178.42, N = 15346613.3465751.851048975.671. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c7g.4xlarge16 vcpu ampere Vm run 2Apple M1300K600K900K1200K1500KSE +/- 1410.11, N = 3SE +/- 705.50, N = 3SE +/- 28022.87, N = 9346814.7564815.241170852.291. (CC) gcc options: -lcrypt -lz -O3 -march=native

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlarge2K4K6K8K10KSE +/- 2.40, N = 379901. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc7g.4xlarge90180270360450SE +/- 0.17, N = 34071. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc7g.4xlarge918273645SE +/- 0.00, N = 3381. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc7g.4xlarge130260390520650SE +/- 0.00, N = 36091. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc7g.4xlarge6001200180024003000SE +/- 1.86, N = 328171. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Apache HTTP Server

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c7g.4xlarge16 vcpu ampere Vm run 2Apple M115K30K45K60K75KSE +/- 38.09, N = 3SE +/- 99.44, N = 3SE +/- 1068.44, N = 967231.8817823.7469723.08-pthread1. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c7g.4xlarge16 vcpu ampere Vm run 2Apple M116K32K48K64K80KSE +/- 649.31, N = 3SE +/- 231.23, N = 3SE +/- 1015.69, N = 973676.9520474.1370710.06-pthread1. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c7g.4xlarge16 vcpu ampere Vm run 2Apple M1200K400K600K800K1000KSE +/- 89.82, N = 3SE +/- 197.16, N = 3SE +/- 18803.88, N = 973546.3221329.881103760.651. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c7g.4xlarge16 vcpu ampere Vm run 2Apple M1200K400K600K800K1000KSE +/- 83.83, N = 3SE +/- 134.65, N = 3SE +/- 20338.46, N = 1072719.3320049.521152906.151. (CC) gcc options: -shared -fPIC -O2

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec7g.4xlarge16 vcpu ampere Vm run 2Apple M1160K320K480K640K800KSE +/- 525.83, N = 3SE +/- 1035.45, N = 3SE +/- 532.48, N = 3666484491649736732


Phoronix Test Suite v10.8.4