Amazon EC2 c7g.4xlarge Graviton3 Tests

Graviton3 benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/2205256-NE-2205240NE24&grs.

Amazon EC2 c7g.4xlarge Graviton3 TestsProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layerc7g.4xlargea1.4xlargeARMv8 Neoverse-V1 (16 Cores)Amazon EC2 c7g.4xlarge (1.0 BIOS)Amazon Device 020032GB193GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.15.0-1004-aws (aarch64)GCC 11.2.0ext4amazonARMv8 Cortex-A72 (16 Cores)Amazon EC2 a1.4xlarge (1.0 BIOS)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Java Details- OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)Python Details- Python 3.10.4Security Details- c7g.4xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - a1.4xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Branch predictor hardening BHB + srbds: Not affected + tsx_async_abort: Not affected

Amazon EC2 c7g.4xlarge Graviton3 Testslczero: Eigenstress-ng: Memory Copyinglczero: BLAScompress-zstd: 3 - Compression Speedhpcg: amg: incompact3d: input.i3d 129 Cells Per Directionmt-dgemm: Sustained Floating-Point Rateincompact3d: input.i3d 193 Cells Per Directionnpb: CG.Cnpb: IS.Dgpaw: Carbon Nanotubelulesh: tensorflow-lite: Mobilenet Floattensorflow-lite: Inception V4openssl: RSA4096tensorflow-lite: Inception ResNet V2npb: MG.Cnpb: FT.Clammps: 20k Atomsrodinia: OpenMP CFD Solveropenssl: RSA4096tensorflow-lite: Mobilenet Quantonnx: fcn-resnet101-11 - CPU - Standardapache: 1000onnx: super-resolution-10 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardtensorflow-lite: SqueezeNetapache: 500apache: 100gromacs: MPI CPU - water_GMX50_barebuild-nodejs: Time To Compileonnx: bertsquad-12 - CPU - Standardapache: 200lammps: Rhodopsin Proteinonnx: GPT-2 - CPU - Standardnpb: SP.Csimdjson: DistinctUserIDsimdjson: PartialTweetsbuild-imagemagick: Time To Compiledacapobench: Jythonnpb: BT.Cbuild-llvm: Ninjadacapobench: Tradesoapavifenc: 2simdjson: Kostyabuild2: Time To Compileavifenc: 6npb: LU.Ccompress-7zip: Compression Ratingavifenc: 0build-gem5: Time To Compilepybench: Total For Average Test Timesavifenc: 6, Losslessdacapobench: Tradebeansbuild-php: Time To Compilebuild-apache: Time To Compilephpbench: PHP Benchmark Suitenpb: EP.Dcompress-zstd: 19 - Decompression Speedwebp: Quality 100, Losslesstensorflow-lite: NASNet Mobilecompress-zstd: 19, Long Mode - Decompression Speedavifenc: 10, Losslesswebp: Quality 100, Lossless, Highest Compressioncompress-zstd: 3 - Decompression Speedmrbayes: Primate Phylogeny Analysistscp: AI Chess Performancestockfish: Total Timerodinia: OpenMP LavaMDngspice: C7552nginx: 1000nginx: 200nginx: 500povray: Trace Timesecuremark: SecureMark-TLScompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19 - Compression Speednginx: 100astcenc: Thoroughngspice: C2670simdjson: LargeRandliquid-dsp: 16 - 256 - 57dacapobench: H2stress-ng: CPU Stressasmfish: 1024 Hash Memory, 26 Depthsynthmark: VoiceMark_100openssl: SHA256stress-ng: Vector Mathstress-ng: Matrix Mathastcenc: Exhaustivecoremark: CoreMark Size 666 - Iterations Per Secondstress-ng: Cryptowebp: Quality 100, Highest Compressioncompress-7zip: Decompression Ratingm-queens: Time To Solven-queens: Elapsed Timestress-ng: IO_uringquantlib: stress-ng: CPU Cachec-ray: Total Time - 4K, 16 Rays Per Pixelrodinia: OpenMP Streamclusterc7g.4xlargea1.4xlarge11896693.3211034639.126.305812588073338.016714255.85386429.12585706571.951041.90155.18010940.9392156.6041855.12546.440051.313481.6111791.7711.42510.478178460.41502.953872719.3328176093257.9473546.3267231.881.128497.57940773676.9511.29179904467.192.692.6227.904394010339.53544.9293524141.6981.94115.0209.3857730.4197824256.841391.171118511.908320369.48326.940666484934.723050.322.76911591.93240.65.76548.2083508.5251.397137009427608891143.334191.286346814.75352380.98346613.3437.86318370839.541.2345710.8713.9248198.2240.738360666729515029.7132134123675.6351372204597355258.1780088.74139.3797405413.86055423181.819.3467305466.82221.536843015.782512.764.3138.51713.296128798.24135633.93.7783418671693353.77062740.891391182.5839391213.15197.57769.3462328.27249990.15188910588.31711693266.362927.162.88541.45045328.65724.661019278.6875716512014.720133.4918636.430.3161765.91011520887.583.24523121293.800.80.7893.632129973148.181784.60011182449.0220.63353.91228.7782558.1232498768.3021155.615345233.9919045196.02974.742241259339.201121.761.80130986.71213.915.209124.7081364.6644.78853850010980430360.304480.793138205.11141436.20139414.8493.801743561616.9143155.4833.5198473.9010.316551333367402366.0015331550331.070678568951727341.477356.85277.7669203869.40201711985.3817.88840891110.36832.285918172.37464.85104.76147.430OpenBenchmarking.org

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc7g.4xlargea1.4xlarge30060090012001500SE +/- 9.70, N = 3SE +/- 0.67, N = 311891281. (CXX) g++ options: -flto -pthread

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlargea1.4xlarge14002800420056007000SE +/- 3.52, N = 3SE +/- 0.91, N = 36693.32798.241. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.4xlargea1.4xlarge2004006008001000SE +/- 6.44, N = 3SE +/- 0.88, N = 311031351. (CXX) g++ options: -flto -pthread

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc7g.4xlargea1.4xlarge10002000300040005000SE +/- 9.57, N = 3SE +/- 4.47, N = 34639.1633.9-llzma1. (CC) gcc options: -O3 -pthread -lz

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlargea1.4xlarge612182430SE +/- 0.03738, N = 3SE +/- 0.00065, N = 326.305803.778341. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlargea1.4xlarge300M600M900M1200M1500MSE +/- 952437.28, N = 3SE +/- 176548.39, N = 312588073331867169331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlargea1.4xlarge1224364860SE +/- 0.01401446, N = 3SE +/- 0.02862870, N = 38.0167142553.770627401. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlargea1.4xlarge1.31712.63423.95135.26846.5855SE +/- 0.016350, N = 3SE +/- 0.002370, N = 35.8538640.8913911. (CC) gcc options: -O3 -march=native -fopenmp

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlargea1.4xlarge4080120160200SE +/- 0.03, N = 3SE +/- 0.15, N = 329.13182.581. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc7g.4xlargea1.4xlarge14002800420056007000SE +/- 17.12, N = 3SE +/- 11.79, N = 66571.951213.151. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlargea1.4xlarge2004006008001000SE +/- 2.29, N = 3SE +/- 0.31, N = 31041.90197.571. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlargea1.4xlarge170340510680850SE +/- 0.08, N = 3SE +/- 5.37, N = 3155.18769.351. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlargea1.4xlarge2K4K6K8K10KSE +/- 76.73, N = 3SE +/- 6.27, N = 310940.942328.271. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc7g.4xlargea1.4xlarge2K4K6K8K10KSE +/- 19.61, N = 3SE +/- 113.94, N = 32156.609990.15

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c7g.4xlargea1.4xlarge40K80K120K160K200KSE +/- 210.27, N = 3SE +/- 1746.17, N = 341855.1188910.0

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargea1.4xlarge5001000150020002500SE +/- 0.23, N = 3SE +/- 0.12, N = 32546.4588.31. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlargea1.4xlarge40K80K120K160K200KSE +/- 305.31, N = 3SE +/- 825.35, N = 340051.3171169.0

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc7g.4xlargea1.4xlarge3K6K9K12K15KSE +/- 4.69, N = 3SE +/- 1.64, N = 313481.613266.361. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc7g.4xlargea1.4xlarge3K6K9K12K15KSE +/- 1.17, N = 3SE +/- 1.73, N = 311791.772927.161. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atomsc7g.4xlargea1.4xlarge3691215SE +/- 0.074, N = 311.4252.8851. (CXX) g++ options: -O3 -lm

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlargea1.4xlarge918273645SE +/- 0.02, N = 3SE +/- 0.08, N = 310.4841.451. (CXX) g++ options: -O2 -lOpenCL

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargea1.4xlarge40K80K120K160K200KSE +/- 82.61, N = 3SE +/- 63.75, N = 3178460.445328.61. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlargea1.4xlarge12002400360048006000SE +/- 17.76, N = 3SE +/- 20.90, N = 31502.955724.66

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc7g.4xlargea1.4xlarge918273645SE +/- 0.00, N = 3SE +/- 0.00, N = 338101. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Apache HTTP Server

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c7g.4xlargea1.4xlarge16K32K48K64K80KSE +/- 83.83, N = 3SE +/- 98.61, N = 372719.3319278.681. (CC) gcc options: -shared -fPIC -O2

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc7g.4xlargea1.4xlarge6001200180024003000SE +/- 1.86, N = 3SE +/- 0.50, N = 328177571. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc7g.4xlargea1.4xlarge130260390520650SE +/- 0.00, N = 3SE +/- 0.50, N = 36091651. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc7g.4xlargea1.4xlarge3K6K9K12K15KSE +/- 22.07, N = 3SE +/- 46.48, N = 33257.9412014.70

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c7g.4xlargea1.4xlarge16K32K48K64K80KSE +/- 89.82, N = 3SE +/- 93.64, N = 373546.3220133.491. (CC) gcc options: -shared -fPIC -O2

Apache HTTP Server

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c7g.4xlargea1.4xlarge14K28K42K56K70KSE +/- 38.09, N = 3SE +/- 28.97, N = 367231.8818636.431. (CC) gcc options: -shared -fPIC -O2

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec7g.4xlargea1.4xlarge0.25380.50760.76141.01521.269SE +/- 0.002, N = 3SE +/- 0.000, N = 31.1280.3161. (CXX) g++ options: -O3

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlargea1.4xlarge400800120016002000SE +/- 2.06, N = 3SE +/- 1.80, N = 3497.581765.91

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc7g.4xlargea1.4xlarge90180270360450SE +/- 0.17, N = 3SE +/- 0.88, N = 34071151. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Apache HTTP Server

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c7g.4xlargea1.4xlarge16K32K48K64K80KSE +/- 649.31, N = 3SE +/- 59.55, N = 373676.9520887.581. (CC) gcc options: -shared -fPIC -O2

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlargea1.4xlarge3691215SE +/- 0.060, N = 3SE +/- 0.040, N = 311.2913.2451. (CXX) g++ options: -O3 -lm

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlargea1.4xlarge2K4K6K8K10KSE +/- 2.40, N = 3SE +/- 2.20, N = 3799023121. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc7g.4xlargea1.4xlarge10002000300040005000SE +/- 9.61, N = 3SE +/- 2.51, N = 34467.191293.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc7g.4xlargea1.4xlarge0.60531.21061.81592.42123.0265SE +/- 0.00, N = 3SE +/- 0.00, N = 32.690.801. (CXX) g++ options: -O3

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc7g.4xlargea1.4xlarge0.58951.1791.76852.3582.9475SE +/- 0.00, N = 3SE +/- 0.00, N = 32.620.781. (CXX) g++ options: -O3

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlargea1.4xlarge20406080100SE +/- 0.13, N = 3SE +/- 0.27, N = 327.9093.63

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlargea1.4xlarge3K6K9K12K15KSE +/- 6.99, N = 4SE +/- 48.38, N = 4394012997

NAS Parallel Benchmarks

Test / Class: BT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc7g.4xlargea1.4xlarge2K4K6K8K10KSE +/- 7.36, N = 3SE +/- 3.44, N = 310339.533148.181. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlargea1.4xlarge400800120016002000SE +/- 5.19, N = 3SE +/- 0.34, N = 3544.931784.60

DaCapo Benchmark

Java Test: Tradesoap

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlargea1.4xlarge2K4K6K8K10KSE +/- 14.95, N = 4SE +/- 71.92, N = 4352411182

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c7g.4xlargea1.4xlarge100200300400500SE +/- 0.11, N = 3SE +/- 0.29, N = 3141.70449.021. (CXX) g++ options: -O3 -fPIC -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac7g.4xlargea1.4xlarge0.43650.8731.30951.7462.1825SE +/- 0.00, N = 3SE +/- 0.00, N = 31.940.631. (CXX) g++ options: -O3

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlargea1.4xlarge80160240320400SE +/- 0.64, N = 3SE +/- 1.89, N = 3115.02353.91

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6c7g.4xlargea1.4xlarge714212835SE +/- 0.025, N = 3SE +/- 0.064, N = 39.38528.7781. (CXX) g++ options: -O3 -fPIC -lm

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.4xlargea1.4xlarge17003400510068008500SE +/- 1.96, N = 3SE +/- 0.15, N = 37730.412558.121. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlargea1.4xlarge20K40K60K80K100KSE +/- 159.36, N = 3SE +/- 91.00, N = 397824324981. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c7g.4xlargea1.4xlarge170340510680850SE +/- 0.18, N = 3SE +/- 0.58, N = 3256.84768.301. (CXX) g++ options: -O3 -fPIC -lm

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlargea1.4xlarge2004006008001000SE +/- 1.33, N = 3SE +/- 0.78, N = 3391.171155.62

PyBench

Total For Average Test Times

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc7g.4xlargea1.4xlarge7001400210028003500SE +/- 0.33, N = 3SE +/- 18.15, N = 311853452

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlargea1.4xlarge816243240SE +/- 0.01, N = 3SE +/- 0.31, N = 311.9133.991. (CXX) g++ options: -O3 -fPIC -lm

DaCapo Benchmark

Java Test: Tradebeans

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc7g.4xlargea1.4xlarge2K4K6K8K10KSE +/- 26.73, N = 4SE +/- 44.35, N = 432039045

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec7g.4xlargea1.4xlarge4080120160200SE +/- 0.11, N = 3SE +/- 0.08, N = 369.48196.03

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec7g.4xlargea1.4xlarge20406080100SE +/- 0.05, N = 3SE +/- 0.01, N = 326.9474.74

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec7g.4xlargea1.4xlarge140K280K420K560K700KSE +/- 525.83, N = 3SE +/- 816.27, N = 3666484241259

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc7g.4xlargea1.4xlarge2004006008001000SE +/- 0.39, N = 3SE +/- 0.24, N = 3934.72339.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlargea1.4xlarge7001400210028003500SE +/- 7.75, N = 3SE +/- 4.74, N = 33050.31121.7-llzma1. (CC) gcc options: -O3 -pthread -lz

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc7g.4xlargea1.4xlarge1428425670SE +/- 0.09, N = 3SE +/- 0.06, N = 322.7761.80-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec7g.4xlargea1.4xlarge7K14K21K28K35KSE +/- 121.56, N = 15SE +/- 49.84, N = 311591.930986.7

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlargea1.4xlarge7001400210028003500SE +/- 6.93, N = 3SE +/- 15.28, N = 33240.61213.9-llzma1. (CC) gcc options: -O3 -pthread -lz

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Losslessc7g.4xlargea1.4xlarge48121620SE +/- 0.021, N = 3SE +/- 0.010, N = 35.76515.2091. (CXX) g++ options: -O3 -fPIC -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc7g.4xlargea1.4xlarge306090120150SE +/- 0.01, N = 3SE +/- 0.09, N = 348.21124.71-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedc7g.4xlargea1.4xlarge8001600240032004000SE +/- 2.07, N = 3SE +/- 21.59, N = 33508.51364.6-llzma1. (CC) gcc options: -O3 -pthread -lz

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc7g.4xlargea1.4xlarge140280420560700SE +/- 0.24, N = 3SE +/- 0.49, N = 3251.40644.791. (CC) gcc options: -O3 -std=c99 -pedantic -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlargea1.4xlarge300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 196.86, N = 513700945385001. (CC) gcc options: -O3 -march=native

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlargea1.4xlarge6M12M18M24M30MSE +/- 153578.64, N = 3SE +/- 123749.22, N = 327608891109804301. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlargea1.4xlarge80160240320400SE +/- 0.15, N = 3SE +/- 0.07, N = 3143.33360.301. (CXX) g++ options: -O2 -lOpenCL

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c7g.4xlargea1.4xlarge100200300400500SE +/- 1.94, N = 3SE +/- 1.19, N = 3191.29480.791. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

nginx

Concurrent Requests: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c7g.4xlargea1.4xlarge70K140K210K280K350KSE +/- 1410.11, N = 3SE +/- 66.96, N = 3346814.75138205.111. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c7g.4xlargea1.4xlarge80K160K240K320K400KSE +/- 3986.77, N = 3SE +/- 133.96, N = 3352380.98141436.201. (CC) gcc options: -lcrypt -lz -O3 -march=native

nginx

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c7g.4xlargea1.4xlarge70K140K210K280K350KSE +/- 1017.52, N = 3SE +/- 141.15, N = 3346613.34139414.841. (CC) gcc options: -lcrypt -lz -O3 -march=native

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlargea1.4xlarge20406080100SE +/- 0.01, N = 3SE +/- 0.94, N = 1537.8693.801. (CXX) g++ options: -pipe -O3 -ffast-math -R/usr/lib -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc7g.4xlargea1.4xlarge40K80K120K160K200KSE +/- 773.26, N = 3SE +/- 59.40, N = 3183708743561. (CC) gcc options: -pedantic -O3

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlargea1.4xlarge918273645SE +/- 0.23, N = 3SE +/- 0.00, N = 339.516.0-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlargea1.4xlarge918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 341.216.9-llzma1. (CC) gcc options: -O3 -pthread -lz

nginx

Concurrent Requests: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c7g.4xlargea1.4xlarge70K140K210K280K350KSE +/- 2009.97, N = 3SE +/- 22.67, N = 3345710.87143155.481. (CC) gcc options: -lcrypt -lz -O3 -march=native

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc7g.4xlargea1.4xlarge816243240SE +/- 0.00, N = 3SE +/- 0.01, N = 313.9233.521. (CXX) g++ options: -O3 -flto -pthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c7g.4xlargea1.4xlarge100200300400500SE +/- 0.86, N = 3SE +/- 3.48, N = 3198.22473.901. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc7g.4xlargea1.4xlarge0.15750.3150.47250.630.7875SE +/- 0.00, N = 3SE +/- 0.00, N = 30.70.31. (CXX) g++ options: -O3

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlargea1.4xlarge80M160M240M320M400MSE +/- 400097.21, N = 3SE +/- 8819.17, N = 33836066671655133331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c7g.4xlargea1.4xlarge14002800420056007000SE +/- 32.57, N = 5SE +/- 63.66, N = 429516740

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc7g.4xlargea1.4xlarge11002200330044005500SE +/- 0.41, N = 3SE +/- 0.16, N = 35029.712366.001. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlargea1.4xlarge7M14M21M28M35MSE +/- 104795.40, N = 3SE +/- 106812.26, N = 33213412315331550

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlargea1.4xlarge150300450600750SE +/- 0.32, N = 3SE +/- 0.00, N = 3675.64331.071. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlargea1.4xlarge3000M6000M9000M12000M15000MSE +/- 7739237.92, N = 3SE +/- 12563225.46, N = 31372204597367856895171. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlargea1.4xlarge12K24K36K48K60KSE +/- 17.05, N = 3SE +/- 0.49, N = 355258.1727341.471. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Mathc7g.4xlargea1.4xlarge20K40K60K80K100KSE +/- 3.18, N = 3SE +/- 40.28, N = 380088.747356.851. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec7g.4xlargea1.4xlarge60120180240300SE +/- 0.01, N = 3SE +/- 0.07, N = 3139.38277.771. (CXX) g++ options: -O3 -flto -pthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlargea1.4xlarge90K180K270K360K450KSE +/- 3211.91, N = 3SE +/- 116.54, N = 3405413.86203869.401. (CC) gcc options: -O2 -lrt" -lrt

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlargea1.4xlarge5K10K15K20K25KSE +/- 32.01, N = 3SE +/- 6.29, N = 323181.8111985.381. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compressionc7g.4xlargea1.4xlarge48121620SE +/- 0.007, N = 3SE +/- 0.072, N = 39.34617.888-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlargea1.4xlarge16K32K48K64K80KSE +/- 12.88, N = 3SE +/- 31.21, N = 373054408911. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlargea1.4xlarge20406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 366.82110.371. (CXX) g++ options: -fopenmp -O2 -march=native

N-Queens

Elapsed Time

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec7g.4xlargea1.4xlarge714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 321.5432.291. (CC) gcc options: -static -fopenmp -O3 -march=native

Stress-NG

Test: IO_uring

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc7g.4xlargea1.4xlarge200K400K600K800K1000KSE +/- 614.16, N = 3SE +/- 3840.04, N = 3843015.78918172.371. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21c7g.4xlarge5001000150020002500SE +/- 0.15, N = 32512.71. (CXX) g++ options: -O3 -march=native -rdynamic

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cachec7g.4xlargea1.4xlarge100200300400500SE +/- 3.64, N = 12SE +/- 3.73, N = 364.31464.851. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlargea1.4xlarge20406080100SE +/- 0.02, N = 3SE +/- 2.00, N = 1538.52104.761. (CC) gcc options: -lm -lpthread -O3

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlargea1.4xlarge1122334455SE +/- 0.33, N = 12SE +/- 0.02, N = 313.3047.431. (CXX) g++ options: -O2 -lOpenCL


Phoronix Test Suite v10.8.4