Amazon EC2 c7g.4xlarge Graviton3

Graviton3 benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2205311-NE-2205240NE87
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 3 Tests
Chess Test Suite 6 Tests
Timed Code Compilation 7 Tests
C/C++ Compiler Tests 15 Tests
Compression Tests 2 Tests
CPU Massive 22 Tests
Creator Workloads 7 Tests
Cryptography 2 Tests
Fortran Tests 4 Tests
Go Language Tests 2 Tests
HPC - High Performance Computing 14 Tests
Imaging 2 Tests
Common Kernel Benchmarks 3 Tests
Linear Algebra 2 Tests
Machine Learning 3 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 7 Tests
Multi-Core 23 Tests
NVIDIA GPU Compute 3 Tests
OpenMPI Tests 10 Tests
Programmer / Developer System Benchmarks 12 Tests
Python Tests 6 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 8 Tests
Server 5 Tests
Server CPU Tests 16 Tests
Single-Threaded 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
c7g.4xlarge
May 24 2022
  8 Hours, 9 Minutes
ampere c7g.4xlarge compar
May 25 2022
  12 Hours, 34 Minutes
Invert Hiding All Results Option
  10 Hours, 21 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Amazon EC2 c7g.4xlarge Graviton3ProcessorMotherboardChipsetMemoryDiskNetworkGraphicsMonitorOSKernelCompilerFile-SystemSystem LayerVulkanScreen Resolutionc7g.4xlargeampere c7g.4xlarge comparARMv8 Neoverse-V1 (16 Cores)Amazon EC2 c7g.4xlarge (1.0 BIOS)Amazon Device 020032GB193GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.15.0-1004-aws (aarch64)GCC 11.2.0ext4amazonAmpere ARMv8 Neoverse-N1 @ 3.00GHz (160 Cores)FOXCONN Mt. Collins (0ACOC017 SCP: 1.08.20210825 BIOS)Ampere Computing LLC Device e10016 x 32 GB DDR4-3200MT/s Samsung M393A4K40DB3-CWE2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07ASPEEDPL2294H4 x Mellanox MT27710 + 2 x Intel I350Ubuntu 20.045.4.0-100-generic (aarch64)1.1.182GCC 9.4.01920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c7g.4xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - ampere c7g.4xlarge compar: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Java Details- c7g.4xlarge: OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)- ampere c7g.4xlarge compar: OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.20.04.1)Python Details- c7g.4xlarge: Python 3.10.4- ampere c7g.4xlarge compar: Python 3.8.10Security Details- c7g.4xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - ampere c7g.4xlarge compar: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Not affected + srbds: Not affected + tsx_async_abort: Not affected Processor Details- ampere c7g.4xlarge compar: Scaling Governor: cppc_cpufreq performance

c7g.4xlarge vs. ampere c7g.4xlarge compar ComparisonPhoronix Test SuiteBaseline+1402.3%+1402.3%+2804.6%+2804.6%+4206.9%+4206.9%898.1%823.6%817.9%756.8%718.8%679.1%672%632.5%623.5%614.8%597.9%509.9%495.7%467%463.5%428.6%393.4%357.2%295.6%291.8%272%270.9%262.4%212.2%190.1%188.4%168.8%161.4%151.6%140.9%137.3%132.6%1012.4%97.3%92.5%91.1%79.3%74.3%70.4%70%69.4%42.5%40.8%39.6%25.9%11.6%7.1%CPU CacheSHA256Matrix MathCryptoTime To SolveInception V4689.2%CPU StressExhaustiveTotal Time - 4.1.R.P.PCoreMark Size 666 - I.P.SVector MathEP.DBT.CNASNet Mobile5609%D.RTotal TimeSP.CLU.CMPI CPU - water_GMX50_bareOpenMP LavaMDFT.CTrace Time292.4%MG.CCG.C1.H.M.2.DRSA4096O.S236.2%NinjaMobilenet Quant2352.9%RSA4096OpenMP CFD Solver171.8%Tradebeans168.9%20k Atomsi.i.1.C.P.DTime To Compilei.i.1.C.P.DCarbon NanotubeCompression RatingSqueezeNet1655.8%Mobilenet Float1505.6%Elapsed TimeI.R.V902.1%Rhodopsin ProteinH294.7%Memory CopyingS.F.P.R92%6BLASEigenThorough3 - Compression Speed54%DistinctUserID43.1%6, LosslessPartialTweets42.4%19 - Compression SpeedTime To CompileJython37.2%P.B.S36.9%19, Long Mode - Compression Speed36.7%P.P.A32%A.C.P31.5%19, Long Mode - D.S31.5%Kostya31.1%19 - D.S30.9%SecureMark-TLS29.6%27.2%Time To Compile3 - D.S25%LargeRand25%C755223.3%Time To Compile22.2%222.1%VoiceMark_10020.3%T.F.A.T.T19.9%16 - 256 - 5719.9%Q.1.L.H.C18%C267017.1%Q.1.L16.5%Time To CompileQ.1.H.C9.2%IS.D10, Lossless7%2.9%Stress-NGOpenSSLStress-NGStress-NGm-queensTensorFlow LiteStress-NGASTC EncoderC-RayCoremarkStress-NGNAS Parallel BenchmarksNAS Parallel BenchmarksTensorFlow Lite7-Zip CompressionStockfishNAS Parallel BenchmarksNAS Parallel BenchmarksGROMACSRodiniaNAS Parallel BenchmarksPOV-RayNAS Parallel BenchmarksNAS Parallel BenchmarksasmFishOpenSSLRodiniaTimed LLVM CompilationTensorFlow LiteOpenSSLLULESHRodiniaDaCapo BenchmarkLAMMPS Molecular Dynamics SimulatorXcompact3d Incompact3dTimed Node.js CompilationXcompact3d Incompact3dGPAW7-Zip CompressionTensorFlow LiteTensorFlow LiteN-QueensTensorFlow LiteLAMMPS Molecular Dynamics SimulatorDaCapo BenchmarkStress-NGACES DGEMMlibavif avifencLeelaChessZeroAlgebraic Multi-Grid BenchmarkLeelaChessZeroHigh Performance Conjugate GradientASTC EncoderZstd Compressionsimdjsonlibavif avifencsimdjsonZstd CompressionTimed Gem5 CompilationDaCapo BenchmarkPHPBenchZstd CompressionTimed MrBayes AnalysisTSCPZstd CompressionsimdjsonZstd CompressionSecureMarkQuantLibBuild2Zstd CompressionsimdjsonNgspiceTimed Apache Compilationlibavif avifencGoogle SynthMarkPyBenchLiquid-DSPWebP Image EncodeNgspiceWebP Image EncodeTimed ImageMagick CompilationWebP Image EncodeNAS Parallel Benchmarkslibavif avifenclibavif avifencc7g.4xlargeampere c7g.4xlarge compar

Amazon EC2 c7g.4xlarge Graviton3lczero: Eigenlczero: BLASbuild-nodejs: Time To Compilebuild-gem5: Time To Compilebuild-llvm: Ninjaasmfish: 1024 Hash Memory, 26 Depthpovray: Trace Timetensorflow-lite: NASNet Mobilemrbayes: Primate Phylogeny Analysisbuild2: Time To Compilesecuremark: SecureMark-TLSlammps: 20k Atomsavifenc: 0ngspice: C7552ngspice: C2670mt-dgemm: Sustained Floating-Point Ratebuild-php: Time To Compilenpb: SP.Ctensorflow-lite: Inception V4tensorflow-lite: SqueezeNettensorflow-lite: Mobilenet Floathpcg: openssl: SHA256tensorflow-lite: Inception ResNet V2compress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speednpb: BT.Cnpb: LU.Cavifenc: 2dacapobench: Tradebeanscompress-7zip: Decompression Ratingcompress-7zip: Compression Ratinggromacs: MPI CPU - water_GMX50_barecompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedquantlib: gpaw: Carbon Nanotubenpb: IS.Dstress-ng: CPU Cacherodinia: OpenMP LavaMDnpb: EP.Dastcenc: Exhaustivesimdjson: DistinctUserIDcoremark: CoreMark Size 666 - Iterations Per Secondsimdjson: PartialTweetsonnx: fcn-resnet101-11 - CPU - Standardonnx: GPT-2 - CPU - Standardonnx: bertsquad-12 - CPU - Standardtensorflow-lite: Mobilenet Quantonnx: ArcFace ResNet-100 - CPU - Standardopenssl: RSA4096openssl: RSA4096onnx: super-resolution-10 - CPU - Standardsimdjson: Kostyasimdjson: LargeRandwebp: Quality 100, Lossless, Highest Compressionrodinia: OpenMP Streamclusterdacapobench: H2apache: 1000nginx: 1000nginx: 200apache: 200nginx: 100apache: 500nginx: 500apache: 100m-queens: Time To Solvephpbench: PHP Benchmark Suitestress-ng: Cryptostockfish: Total Timeastcenc: Thoroughstress-ng: CPU Stressstress-ng: Memory Copyingsynthmark: VoiceMark_100stress-ng: Vector Mathstress-ng: Matrix Mathpybench: Total For Average Test Timesbuild-apache: Time To Compileamg: build-imagemagick: Time To Compilewebp: Quality 100, Losslessincompact3d: input.i3d 193 Cells Per Directionc-ray: Total Time - 4K, 16 Rays Per Pixelnpb: FT.Cliquid-dsp: 16 - 256 - 57rodinia: OpenMP CFD Solverlulesh: n-queens: Elapsed Timenpb: CG.Cstress-ng: IO_uringlammps: Rhodopsin Proteinavifenc: 6, Losslesswebp: Quality 100, Highest Compressiondacapobench: Jythonnpb: MG.Cavifenc: 6dacapobench: Tradesoapincompact3d: input.i3d 129 Cells Per Directionavifenc: 10, Losslesstscp: AI Chess Performancec7g.4xlargeampere c7g.4xlarge compar11891103497.579391.171544.9293213412337.86311591.9251.397115.02018370811.425256.841191.286198.2245.85386469.4834467.1941855.13257.942156.6026.30581372204597340051.33240.639.510339.537730.41141.698320373054978241.1283050.341.23508.54639.12512.7155.1801041.9064.31143.334934.72139.37972.69405413.8605542.623879904071502.95609178460.42546.428171.940.748.20813.296295172719.33346814.75352380.9873676.95345710.8773546.32346613.3467231.8866.82266648423181.812760889113.92485029.716693.32675.63555258.1780088.74118526.940125880733327.90422.76929.125857038.51711791.7738360666710.47810940.93921.5366571.95843015.7811.29111.9089.346394013481.619.38535248.016714255.765137009420261978197.761280.188174.549119173049148.591661777331.96391.33514177130.713264.203235.945232.1593.04829770.00025174.4933031257204.134625.544.72321267371058074013492465.228.963065.2040864.21173.05686124351942275435.5652330.158.02806.63012.81974.865.4021116.18641.9031.3536522.9518.05351.882933123.6787731.8436866.1646811.27387.01.480.5656.88844.69557478.161487006198623.071565451348.220839188.6312885.53561.595394967.53735128.78142132.922219400800024.99726.53612.08999955.25846649.6931990666728.48431552.7261.93624449.5022.2728.35910.206540452823.394.9103.066937096.1701041565OpenBenchmarking.org

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc7g.4xlargeampere c7g.4xlarge compar400800120016002000SE +/- 9.70, N = 3SE +/- 28.10, N = 9118920261. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc7g.4xlargeampere c7g.4xlarge compar400800120016002000Min: 1171 / Avg: 1189.33 / Max: 1204Min: 1871 / Avg: 2026.33 / Max: 21391. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.4xlargeampere c7g.4xlarge compar400800120016002000SE +/- 6.44, N = 3SE +/- 29.70, N = 9110319781. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.4xlargeampere c7g.4xlarge compar30060090012001500Min: 1090 / Avg: 1102.67 / Max: 1111Min: 1852 / Avg: 1978.22 / Max: 21021. (CXX) g++ options: -flto -pthread

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlargeampere c7g.4xlarge compar110220330440550SE +/- 2.06, N = 3SE +/- 2.15, N = 12497.58197.76
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlargeampere c7g.4xlarge compar90180270360450Min: 493.85 / Avg: 497.58 / Max: 500.97Min: 181.06 / Avg: 197.76 / Max: 209.25

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlargeampere c7g.4xlarge compar80160240320400SE +/- 1.33, N = 3SE +/- 6.26, N = 8391.17280.19
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlargeampere c7g.4xlarge compar70140210280350Min: 389.16 / Avg: 391.17 / Max: 393.69Min: 257.83 / Avg: 280.19 / Max: 301.53

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlargeampere c7g.4xlarge compar120240360480600SE +/- 5.19, N = 3SE +/- 2.96, N = 9544.93174.55
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlargeampere c7g.4xlarge compar100200300400500Min: 535.72 / Avg: 544.93 / Max: 553.68Min: 159.7 / Avg: 174.55 / Max: 185.17

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlargeampere c7g.4xlarge compar30M60M90M120M150MSE +/- 104795.40, N = 3SE +/- 1911546.26, N = 932134123119173049
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlargeampere c7g.4xlarge compar20M40M60M80M100MMin: 32023095 / Avg: 32134123.33 / Max: 32343588Min: 107713180 / Avg: 119173049.33 / Max: 125128795

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlargeampere c7g.4xlarge compar306090120150SE +/- 0.01, N = 3SE +/- 2.69, N = 1237.86148.59-R/usr/lib-pthread1. (CXX) g++ options: -pipe -O3 -ffast-math -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlargeampere c7g.4xlarge compar306090120150Min: 37.84 / Avg: 37.86 / Max: 37.89Min: 132.18 / Avg: 148.59 / Max: 164.681. (CXX) g++ options: -pipe -O3 -ffast-math -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec7g.4xlargeampere c7g.4xlarge compar140K280K420K560K700KSE +/- 121.56, N = 15SE +/- 12477.25, N = 1511591.9661777.0
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec7g.4xlargeampere c7g.4xlarge compar110K220K330K440K550KMin: 10847.8 / Avg: 11591.94 / Max: 12395.4Min: 586979 / Avg: 661776.73 / Max: 737118

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc7g.4xlargeampere c7g.4xlarge compar70140210280350SE +/- 0.24, N = 3SE +/- 0.61, N = 3251.40331.961. (CC) gcc options: -O3 -std=c99 -pedantic -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc7g.4xlargeampere c7g.4xlarge compar60120180240300Min: 251.04 / Avg: 251.4 / Max: 251.85Min: 330.75 / Avg: 331.96 / Max: 332.651. (CC) gcc options: -O3 -std=c99 -pedantic -lm

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlargeampere c7g.4xlarge compar306090120150SE +/- 0.64, N = 3SE +/- 1.87, N = 15115.0291.34
OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlargeampere c7g.4xlarge compar20406080100Min: 113.8 / Avg: 115.02 / Max: 115.97Min: 79.68 / Avg: 91.34 / Max: 100.09

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc7g.4xlargeampere c7g.4xlarge compar40K80K120K160K200KSE +/- 773.26, N = 3SE +/- 23.89, N = 31837081417711. (CC) gcc options: -pedantic -O3
OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc7g.4xlargeampere c7g.4xlarge compar30K60K90K120K150KMin: 182165.75 / Avg: 183708.29 / Max: 184575.7Min: 141734.53 / Avg: 141770.63 / Max: 141815.81. (CC) gcc options: -pedantic -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atomsc7g.4xlargeampere c7g.4xlarge compar714212835SE +/- 0.03, N = 311.4330.71-pthread1. (CXX) g++ options: -O3 -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atomsc7g.4xlargeampere c7g.4xlarge compar714212835Min: 30.67 / Avg: 30.71 / Max: 30.761. (CXX) g++ options: -O3 -lm

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c7g.4xlargeampere c7g.4xlarge compar60120180240300SE +/- 0.18, N = 3SE +/- 3.11, N = 3256.84264.201. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c7g.4xlargeampere c7g.4xlarge compar50100150200250Min: 256.51 / Avg: 256.84 / Max: 257.1Min: 261 / Avg: 264.2 / Max: 270.421. (CXX) g++ options: -O3 -fPIC -lm

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c7g.4xlargeampere c7g.4xlarge compar50100150200250SE +/- 1.94, N = 3SE +/- 2.35, N = 3191.29235.95-lXft -lfontconfig -lXrender -lfreetype1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c7g.4xlargeampere c7g.4xlarge compar4080120160200Min: 188.31 / Avg: 191.29 / Max: 194.94Min: 233.42 / Avg: 235.95 / Max: 240.651. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c7g.4xlargeampere c7g.4xlarge compar50100150200250SE +/- 0.86, N = 3SE +/- 0.94, N = 3198.22232.16-lXft -lfontconfig -lXrender -lfreetype1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE
OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c7g.4xlargeampere c7g.4xlarge compar4080120160200Min: 197.24 / Avg: 198.22 / Max: 199.94Min: 230.86 / Avg: 232.16 / Max: 233.991. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlargeampere c7g.4xlarge compar1.31712.63423.95135.26846.5855SE +/- 0.016350, N = 3SE +/- 0.106638, N = 155.8538643.0482971. (CC) gcc options: -O3 -march=native -fopenmp
OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlargeampere c7g.4xlarge compar246810Min: 5.83 / Avg: 5.85 / Max: 5.89Min: 2.43 / Avg: 3.05 / Max: 3.731. (CC) gcc options: -O3 -march=native -fopenmp

Timed PHP Compilation

This test times how long it takes to build PHP 7. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec7g.4xlargeampere c7g.4xlarge compar1632486480SE +/- 0.11, N = 3SE +/- 1.21, N = 1569.4870.00
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec7g.4xlargeampere c7g.4xlarge compar1428425670Min: 69.32 / Avg: 69.48 / Max: 69.7Min: 62.81 / Avg: 70 / Max: 78.04

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc7g.4xlargeampere c7g.4xlarge compar5K10K15K20K25KSE +/- 9.61, N = 3SE +/- 10.76, N = 34467.1925174.49-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc7g.4xlargeampere c7g.4xlarge compar4K8K12K16K20KMin: 4449.83 / Avg: 4467.19 / Max: 4483.01Min: 25158.5 / Avg: 25174.49 / Max: 25194.951. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c7g.4xlargeampere c7g.4xlarge compar70K140K210K280K350KSE +/- 210.27, N = 3SE +/- 5364.69, N = 1541855.1330312.0
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c7g.4xlargeampere c7g.4xlarge compar60K120K180K240K300KMin: 41440.3 / Avg: 41855.1 / Max: 42122.5Min: 290591 / Avg: 330311.73 / Max: 365454

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc7g.4xlargeampere c7g.4xlarge compar12K24K36K48K60KSE +/- 22.07, N = 3SE +/- 1420.97, N = 153257.9457204.10
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc7g.4xlargeampere c7g.4xlarge compar10K20K30K40K50KMin: 3216.26 / Avg: 3257.94 / Max: 3291.38Min: 48518.2 / Avg: 57204.1 / Max: 65772.7

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc7g.4xlargeampere c7g.4xlarge compar7K14K21K28K35KSE +/- 19.61, N = 3SE +/- 545.26, N = 152156.6034625.50
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc7g.4xlargeampere c7g.4xlarge compar6K12K18K24K30KMin: 2129.52 / Avg: 2156.6 / Max: 2194.7Min: 31024.9 / Avg: 34625.51 / Max: 39457.1

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlargeampere c7g.4xlarge compar1020304050SE +/- 0.04, N = 3SE +/- 0.04, N = 326.3144.72-pthread1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlargeampere c7g.4xlarge compar918273645Min: 26.26 / Avg: 26.31 / Max: 26.38Min: 44.65 / Avg: 44.72 / Max: 44.761. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlargeampere c7g.4xlarge compar30000M60000M90000M120000M150000MSE +/- 7739237.92, N = 3SE +/- 487835541.05, N = 3137220459731267371058071. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlargeampere c7g.4xlarge compar20000M40000M60000M80000M100000MMin: 13712096220 / Avg: 13722045973.33 / Max: 13737289210Min: 125763149680 / Avg: 126737105806.67 / Max: 1272741602401. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlargeampere c7g.4xlarge compar90K180K270K360K450KSE +/- 305.31, N = 3SE +/- 6995.70, N = 1440051.3401349.0
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlargeampere c7g.4xlarge compar70K140K210K280K350KMin: 39503.5 / Avg: 40051.33 / Max: 40558.8Min: 352522 / Avg: 401349.43 / Max: 451484

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlargeampere c7g.4xlarge compar7001400210028003500SE +/- 6.93, N = 3SE +/- 4.45, N = 153240.62465.2-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlargeampere c7g.4xlarge compar6001200180024003000Min: 3229.8 / Avg: 3240.57 / Max: 3253.5Min: 2450.4 / Avg: 2465.17 / Max: 2516.81. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlargeampere c7g.4xlarge compar918273645SE +/- 0.23, N = 3SE +/- 1.04, N = 1539.528.9-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlargeampere c7g.4xlarge compar816243240Min: 39 / Avg: 39.47 / Max: 39.7Min: 21.5 / Avg: 28.91 / Max: 32.71. (CC) gcc options: -O3 -pthread -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc7g.4xlargeampere c7g.4xlarge compar14K28K42K56K70KSE +/- 7.36, N = 3SE +/- 56.15, N = 310339.5363065.20-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc7g.4xlargeampere c7g.4xlarge compar11K22K33K44K55KMin: 10325.26 / Avg: 10339.53 / Max: 10349.81Min: 62955.25 / Avg: 63065.2 / Max: 63140.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.4xlargeampere c7g.4xlarge compar9K18K27K36K45KSE +/- 1.96, N = 3SE +/- 28.42, N = 37730.4140864.21-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.4xlargeampere c7g.4xlarge compar7K14K21K28K35KMin: 7728.06 / Avg: 7730.41 / Max: 7734.31Min: 40823.01 / Avg: 40864.21 / Max: 40918.731. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c7g.4xlargeampere c7g.4xlarge compar4080120160200SE +/- 0.11, N = 3SE +/- 1.54, N = 3141.70173.061. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c7g.4xlargeampere c7g.4xlarge compar306090120150Min: 141.5 / Avg: 141.7 / Max: 141.88Min: 170.9 / Avg: 173.06 / Max: 176.051. (CXX) g++ options: -O3 -fPIC -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc7g.4xlargeampere c7g.4xlarge compar2K4K6K8K10KSE +/- 26.73, N = 4SE +/- 399.23, N = 2032038612
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc7g.4xlargeampere c7g.4xlarge compar15003000450060007500Min: 3141 / Avg: 3202.5 / Max: 3264Min: 6898 / Avg: 8611.65 / Max: 13457

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlargeampere c7g.4xlarge compar90K180K270K360K450KSE +/- 12.88, N = 3SE +/- 4896.50, N = 15730544351941. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlargeampere c7g.4xlarge compar80K160K240K320K400KMin: 73037 / Avg: 73053.67 / Max: 73079Min: 412578 / Avg: 435194 / Max: 4800201. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlargeampere c7g.4xlarge compar50K100K150K200K250KSE +/- 159.36, N = 3SE +/- 3424.87, N = 15978242275431. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlargeampere c7g.4xlarge compar40K80K120K160K200KMin: 97563 / Avg: 97824.33 / Max: 98113Min: 207583 / Avg: 227543.27 / Max: 2577681. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec7g.4xlargeampere c7g.4xlarge compar1.25212.50423.75635.00846.2605SE +/- 0.002, N = 3SE +/- 0.027, N = 21.1285.565-pthread1. (CXX) g++ options: -O3
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec7g.4xlargeampere c7g.4xlarge compar246810Min: 1.13 / Avg: 1.13 / Max: 1.13Min: 5.54 / Avg: 5.57 / Max: 5.591. (CXX) g++ options: -O3

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlargeampere c7g.4xlarge compar7001400210028003500SE +/- 7.75, N = 3SE +/- 2.68, N = 153050.32330.1-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlargeampere c7g.4xlarge compar5001000150020002500Min: 3042.5 / Avg: 3050.3 / Max: 3065.8Min: 2313.1 / Avg: 2330.13 / Max: 23531. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlargeampere c7g.4xlarge compar1326395265SE +/- 0.00, N = 3SE +/- 1.05, N = 1541.258.0-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlargeampere c7g.4xlarge compar1122334455Min: 41.2 / Avg: 41.2 / Max: 41.2Min: 50.2 / Avg: 57.99 / Max: 66.81. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedc7g.4xlargeampere c7g.4xlarge compar8001600240032004000SE +/- 2.07, N = 3SE +/- 5.11, N = 123508.52806.6-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedc7g.4xlargeampere c7g.4xlarge compar6001200180024003000Min: 3504.5 / Avg: 3508.47 / Max: 3511.5Min: 2785.7 / Avg: 2806.56 / Max: 2840.91. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc7g.4xlargeampere c7g.4xlarge compar10002000300040005000SE +/- 9.57, N = 3SE +/- 207.50, N = 154639.13012.8-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc7g.4xlargeampere c7g.4xlarge compar8001600240032004000Min: 4620 / Avg: 4639.13 / Max: 4649.1Min: 2061.8 / Avg: 3012.81 / Max: 4700.11. (CC) gcc options: -O3 -pthread -lz

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21c7g.4xlargeampere c7g.4xlarge compar5001000150020002500SE +/- 0.15, N = 3SE +/- 13.49, N = 152512.71974.81. (CXX) g++ options: -O3 -march=native -rdynamic
OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21c7g.4xlargeampere c7g.4xlarge compar400800120016002000Min: 2512.5 / Avg: 2512.73 / Max: 2513Min: 1901.8 / Avg: 1974.82 / Max: 2035.11. (CXX) g++ options: -O3 -march=native -rdynamic

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlargeampere c7g.4xlarge compar306090120150SE +/- 0.08, N = 3SE +/- 0.11, N = 3155.1865.40-pthread1. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlargeampere c7g.4xlarge compar306090120150Min: 155.01 / Avg: 155.18 / Max: 155.29Min: 65.19 / Avg: 65.4 / Max: 65.531. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlargeampere c7g.4xlarge compar2004006008001000SE +/- 2.29, N = 3SE +/- 24.32, N = 151041.901116.18-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlargeampere c7g.4xlarge compar2004006008001000Min: 1038.58 / Avg: 1041.9 / Max: 1046.3Min: 1030.06 / Avg: 1116.18 / Max: 1424.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cachec7g.4xlargeampere c7g.4xlarge compar140280420560700SE +/- 3.64, N = 12SE +/- 5.56, N = 764.31641.901. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cachec7g.4xlargeampere c7g.4xlarge compar110220330440550Min: 40.19 / Avg: 64.31 / Max: 82.06Min: 622.2 / Avg: 641.9 / Max: 659.511. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlargeampere c7g.4xlarge compar306090120150SE +/- 0.15, N = 3SE +/- 0.19, N = 3143.3331.351. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlargeampere c7g.4xlarge compar306090120150Min: 143.14 / Avg: 143.33 / Max: 143.64Min: 31.06 / Avg: 31.35 / Max: 31.721. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc7g.4xlargeampere c7g.4xlarge compar14002800420056007000SE +/- 0.39, N = 3SE +/- 37.11, N = 3934.726522.95-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc7g.4xlargeampere c7g.4xlarge compar11002200330044005500Min: 934.01 / Avg: 934.72 / Max: 935.36Min: 6458.49 / Avg: 6522.95 / Max: 6587.031. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec7g.4xlargeampere c7g.4xlarge compar306090120150SE +/- 0.01, N = 3SE +/- 0.15, N = 3139.3818.051. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec7g.4xlargeampere c7g.4xlarge compar306090120150Min: 139.36 / Avg: 139.38 / Max: 139.39Min: 17.75 / Avg: 18.05 / Max: 18.221. (CXX) g++ options: -O3 -flto -pthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc7g.4xlargeampere c7g.4xlarge compar0.60531.21061.81592.42123.0265SE +/- 0.00, N = 3SE +/- 0.00, N = 32.691.88-pthread1. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc7g.4xlargeampere c7g.4xlarge compar246810Min: 2.69 / Avg: 2.69 / Max: 2.69Min: 1.88 / Avg: 1.88 / Max: 1.881. (CXX) g++ options: -O3

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlargeampere c7g.4xlarge compar600K1200K1800K2400K3000KSE +/- 3211.91, N = 3SE +/- 25314.53, N = 15405413.862933123.681. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlargeampere c7g.4xlarge compar500K1000K1500K2000K2500KMin: 399077.13 / Avg: 405413.86 / Max: 409495.17Min: 2785676.98 / Avg: 2933123.68 / Max: 31154671. (CC) gcc options: -O2 -lrt" -lrt

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc7g.4xlargeampere c7g.4xlarge compar0.58951.1791.76852.3582.9475SE +/- 0.00, N = 3SE +/- 0.00, N = 32.621.84-pthread1. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc7g.4xlargeampere c7g.4xlarge compar246810Min: 2.62 / Avg: 2.62 / Max: 2.62Min: 1.84 / Avg: 1.84 / Max: 1.841. (CXX) g++ options: -O3

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc7g.4xlarge918273645SE +/- 0.00, N = 3381. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc7g.4xlarge816243240Min: 38 / Avg: 38 / Max: 381. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlarge2K4K6K8K10KSE +/- 2.40, N = 379901. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlarge14002800420056007000Min: 7985.5 / Avg: 7990.17 / Max: 7993.51. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Model: GPT-2 - Device: CPU - Executor: Standard

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc7g.4xlarge90180270360450SE +/- 0.17, N = 34071. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc7g.4xlarge70140210280350Min: 407 / Avg: 407.17 / Max: 407.51. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Model: bertsquad-12 - Device: CPU - Executor: Standard

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlargeampere c7g.4xlarge compar8K16K24K32K40KSE +/- 17.76, N = 3SE +/- 452.78, N = 31502.9536866.10
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlargeampere c7g.4xlarge compar6K12K18K24K30KMin: 1468.14 / Avg: 1502.95 / Max: 1526.49Min: 36301.6 / Avg: 36866.13 / Max: 37761.6

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc7g.4xlarge130260390520650SE +/- 0.00, N = 36091. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc7g.4xlarge110220330440550Min: 608.5 / Avg: 608.5 / Max: 608.51. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargeampere c7g.4xlarge compar140K280K420K560K700KSE +/- 82.61, N = 3SE +/- 61.45, N = 3178460.4646811.21. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargeampere c7g.4xlarge compar110K220K330K440K550KMin: 178358.2 / Avg: 178460.37 / Max: 178623.9Min: 646712.4 / Avg: 646811.23 / Max: 646923.91. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargeampere c7g.4xlarge compar16003200480064008000SE +/- 0.23, N = 3SE +/- 20.84, N = 32546.47387.01. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlargeampere c7g.4xlarge compar13002600390052006500Min: 2546 / Avg: 2546.4 / Max: 2546.8Min: 7361.5 / Avg: 7387 / Max: 7428.31. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc7g.4xlarge6001200180024003000SE +/- 1.86, N = 328171. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc7g.4xlarge5001000150020002500Min: 2815 / Avg: 2817.33 / Max: 28211. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Model: super-resolution-10 - Device: CPU - Executor: Standard

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./onnx: line 2: ./onnxruntime/build/Linux/Release/onnxruntime_perf_test: No such file or directory

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac7g.4xlargeampere c7g.4xlarge compar0.43650.8731.30951.7462.1825SE +/- 0.00, N = 3SE +/- 0.00, N = 31.941.48-pthread1. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac7g.4xlargeampere c7g.4xlarge compar246810Min: 1.94 / Avg: 1.94 / Max: 1.94Min: 1.47 / Avg: 1.48 / Max: 1.481. (CXX) g++ options: -O3

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc7g.4xlargeampere c7g.4xlarge compar0.15750.3150.47250.630.7875SE +/- 0.00, N = 3SE +/- 0.00, N = 30.700.56-pthread1. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc7g.4xlargeampere c7g.4xlarge compar246810Min: 0.7 / Avg: 0.7 / Max: 0.7Min: 0.56 / Avg: 0.56 / Max: 0.571. (CXX) g++ options: -O3

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc7g.4xlargeampere c7g.4xlarge compar1326395265SE +/- 0.01, N = 3SE +/- 0.02, N = 348.2156.89-pthread -ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc7g.4xlargeampere c7g.4xlarge compar1122334455Min: 48.2 / Avg: 48.21 / Max: 48.22Min: 56.86 / Avg: 56.89 / Max: 56.911. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlargeampere c7g.4xlarge compar1020304050SE +/- 0.33, N = 12SE +/- 0.41, N = 313.3044.701. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlargeampere c7g.4xlarge compar918273645Min: 11.89 / Avg: 13.3 / Max: 14.87Min: 43.9 / Avg: 44.69 / Max: 45.221. (CXX) g++ options: -O2 -lOpenCL

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c7g.4xlargeampere c7g.4xlarge compar12002400360048006000SE +/- 32.57, N = 5SE +/- 99.63, N = 2029515747
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c7g.4xlargeampere c7g.4xlarge compar10002000300040005000Min: 2868 / Avg: 2951 / Max: 3068Min: 5032 / Avg: 5747.35 / Max: 6580

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c7g.4xlarge16K32K48K64K80KSE +/- 83.83, N = 372719.331. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c7g.4xlarge13K26K39K52K65KMin: 72567.8 / Avg: 72719.33 / Max: 72857.221. (CC) gcc options: -shared -fPIC -O2

Concurrent Requests: 1000

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./apache: 2: /go/bin/bombardier: not found

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c7g.4xlarge70K140K210K280K350KSE +/- 1410.11, N = 3346814.751. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c7g.4xlarge60K120K180K240K300KMin: 344622.05 / Avg: 346814.75 / Max: 349447.111. (CC) gcc options: -lcrypt -lz -O3 -march=native

Concurrent Requests: 1000

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./nginx: 2: /go/bin/bombardier: not found

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c7g.4xlarge80K160K240K320K400KSE +/- 3986.77, N = 3352380.981. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c7g.4xlarge60K120K180K240K300KMin: 344424.56 / Avg: 352380.98 / Max: 356811.551. (CC) gcc options: -lcrypt -lz -O3 -march=native

Concurrent Requests: 200

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./nginx: 2: /go/bin/bombardier: not found

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c7g.4xlarge16K32K48K64K80KSE +/- 649.31, N = 373676.951. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c7g.4xlarge13K26K39K52K65KMin: 72788.14 / Avg: 73676.95 / Max: 74941.31. (CC) gcc options: -shared -fPIC -O2

Concurrent Requests: 200

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./apache: 2: /go/bin/bombardier: not found

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c7g.4xlarge70K140K210K280K350KSE +/- 2009.97, N = 3345710.871. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c7g.4xlarge60K120K180K240K300KMin: 341701.14 / Avg: 345710.87 / Max: 347963.741. (CC) gcc options: -lcrypt -lz -O3 -march=native

Concurrent Requests: 100

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./nginx: 2: /go/bin/bombardier: not found

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c7g.4xlarge16K32K48K64K80KSE +/- 89.82, N = 373546.321. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c7g.4xlarge13K26K39K52K65KMin: 73405.22 / Avg: 73546.32 / Max: 73713.171. (CC) gcc options: -shared -fPIC -O2

Concurrent Requests: 500

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./apache: 2: /go/bin/bombardier: not found

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c7g.4xlarge70K140K210K280K350KSE +/- 1017.52, N = 3346613.341. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c7g.4xlarge60K120K180K240K300KMin: 344614.99 / Avg: 346613.34 / Max: 347945.691. (CC) gcc options: -lcrypt -lz -O3 -march=native

Concurrent Requests: 500

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./nginx: 2: /go/bin/bombardier: not found

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c7g.4xlarge14K28K42K56K70KSE +/- 38.09, N = 367231.881. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c7g.4xlarge12K24K36K48K60KMin: 67187.11 / Avg: 67231.88 / Max: 67307.651. (CC) gcc options: -shared -fPIC -O2

Concurrent Requests: 100

ampere c7g.4xlarge compar: The test quit with a non-zero exit status. The test quit with a non-zero exit status. The test quit with a non-zero exit status. E: ./apache: 2: /go/bin/bombardier: not found

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlargeampere c7g.4xlarge compar1530456075SE +/- 0.003, N = 3SE +/- 0.016, N = 366.8228.1611. (CXX) g++ options: -fopenmp -O2 -march=native
OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlargeampere c7g.4xlarge compar1326395265Min: 66.82 / Avg: 66.82 / Max: 66.83Min: 8.14 / Avg: 8.16 / Max: 8.191. (CXX) g++ options: -fopenmp -O2 -march=native

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec7g.4xlargeampere c7g.4xlarge compar140K280K420K560K700KSE +/- 525.83, N = 3SE +/- 1848.20, N = 3666484487006
OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec7g.4xlargeampere c7g.4xlarge compar120K240K360K480K600KMin: 665522 / Avg: 666484 / Max: 667333Min: 483377 / Avg: 487006 / Max: 489429

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlargeampere c7g.4xlarge compar40K80K120K160K200KSE +/- 32.01, N = 3SE +/- 2482.33, N = 423181.81198623.071. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlargeampere c7g.4xlarge compar30K60K90K120K150KMin: 23119.13 / Avg: 23181.81 / Max: 23224.4Min: 194198.56 / Avg: 198623.07 / Max: 203803.021. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlargeampere c7g.4xlarge compar30M60M90M120M150MSE +/- 153578.64, N = 3SE +/- 742399.94, N = 3276088911565451341. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlargeampere c7g.4xlarge compar30M60M90M120M150MMin: 27303905 / Avg: 27608891 / Max: 27792957Min: 155158705 / Avg: 156545133.67 / Max: 1576986031. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc7g.4xlargeampere c7g.4xlarge compar48121620SE +/- 0.0011, N = 3SE +/- 0.0795, N = 1513.92488.22081. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc7g.4xlargeampere c7g.4xlarge compar48121620Min: 13.92 / Avg: 13.92 / Max: 13.93Min: 7.78 / Avg: 8.22 / Max: 8.821. (CXX) g++ options: -O3 -flto -pthread

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc7g.4xlargeampere c7g.4xlarge compar8K16K24K32K40KSE +/- 0.41, N = 3SE +/- 393.27, N = 35029.7139188.631. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc7g.4xlargeampere c7g.4xlarge compar7K14K21K28K35KMin: 5028.91 / Avg: 5029.71 / Max: 5030.29Min: 38402.35 / Avg: 39188.63 / Max: 39599.081. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlargeampere c7g.4xlarge compar3K6K9K12K15KSE +/- 3.52, N = 3SE +/- 53.04, N = 36693.3212885.531. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlargeampere c7g.4xlarge compar2K4K6K8K10KMin: 6686.28 / Avg: 6693.32 / Max: 6696.97Min: 12780.13 / Avg: 12885.53 / Max: 12948.571. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlargeampere c7g.4xlarge compar150300450600750SE +/- 0.32, N = 3SE +/- 0.18, N = 3675.64561.601. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlargeampere c7g.4xlarge compar120240360480600Min: 675.15 / Avg: 675.64 / Max: 676.25Min: 561.36 / Avg: 561.59 / Max: 561.941. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlargeampere c7g.4xlarge compar80K160K240K320K400KSE +/- 17.05, N = 3SE +/- 2465.32, N = 355258.17394967.531. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlargeampere c7g.4xlarge compar70K140K210K280K350KMin: 55237.21 / Avg: 55258.17 / Max: 55291.94Min: 391830.53 / Avg: 394967.53 / Max: 399830.41. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Mathc7g.4xlargeampere c7g.4xlarge compar160K320K480K640K800KSE +/- 3.18, N = 3SE +/- 7975.03, N = 380088.74735128.781. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Mathc7g.4xlargeampere c7g.4xlarge compar130K260K390K520K650KMin: 80082.86 / Avg: 80088.74 / Max: 80093.79Min: 719292.98 / Avg: 735128.78 / Max: 744697.041. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

PyBench

This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc7g.4xlargeampere c7g.4xlarge compar30060090012001500SE +/- 0.33, N = 3SE +/- 2.96, N = 311851421
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc7g.4xlargeampere c7g.4xlarge compar2004006008001000Min: 1184 / Avg: 1184.67 / Max: 1185Min: 1417 / Avg: 1421.33 / Max: 1427

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec7g.4xlargeampere c7g.4xlarge compar816243240SE +/- 0.05, N = 3SE +/- 0.25, N = 326.9432.92
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec7g.4xlargeampere c7g.4xlarge compar714212835Min: 26.87 / Avg: 26.94 / Max: 27.04Min: 32.43 / Avg: 32.92 / Max: 33.2

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlargeampere c7g.4xlarge compar500M1000M1500M2000M2500MSE +/- 952437.28, N = 3SE +/- 3186022.65, N = 312588073332194008000-pthread1. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlargeampere c7g.4xlarge compar400M800M1200M1600M2000MMin: 1256931000 / Avg: 1258807333.33 / Max: 1260030000Min: 2187757000 / Avg: 2194008000 / Max: 21982040001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Timed ImageMagick Compilation

This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlargeampere c7g.4xlarge compar714212835SE +/- 0.13, N = 3SE +/- 0.23, N = 327.9025.00
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlargeampere c7g.4xlarge compar612182430Min: 27.67 / Avg: 27.9 / Max: 28.12Min: 24.53 / Avg: 25 / Max: 25.24

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc7g.4xlargeampere c7g.4xlarge compar612182430SE +/- 0.09, N = 3SE +/- 0.00, N = 322.7726.54-pthread -ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc7g.4xlargeampere c7g.4xlarge compar612182430Min: 22.67 / Avg: 22.77 / Max: 22.94Min: 26.53 / Avg: 26.54 / Max: 26.541. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlargeampere c7g.4xlarge compar714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 329.1312.09-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlargeampere c7g.4xlarge compar612182430Min: 29.1 / Avg: 29.13 / Max: 29.18Min: 12.05 / Avg: 12.09 / Max: 12.161. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlargeampere c7g.4xlarge compar918273645SE +/- 0.016, N = 3SE +/- 0.014, N = 338.5175.2581. (CC) gcc options: -lm -lpthread -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlargeampere c7g.4xlarge compar816243240Min: 38.49 / Avg: 38.52 / Max: 38.55Min: 5.23 / Avg: 5.26 / Max: 5.271. (CC) gcc options: -lm -lpthread -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc7g.4xlargeampere c7g.4xlarge compar10K20K30K40K50KSE +/- 1.17, N = 3SE +/- 33.90, N = 211791.7746649.69-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc7g.4xlargeampere c7g.4xlarge compar8K16K24K32K40KMin: 11789.44 / Avg: 11791.77 / Max: 11792.99Min: 46615.79 / Avg: 46649.69 / Max: 46683.581. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlargeampere c7g.4xlarge compar80M160M240M320M400MSE +/- 400097.21, N = 3SE +/- 234828.54, N = 33836066673199066671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlargeampere c7g.4xlarge compar70M140M210M280M350MMin: 382810000 / Avg: 383606666.67 / Max: 384070000Min: 319450000 / Avg: 319906666.67 / Max: 3202300001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlargeampere c7g.4xlarge compar714212835SE +/- 0.02, N = 3SE +/- 0.28, N = 310.4828.481. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlargeampere c7g.4xlarge compar612182430Min: 10.44 / Avg: 10.48 / Max: 10.51Min: 28.03 / Avg: 28.48 / Max: 28.981. (CXX) g++ options: -O2 -lOpenCL

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlargeampere c7g.4xlarge compar7K14K21K28K35KSE +/- 76.73, N = 3SE +/- 31.03, N = 310940.9431552.73-pthread1. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlargeampere c7g.4xlarge compar5K10K15K20K25KMin: 10787.69 / Avg: 10940.94 / Max: 11024.62Min: 31519.42 / Avg: 31552.73 / Max: 31614.721. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

N-Queens

This is a test of the OpenMP version of a test that solves the N-queens problem. The board problem size is 18. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec7g.4xlargeampere c7g.4xlarge compar510152025SE +/- 0.000, N = 3SE +/- 0.041, N = 1521.5361.9361. (CC) gcc options: -static -fopenmp -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec7g.4xlargeampere c7g.4xlarge compar510152025Min: 21.54 / Avg: 21.54 / Max: 21.54Min: 1.51 / Avg: 1.94 / Max: 2.131. (CC) gcc options: -static -fopenmp -O3 -march=native

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc7g.4xlargeampere c7g.4xlarge compar5K10K15K20K25KSE +/- 17.12, N = 3SE +/- 224.01, N = 36571.9524449.50-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc7g.4xlargeampere c7g.4xlarge compar4K8K12K16K20KMin: 6551.05 / Avg: 6571.95 / Max: 6605.88Min: 24050.07 / Avg: 24449.5 / Max: 24824.951. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc7g.4xlarge200K400K600K800K1000KSE +/- 614.16, N = 3843015.781. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc7g.4xlarge150K300K450K600K750KMin: 841810.62 / Avg: 843015.78 / Max: 843823.921. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Test: IO_uring

ampere c7g.4xlarge compar: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlargeampere c7g.4xlarge compar510152025SE +/- 0.06, N = 3SE +/- 0.22, N = 1511.2922.27-pthread1. (CXX) g++ options: -O3 -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlargeampere c7g.4xlarge compar510152025Min: 11.17 / Avg: 11.29 / Max: 11.36Min: 20.72 / Avg: 22.27 / Max: 24.031. (CXX) g++ options: -O3 -lm

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlargeampere c7g.4xlarge compar3691215SE +/- 0.011, N = 3SE +/- 0.039, N = 311.9088.3591. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlargeampere c7g.4xlarge compar3691215Min: 11.89 / Avg: 11.91 / Max: 11.92Min: 8.3 / Avg: 8.36 / Max: 8.431. (CXX) g++ options: -O3 -fPIC -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compressionc7g.4xlargeampere c7g.4xlarge compar3691215SE +/- 0.007, N = 3SE +/- 0.103, N = 39.34610.206-pthread -ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest Compressionc7g.4xlargeampere c7g.4xlarge compar3691215Min: 9.34 / Avg: 9.35 / Max: 9.36Min: 10.1 / Avg: 10.21 / Max: 10.411. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlargeampere c7g.4xlarge compar12002400360048006000SE +/- 6.99, N = 4SE +/- 31.65, N = 439405404
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlargeampere c7g.4xlarge compar9001800270036004500Min: 3927 / Avg: 3940.25 / Max: 3960Min: 5339 / Avg: 5404 / Max: 5475

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc7g.4xlargeampere c7g.4xlarge compar11K22K33K44K55KSE +/- 4.69, N = 3SE +/- 24.22, N = 313481.6152823.39-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc7g.4xlargeampere c7g.4xlarge compar9K18K27K36K45KMin: 13472.59 / Avg: 13481.61 / Max: 13488.33Min: 52798.49 / Avg: 52823.39 / Max: 52871.831. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6c7g.4xlargeampere c7g.4xlarge compar3691215SE +/- 0.025, N = 3SE +/- 0.061, N = 49.3854.9101. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6c7g.4xlargeampere c7g.4xlarge compar3691215Min: 9.34 / Avg: 9.38 / Max: 9.41Min: 4.81 / Avg: 4.91 / Max: 5.091. (CXX) g++ options: -O3 -fPIC -lm

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlarge8001600240032004000SE +/- 14.95, N = 43524
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlarge6001200180024003000Min: 3487 / Avg: 3523.75 / Max: 3551

Java Test: Tradesoap

ampere c7g.4xlarge compar: The test run did not produce a result. The test run did not produce a result. The test run did not produce a result. The test run did not produce a result.

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlargeampere c7g.4xlarge compar246810SE +/- 0.01401446, N = 3SE +/- 0.00831997, N = 28.016714253.06693709-lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz-pthread1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlargeampere c7g.4xlarge compar3691215Min: 8 / Avg: 8.02 / Max: 8.04Min: 3.06 / Avg: 3.07 / Max: 3.081. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Losslessc7g.4xlargeampere c7g.4xlarge compar246810SE +/- 0.021, N = 3SE +/- 0.015, N = 35.7656.1701. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Losslessc7g.4xlargeampere c7g.4xlarge compar246810Min: 5.73 / Avg: 5.76 / Max: 5.8Min: 6.14 / Avg: 6.17 / Max: 6.21. (CXX) g++ options: -O3 -fPIC -lm

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlargeampere c7g.4xlarge compar300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 966.51, N = 5137009410415651. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlargeampere c7g.4xlarge compar200K400K600K800K1000KMin: 1370094 / Avg: 1370094 / Max: 1370094Min: 1039203 / Avg: 1041565.2 / Max: 10451191. (CC) gcc options: -O3 -march=native

102 Results Shown

LeelaChessZero:
  Eigen
  BLAS
Timed Node.js Compilation
Timed Gem5 Compilation
Timed LLVM Compilation
asmFish
POV-Ray
TensorFlow Lite
Timed MrBayes Analysis
Build2
SecureMark
LAMMPS Molecular Dynamics Simulator
libavif avifenc
Ngspice:
  C7552
  C2670
ACES DGEMM
Timed PHP Compilation
NAS Parallel Benchmarks
TensorFlow Lite:
  Inception V4
  SqueezeNet
  Mobilenet Float
High Performance Conjugate Gradient
OpenSSL
TensorFlow Lite
Zstd Compression:
  19, Long Mode - Decompression Speed
  19, Long Mode - Compression Speed
NAS Parallel Benchmarks:
  BT.C
  LU.C
libavif avifenc
DaCapo Benchmark
7-Zip Compression:
  Decompression Rating
  Compression Rating
GROMACS
Zstd Compression:
  19 - Decompression Speed
  19 - Compression Speed
  3 - Decompression Speed
  3 - Compression Speed
QuantLib
GPAW
NAS Parallel Benchmarks
Stress-NG
Rodinia
NAS Parallel Benchmarks
ASTC Encoder
simdjson
Coremark
simdjson
ONNX Runtime:
  fcn-resnet101-11 - CPU - Standard
  GPT-2 - CPU - Standard
  bertsquad-12 - CPU - Standard
TensorFlow Lite
ONNX Runtime
OpenSSL:
  RSA4096:
    verify/s
    sign/s
ONNX Runtime
simdjson:
  Kostya
  LargeRand
WebP Image Encode
Rodinia
DaCapo Benchmark
Apache HTTP Server
nginx:
  1000
  200
Apache HTTP Server
nginx
Apache HTTP Server
nginx
Apache HTTP Server
m-queens
PHPBench
Stress-NG
Stockfish
ASTC Encoder
Stress-NG:
  CPU Stress
  Memory Copying
Google SynthMark
Stress-NG:
  Vector Math
  Matrix Math
PyBench
Timed Apache Compilation
Algebraic Multi-Grid Benchmark
Timed ImageMagick Compilation
WebP Image Encode
Xcompact3d Incompact3d
C-Ray
NAS Parallel Benchmarks
Liquid-DSP
Rodinia
LULESH
N-Queens
NAS Parallel Benchmarks
Stress-NG
LAMMPS Molecular Dynamics Simulator
libavif avifenc
WebP Image Encode
DaCapo Benchmark
NAS Parallel Benchmarks
libavif avifenc
DaCapo Benchmark
Xcompact3d Incompact3d
libavif avifenc
TSCP