Amazon EC2 Graviton3 Benchmark Comparison

Amazon AWS Graviton3 benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2205260-PTS-GRAVITON42
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
Chess Test Suite 6 Tests
Timed Code Compilation 7 Tests
C/C++ Compiler Tests 15 Tests
Compression Tests 2 Tests
CPU Massive 22 Tests
Creator Workloads 7 Tests
Cryptography 2 Tests
Fortran Tests 4 Tests
Go Language Tests 2 Tests
HPC - High Performance Computing 14 Tests
Imaging 2 Tests
Common Kernel Benchmarks 3 Tests
Linear Algebra 2 Tests
Machine Learning 3 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 7 Tests
Multi-Core 23 Tests
NVIDIA GPU Compute 3 Tests
OpenMPI Tests 10 Tests
Programmer / Developer System Benchmarks 12 Tests
Python Tests 6 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 8 Tests
Server 5 Tests
Server CPU Tests 16 Tests
Single-Threaded 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
a1.4xlarge Graviton
May 25 2022
  17 Hours, 42 Minutes
c6g.4xlarge Graviton2
May 25 2022
  9 Hours, 58 Minutes
c7g.4xlarge Graviton3
May 24 2022
  7 Hours, 43 Minutes
c6a.4xlarge EPYC
May 26 2022
  11 Hours, 43 Minutes
c6i.4xlarge Xeon
May 26 2022
  9 Hours, 40 Minutes
Invert Hiding All Results Option
  11 Hours, 21 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Amazon EC2 Graviton3 Benchmark ComparisonProcessorMotherboardChipsetMemoryDiskNetworkOSKernelCompilerFile-SystemSystem Layerc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge XeonARMv8 Neoverse-V1 (16 Cores)Amazon EC2 c7g.4xlarge (1.0 BIOS)Amazon Device 020032GB193GB Amazon Elastic Block StoreAmazon ElasticUbuntu 22.045.15.0-1004-aws (aarch64)GCC 11.2.0ext4amazonARMv8 Cortex-A72 (16 Cores)Amazon EC2 a1.4xlarge (1.0 BIOS)ARMv8 Neoverse-N1 (16 Cores)Amazon EC2 c6g.4xlarge (1.0 BIOS)AMD EPYC 7R13 (8 Cores / 16 Threads)Amazon EC2 c6a.4xlarge (1.0 BIOS)Intel 440FX 82441FX PMC5.15.0-1004-aws (x86_64)Intel Xeon Platinum 8375C (8 Cores / 16 Threads)Amazon EC2 c6i.4xlarge (1.0 BIOS)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c7g.4xlarge Graviton3: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - a1.4xlarge Graviton: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6g.4xlarge Graviton2: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c6a.4xlarge EPYC: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6i.4xlarge Xeon: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Java Details- OpenJDK Runtime Environment (build 11.0.15+10-Ubuntu-0ubuntu0.22.04.1)Python Details- Python 3.10.4Security Details- c7g.4xlarge Graviton3: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - a1.4xlarge Graviton: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Not affected + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Branch predictor hardening BHB + srbds: Not affected + tsx_async_abort: Not affected - c6g.4xlarge Graviton2: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c6a.4xlarge EPYC: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected - c6i.4xlarge Xeon: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected Processor Details- c6a.4xlarge EPYC: CPU Microcode: 0xa001144- c6i.4xlarge Xeon: CPU Microcode: 0xd000331

c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge XeonLogarithmic Result OverviewPhoronix Test SuiteHigh Performance Conjugate GradientAlgebraic Multi-Grid BenchmarkACES DGEMMONNX RuntimeXcompact3d Incompact3dNAS Parallel BenchmarksTimed MrBayes AnalysisGPAWLULESHGROMACSApache HTTP ServersimdjsonTensorFlow LiteASTC EncoderTimed Node.js CompilationLAMMPS Molecular Dynamics SimulatorPyBenchPHPBenchlibavif avifencTimed ImageMagick CompilationTimed Apache CompilationTimed LLVM CompilationRodiniaOpenSSLZstd CompressionSecureMarkNgspiceLiquid-DSPBuild2Timed PHP CompilationDaCapo BenchmarkWebP Image EncodeTimed Gem5 CompilationnginxC-RayTSCPStockfishPOV-Ray7-Zip CompressionStress-NGasmFishGoogle SynthMarkLeelaChessZeroCoremarkN-Queensm-queens

Amazon EC2 Graviton3 Benchmark Comparisonstress-ng: Memory Copyingnpb: MG.Cnpb: CG.Cnpb: SP.Ccompress-zstd: 3 - Compression Speednpb: FT.Chpcg: amg: incompact3d: input.i3d 129 Cells Per Directionmt-dgemm: Sustained Floating-Point Rateincompact3d: input.i3d 193 Cells Per Directionstress-ng: CPU Stresssimdjson: DistinctUserIDmrbayes: Primate Phylogeny Analysisnpb: IS.Dtensorflow-lite: Mobilenet Floatgpaw: Carbon Nanotubeavifenc: 2simdjson: PartialTweetslulesh: apache: 100astcenc: Thoroughgromacs: MPI CPU - water_GMX50_baretensorflow-lite: Inception V4apache: 500apache: 200simdjson: Kostyanpb: BT.Copenssl: RSA4096tensorflow-lite: Inception ResNet V2apache: 1000tensorflow-lite: SqueezeNetastcenc: Exhaustiverodinia: OpenMP CFD Solveropenssl: RSA4096avifenc: 0build-nodejs: Time To Compilelammps: Rhodopsin Proteinpybench: Total For Average Test Timesphpbench: PHP Benchmark Suitebuild-imagemagick: Time To Compiletensorflow-lite: NASNet Mobilebuild-apache: Time To Compiledacapobench: Jythonbuild-llvm: Ninjanpb: EP.Dngspice: C2670dacapobench: Tradesoapsimdjson: LargeRandsecuremark: SecureMark-TLSdacapobench: Tradebeansliquid-dsp: 16 - 256 - 57build2: Time To Compilebuild-php: Time To Compilecompress-7zip: Compression Ratingngspice: C7552webp: Quality 100, Lossless, Highest Compressionbuild-gem5: Time To Compilewebp: Quality 100, Losslessavifenc: 6, Losslessnginx: 1000nginx: 500nginx: 200compress-zstd: 19 - Decompression Speednginx: 100tscp: AI Chess Performancecompress-zstd: 19, Long Mode - Decompression Speedstockfish: Total Timerodinia: OpenMP LavaMDpovray: Trace Timecompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19 - Compression Speeddacapobench: H2stress-ng: Cryptoasmfish: 1024 Hash Memory, 26 Depthsynthmark: VoiceMark_100openssl: SHA256stress-ng: Vector Mathnpb: LU.Clczero: Eigenlczero: BLAScoremark: CoreMark Size 666 - Iterations Per Secondn-queens: Elapsed Timecompress-7zip: Decompression Ratingm-queens: Time To Solvestress-ng: IO_uringonnx: super-resolution-10 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: GPT-2 - CPU - Standardtensorflow-lite: Mobilenet Quantc-ray: Total Time - 4K, 16 Rays Per Pixelrodinia: OpenMP Streamclusterc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6693.3213481.616571.954467.194639.111791.7726.305812588073338.016714255.85386429.12585705029.712.69251.3971041.902156.60155.180141.6982.6210940.93967231.8813.92481.12841855.173546.3273676.951.9410339.532546.440051.372719.333257.94139.379710.478178460.4256.841497.57911.291118566648427.90411591.926.9403940544.929934.72198.22435240.71837083203383606667115.02069.48397824191.28648.208391.17122.76911.908346814.75346613.34352380.983050.3345710.8713700943240.627608891143.33437.86339.541.2295123181.8132134123675.6351372204597355258.177730.4111891103405413.86055421.5367305466.822843015.7828176093840779901502.9538.51713.296798.243266.361213.151293.80633.92927.163.7783418671693353.77062740.891391182.5839392366.000.8644.788197.579990.15769.346449.0220.782328.272418636.4333.51980.31618891020133.4920887.580.633148.18588.317116919278.6812014.7277.766941.45045328.6768.3021765.9103.245345224125993.63230986.774.742129971784.600339.20473.901111820.3743569045165513333353.912196.02932498480.793124.7081155.61561.80133.991138205.11139414.84141436.201121.7143155.485385001213.910980430360.30493.8011616.9674011985.3815331550331.070678568951727341.472558.12128135203869.40201732.28540891110.368918172.377571651011523125724.66104.76147.4302903.006720.683520.862356.162878.86244.4819.721893265290011.57335474.78512341.02408353404.941.53384.753372.762500.87215.528238.2051.516016.162746995.3516.52220.78146793.950077.8150059.971.196449.11660.645955.746629.453969.35159.203917.03553951.5406.937628.4017.935174144985540.33314985.434.2015626682.981558.88263.72445060.491203014344262890000142.27788.89771285255.20566.147488.80531.08216.518308213.13310596.58308938.672051.6307349.368723132196.321679245215.66651.04731.034.6396417924.1826540482470.3891072318408337753.895133.89834864315464.33980023.1365944575.224770521.8120723342832269481980.2462.32315.4843551.8016826.436169.228094.792768.718299.965.0604226767070028.27976612.432432110.77002713304.504.30120.636541.352159.72302.95693.9463.645452.105177567.697.98181.00444920.681995.6483070.002.8013134.462088.941366.671537.113103.1272.390821.789136784.2195.532664.3475.067196148074132.6269266.8623.5324616760.344466.21245.88640520.952132883167509746667150.99467.08462562180.35648.677515.20126.70816.394388657.76389030.11390932.792907.5388010.7614426312826.023857623224.33149.43525.930.0301913556.0626187688663.0731169140335353787.6125140.5510011091345133.44054116.3785731872.330768723.46369611926548856173847.9669.34918.3833150.4926298.819522.829563.223440.620423.578.6603166136476717.86827722.23054569.216997812527.164.30134.924861.571965.07202.10697.7353.718112.371586545.577.26251.45241185.791746.5794458.222.4613888.402161.341179.779830.962983.9369.638720.446140964.4204.994604.6206.22099782818629.73710900.622.5274013685.7041103.22147.89338150.862305492928373100000136.80164.33766631161.08141.805469.94021.12217.529347345.49351672.92356829.932582.0356302.8412725962666.122081961281.38952.78433.838.1292110210.3423746200565.690709699393740140.3038136.7714661397285378.84166118.8394565391.2311037943.373450137413977379443967.3992.54523.512OpenBenchmarking.org

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon14002800420056007000SE +/- 3.52, N = 3SE +/- 0.91, N = 3SE +/- 3.75, N = 3SE +/- 11.57, N = 3SE +/- 0.94, N = 36693.32798.242903.003551.803150.491. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory Copyingc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon12002400360048006000Min: 6686.28 / Avg: 6693.32 / Max: 6696.97Min: 796.83 / Avg: 798.24 / Max: 799.93Min: 2895.63 / Avg: 2903 / Max: 2907.88Min: 3528.68 / Avg: 3551.8 / Max: 3564.29Min: 3148.99 / Avg: 3150.49 / Max: 3152.221. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6K12K18K24K30KSE +/- 4.69, N = 3SE +/- 1.64, N = 3SE +/- 1.39, N = 3SE +/- 30.62, N = 3SE +/- 184.24, N = 313481.613266.366720.6816826.4326298.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon5K10K15K20K25KMin: 13472.59 / Avg: 13481.61 / Max: 13488.33Min: 3263.37 / Avg: 3266.36 / Max: 3269.01Min: 6719.25 / Avg: 6720.68 / Max: 6723.47Min: 16791.6 / Avg: 16826.43 / Max: 16887.46Min: 26062.84 / Avg: 26298.81 / Max: 26661.91. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 17.12, N = 3SE +/- 11.79, N = 6SE +/- 9.95, N = 3SE +/- 81.25, N = 3SE +/- 66.44, N = 36571.951213.153520.866169.229522.821. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon17003400510068008500Min: 6551.05 / Avg: 6571.95 / Max: 6605.88Min: 1179 / Avg: 1213.15 / Max: 1248.52Min: 3501.43 / Avg: 3520.86 / Max: 3534.33Min: 6043 / Avg: 6169.22 / Max: 6320.96Min: 9400.77 / Avg: 9522.82 / Max: 9629.371. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 9.61, N = 3SE +/- 2.51, N = 3SE +/- 0.57, N = 3SE +/- 24.63, N = 3SE +/- 73.65, N = 34467.191293.802356.168094.799563.221. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon17003400510068008500Min: 4449.83 / Avg: 4467.19 / Max: 4483.01Min: 1288.84 / Avg: 1293.8 / Max: 1296.88Min: 2355.3 / Avg: 2356.16 / Max: 2357.24Min: 8057.91 / Avg: 8094.79 / Max: 8141.52Min: 9433.84 / Avg: 9563.22 / Max: 9688.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc6g.4xlarge Graviton2c6a.4xlarge EPYCc7g.4xlarge Graviton3a1.4xlarge Gravitonc6i.4xlarge Xeon10002000300040005000SE +/- 6.37, N = 3SE +/- 2.65, N = 3SE +/- 9.57, N = 3SE +/- 4.47, N = 3SE +/- 29.53, N = 32888.32784.04639.1633.93440.6-llzma-llzma-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedc6g.4xlarge Graviton2c6a.4xlarge EPYCc7g.4xlarge Graviton3a1.4xlarge Gravitonc6i.4xlarge Xeon8001600240032004000Min: 2876.8 / Avg: 2888.33 / Max: 2898.8Min: 2779.8 / Avg: 2783.97 / Max: 2788.9Min: 4620 / Avg: 4639.13 / Max: 4649.1Min: 626.2 / Avg: 633.93 / Max: 641.7Min: 3408.7 / Avg: 3440.6 / Max: 3499.61. (CC) gcc options: -O3 -pthread -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon4K8K12K16K20KSE +/- 1.17, N = 3SE +/- 1.73, N = 3SE +/- 1.10, N = 3SE +/- 45.90, N = 3SE +/- 40.24, N = 311791.772927.166244.4818299.9620423.571. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon4K8K12K16K20KMin: 11789.44 / Avg: 11791.77 / Max: 11792.99Min: 2923.87 / Avg: 2927.16 / Max: 2929.76Min: 6242.29 / Avg: 6244.48 / Max: 6245.7Min: 18217.55 / Avg: 18299.96 / Max: 18376.18Min: 20343.5 / Avg: 20423.57 / Max: 20470.621. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon612182430SE +/- 0.03738, N = 3SE +/- 0.00065, N = 3SE +/- 0.01639, N = 3SE +/- 0.00225, N = 3SE +/- 0.04033, N = 326.305803.7783419.721805.060428.660311. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon612182430Min: 26.26 / Avg: 26.31 / Max: 26.38Min: 3.78 / Avg: 3.78 / Max: 3.78Min: 19.7 / Avg: 19.72 / Max: 19.75Min: 5.06 / Avg: 5.06 / Max: 5.06Min: 8.58 / Avg: 8.66 / Max: 8.71. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon300M600M900M1200M1500MSE +/- 952437.28, N = 3SE +/- 176548.39, N = 3SE +/- 3420043.89, N = 3SE +/- 103921.81, N = 3SE +/- 5114517.12, N = 312588073331867169339326529002676707006613647671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi
OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon200M400M600M800M1000MMin: 1256931000 / Avg: 1258807333.33 / Max: 1260030000Min: 186372300 / Avg: 186716933.33 / Max: 186955800Min: 927742900 / Avg: 932652900 / Max: 939232100Min: 267514000 / Avg: 267670700 / Max: 267867300Min: 654941100 / Avg: 661364766.67 / Max: 6714706001. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1224364860SE +/- 0.01401446, N = 3SE +/- 0.02862870, N = 3SE +/- 0.01351889, N = 3SE +/- 0.03718674, N = 3SE +/- 0.09619197, N = 38.0167142553.7706274011.5733547028.2797661017.868277201. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1122334455Min: 8 / Avg: 8.02 / Max: 8.04Min: 53.73 / Avg: 53.77 / Max: 53.83Min: 11.55 / Avg: 11.57 / Max: 11.59Min: 28.22 / Avg: 28.28 / Max: 28.35Min: 17.69 / Avg: 17.87 / Max: 18.011. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

ACES DGEMM

This is a multi-threaded DGEMM benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1.31712.63423.95135.26846.5855SE +/- 0.016350, N = 3SE +/- 0.002370, N = 3SE +/- 0.007139, N = 3SE +/- 0.023324, N = 6SE +/- 0.003819, N = 35.8538640.8913914.7851232.4324322.2305451. (CC) gcc options: -O3 -march=native -fopenmp
OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon246810Min: 5.83 / Avg: 5.85 / Max: 5.89Min: 0.89 / Avg: 0.89 / Max: 0.9Min: 4.77 / Avg: 4.79 / Max: 4.8Min: 2.32 / Avg: 2.43 / Max: 2.48Min: 2.22 / Avg: 2.23 / Max: 2.241. (CC) gcc options: -O3 -march=native -fopenmp

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon4080120160200SE +/- 0.03, N = 3SE +/- 0.15, N = 3SE +/- 0.01, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 329.13182.5841.02110.7769.221. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon306090120150Min: 29.1 / Avg: 29.13 / Max: 29.18Min: 182.36 / Avg: 182.58 / Max: 182.88Min: 41.01 / Avg: 41.02 / Max: 41.05Min: 110.58 / Avg: 110.77 / Max: 111.01Min: 68.94 / Avg: 69.22 / Max: 69.381. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3K6K9K12K15KSE +/- 0.41, N = 3SE +/- 0.16, N = 3SE +/- 0.54, N = 3SE +/- 37.60, N = 3SE +/- 155.66, N = 35029.712366.003404.9413304.5012527.161. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stressc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 5028.91 / Avg: 5029.71 / Max: 5030.29Min: 2365.79 / Avg: 2366 / Max: 2366.31Min: 3404.14 / Avg: 3404.94 / Max: 3405.97Min: 13246.57 / Avg: 13304.5 / Max: 13374.99Min: 12365.65 / Avg: 12527.16 / Max: 12838.411. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon0.96751.9352.90253.874.8375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 32.690.801.534.304.301. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: DistinctUserIDc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon246810Min: 2.69 / Avg: 2.69 / Max: 2.69Min: 0.8 / Avg: 0.8 / Max: 0.8Min: 1.53 / Avg: 1.53 / Max: 1.53Min: 4.29 / Avg: 4.3 / Max: 4.31Min: 4.29 / Avg: 4.3 / Max: 4.31. (CXX) g++ options: -O3

Timed MrBayes Analysis

This test performs a bayesian analysis of a set of primate genome sequences in order to estimate their phylogeny. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon140280420560700SE +/- 0.24, N = 3SE +/- 0.49, N = 3SE +/- 0.11, N = 3SE +/- 0.35, N = 3SE +/- 1.43, N = 3251.40644.79384.75120.64134.92-mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -mabm-mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mabm1. (CC) gcc options: -O3 -std=c99 -pedantic -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon110220330440550Min: 251.04 / Avg: 251.4 / Max: 251.85Min: 644.13 / Avg: 644.79 / Max: 645.74Min: 384.53 / Avg: 384.75 / Max: 384.88Min: 120.17 / Avg: 120.64 / Max: 121.31Min: 132.1 / Avg: 134.92 / Max: 136.681. (CC) gcc options: -O3 -std=c99 -pedantic -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000SE +/- 2.29, N = 3SE +/- 0.31, N = 3SE +/- 0.20, N = 3SE +/- 0.47, N = 3SE +/- 2.14, N = 31041.90197.57372.76541.35861.571. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000Min: 1038.58 / Avg: 1041.9 / Max: 1046.3Min: 197.2 / Avg: 197.57 / Max: 198.19Min: 372.52 / Avg: 372.76 / Max: 373.15Min: 540.42 / Avg: 541.35 / Max: 541.83Min: 857.51 / Avg: 861.57 / Max: 864.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 19.61, N = 3SE +/- 113.94, N = 3SE +/- 28.63, N = 3SE +/- 1.03, N = 3SE +/- 1.81, N = 32156.609990.152500.872159.721965.07
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Floatc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 2129.52 / Avg: 2156.6 / Max: 2194.7Min: 9831.11 / Avg: 9990.15 / Max: 10211Min: 2462.74 / Avg: 2500.87 / Max: 2556.93Min: 2157.69 / Avg: 2159.72 / Max: 2161.04Min: 1961.49 / Avg: 1965.07 / Max: 1967.34

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon170340510680850SE +/- 0.08, N = 3SE +/- 5.37, N = 3SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.24, N = 3155.18769.35215.53302.96202.111. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi
OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotubec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon140280420560700Min: 155.01 / Avg: 155.18 / Max: 155.29Min: 763.32 / Avg: 769.35 / Max: 780.07Min: 215.37 / Avg: 215.53 / Max: 215.79Min: 302.63 / Avg: 302.96 / Max: 303.2Min: 201.8 / Avg: 202.11 / Max: 202.581. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon100200300400500SE +/- 0.11, N = 3SE +/- 0.29, N = 3SE +/- 0.12, N = 3SE +/- 0.44, N = 3SE +/- 0.26, N = 3141.70449.02238.2193.9597.741. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80160240320400Min: 141.5 / Avg: 141.7 / Max: 141.88Min: 448.45 / Avg: 449.02 / Max: 449.4Min: 237.98 / Avg: 238.21 / Max: 238.4Min: 93.42 / Avg: 93.95 / Max: 94.82Min: 97.23 / Avg: 97.74 / Max: 98.121. (CXX) g++ options: -O3 -fPIC -lm

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon0.83481.66962.50443.33924.174SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.620.781.513.643.711. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: PartialTweetsc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon246810Min: 2.62 / Avg: 2.62 / Max: 2.62Min: 0.78 / Avg: 0.78 / Max: 0.78Min: 1.51 / Avg: 1.51 / Max: 1.51Min: 3.64 / Avg: 3.64 / Max: 3.65Min: 3.7 / Avg: 3.71 / Max: 3.711. (CXX) g++ options: -O3

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 76.73, N = 3SE +/- 6.27, N = 3SE +/- 4.88, N = 3SE +/- 5.52, N = 3SE +/- 14.20, N = 310940.942328.276016.165452.118112.371. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi
OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 10787.69 / Avg: 10940.94 / Max: 11024.62Min: 2316.57 / Avg: 2328.27 / Max: 2338.01Min: 6009.82 / Avg: 6016.16 / Max: 6025.75Min: 5441.17 / Avg: 5452.11 / Max: 5458.81Min: 8085.86 / Avg: 8112.37 / Max: 8134.431. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20K40K60K80K100KSE +/- 38.09, N = 3SE +/- 28.97, N = 3SE +/- 93.03, N = 3SE +/- 211.56, N = 3SE +/- 389.13, N = 367231.8818636.4346995.3577567.6986545.571. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 100c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon15K30K45K60K75KMin: 67187.11 / Avg: 67231.88 / Max: 67307.65Min: 18584 / Avg: 18636.43 / Max: 18683.99Min: 46816.56 / Avg: 46995.35 / Max: 47129.33Min: 77148.29 / Avg: 77567.69 / Max: 77825.94Min: 85770.92 / Avg: 86545.57 / Max: 86997.781. (CC) gcc options: -shared -fPIC -O2

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon816243240SE +/- 0.0011, N = 3SE +/- 0.0061, N = 3SE +/- 0.0064, N = 3SE +/- 0.0154, N = 3SE +/- 0.0001, N = 313.924833.519816.52227.98187.26251. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thoroughc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon714212835Min: 13.92 / Avg: 13.92 / Max: 13.93Min: 33.51 / Avg: 33.52 / Max: 33.53Min: 16.51 / Avg: 16.52 / Max: 16.53Min: 7.96 / Avg: 7.98 / Max: 8.01Min: 7.26 / Avg: 7.26 / Max: 7.261. (CXX) g++ options: -O3 -flto -pthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon0.32670.65340.98011.30681.6335SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.1280.3160.7811.0041.4521. (CXX) g++ options: -O3
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_barec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon246810Min: 1.13 / Avg: 1.13 / Max: 1.13Min: 0.32 / Avg: 0.32 / Max: 0.32Min: 0.78 / Avg: 0.78 / Max: 0.78Min: 1 / Avg: 1 / Max: 1.01Min: 1.45 / Avg: 1.45 / Max: 1.451. (CXX) g++ options: -O3

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon40K80K120K160K200KSE +/- 210.27, N = 3SE +/- 1746.17, N = 3SE +/- 197.89, N = 3SE +/- 53.95, N = 3SE +/- 75.14, N = 341855.1188910.046793.944920.641185.7
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V4c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30K60K90K120K150KMin: 41440.3 / Avg: 41855.1 / Max: 42122.5Min: 185496 / Avg: 188910 / Max: 191254Min: 46548.4 / Avg: 46793.9 / Max: 47185.5Min: 44825.6 / Avg: 44920.63 / Max: 45012.4Min: 41107.5 / Avg: 41185.67 / Max: 41335.9

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20K40K60K80K100KSE +/- 89.82, N = 3SE +/- 93.64, N = 3SE +/- 578.32, N = 3SE +/- 636.46, N = 13SE +/- 833.50, N = 773546.3220133.4950077.8181995.6491746.571. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon16K32K48K64K80KMin: 73405.22 / Avg: 73546.32 / Max: 73713.17Min: 19971.62 / Avg: 20133.49 / Max: 20295.99Min: 48925.08 / Avg: 50077.81 / Max: 50736.49Min: 74657.82 / Avg: 81995.64 / Max: 83405.1Min: 86751.45 / Avg: 91746.57 / Max: 92771.951. (CC) gcc options: -shared -fPIC -O2

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20K40K60K80K100KSE +/- 649.31, N = 3SE +/- 59.55, N = 3SE +/- 112.65, N = 3SE +/- 644.29, N = 3SE +/- 615.05, N = 373676.9520887.5850059.9783070.0094458.221. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon16K32K48K64K80KMin: 72788.14 / Avg: 73676.95 / Max: 74941.3Min: 20769.87 / Avg: 20887.58 / Max: 20962.15Min: 49842.68 / Avg: 50059.97 / Max: 50220.18Min: 82007.59 / Avg: 83070 / Max: 84232.7Min: 93478.69 / Avg: 94458.22 / Max: 95592.381. (CC) gcc options: -shared -fPIC -O2

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon0.631.261.892.523.15SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.940.631.192.802.461. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: Kostyac7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon246810Min: 1.94 / Avg: 1.94 / Max: 1.94Min: 0.63 / Avg: 0.63 / Max: 0.63Min: 1.19 / Avg: 1.19 / Max: 1.19Min: 2.8 / Avg: 2.8 / Max: 2.81Min: 2.46 / Avg: 2.46 / Max: 2.471. (CXX) g++ options: -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3K6K9K12K15KSE +/- 7.36, N = 3SE +/- 3.44, N = 3SE +/- 3.20, N = 3SE +/- 98.45, N = 3SE +/- 22.04, N = 310339.533148.186449.1113134.4613888.401. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 10325.26 / Avg: 10339.53 / Max: 10349.81Min: 3141.34 / Avg: 3148.18 / Max: 3152.19Min: 6444.55 / Avg: 6449.11 / Max: 6455.29Min: 12949.99 / Avg: 13134.46 / Max: 13286.34Min: 13862.56 / Avg: 13888.4 / Max: 13932.241. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon5001000150020002500SE +/- 0.23, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 1.40, N = 3SE +/- 4.47, N = 32546.4588.3660.62088.92161.3-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon400800120016002000Min: 2546 / Avg: 2546.4 / Max: 2546.8Min: 588.1 / Avg: 588.3 / Max: 588.5Min: 660.5 / Avg: 660.57 / Max: 660.6Min: 2086.1 / Avg: 2088.9 / Max: 2090.3Min: 2152.4 / Avg: 2161.33 / Max: 2165.91. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon40K80K120K160K200KSE +/- 305.31, N = 3SE +/- 825.35, N = 3SE +/- 336.95, N = 3SE +/- 27.66, N = 3SE +/- 110.01, N = 340051.3171169.045955.741366.641179.7
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30K60K90K120K150KMin: 39503.5 / Avg: 40051.33 / Max: 40558.8Min: 170102 / Avg: 171168.67 / Max: 172793Min: 45554.1 / Avg: 45955.73 / Max: 46625.2Min: 41314.3 / Avg: 41366.57 / Max: 41408.4Min: 41054 / Avg: 41179.67 / Max: 41398.9

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20K40K60K80K100KSE +/- 83.83, N = 3SE +/- 98.61, N = 3SE +/- 276.10, N = 3SE +/- 397.88, N = 3SE +/- 335.63, N = 372719.3319278.6846629.4571537.1179830.961. (CC) gcc options: -shared -fPIC -O2
OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon14K28K42K56K70KMin: 72567.8 / Avg: 72719.33 / Max: 72857.22Min: 19082.25 / Avg: 19278.68 / Max: 19392.14Min: 46348.82 / Avg: 46629.45 / Max: 47181.62Min: 70744.64 / Avg: 71537.11 / Max: 71995.87Min: 79188.28 / Avg: 79830.96 / Max: 80320.151. (CC) gcc options: -shared -fPIC -O2

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3K6K9K12K15KSE +/- 22.07, N = 3SE +/- 46.48, N = 3SE +/- 37.23, N = 3SE +/- 1.37, N = 3SE +/- 3.54, N = 33257.9412014.703969.353103.122983.93
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNetc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 3216.26 / Avg: 3257.94 / Max: 3291.38Min: 11923.3 / Avg: 12014.7 / Max: 12075.1Min: 3901.04 / Avg: 3969.35 / Max: 4029.15Min: 3100.56 / Avg: 3103.12 / Max: 3105.26Min: 2978.25 / Avg: 2983.93 / Max: 2990.42

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon60120180240300SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3139.38277.77159.2072.3969.641. (CXX) g++ options: -O3 -flto -pthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustivec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon50100150200250Min: 139.36 / Avg: 139.38 / Max: 139.39Min: 277.66 / Avg: 277.77 / Max: 277.89Min: 159.2 / Avg: 159.2 / Max: 159.21Min: 72.36 / Avg: 72.39 / Max: 72.44Min: 69.58 / Avg: 69.64 / Max: 69.711. (CXX) g++ options: -O3 -flto -pthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon918273645SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 310.4841.4517.0421.7920.451. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon918273645Min: 10.44 / Avg: 10.48 / Max: 10.51Min: 41.31 / Avg: 41.45 / Max: 41.6Min: 16.97 / Avg: 17.03 / Max: 17.12Min: 21.62 / Avg: 21.79 / Max: 21.89Min: 20.42 / Avg: 20.45 / Max: 20.491. (CXX) g++ options: -O2 -lOpenCL

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon40K80K120K160K200KSE +/- 82.61, N = 3SE +/- 63.75, N = 3SE +/- 3.30, N = 3SE +/- 74.60, N = 3SE +/- 47.94, N = 3178460.445328.653951.5136784.2140964.4-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA4096c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30K60K90K120K150KMin: 178358.2 / Avg: 178460.37 / Max: 178623.9Min: 45201.2 / Avg: 45328.63 / Max: 45396Min: 53945.2 / Avg: 53951.53 / Max: 53956.3Min: 136644.7 / Avg: 136784.2 / Max: 136899.8Min: 140874.2 / Avg: 140964.37 / Max: 141037.71. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon170340510680850SE +/- 0.18, N = 3SE +/- 0.58, N = 3SE +/- 0.13, N = 3SE +/- 0.62, N = 3SE +/- 0.33, N = 3256.84768.30406.94195.53204.991. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon140280420560700Min: 256.51 / Avg: 256.84 / Max: 257.1Min: 767.54 / Avg: 768.3 / Max: 769.45Min: 406.75 / Avg: 406.94 / Max: 407.19Min: 194.74 / Avg: 195.53 / Max: 196.75Min: 204.36 / Avg: 204.99 / Max: 205.491. (CXX) g++ options: -O3 -fPIC -lm

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon400800120016002000SE +/- 2.06, N = 3SE +/- 1.80, N = 3SE +/- 0.37, N = 3SE +/- 0.26, N = 3SE +/- 0.42, N = 3497.581765.91628.40664.35604.62
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 17.3Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30060090012001500Min: 493.85 / Avg: 497.58 / Max: 500.97Min: 1762.78 / Avg: 1765.91 / Max: 1769.01Min: 627.82 / Avg: 628.4 / Max: 629.09Min: 664.08 / Avg: 664.35 / Max: 664.86Min: 603.88 / Avg: 604.62 / Max: 605.34

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3691215SE +/- 0.060, N = 3SE +/- 0.040, N = 3SE +/- 0.014, N = 3SE +/- 0.039, N = 12SE +/- 0.009, N = 311.2913.2457.9355.0676.2201. (CXX) g++ options: -O3 -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Proteinc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3691215Min: 11.17 / Avg: 11.29 / Max: 11.36Min: 3.17 / Avg: 3.25 / Max: 3.3Min: 7.91 / Avg: 7.93 / Max: 7.96Min: 4.64 / Avg: 5.07 / Max: 5.13Min: 6.21 / Avg: 6.22 / Max: 6.241. (CXX) g++ options: -O3 -lm

PyBench

This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon7001400210028003500SE +/- 0.33, N = 3SE +/- 18.15, N = 3SE +/- 1.67, N = 3SE +/- 1.53, N = 3SE +/- 3.84, N = 31185345217411961997
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test Timesc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6001200180024003000Min: 1184 / Avg: 1184.67 / Max: 1185Min: 3416 / Avg: 3452 / Max: 3474Min: 1739 / Avg: 1740.67 / Max: 1744Min: 1959 / Avg: 1961 / Max: 1964Min: 993 / Avg: 997.33 / Max: 1005

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon200K400K600K800K1000KSE +/- 525.83, N = 3SE +/- 816.27, N = 3SE +/- 743.13, N = 3SE +/- 2681.41, N = 3SE +/- 983.65, N = 3666484241259449855480741828186
OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suitec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon140K280K420K560K700KMin: 665522 / Avg: 666484 / Max: 667333Min: 239636 / Avg: 241259.33 / Max: 242221Min: 448715 / Avg: 449855.33 / Max: 451251Min: 475521 / Avg: 480741.33 / Max: 484415Min: 826631 / Avg: 828185.67 / Max: 830007

Timed ImageMagick Compilation

This test times how long it takes to build ImageMagick. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 327.9093.6340.3332.6329.74
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100Min: 27.67 / Avg: 27.9 / Max: 28.12Min: 93.22 / Avg: 93.63 / Max: 94.13Min: 39.91 / Avg: 40.33 / Max: 40.64Min: 32.52 / Avg: 32.63 / Max: 32.72Min: 29.63 / Avg: 29.74 / Max: 29.86

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon7K14K21K28K35KSE +/- 121.56, N = 15SE +/- 49.84, N = 3SE +/- 203.15, N = 15SE +/- 23.44, N = 3SE +/- 166.62, N = 1411591.9030986.7014985.409266.8610900.60
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: NASNet Mobilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon5K10K15K20K25KMin: 10847.8 / Avg: 11591.94 / Max: 12395.4Min: 30906.7 / Avg: 30986.67 / Max: 31078.2Min: 13965.4 / Avg: 14985.42 / Max: 16307.5Min: 9235.81 / Avg: 9266.86 / Max: 9312.81Min: 10663.9 / Avg: 10900.6 / Max: 13062.1

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 326.9474.7434.2023.5322.53
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1428425670Min: 26.87 / Avg: 26.94 / Max: 27.04Min: 74.72 / Avg: 74.74 / Max: 74.76Min: 34.18 / Avg: 34.2 / Max: 34.23Min: 23.45 / Avg: 23.53 / Max: 23.67Min: 22.42 / Avg: 22.53 / Max: 22.59

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3K6K9K12K15KSE +/- 6.99, N = 4SE +/- 48.38, N = 4SE +/- 23.29, N = 4SE +/- 23.33, N = 4SE +/- 24.07, N = 4394012997562646164013
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jythonc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 3927 / Avg: 3940.25 / Max: 3960Min: 12878 / Avg: 12997.25 / Max: 13093Min: 5587 / Avg: 5625.5 / Max: 5693Min: 4574 / Avg: 4616.25 / Max: 4677Min: 3955 / Avg: 4013 / Max: 4063

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon400800120016002000SE +/- 5.19, N = 3SE +/- 0.34, N = 3SE +/- 0.49, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3544.931784.60682.98760.34685.70
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninjac7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30060090012001500Min: 535.72 / Avg: 544.93 / Max: 553.68Min: 1784.16 / Avg: 1784.6 / Max: 1785.27Min: 682.05 / Avg: 682.98 / Max: 683.7Min: 760.25 / Avg: 760.34 / Max: 760.43Min: 685.49 / Avg: 685.7 / Max: 685.91

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000SE +/- 0.39, N = 3SE +/- 0.24, N = 3SE +/- 0.23, N = 3SE +/- 0.06, N = 3SE +/- 19.93, N = 9934.72339.20558.88466.211103.221. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000Min: 934.01 / Avg: 934.72 / Max: 935.36Min: 338.94 / Avg: 339.2 / Max: 339.67Min: 558.51 / Avg: 558.88 / Max: 559.3Min: 466.09 / Avg: 466.21 / Max: 466.3Min: 1030.08 / Avg: 1103.22 / Max: 1180.251. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon100200300400500SE +/- 0.86, N = 3SE +/- 3.48, N = 3SE +/- 0.91, N = 3SE +/- 1.17, N = 3SE +/- 1.80, N = 4198.22473.90263.72245.89147.891. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80160240320400Min: 197.24 / Avg: 198.22 / Max: 199.94Min: 467.68 / Avg: 473.9 / Max: 479.7Min: 262.1 / Avg: 263.72 / Max: 265.25Min: 244.38 / Avg: 245.89 / Max: 248.18Min: 143.57 / Avg: 147.89 / Max: 152.371. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 14.95, N = 4SE +/- 71.92, N = 4SE +/- 27.95, N = 4SE +/- 16.15, N = 4SE +/- 24.39, N = 4352411182450640523815
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoapc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KMin: 3487 / Avg: 3523.75 / Max: 3551Min: 10986 / Avg: 11181.5 / Max: 11320Min: 4428 / Avg: 4506.25 / Max: 4560Min: 4012 / Avg: 4051.75 / Max: 4084Min: 3747 / Avg: 3814.5 / Max: 3863

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon0.21380.42760.64140.85521.069SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.700.300.490.950.861. (CXX) g++ options: -O3
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 1.0Throughput Test: LargeRandomc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon246810Min: 0.7 / Avg: 0.7 / Max: 0.7Min: 0.3 / Avg: 0.3 / Max: 0.3Min: 0.48 / Avg: 0.49 / Max: 0.49Min: 0.95 / Avg: 0.95 / Max: 0.95Min: 0.86 / Avg: 0.86 / Max: 0.861. (CXX) g++ options: -O3

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon50K100K150K200K250KSE +/- 773.26, N = 3SE +/- 59.40, N = 3SE +/- 23.07, N = 3SE +/- 3310.19, N = 9SE +/- 864.34, N = 3183708743561203012132882305491. (CC) gcc options: -pedantic -O3
OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon40K80K120K160K200KMin: 182165.75 / Avg: 183708.29 / Max: 184575.7Min: 74239.26 / Avg: 74356.36 / Max: 74432.21Min: 120260.21 / Avg: 120301.13 / Max: 120340.04Min: 198187.25 / Avg: 213287.78 / Max: 230349.17Min: 229225.86 / Avg: 230548.69 / Max: 232173.881. (CC) gcc options: -pedantic -O3

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 26.73, N = 4SE +/- 44.35, N = 4SE +/- 40.13, N = 4SE +/- 23.53, N = 11SE +/- 19.24, N = 2032039045434431672928
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradebeansc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon16003200480064008000Min: 3141 / Avg: 3202.5 / Max: 3264Min: 8913 / Avg: 9045 / Max: 9102Min: 4257 / Avg: 4343.5 / Max: 4428Min: 2979 / Avg: 3167.09 / Max: 3277Min: 2827 / Avg: 2928.45 / Max: 3106

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon110M220M330M440M550MSE +/- 400097.21, N = 3SE +/- 8819.17, N = 3SE +/- 35118.85, N = 3SE +/- 489364.67, N = 3SE +/- 41633.32, N = 33836066671655133332628900005097466673731000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon90M180M270M360M450MMin: 382810000 / Avg: 383606666.67 / Max: 384070000Min: 165500000 / Avg: 165513333.33 / Max: 165530000Min: 262820000 / Avg: 262890000 / Max: 262930000Min: 508770000 / Avg: 509746666.67 / Max: 510290000Min: 373020000 / Avg: 373100000 / Max: 3731600001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80160240320400SE +/- 0.64, N = 3SE +/- 1.89, N = 3SE +/- 0.70, N = 3SE +/- 0.87, N = 3SE +/- 0.69, N = 3115.02353.91142.28150.99136.80
OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon60120180240300Min: 113.8 / Avg: 115.02 / Max: 115.97Min: 351.57 / Avg: 353.91 / Max: 357.65Min: 140.89 / Avg: 142.28 / Max: 143.17Min: 149.64 / Avg: 150.99 / Max: 152.62Min: 135.42 / Avg: 136.8 / Max: 137.5

Timed PHP Compilation

This test times how long it takes to build PHP 7. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon4080120160200SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 369.48196.0388.9067.0864.34
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon4080120160200Min: 69.32 / Avg: 69.48 / Max: 69.7Min: 195.88 / Avg: 196.03 / Max: 196.15Min: 88.57 / Avg: 88.9 / Max: 89.52Min: 67.01 / Avg: 67.08 / Max: 67.18Min: 64.18 / Avg: 64.34 / Max: 64.49

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20K40K60K80K100KSE +/- 159.36, N = 3SE +/- 91.00, N = 3SE +/- 44.77, N = 3SE +/- 16.02, N = 3SE +/- 174.34, N = 397824324987128562562666311. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Compression Ratingc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20K40K60K80K100KMin: 97563 / Avg: 97824.33 / Max: 98113Min: 32380 / Avg: 32498 / Max: 32677Min: 71213 / Avg: 71284.67 / Max: 71367Min: 62532 / Avg: 62561.67 / Max: 62587Min: 66455 / Avg: 66631.33 / Max: 669801. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon100200300400500SE +/- 1.94, N = 3SE +/- 1.19, N = 3SE +/- 2.40, N = 7SE +/- 0.66, N = 3SE +/- 0.33, N = 3191.29480.79255.21180.36161.081. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon90180270360450Min: 188.31 / Avg: 191.29 / Max: 194.94Min: 478.62 / Avg: 480.79 / Max: 482.72Min: 242.65 / Avg: 255.21 / Max: 261.23Min: 179.41 / Avg: 180.36 / Max: 181.63Min: 160.42 / Avg: 161.08 / Max: 161.441. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon306090120150SE +/- 0.01, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.34, N = 348.21124.7166.1548.6841.81-ltiff-ltiff-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest Compressionc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100Min: 48.2 / Avg: 48.21 / Max: 48.22Min: 124.55 / Avg: 124.71 / Max: 124.84Min: 66.13 / Avg: 66.15 / Max: 66.16Min: 48.61 / Avg: 48.68 / Max: 48.79Min: 41.46 / Avg: 41.81 / Max: 42.481. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000SE +/- 1.33, N = 3SE +/- 0.78, N = 3SE +/- 0.53, N = 3SE +/- 0.79, N = 3SE +/- 0.59, N = 3391.171155.62488.81515.20469.94
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000Min: 389.16 / Avg: 391.17 / Max: 393.69Min: 1154.47 / Avg: 1155.62 / Max: 1157.1Min: 487.79 / Avg: 488.81 / Max: 489.55Min: 514.12 / Avg: 515.2 / Max: 516.74Min: 469.21 / Avg: 469.94 / Max: 471.11

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1428425670SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.17, N = 15SE +/- 0.03, N = 322.7761.8031.0826.7121.12-ltiff-ltiff-ltiff-ltiff1. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Losslessc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1224364860Min: 22.67 / Avg: 22.77 / Max: 22.94Min: 61.68 / Avg: 61.8 / Max: 61.88Min: 31.05 / Avg: 31.08 / Max: 31.11Min: 25.92 / Avg: 26.71 / Max: 27.99Min: 21.07 / Avg: 21.12 / Max: 21.171. (CC) gcc options: -fvisibility=hidden -O2 -lm -ljpeg -lpng16

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon816243240SE +/- 0.01, N = 3SE +/- 0.31, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 311.9133.9916.5216.3917.531. (CXX) g++ options: -O3 -fPIC -lm
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Losslessc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon714212835Min: 11.89 / Avg: 11.91 / Max: 11.92Min: 33.65 / Avg: 33.99 / Max: 34.6Min: 16.18 / Avg: 16.52 / Max: 16.7Min: 16.17 / Avg: 16.39 / Max: 16.57Min: 17.49 / Avg: 17.53 / Max: 17.581. (CXX) g++ options: -O3 -fPIC -lm

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80K160K240K320K400KSE +/- 1410.11, N = 3SE +/- 66.96, N = 3SE +/- 1677.89, N = 3SE +/- 781.49, N = 3SE +/- 2637.25, N = 3346814.75138205.11308213.13388657.76347345.491. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon70K140K210K280K350KMin: 344622.05 / Avg: 346814.75 / Max: 349447.11Min: 138094.88 / Avg: 138205.11 / Max: 138326.1Min: 306510.05 / Avg: 308213.13 / Max: 311568.78Min: 387384.75 / Avg: 388657.76 / Max: 390079.6Min: 344132.5 / Avg: 347345.49 / Max: 352574.531. (CC) gcc options: -lcrypt -lz -O3 -march=native

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80K160K240K320K400KSE +/- 1017.52, N = 3SE +/- 141.15, N = 3SE +/- 3783.68, N = 3SE +/- 771.95, N = 3SE +/- 1620.39, N = 3346613.34139414.84310596.58389030.11351672.921. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon70K140K210K280K350KMin: 344614.99 / Avg: 346613.34 / Max: 347945.69Min: 139196.57 / Avg: 139414.84 / Max: 139679.01Min: 303433 / Avg: 310596.58 / Max: 316290.47Min: 387669.66 / Avg: 389030.11 / Max: 390342.46Min: 349589.65 / Avg: 351672.92 / Max: 354864.441. (CC) gcc options: -lcrypt -lz -O3 -march=native

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80K160K240K320K400KSE +/- 3986.77, N = 3SE +/- 133.96, N = 3SE +/- 1347.28, N = 3SE +/- 1242.81, N = 3SE +/- 1582.66, N = 3352380.98141436.20308938.67390932.79356829.931. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon70K140K210K280K350KMin: 344424.56 / Avg: 352380.98 / Max: 356811.55Min: 141169.18 / Avg: 141436.2 / Max: 141588.71Min: 306245.77 / Avg: 308938.67 / Max: 310367.17Min: 388925.58 / Avg: 390932.79 / Max: 393206.06Min: 353665.2 / Avg: 356829.93 / Max: 358465.711. (CC) gcc options: -lcrypt -lz -O3 -march=native

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon7001400210028003500SE +/- 7.75, N = 3SE +/- 4.74, N = 3SE +/- 12.10, N = 3SE +/- 3.25, N = 3SE +/- 24.18, N = 33050.31121.72051.62907.52582.0-llzma-llzma-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon5001000150020002500Min: 3042.5 / Avg: 3050.3 / Max: 3065.8Min: 1115.1 / Avg: 1121.7 / Max: 1130.9Min: 2035.9 / Avg: 2051.6 / Max: 2075.4Min: 2901.2 / Avg: 2907.47 / Max: 2912.1Min: 2533.7 / Avg: 2581.97 / Max: 2608.71. (CC) gcc options: -O3 -pthread -lz

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80K160K240K320K400KSE +/- 2009.97, N = 3SE +/- 22.67, N = 3SE +/- 3992.58, N = 3SE +/- 436.72, N = 3SE +/- 1727.81, N = 3345710.87143155.48307349.36388010.76356302.841. (CC) gcc options: -lcrypt -lz -O3 -march=native
OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon70K140K210K280K350KMin: 341701.14 / Avg: 345710.87 / Max: 347963.74Min: 143113.1 / Avg: 143155.48 / Max: 143190.63Min: 302113.55 / Avg: 307349.36 / Max: 315188.56Min: 387337.49 / Avg: 388010.76 / Max: 388829.26Min: 354138.78 / Avg: 356302.84 / Max: 359718.031. (CC) gcc options: -lcrypt -lz -O3 -march=native

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon300K600K900K1200K1500KSE +/- 0.00, N = 5SE +/- 196.86, N = 5SE +/- 338.27, N = 5SE +/- 4180.17, N = 5SE +/- 1099.67, N = 51370094538500872313144263112725961. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performancec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon300K600K900K1200K1500KMin: 1370094 / Avg: 1370094 / Max: 1370094Min: 537869 / Avg: 538499.8 / Max: 538921Min: 871484 / Avg: 872312.6 / Max: 872865Min: 1426886 / Avg: 1442630.8 / Max: 1449415Min: 1269073 / Avg: 1272595.8 / Max: 12749491. (CC) gcc options: -O3 -march=native

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon7001400210028003500SE +/- 6.93, N = 3SE +/- 15.28, N = 3SE +/- 2.93, N = 3SE +/- 6.53, N = 3SE +/- 7.82, N = 33240.61213.92196.32826.02666.1-llzma-llzma-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6001200180024003000Min: 3229.8 / Avg: 3240.57 / Max: 3253.5Min: 1184 / Avg: 1213.9 / Max: 1234.3Min: 2193.3 / Avg: 2196.33 / Max: 2202.2Min: 2815 / Avg: 2825.97 / Max: 2837.6Min: 2657.8 / Avg: 2666.07 / Max: 2681.71. (CC) gcc options: -O3 -pthread -lz

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6M12M18M24M30MSE +/- 153578.64, N = 3SE +/- 123749.22, N = 3SE +/- 292329.99, N = 3SE +/- 149731.77, N = 3SE +/- 242448.39, N = 32760889110980430216792452385762322081961-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon5M10M15M20M25MMin: 27303905 / Avg: 27608891 / Max: 27792957Min: 10738520 / Avg: 10980430 / Max: 11146676Min: 21327596 / Avg: 21679245.33 / Max: 22259579Min: 23609163 / Avg: 23857622.67 / Max: 24126627Min: 21829335 / Avg: 22081960.67 / Max: 225667131. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon80160240320400SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3143.33360.30215.67224.33281.391. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon60120180240300Min: 143.14 / Avg: 143.33 / Max: 143.64Min: 360.17 / Avg: 360.3 / Max: 360.37Min: 215.65 / Avg: 215.67 / Max: 215.69Min: 224.3 / Avg: 224.33 / Max: 224.38Min: 281.12 / Avg: 281.39 / Max: 281.581. (CXX) g++ options: -O2 -lOpenCL

POV-Ray

This is a test of POV-Ray, the Persistence of Vision Raytracer. POV-Ray is used to create 3D graphics using ray-tracing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100SE +/- 0.01, N = 3SE +/- 0.94, N = 15SE +/- 0.00, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 337.8693.8051.0549.4452.78-march=native-march=native1. (CXX) g++ options: -pipe -O3 -ffast-math -R/usr/lib -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system
OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Timec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100Min: 37.84 / Avg: 37.86 / Max: 37.89Min: 89.04 / Avg: 93.8 / Max: 100.81Min: 51.04 / Avg: 51.05 / Max: 51.05Min: 49.24 / Avg: 49.44 / Max: 49.8Min: 52.63 / Avg: 52.78 / Max: 53.011. (CXX) g++ options: -pipe -O3 -ffast-math -R/usr/lib -lXpm -lSM -lICE -lX11 -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon918273645SE +/- 0.23, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 339.516.031.025.933.8-llzma-llzma-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon816243240Min: 39 / Avg: 39.47 / Max: 39.7Min: 16 / Avg: 16 / Max: 16Min: 30.9 / Avg: 30.97 / Max: 31Min: 25.4 / Avg: 25.93 / Max: 26.3Min: 33.6 / Avg: 33.8 / Max: 33.91. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon918273645SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.21, N = 3SE +/- 0.40, N = 341.216.934.630.038.1-llzma-llzma-llzma-llzma1. (CC) gcc options: -O3 -pthread -lz
OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speedc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon918273645Min: 41.2 / Avg: 41.2 / Max: 41.2Min: 16.9 / Avg: 16.93 / Max: 17Min: 34.5 / Avg: 34.6 / Max: 34.7Min: 29.6 / Avg: 30 / Max: 30.3Min: 37.3 / Avg: 38.1 / Max: 38.51. (CC) gcc options: -O3 -pthread -lz

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon14002800420056007000SE +/- 32.57, N = 5SE +/- 63.66, N = 4SE +/- 45.89, N = 4SE +/- 27.42, N = 4SE +/- 32.93, N = 429516740396430192921
OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon12002400360048006000Min: 2868 / Avg: 2951 / Max: 3068Min: 6626 / Avg: 6740 / Max: 6920Min: 3843 / Avg: 3964 / Max: 4056Min: 2941 / Avg: 3018.75 / Max: 3070Min: 2835 / Avg: 2921.25 / Max: 2986

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon5K10K15K20K25KSE +/- 32.01, N = 3SE +/- 6.29, N = 3SE +/- 92.83, N = 3SE +/- 3.93, N = 3SE +/- 5.89, N = 323181.8111985.3817924.1813556.0610210.341. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Cryptoc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon4K8K12K16K20KMin: 23119.13 / Avg: 23181.81 / Max: 23224.4Min: 11977.75 / Avg: 11985.38 / Max: 11997.86Min: 17748.62 / Avg: 17924.18 / Max: 18064.26Min: 13551.02 / Avg: 13556.06 / Max: 13563.8Min: 10199.14 / Avg: 10210.34 / Max: 10219.11. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon7M14M21M28M35MSE +/- 104795.40, N = 3SE +/- 106812.26, N = 3SE +/- 359309.26, N = 3SE +/- 303648.79, N = 3SE +/- 325631.00, N = 33213412315331550265404822618768823746200
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6M12M18M24M30MMin: 32023095 / Avg: 32134123.33 / Max: 32343588Min: 15140045 / Avg: 15331549.67 / Max: 15509284Min: 26061970 / Avg: 26540482 / Max: 27244043Min: 25653010 / Avg: 26187687.67 / Max: 26704421Min: 23100009 / Avg: 23746200.33 / Max: 24139540

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon150300450600750SE +/- 0.32, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 7.09, N = 3SE +/- 2.00, N = 3675.64331.07470.39663.07565.691. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast
OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_100c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon120240360480600Min: 675.15 / Avg: 675.64 / Max: 676.25Min: 331.07 / Avg: 331.07 / Max: 331.07Min: 470.05 / Avg: 470.39 / Max: 471.05Min: 650.2 / Avg: 663.07 / Max: 674.63Min: 563.66 / Avg: 565.69 / Max: 569.691. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon3000M6000M9000M12000M15000MSE +/- 7739237.92, N = 3SE +/- 12563225.46, N = 3SE +/- 47755430.47, N = 3SE +/- 8616254.20, N = 3SE +/- 606684.16, N = 313722045973678568951710723184083116914033537096993937-m64-m641. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl
OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA256c7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2000M4000M6000M8000M10000MMin: 13712096220 / Avg: 13722045973.33 / Max: 13737289210Min: 6760580260 / Avg: 6785689516.67 / Max: 6799049020Min: 10627684700 / Avg: 10723184083.33 / Max: 10772216060Min: 11674247470 / Avg: 11691403353.33 / Max: 11701387090Min: 7096258150 / Avg: 7096993936.67 / Max: 70981973901. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon12K24K36K48K60KSE +/- 17.05, N = 3SE +/- 0.49, N = 3SE +/- 15.72, N = 3SE +/- 2.46, N = 3SE +/- 28.50, N = 355258.1727341.4737753.8953787.6140140.301. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Mathc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon10K20K30K40K50KMin: 55237.21 / Avg: 55258.17 / Max: 55291.94Min: 27340.68 / Avg: 27341.47 / Max: 27342.37Min: 37737.89 / Avg: 37753.89 / Max: 37785.32Min: 53783.01 / Avg: 53787.61 / Max: 53791.41Min: 40101.89 / Avg: 40140.3 / Max: 40195.981. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon8K16K24K32K40KSE +/- 1.96, N = 3SE +/- 0.15, N = 3SE +/- 0.90, N = 3SE +/- 18.06, N = 3SE +/- 160.86, N = 37730.412558.125133.8925140.5538136.771. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon7K14K21K28K35KMin: 7728.06 / Avg: 7730.41 / Max: 7734.31Min: 2557.84 / Avg: 2558.12 / Max: 2558.37Min: 5132.39 / Avg: 5133.89 / Max: 5135.49Min: 25107.3 / Avg: 25140.55 / Max: 25169.4Min: 37926.17 / Avg: 38136.77 / Max: 38452.691. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30060090012001500SE +/- 9.70, N = 3SE +/- 0.67, N = 3SE +/- 12.00, N = 3SE +/- 11.74, N = 9SE +/- 13.37, N = 31189128834100114661. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30060090012001500Min: 1171 / Avg: 1189.33 / Max: 1204Min: 127 / Avg: 128.33 / Max: 129Min: 819 / Avg: 834.33 / Max: 858Min: 943 / Avg: 1001.11 / Max: 1052Min: 1447 / Avg: 1466.33 / Max: 14921. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30060090012001500SE +/- 6.44, N = 3SE +/- 0.88, N = 3SE +/- 10.22, N = 4SE +/- 12.82, N = 9SE +/- 12.41, N = 91103135864109113971. (CXX) g++ options: -flto -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000Min: 1090 / Avg: 1102.67 / Max: 1111Min: 134 / Avg: 135.33 / Max: 137Min: 840 / Avg: 864.25 / Max: 890Min: 1024 / Avg: 1091.11 / Max: 1144Min: 1345 / Avg: 1396.67 / Max: 14521. (CXX) g++ options: -flto -pthread

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon90K180K270K360K450KSE +/- 3211.91, N = 3SE +/- 116.54, N = 3SE +/- 49.84, N = 3SE +/- 2163.00, N = 3SE +/- 80.93, N = 3405413.86203869.40315464.34345133.44285378.841. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon70K140K210K280K350KMin: 399077.13 / Avg: 405413.86 / Max: 409495.17Min: 203704.88 / Avg: 203869.4 / Max: 204094.65Min: 315395.23 / Avg: 315464.34 / Max: 315561.11Min: 342961.26 / Avg: 345133.44 / Max: 349459.43Min: 285243.13 / Avg: 285378.84 / Max: 285523.091. (CC) gcc options: -O2 -lrt" -lrt

N-Queens

This is a test of the OpenMP version of a test that solves the N-queens problem. The board problem size is 18. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 321.5432.2923.1416.3818.841. (CC) gcc options: -static -fopenmp -O3 -march=native
OpenBenchmarking.orgSeconds, Fewer Is BetterN-Queens 1.0Elapsed Timec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon714212835Min: 21.54 / Avg: 21.54 / Max: 21.54Min: 32.28 / Avg: 32.29 / Max: 32.29Min: 23.13 / Avg: 23.14 / Max: 23.14Min: 16.37 / Avg: 16.38 / Max: 16.39Min: 18.84 / Avg: 18.84 / Max: 18.841. (CC) gcc options: -static -fopenmp -O3 -march=native

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon16K32K48K64K80KSE +/- 12.88, N = 3SE +/- 31.21, N = 3SE +/- 239.68, N = 3SE +/- 142.56, N = 3SE +/- 35.00, N = 373054408915944557318456531. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 21.06Test: Decompression Ratingc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon13K26K39K52K65KMin: 73037 / Avg: 73053.67 / Max: 73079Min: 40833 / Avg: 40891 / Max: 40940Min: 58966 / Avg: 59445.33 / Max: 59689Min: 57036 / Avg: 57318.33 / Max: 57494Min: 45595 / Avg: 45653.33 / Max: 457161. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 366.82110.3775.2272.3391.231. (CXX) g++ options: -fopenmp -O2 -march=native
OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To Solvec7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100Min: 66.82 / Avg: 66.82 / Max: 66.83Min: 110.36 / Avg: 110.37 / Max: 110.38Min: 75.22 / Avg: 75.22 / Max: 75.23Min: 72.3 / Avg: 72.33 / Max: 72.37Min: 91.17 / Avg: 91.23 / Max: 91.341. (CXX) g++ options: -fopenmp -O2 -march=native

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon200K400K600K800K1000KSE +/- 614.16, N = 3SE +/- 3840.04, N = 3SE +/- 2395.13, N = 3SE +/- 713.03, N = 3SE +/- 405.56, N = 3843015.78918172.37770521.81768723.461037943.371. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread
OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: IO_uringc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon200K400K600K800K1000KMin: 841810.62 / Avg: 843015.78 / Max: 843823.92Min: 912809.75 / Avg: 918172.37 / Max: 925614.93Min: 767318.01 / Avg: 770521.81 / Max: 775207.81Min: 767641.15 / Avg: 768723.46 / Max: 770068.79Min: 1037132.46 / Avg: 1037943.37 / Max: 1038364.881. (CC) gcc options: -O2 -std=gnu99 -lm -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon8001600240032004000SE +/- 1.86, N = 3SE +/- 0.50, N = 3SE +/- 1.74, N = 3SE +/- 234.97, N = 12SE +/- 1.61, N = 328177572072369634501. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon6001200180024003000Min: 2815 / Avg: 2817.33 / Max: 2821Min: 756.5 / Avg: 757 / Max: 758Min: 2069.5 / Avg: 2072.33 / Max: 2075.5Min: 3124.5 / Avg: 3696.04 / Max: 4905Min: 3446.5 / Avg: 3449.5 / Max: 34521. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon30060090012001500SE +/- 0.00, N = 3SE +/- 0.50, N = 3SE +/- 0.17, N = 3SE +/- 82.60, N = 12SE +/- 91.51, N = 12609165334119213741. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2004006008001000Min: 608.5 / Avg: 608.5 / Max: 608.5Min: 163.5 / Avg: 164.5 / Max: 165Min: 333.5 / Avg: 333.83 / Max: 334Min: 924.5 / Avg: 1192.42 / Max: 1524.5Min: 1191 / Avg: 1373.75 / Max: 1918.51. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 5.55, N = 12SE +/- 0.60, N = 3381028651391. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon306090120150Min: 38 / Avg: 38 / Max: 38Min: 9.5 / Avg: 9.5 / Max: 9.5Min: 27.5 / Avg: 27.5 / Max: 27.5Min: 46.5 / Avg: 65.08 / Max: 84Min: 138 / Avg: 138.83 / Max: 1401. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon170340510680850SE +/- 0.17, N = 3SE +/- 0.88, N = 3SE +/- 0.17, N = 3SE +/- 0.58, N = 3SE +/- 50.92, N = 124071153224887731. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon140280420560700Min: 407 / Avg: 407.17 / Max: 407.5Min: 113.5 / Avg: 115.17 / Max: 116.5Min: 321.5 / Avg: 321.67 / Max: 322Min: 486.5 / Avg: 487.5 / Max: 488.5Min: 632.5 / Avg: 773.25 / Max: 10041. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon2K4K6K8K10KSE +/- 2.40, N = 3SE +/- 2.20, N = 3SE +/- 3.50, N = 3SE +/- 75.29, N = 12SE +/- 322.41, N = 12799023126948561779441. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: Standardc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon14002800420056007000Min: 7985.5 / Avg: 7990.17 / Max: 7993.5Min: 2308 / Avg: 2312.17 / Max: 2315.5Min: 6944 / Avg: 6947.5 / Max: 6954.5Min: 5386 / Avg: 5617 / Max: 5986.5Min: 6856 / Avg: 7944.42 / Max: 9074.51. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon12002400360048006000SE +/- 17.76, N = 3SE +/- 20.90, N = 3SE +/- 14.44, N = 3SE +/- 53.31, N = 15SE +/- 80.05, N = 121502.955724.661980.243847.963967.39
OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Quantc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon10002000300040005000Min: 1468.14 / Avg: 1502.95 / Max: 1526.49Min: 5686.62 / Avg: 5724.66 / Max: 5758.7Min: 1956.49 / Avg: 1980.24 / Max: 2006.34Min: 3543.81 / Avg: 3847.96 / Max: 4160Min: 3442.24 / Avg: 3967.39 / Max: 4294.78

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100SE +/- 0.02, N = 3SE +/- 2.00, N = 15SE +/- 0.03, N = 3SE +/- 0.77, N = 5SE +/- 0.04, N = 338.52104.7662.3269.3592.551. (CC) gcc options: -lm -lpthread -O3
OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon20406080100Min: 38.49 / Avg: 38.52 / Max: 38.55Min: 97.09 / Avg: 104.76 / Max: 116.7Min: 62.29 / Avg: 62.32 / Max: 62.38Min: 68.48 / Avg: 69.35 / Max: 72.44Min: 92.49 / Avg: 92.55 / Max: 92.631. (CC) gcc options: -lm -lpthread -O3

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1122334455SE +/- 0.33, N = 12SE +/- 0.02, N = 3SE +/- 0.26, N = 15SE +/- 0.05, N = 3SE +/- 0.07, N = 313.3047.4315.4818.3823.511. (CXX) g++ options: -O2 -lOpenCL
OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc7g.4xlarge Graviton3a1.4xlarge Gravitonc6g.4xlarge Graviton2c6a.4xlarge EPYCc6i.4xlarge Xeon1020304050Min: 11.89 / Avg: 13.3 / Max: 14.87Min: 47.4 / Avg: 47.43 / Max: 47.47Min: 14.26 / Avg: 15.48 / Max: 17.08Min: 18.28 / Avg: 18.38 / Max: 18.45Min: 23.39 / Avg: 23.51 / Max: 23.631. (CXX) g++ options: -O2 -lOpenCL

94 Results Shown

Stress-NG
NAS Parallel Benchmarks:
  MG.C
  CG.C
  SP.C
Zstd Compression
NAS Parallel Benchmarks
High Performance Conjugate Gradient
Algebraic Multi-Grid Benchmark
Xcompact3d Incompact3d
ACES DGEMM
Xcompact3d Incompact3d
Stress-NG
simdjson
Timed MrBayes Analysis
NAS Parallel Benchmarks
TensorFlow Lite
GPAW
libavif avifenc
simdjson
LULESH
Apache HTTP Server
ASTC Encoder
GROMACS
TensorFlow Lite
Apache HTTP Server:
  500
  200
simdjson
NAS Parallel Benchmarks
OpenSSL
TensorFlow Lite
Apache HTTP Server
TensorFlow Lite
ASTC Encoder
Rodinia
OpenSSL
libavif avifenc
Timed Node.js Compilation
LAMMPS Molecular Dynamics Simulator
PyBench
PHPBench
Timed ImageMagick Compilation
TensorFlow Lite
Timed Apache Compilation
DaCapo Benchmark
Timed LLVM Compilation
NAS Parallel Benchmarks
Ngspice
DaCapo Benchmark
simdjson
SecureMark
DaCapo Benchmark
Liquid-DSP
Build2
Timed PHP Compilation
7-Zip Compression
Ngspice
WebP Image Encode
Timed Gem5 Compilation
WebP Image Encode
libavif avifenc
nginx:
  1000
  500
  200
Zstd Compression
nginx
TSCP
Zstd Compression
Stockfish
Rodinia
POV-Ray
Zstd Compression:
  19, Long Mode - Compression Speed
  19 - Compression Speed
DaCapo Benchmark
Stress-NG
asmFish
Google SynthMark
OpenSSL
Stress-NG
NAS Parallel Benchmarks
LeelaChessZero:
  Eigen
  BLAS
Coremark
N-Queens
7-Zip Compression
m-queens
Stress-NG
ONNX Runtime:
  super-resolution-10 - CPU - Standard
  ArcFace ResNet-100 - CPU - Standard
  fcn-resnet101-11 - CPU - Standard
  bertsquad-12 - CPU - Standard
  GPT-2 - CPU - Standard
TensorFlow Lite
C-Ray
Rodinia