Tau T2A 8 16 32 vCPU Scaling

Benchmarks by Michael Larabel for a future article

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2208120-PTS-2208123N58
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Tau T2A: 8 vCPUs
August 10 2022
  1 Day, 5 Hours, 56 Minutes
Tau T2A: 16 vCPUs
August 10 2022
  1 Day, 1 Hour, 45 Minutes
Tau T2A: 32 vCPUs
August 11 2022
  1 Day, 6 Hours, 6 Minutes
Invert Behavior (Only Show Selected Data)
  1 Day, 4 Hours, 35 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardMemoryDiskNetworkOSKernelCompilerFile-SystemSystem LayerTau T2A 8 vCPUs 16 vCPUs 32 vCPUsARMv8 Neoverse-N1 (8 Cores)KVM Google Compute Engine32GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.045.15.0-1013-gcp (aarch64)GCC 12.0.1 20220319ext4KVMARMv8 Neoverse-N1 (16 Cores)64GBARMv8 Neoverse-N1 (32 Cores)128GB5.15.0-1016-gcp (aarch64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Java Details- OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.4Security Details- Tau T2A: 8 vCPUs: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - Tau T2A: 16 vCPUs: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - Tau T2A: 32 vCPUs: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

spec-jbb2015: SPECjbb2015-Composite critical-jOPSspec-jbb2015: SPECjbb2015-Composite max-jOPSlammps: 20k Atomsrenaissance: Savina Reactors.IOopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timegraph500: 26graph500: 26graph500: 26graph500: 26spark: 40000000 - 2000 - Broadcast Inner Join Test Timespark: 40000000 - 2000 - Inner Join Test Timespark: 40000000 - 2000 - Repartition Test Timespark: 40000000 - 2000 - Group By Test Timespark: 40000000 - 2000 - Calculate Pi Benchmark Using Dataframespark: 40000000 - 2000 - Calculate Pi Benchmarkspark: 40000000 - 2000 - SHA-512 Benchmark Timespark: 40000000 - 100 - Broadcast Inner Join Test Timespark: 40000000 - 100 - Inner Join Test Timespark: 40000000 - 100 - Repartition Test Timespark: 40000000 - 100 - Group By Test Timespark: 40000000 - 100 - Calculate Pi Benchmark Using Dataframespark: 40000000 - 100 - Calculate Pi Benchmarkspark: 40000000 - 100 - SHA-512 Benchmark Timeblender: Classroom - CPU-Onlybuild-gem5: Time To Compilespark: 1000000 - 100 - Broadcast Inner Join Test Timespark: 1000000 - 100 - Inner Join Test Timespark: 1000000 - 100 - Repartition Test Timespark: 1000000 - 100 - Calculate Pi Benchmark Using Dataframespark: 1000000 - 100 - Calculate Pi Benchmarkspark: 1000000 - 100 - SHA-512 Benchmark Timeblender: Fishy Cat - CPU-Onlypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlyspark: 1000000 - 2000 - Broadcast Inner Join Test Timespark: 1000000 - 2000 - Inner Join Test Timespark: 1000000 - 2000 - Repartition Test Timespark: 1000000 - 2000 - Group By Test Timespark: 1000000 - 2000 - Calculate Pi Benchmark Using Dataframespark: 1000000 - 2000 - Calculate Pi Benchmarkspark: 1000000 - 2000 - SHA-512 Benchmark Timeavifenc: 0rocksdb: Read While Writingblender: BMW27 - CPU-Onlygpaw: Carbon Nanotubetnn: CPU - DenseNetgromacs: MPI CPU - water_GMX50_barecassandra: Writesrocksdb: Read Rand Write Randavifenc: 2openssl: SHA256askap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingastcenc: Exhaustivepgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 100 - Read Onlyaircrack-ng: renaissance: Apache Spark Bayesaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingvpxenc: Speed 0 - Bosphorus 1080phpcg: npb: SP.Cstress-ng: Futexnpb: BT.Cnpb: EP.Dtensorflow-lite: SqueezeNetvpxenc: Speed 5 - Bosphorus 4Ksysbench: CPUbuild-ffmpeg: Time To Compiletensorflow-lite: Inception V4tensorflow-lite: Inception ResNet V2tensorflow-lite: Mobilenet Floatopenssl: RSA4096openssl: RSA4096rocksdb: Rand Readbuild-mplayer: Time To Compilevpxenc: Speed 5 - Bosphorus 1080pdacapobench: Tradesoapnpb: CG.Cnpb: LU.Cnpb: IS.Dstress-ng: CPU Stressstress-ng: CPU Cachestress-ng: Vector Mathstress-ng: Matrix Mathstress-ng: System V Message Passingnpb: SP.Btnn: CPU - MobileNet v2astcenc: Thoroughaskap: Hogbom Clean OpenMPcoremark: CoreMark Size 666 - Iterations Per Secondavifenc: 6, Losslessaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingnpb: FT.Cavifenc: 6avifenc: 10, Losslessastcenc: Mediumnpb: MG.Clammps: Rhodopsin ProteinTau T2A 8 vCPUs 16 vCPUs 32 vCPUs392191584.66226456.42426.16425.9574.7178.0066.2745.6115.71277.8389.2380.2680.0268.6850.8715.65278.36684567793.671016.66917.3932.863.484.5815.92277.896.28841.185.045496285.185.985.678.8515.81278.4725272298.09456.237594702447.71381.1963842.1230.4517862548353245.60164560835071977.891325.06276.87981.844542378308.5802249.82196.742360.634.6511.09697115.28937451.9914368.29820.946618.326.1127237.28113.45597646.191592.64395.7332136.8393.73105568988.83811.2780406855.8132029.141104.262065.53436.3124633.9938215.954507844.177338.98331.34429.0505371.295175037.76514323.3122421.432296.1018574.2320.2739.9979.050527703.334.8129207180928.49915981.51534.72303.71262563000952655002574780007075020042.7544.3935.4530.708.34137.04020772451.5545.2944.6237.2235.648.40136.84961301951.40506.10495.5411.792.222.558.39137.7612851904.93426.041.9001316072.653.653.367.438.29137.1739761995.91328.9701264826226.26208.9653358.7280.88039296884700194.774129264115273343.252585.31137.62180.63315789416697.9241262.04083.163789.014.8417.098419710.901198681.8749125.931634.993955.896.6854317.4261.98346113.445445.92481.9664247.0786.76204896747.62111.78559812171.9555447.311498.454116.96551.2549102.3076177.565475267.3619552.45328.88614.2146645.161351562.54315814.7025023.73631.8132644.8511.1687.6586.944933309.768.861229553507516.55010705.9994.53206.450837200016954200047737700012470200026.5528.6622.2222.844.7869.7939.2231.9830.3224.3627.644.7669.5746.30249.89312.1201.682.132.014.7969.774.79214.410.8033122392.122.872.606.724.8069.924.96266.3372610992112.47130.3533056.8971.718878191321827169.639257889199133899.283962.0868.65570.30432953933647.548766.45522.074456.554.9922.093026843.581437660.6269530.643265.683853.906.99108241.6138.95831657.333994.92093.25128273.11570.212470420128.92812.11501521433.9287702.301822.778209.47566.9197749.08151792.836128517.1034381.91322.7687.1619996.700700917.94473710.3419181.247262.7452309.816.6826.7755.982550939.0516.596OpenBenchmarking.org

SPECjbb 2015

This is a benchmark of SPECjbb 2015. For this test profile to work, you must have a valid license/copy of the SPECjbb 2015 ISO (SPECjbb2015-1.02.iso) in your Phoronix Test Suite download cache. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgjOPS, More Is BetterSPECjbb 2015SPECjbb2015-Composite critical-jOPS32 vCPUs16 vCPUs8 vCPUs5K10K15K20K25K2295592073921

OpenBenchmarking.orgjOPS, More Is BetterSPECjbb 2015SPECjbb2015-Composite max-jOPS32 vCPUs16 vCPUs8 vCPUs8K16K24K32K40K35075180929158

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atoms32 vCPUs16 vCPUs8 vCPUs48121620SE +/- 0.004, N = 3SE +/- 0.112, N = 3SE +/- 0.025, N = 316.5508.4994.6621. (CXX) g++ options: -O3 -march=native -ldl

Renaissance

Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.14Test: Savina Reactors.IO32 vCPUs16 vCPUs8 vCPUs6K12K18K24K30KSE +/- 131.70, N = 4SE +/- 583.25, N = 12SE +/- 435.60, N = 910705.915981.526456.4MIN: 10505.49 / MAX: 14847.21MIN: 12776.53 / MAX: 36273.51MIN: 13667.16 / MAX: 42318.14

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Execution Time32 vCPUs16 vCPUs8 vCPUs5001000150020002500994.531534.722426.16-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats-ltransportModels -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling1. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Mesh Time32 vCPUs16 vCPUs8 vCPUs90180270360450206.40303.71425.95-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats-ltransportModels -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling1. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lgenericPatchFields -lOpenFOAM -ldl -lm

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 2632 vCPUs16 vCPUs110M220M330M440M550M5083720002625630001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 2632 vCPUs16 vCPUs40M80M120M160M200M169542000952655001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 2632 vCPUs16 vCPUs100M200M300M400M500M4773770002574780001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 2632 vCPUs16 vCPUs30M60M90M120M150M124702000707502001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Broadcast Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.17, N = 12SE +/- 0.40, N = 3SE +/- 0.33, N = 326.5542.7574.71

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.19, N = 12SE +/- 0.73, N = 3SE +/- 1.11, N = 328.6644.3978.00

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Repartition Test Time32 vCPUs16 vCPUs8 vCPUs1530456075SE +/- 0.24, N = 12SE +/- 0.20, N = 3SE +/- 0.36, N = 322.2235.4566.27

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Group By Test Time32 vCPUs16 vCPUs8 vCPUs1020304050SE +/- 0.32, N = 12SE +/- 0.23, N = 3SE +/- 0.57, N = 322.8430.7045.61

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Calculate Pi Benchmark Using Dataframe32 vCPUs16 vCPUs8 vCPUs48121620SE +/- 0.02, N = 12SE +/- 0.03, N = 3SE +/- 0.00, N = 34.788.3415.71

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Calculate Pi Benchmark32 vCPUs16 vCPUs8 vCPUs60120180240300SE +/- 0.11, N = 12SE +/- 0.17, N = 3SE +/- 0.13, N = 369.79137.04277.83

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - SHA-512 Benchmark Time32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.55, N = 12SE +/- 0.08, N = 3SE +/- 0.52, N = 339.2251.5589.23

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Broadcast Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.26, N = 9SE +/- 0.44, N = 3SE +/- 0.22, N = 331.9845.2980.26

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.44, N = 9SE +/- 0.72, N = 3SE +/- 0.19, N = 330.3244.6280.02

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Repartition Test Time32 vCPUs16 vCPUs8 vCPUs1530456075SE +/- 0.12, N = 9SE +/- 0.92, N = 3SE +/- 0.25, N = 324.3637.2268.68

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Group By Test Time32 vCPUs16 vCPUs8 vCPUs1122334455SE +/- 0.16, N = 9SE +/- 0.24, N = 3SE +/- 0.53, N = 327.6435.6450.87

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe32 vCPUs16 vCPUs8 vCPUs48121620SE +/- 0.01, N = 9SE +/- 0.01, N = 3SE +/- 0.05, N = 34.768.4015.65

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Calculate Pi Benchmark32 vCPUs16 vCPUs8 vCPUs60120180240300SE +/- 0.08, N = 9SE +/- 0.06, N = 3SE +/- 0.14, N = 369.57136.85278.37

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - SHA-512 Benchmark Time32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.45, N = 9SE +/- 0.22, N = 3SE +/- 0.85, N = 346.3051.4093.67

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing is supported. This system/blender test profile makes use of the system-supplied Blender. Use pts/blender if wishing to stick to a fixed version of Blender. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlenderBlend File: Classroom - Compute: CPU-Only32 vCPUs16 vCPUs8 vCPUs2004006008001000SE +/- 0.07, N = 3SE +/- 0.22, N = 3SE +/- 1.99, N = 3249.89506.101016.66

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compile32 vCPUs16 vCPUs8 vCPUs2004006008001000SE +/- 1.96, N = 3SE +/- 1.16, N = 3SE +/- 0.66, N = 3312.12495.54917.39

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs0.64351.2871.93052.5743.2175SE +/- 0.03, N = 15SE +/- 0.04, N = 12SE +/- 0.03, N = 31.681.792.86

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs0.7831.5662.3493.1323.915SE +/- 0.02, N = 15SE +/- 0.03, N = 12SE +/- 0.04, N = 32.132.223.48

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Repartition Test Time32 vCPUs16 vCPUs8 vCPUs1.03052.0613.09154.1225.1525SE +/- 0.03, N = 15SE +/- 0.03, N = 12SE +/- 0.02, N = 32.012.554.58

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe32 vCPUs16 vCPUs8 vCPUs48121620SE +/- 0.01, N = 15SE +/- 0.02, N = 12SE +/- 0.02, N = 34.798.3915.92

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark32 vCPUs16 vCPUs8 vCPUs60120180240300SE +/- 0.06, N = 15SE +/- 0.11, N = 12SE +/- 0.17, N = 369.77137.76277.89

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time32 vCPUs16 vCPUs8 vCPUs246810SE +/- 0.11, N = 15SE +/- 0.05, N = 12SE +/- 0.03, N = 34.794.936.28

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing is supported. This system/blender test profile makes use of the system-supplied Blender. Use pts/blender if wishing to stick to a fixed version of Blender. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlenderBlend File: Fishy Cat - Compute: CPU-Only32 vCPUs16 vCPUs8 vCPUs2004006008001000SE +/- 0.42, N = 3SE +/- 0.85, N = 3SE +/- 1.74, N = 3214.41426.04841.18

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency32 vCPUs16 vCPUs8 vCPUs1.13512.27023.40534.54045.6755SE +/- 0.012, N = 12SE +/- 0.021, N = 3SE +/- 0.060, N = 120.8031.9005.0451. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only32 vCPUs16 vCPUs8 vCPUs70K140K210K280K350KSE +/- 4561.68, N = 12SE +/- 1418.81, N = 3SE +/- 588.20, N = 12312239131607496281. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Broadcast Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs1.16552.3313.49654.6625.8275SE +/- 0.02, N = 15SE +/- 0.05, N = 3SE +/- 0.07, N = 32.122.655.18

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Inner Join Test Time32 vCPUs16 vCPUs8 vCPUs1.34552.6914.03655.3826.7275SE +/- 0.04, N = 15SE +/- 0.11, N = 3SE +/- 0.09, N = 32.873.655.98

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Repartition Test Time32 vCPUs16 vCPUs8 vCPUs1.27582.55163.82745.10326.379SE +/- 0.03, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 32.603.365.67

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Group By Test Time32 vCPUs16 vCPUs8 vCPUs246810SE +/- 0.05, N = 15SE +/- 0.10, N = 3SE +/- 0.13, N = 36.727.438.85

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Calculate Pi Benchmark Using Dataframe32 vCPUs16 vCPUs8 vCPUs48121620SE +/- 0.01, N = 15SE +/- 0.03, N = 3SE +/- 0.07, N = 34.808.2915.81

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Calculate Pi Benchmark32 vCPUs16 vCPUs8 vCPUs60120180240300SE +/- 0.06, N = 15SE +/- 0.20, N = 3SE +/- 0.35, N = 369.92137.17278.47

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - SHA-512 Benchmark Time32 vCPUs16 vCPUs8 vCPUs246810SE +/- 0.04, N = 15SE +/- 0.05, N = 3SE +/- 0.08, N = 34.965.918.09

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 032 vCPUs16 vCPUs8 vCPUs100200300400500SE +/- 0.65, N = 3SE +/- 0.80, N = 3SE +/- 0.90, N = 3266.34328.97456.241. (CXX) g++ options: -O3 -fPIC -march=native -lm

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 7.0.1Test: Read While Writing32 vCPUs16 vCPUs8 vCPUs600K1200K1800K2400K3000KSE +/- 32390.32, N = 12SE +/- 20446.66, N = 15SE +/- 9067.95, N = 15261099212648265947021. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing is supported. This system/blender test profile makes use of the system-supplied Blender. Use pts/blender if wishing to stick to a fixed version of Blender. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlenderBlend File: BMW27 - Compute: CPU-Only32 vCPUs16 vCPUs8 vCPUs100200300400500SE +/- 0.10, N = 3SE +/- 0.50, N = 3SE +/- 0.04, N = 3112.47226.26447.71

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotube32 vCPUs16 vCPUs8 vCPUs80160240320400SE +/- 0.30, N = 3SE +/- 0.03, N = 3SE +/- 0.63, N = 3130.35208.97381.201. (CC) gcc options: -shared -fwrapv -O2 -O3 -march=native -lxc -lblas -lmpi

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNet32 vCPUs16 vCPUs8 vCPUs8001600240032004000SE +/- 6.90, N = 3SE +/- 12.43, N = 3SE +/- 9.75, N = 33056.903358.733842.12MIN: 2928.19 / MAX: 3237.58MIN: 3163.2 / MAX: 3575.85MIN: 3619.38 / MAX: 4060.161. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare32 vCPUs16 vCPUs8 vCPUs0.38660.77321.15981.54641.933SE +/- 0.010, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 31.7180.8800.4501. (CXX) g++ options: -O3 -march=native

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Writes32 vCPUs16 vCPUs8 vCPUs20K40K60K80K100KSE +/- 777.36, N = 3SE +/- 256.55, N = 3SE +/- 136.95, N = 10878193929617862

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 7.0.1Test: Read Random Write Random32 vCPUs16 vCPUs8 vCPUs300K600K900K1200K1500KSE +/- 9643.50, N = 15SE +/- 1701.74, N = 3SE +/- 4976.00, N = 1513218278847005483531. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 232 vCPUs16 vCPUs8 vCPUs50100150200250SE +/- 0.13, N = 3SE +/- 0.47, N = 3SE +/- 0.32, N = 3169.64194.77245.601. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA25632 vCPUs16 vCPUs8 vCPUs6000M12000M18000M24000M30000MSE +/- 119493320.18, N = 3SE +/- 19283388.31, N = 3SE +/- 19026629.44, N = 3257889199131292641152764560835071. (CC) gcc options: -pthread -O3 -march=native -lssl -lcrypto -ldl

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding32 vCPUs16 vCPUs8 vCPUs8001600240032004000SE +/- 42.99, N = 15SE +/- 32.26, N = 3SE +/- 23.42, N = 153899.283343.251977.891. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding32 vCPUs16 vCPUs8 vCPUs8001600240032004000SE +/- 54.84, N = 15SE +/- 12.80, N = 3SE +/- 24.91, N = 153962.082585.311325.061. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustive32 vCPUs16 vCPUs8 vCPUs60120180240300SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 3.08, N = 368.66137.62276.881. (CXX) g++ options: -O3 -march=native -flto -pthread

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency32 vCPUs16 vCPUs8 vCPUs0.41490.82981.24471.65962.0745SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.023, N = 30.3040.6331.8441. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only32 vCPUs16 vCPUs8 vCPUs70K140K210K280K350KSE +/- 1811.74, N = 3SE +/- 697.61, N = 3SE +/- 663.61, N = 3329539157894542371. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

Aircrack-ng

Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.732 vCPUs16 vCPUs8 vCPUs7K14K21K28K35KSE +/- 287.54, N = 15SE +/- 192.97, N = 15SE +/- 103.85, N = 1533647.5516697.928308.58-lpcre-lpcre1. (CXX) g++ options: -std=gnu++17 -O3 -fvisibility=hidden -fcommon -rdynamic -lnl-3 -lnl-genl-3 -lpthread -lz -lssl -lcrypto -lhwloc -ldl -lm -pthread

Renaissance

Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.14Test: Apache Spark Bayes32 vCPUs16 vCPUs8 vCPUs5001000150020002500SE +/- 9.73, N = 3SE +/- 7.45, N = 3SE +/- 42.77, N = 15766.41262.02249.8MIN: 495.95 / MAX: 1178.88MIN: 877.37 / MAX: 1398.23MIN: 1478.18 / MAX: 2434.18

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding32 vCPUs16 vCPUs8 vCPUs12002400360048006000SE +/- 80.56, N = 15SE +/- 2.61, N = 3SE +/- 7.87, N = 35522.074083.162196.741. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding32 vCPUs16 vCPUs8 vCPUs10002000300040005000SE +/- 35.89, N = 15SE +/- 5.95, N = 3SE +/- 5.73, N = 34456.553789.012360.631. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 1080p32 vCPUs16 vCPUs8 vCPUs1.12282.24563.36844.49125.614SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.994.844.651. (CXX) g++ options: -lm -lpthread -O3 -march=native -march=armv8-a -fPIC -U_FORTIFY_SOURCE -std=gnu++11

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.132 vCPUs16 vCPUs8 vCPUs510152025SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 322.0917.1011.101. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C32 vCPUs16 vCPUs8 vCPUs6K12K18K24K30KSE +/- 31.60, N = 3SE +/- 112.56, N = 3SE +/- 39.91, N = 326843.5819710.907115.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Futex32 vCPUs16 vCPUs8 vCPUs300K600K900K1200K1500KSE +/- 15026.23, N = 3SE +/- 30917.20, N = 15SE +/- 36001.77, N = 151437660.621198681.87937451.991. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C32 vCPUs16 vCPUs8 vCPUs15K30K45K60K75KSE +/- 272.46, N = 3SE +/- 18.18, N = 3SE +/- 23.11, N = 369530.6449125.9314368.291. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D32 vCPUs16 vCPUs8 vCPUs7001400210028003500SE +/- 2.04, N = 3SE +/- 1.03, N = 3SE +/- 0.56, N = 33265.681634.99820.941. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNet32 vCPUs16 vCPUs8 vCPUs14002800420056007000SE +/- 31.57, N = 8SE +/- 11.05, N = 3SE +/- 9.96, N = 33853.903955.896618.32

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4K32 vCPUs16 vCPUs8 vCPUs246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.996.686.111. (CXX) g++ options: -lm -lpthread -O3 -march=native -march=armv8-a -fPIC -U_FORTIFY_SOURCE -std=gnu++11

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU32 vCPUs16 vCPUs8 vCPUs20K40K60K80K100KSE +/- 23.77, N = 3SE +/- 12.70, N = 3SE +/- 6.95, N = 3108241.6154317.4227237.281. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.4Time To Compile32 vCPUs16 vCPUs8 vCPUs306090120150SE +/- 0.16, N = 3SE +/- 0.09, N = 3SE +/- 0.18, N = 338.9661.98113.46

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V432 vCPUs16 vCPUs8 vCPUs20K40K60K80K100KSE +/- 149.01, N = 3SE +/- 84.24, N = 3SE +/- 49.87, N = 331657.346113.497646.1

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V232 vCPUs16 vCPUs8 vCPUs20K40K60K80K100KSE +/- 379.42, N = 3SE +/- 16.18, N = 3SE +/- 70.02, N = 333994.945445.991592.6

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Float32 vCPUs16 vCPUs8 vCPUs9001800270036004500SE +/- 17.55, N = 3SE +/- 4.32, N = 3SE +/- 3.06, N = 32093.252481.964395.73

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA409632 vCPUs16 vCPUs8 vCPUs30K60K90K120K150KSE +/- 29.86, N = 3SE +/- 8.35, N = 3SE +/- 10.26, N = 3128273.164247.032136.81. (CC) gcc options: -pthread -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA409632 vCPUs16 vCPUs8 vCPUs30060090012001500SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 31570.2786.7393.71. (CC) gcc options: -pthread -O3 -march=native -lssl -lcrypto -ldl

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 7.0.1Test: Random Read32 vCPUs16 vCPUs8 vCPUs30M60M90M120M150MSE +/- 376574.31, N = 3SE +/- 735054.27, N = 3SE +/- 252880.06, N = 312470420162048967310556891. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Timed MPlayer Compilation

This test times how long it takes to build the MPlayer open-source media player program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile32 vCPUs16 vCPUs8 vCPUs20406080100SE +/- 0.35, N = 4SE +/- 0.26, N = 3SE +/- 0.03, N = 328.9347.6288.84

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 1080p32 vCPUs16 vCPUs8 vCPUs3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 312.1111.7811.271. (CXX) g++ options: -lm -lpthread -O3 -march=native -march=armv8-a -fPIC -U_FORTIFY_SOURCE -std=gnu++11

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap32 vCPUs16 vCPUs8 vCPUs2K4K6K8K10KSE +/- 95.95, N = 20SE +/- 52.52, N = 4SE +/- 68.63, N = 4501555988040

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C32 vCPUs16 vCPUs8 vCPUs5K10K15K20K25KSE +/- 35.67, N = 3SE +/- 171.15, N = 3SE +/- 49.63, N = 1521433.9212171.956855.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C32 vCPUs16 vCPUs8 vCPUs20K40K60K80K100KSE +/- 137.48, N = 3SE +/- 701.24, N = 3SE +/- 50.76, N = 387702.3055447.3132029.141. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D32 vCPUs16 vCPUs8 vCPUs400800120016002000SE +/- 0.86, N = 3SE +/- 14.70, N = 3SE +/- 1.14, N = 31822.771498.451104.261. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stress32 vCPUs16 vCPUs8 vCPUs2K4K6K8K10KSE +/- 4.23, N = 3SE +/- 2.80, N = 3SE +/- 1.28, N = 38209.474116.962065.531. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cache32 vCPUs16 vCPUs8 vCPUs120240360480600SE +/- 0.28, N = 3SE +/- 2.05, N = 3SE +/- 2.30, N = 3566.91551.25436.311. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Math32 vCPUs16 vCPUs8 vCPUs20K40K60K80K100KSE +/- 190.70, N = 3SE +/- 27.43, N = 3SE +/- 6.31, N = 397749.0849102.3024633.991. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Math32 vCPUs16 vCPUs8 vCPUs30K60K90K120K150KSE +/- 9.80, N = 3SE +/- 10.44, N = 3SE +/- 25.04, N = 3151792.8376177.5638215.951. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: System V Message Passing32 vCPUs16 vCPUs8 vCPUs1.3M2.6M3.9M5.2M6.5MSE +/- 7551.56, N = 3SE +/- 15929.45, N = 3SE +/- 12538.93, N = 36128517.105475267.364507844.171. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B32 vCPUs16 vCPUs8 vCPUs7K14K21K28K35KSE +/- 38.20, N = 3SE +/- 244.58, N = 3SE +/- 17.11, N = 334381.9119552.457338.981. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v232 vCPUs16 vCPUs8 vCPUs70140210280350SE +/- 0.05, N = 3SE +/- 1.36, N = 3SE +/- 0.78, N = 3322.77328.89331.34MIN: 319.63 / MAX: 326.43MIN: 322.15 / MAX: 373.8MIN: 327.36 / MAX: 339.941. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thorough32 vCPUs16 vCPUs8 vCPUs714212835SE +/- 0.0033, N = 3SE +/- 0.0106, N = 3SE +/- 0.0316, N = 37.161914.214629.05051. (CXX) g++ options: -O3 -march=native -flto -pthread

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP32 vCPUs16 vCPUs8 vCPUs2004006008001000SE +/- 3.30, N = 3SE +/- 0.00, N = 3SE +/- 1.21, N = 3996.70645.16371.301. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second32 vCPUs16 vCPUs8 vCPUs150K300K450K600K750KSE +/- 385.56, N = 3SE +/- 87.09, N = 3SE +/- 85.46, N = 3700917.94351562.54175037.771. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Lossless32 vCPUs16 vCPUs8 vCPUs612182430SE +/- 0.00, N = 3SE +/- 0.19, N = 3SE +/- 0.11, N = 310.3414.7023.311. (CXX) g++ options: -O3 -fPIC -march=native -lm

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding32 vCPUs16 vCPUs8 vCPUs2K4K6K8K10KSE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 33.24, N = 39181.245023.702421.431. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding32 vCPUs16 vCPUs8 vCPUs16003200480064008000SE +/- 66.63, N = 3SE +/- 43.40, N = 3SE +/- 29.91, N = 37262.743631.812296.101. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C32 vCPUs16 vCPUs8 vCPUs11K22K33K44K55KSE +/- 41.18, N = 3SE +/- 300.01, N = 3SE +/- 15.96, N = 352309.8132644.8518574.231. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 632 vCPUs16 vCPUs8 vCPUs510152025SE +/- 0.020, N = 3SE +/- 0.037, N = 3SE +/- 0.130, N = 36.68211.16820.2731. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Lossless32 vCPUs16 vCPUs8 vCPUs3691215SE +/- 0.072, N = 3SE +/- 0.065, N = 8SE +/- 0.096, N = 36.7757.6589.9971. (CXX) g++ options: -O3 -fPIC -march=native -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Medium32 vCPUs16 vCPUs8 vCPUs3691215SE +/- 0.0035, N = 3SE +/- 0.0194, N = 3SE +/- 0.0253, N = 35.98256.94499.05051. (CXX) g++ options: -O3 -march=native -flto -pthread

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C32 vCPUs16 vCPUs8 vCPUs11K22K33K44K55KSE +/- 31.40, N = 3SE +/- 102.49, N = 3SE +/- 46.23, N = 350939.0533309.7627703.331. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Protein32 vCPUs16 vCPUs8 vCPUs48121620SE +/- 0.012, N = 3SE +/- 0.037, N = 3SE +/- 0.011, N = 316.5968.8614.8121. (CXX) g++ options: -O3 -march=native -ldl

102 Results Shown

SPECjbb 2015:
  SPECjbb2015-Composite critical-jOPS
  SPECjbb2015-Composite max-jOPS
LAMMPS Molecular Dynamics Simulator
Renaissance
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Execution Time
  drivaerFastback, Medium Mesh Size - Mesh Time
Graph500:
  26:
    bfs max_TEPS
    sssp max_TEPS
    bfs median_TEPS
    sssp median_TEPS
Apache Spark:
  40000000 - 2000 - Broadcast Inner Join Test Time
  40000000 - 2000 - Inner Join Test Time
  40000000 - 2000 - Repartition Test Time
  40000000 - 2000 - Group By Test Time
  40000000 - 2000 - Calculate Pi Benchmark Using Dataframe
  40000000 - 2000 - Calculate Pi Benchmark
  40000000 - 2000 - SHA-512 Benchmark Time
  40000000 - 100 - Broadcast Inner Join Test Time
  40000000 - 100 - Inner Join Test Time
  40000000 - 100 - Repartition Test Time
  40000000 - 100 - Group By Test Time
  40000000 - 100 - Calculate Pi Benchmark Using Dataframe
  40000000 - 100 - Calculate Pi Benchmark
  40000000 - 100 - SHA-512 Benchmark Time
Blender
Timed Gem5 Compilation
Apache Spark:
  1000000 - 100 - Broadcast Inner Join Test Time
  1000000 - 100 - Inner Join Test Time
  1000000 - 100 - Repartition Test Time
  1000000 - 100 - Calculate Pi Benchmark Using Dataframe
  1000000 - 100 - Calculate Pi Benchmark
  1000000 - 100 - SHA-512 Benchmark Time
Blender
PostgreSQL pgbench:
  100 - 250 - Read Only - Average Latency
  100 - 250 - Read Only
Apache Spark:
  1000000 - 2000 - Broadcast Inner Join Test Time
  1000000 - 2000 - Inner Join Test Time
  1000000 - 2000 - Repartition Test Time
  1000000 - 2000 - Group By Test Time
  1000000 - 2000 - Calculate Pi Benchmark Using Dataframe
  1000000 - 2000 - Calculate Pi Benchmark
  1000000 - 2000 - SHA-512 Benchmark Time
libavif avifenc
Facebook RocksDB
Blender
GPAW
TNN
GROMACS
Apache Cassandra
Facebook RocksDB
libavif avifenc
OpenSSL
ASKAP:
  tConvolve MPI - Gridding
  tConvolve MPI - Degridding
ASTC Encoder
PostgreSQL pgbench:
  100 - 100 - Read Only - Average Latency
  100 - 100 - Read Only
Aircrack-ng
Renaissance
ASKAP:
  tConvolve MT - Degridding
  tConvolve MT - Gridding
VP9 libvpx Encoding
High Performance Conjugate Gradient
NAS Parallel Benchmarks
Stress-NG
NAS Parallel Benchmarks:
  BT.C
  EP.D
TensorFlow Lite
VP9 libvpx Encoding
Sysbench
Timed FFmpeg Compilation
TensorFlow Lite:
  Inception V4
  Inception ResNet V2
  Mobilenet Float
OpenSSL:
  RSA4096:
    verify/s
    sign/s
Facebook RocksDB
Timed MPlayer Compilation
VP9 libvpx Encoding
DaCapo Benchmark
NAS Parallel Benchmarks:
  CG.C
  LU.C
  IS.D
Stress-NG:
  CPU Stress
  CPU Cache
  Vector Math
  Matrix Math
  System V Message Passing
NAS Parallel Benchmarks
TNN
ASTC Encoder
ASKAP
Coremark
libavif avifenc
ASKAP:
  tConvolve OpenMP - Degridding
  tConvolve OpenMP - Gridding
NAS Parallel Benchmarks
libavif avifenc:
  6
  10, Lossless
ASTC Encoder
NAS Parallel Benchmarks
LAMMPS Molecular Dynamics Simulator