Tau T2A 8 16 32 vCPU Scaling

Benchmarks by Michael Larabel for a future article

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2208120-PTS-2208123N58
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Comparison
Transpose Comparison

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Tau T2A: 8 vCPUs
August 10 2022
  1 Day, 5 Hours, 56 Minutes
Tau T2A: 16 vCPUs
August 10 2022
  1 Day, 1 Hour, 45 Minutes
Tau T2A: 32 vCPUs
August 11 2022
  1 Day, 6 Hours, 6 Minutes
Invert Behavior (Only Show Selected Data)
  1 Day, 4 Hours, 35 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


ProcessorMotherboardMemoryDiskNetworkOSKernelCompilerFile-SystemSystem LayerTau T2A 8 vCPUs 16 vCPUs 32 vCPUsARMv8 Neoverse-N1 (8 Cores)KVM Google Compute Engine32GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.045.15.0-1013-gcp (aarch64)GCC 12.0.1 20220319ext4KVMARMv8 Neoverse-N1 (16 Cores)64GBARMv8 Neoverse-N1 (32 Cores)128GB5.15.0-1016-gcp (aarch64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Java Details- OpenJDK Runtime Environment (build 11.0.16+8-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.4Security Details- Tau T2A: 8 vCPUs: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - Tau T2A: 16 vCPUs: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - Tau T2A: 32 vCPUs: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

aircrack-ng: cassandra: Writesspark: 1000000 - 100 - SHA-512 Benchmark Timespark: 1000000 - 100 - Calculate Pi Benchmarkspark: 1000000 - 100 - Calculate Pi Benchmark Using Dataframespark: 1000000 - 100 - Repartition Test Timespark: 1000000 - 100 - Inner Join Test Timespark: 1000000 - 100 - Broadcast Inner Join Test Timespark: 1000000 - 2000 - SHA-512 Benchmark Timespark: 1000000 - 2000 - Calculate Pi Benchmarkspark: 1000000 - 2000 - Calculate Pi Benchmark Using Dataframespark: 1000000 - 2000 - Group By Test Timespark: 1000000 - 2000 - Repartition Test Timespark: 1000000 - 2000 - Inner Join Test Timespark: 1000000 - 2000 - Broadcast Inner Join Test Timespark: 40000000 - 100 - SHA-512 Benchmark Timespark: 40000000 - 100 - Calculate Pi Benchmarkspark: 40000000 - 100 - Calculate Pi Benchmark Using Dataframespark: 40000000 - 100 - Group By Test Timespark: 40000000 - 100 - Repartition Test Timespark: 40000000 - 100 - Inner Join Test Timespark: 40000000 - 100 - Broadcast Inner Join Test Timespark: 40000000 - 2000 - SHA-512 Benchmark Timespark: 40000000 - 2000 - Calculate Pi Benchmarkspark: 40000000 - 2000 - Calculate Pi Benchmark Using Dataframespark: 40000000 - 2000 - Group By Test Timespark: 40000000 - 2000 - Repartition Test Timespark: 40000000 - 2000 - Inner Join Test Timespark: 40000000 - 2000 - Broadcast Inner Join Test Timeaskap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddingaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingaskap: Hogbom Clean OpenMPastcenc: Mediumastcenc: Thoroughastcenc: Exhaustiveblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlycoremark: CoreMark Size 666 - Iterations Per Seconddacapobench: Tradesoaprocksdb: Rand Readrocksdb: Read While Writingrocksdb: Read Rand Write Randgpaw: Carbon Nanotubegraph500: 26graph500: 26graph500: 26graph500: 26gromacs: MPI CPU - water_GMX50_barehpcg: lammps: 20k Atomslammps: Rhodopsin Proteinavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslessnpb: BT.Cnpb: CG.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Cnpb: SP.Bnpb: SP.Copenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenssl: SHA256openssl: RSA4096openssl: RSA4096pgbench: 100 - 100 - Read Onlypgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencyrenaissance: Apache Spark Bayesrenaissance: Savina Reactors.IOspec-jbb2015: SPECjbb2015-Composite max-jOPSspec-jbb2015: SPECjbb2015-Composite critical-jOPSstress-ng: Futexstress-ng: CPU Cachestress-ng: CPU Stressstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: System V Message Passingsysbench: CPUtensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: Mobilenet Floattensorflow-lite: Inception ResNet V2build-ffmpeg: Time To Compilebuild-gem5: Time To Compilebuild-mplayer: Time To Compiletnn: CPU - DenseNettnn: CPU - MobileNet v2vpxenc: Speed 5 - Bosphorus 4Kvpxenc: Speed 0 - Bosphorus 1080pvpxenc: Speed 5 - Bosphorus 1080pTau T2A 8 vCPUs 16 vCPUs 32 vCPUs8308.580178626.28277.8915.924.583.482.868.09278.47252722915.818.855.675.985.1893.67278.36684567715.6550.8768.6880.0280.2689.23277.8315.7145.6166.2778.0074.712360.632196.741325.061977.892296.102421.43371.2959.050529.0505276.8798447.711016.66841.18175037.765143804031055689594702548353381.1960.4511.09694.6624.812456.237245.60120.27323.3129.99714368.296855.81820.9418574.231104.2632029.1427703.337338.987115.28425.952426.166456083507393.732136.8542371.844496285.0452249.826456.491583921937451.99436.312065.5338215.9524633.994507844.1727237.286618.3297646.14395.7391592.6113.455917.39388.8383842.123331.3446.114.6511.2716697.924392964.93137.7612851908.392.552.221.795.91137.1739761998.297.433.363.652.6551.40136.8496130198.4035.6437.2244.6245.2951.55137.0402077248.3430.7035.4544.3942.753789.014083.162585.313343.253631.815023.7645.1616.944914.2146137.6218226.26506.10426.04351562.5431585598620489671264826884700208.96525747800026256300070750200952655000.88017.09848.4998.861328.970194.77411.16814.7027.65849125.9312171.951634.9932644.851498.4555447.3133309.7619552.4519710.90303.711534.7212926411527786.764247.01578940.6331316071.9001262.015981.51809292071198681.87551.254116.9676177.5649102.305475267.3654317.423955.8946113.42481.9645445.961.983495.54147.6213358.728328.8866.684.8411.7833647.548878194.7969.774.792.012.131.684.9669.924.806.722.602.872.1246.3069.574.7627.6424.3630.3231.9839.2269.794.7822.8422.2228.6626.554456.555522.073962.083899.287262.749181.24996.7005.98257.161968.6557112.47249.89214.41700917.944737501512470420126109921321827130.3534773770005083720001247020001695420001.71822.093016.55016.596266.337169.6396.68210.3416.77569530.6421433.923265.6852309.811822.7787702.3050939.0534381.9126843.58206.4994.53257889199131570.2128273.13295390.3043122390.803766.410705.935075229551437660.62566.918209.47151792.8397749.086128517.10108241.613853.9031657.32093.2533994.938.958312.12028.9283056.897322.7686.994.9912.11OpenBenchmarking.org

Aircrack-ng

Aircrack-ng is a tool for assessing WiFi/WLAN network security. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.78 vCPUs16 vCPUs32 vCPUs7K14K21K28K35KSE +/- 103.85, N = 15SE +/- 192.97, N = 15SE +/- 287.54, N = 158308.5816697.9233647.55-lpcre-lpcre1. (CXX) g++ options: -std=gnu++17 -O3 -fvisibility=hidden -fcommon -rdynamic -lnl-3 -lnl-genl-3 -lpthread -lz -lssl -lcrypto -lhwloc -ldl -lm -pthread

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Writes8 vCPUs16 vCPUs32 vCPUs20K40K60K80K100KSE +/- 136.95, N = 10SE +/- 256.55, N = 3SE +/- 777.36, N = 3178623929687819

Apache Spark

This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - SHA-512 Benchmark Time8 vCPUs16 vCPUs32 vCPUs246810SE +/- 0.03, N = 3SE +/- 0.05, N = 12SE +/- 0.11, N = 156.284.934.79

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark8 vCPUs16 vCPUs32 vCPUs60120180240300SE +/- 0.17, N = 3SE +/- 0.11, N = 12SE +/- 0.06, N = 15277.89137.7669.77

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe8 vCPUs16 vCPUs32 vCPUs48121620SE +/- 0.02, N = 3SE +/- 0.02, N = 12SE +/- 0.01, N = 1515.928.394.79

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Repartition Test Time8 vCPUs16 vCPUs32 vCPUs1.03052.0613.09154.1225.1525SE +/- 0.02, N = 3SE +/- 0.03, N = 12SE +/- 0.03, N = 154.582.552.01

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs0.7831.5662.3493.1323.915SE +/- 0.04, N = 3SE +/- 0.03, N = 12SE +/- 0.02, N = 153.482.222.13

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 100 - Broadcast Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs0.64351.2871.93052.5743.2175SE +/- 0.03, N = 3SE +/- 0.04, N = 12SE +/- 0.03, N = 152.861.791.68

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - SHA-512 Benchmark Time8 vCPUs16 vCPUs32 vCPUs246810SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 158.095.914.96

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Calculate Pi Benchmark8 vCPUs16 vCPUs32 vCPUs60120180240300SE +/- 0.35, N = 3SE +/- 0.20, N = 3SE +/- 0.06, N = 15278.47137.1769.92

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Calculate Pi Benchmark Using Dataframe8 vCPUs16 vCPUs32 vCPUs48121620SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 1515.818.294.80

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Group By Test Time8 vCPUs16 vCPUs32 vCPUs246810SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 158.857.436.72

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Repartition Test Time8 vCPUs16 vCPUs32 vCPUs1.27582.55163.82745.10326.379SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 155.673.362.60

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs1.34552.6914.03655.3826.7275SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 155.983.652.87

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 1000000 - Partitions: 2000 - Broadcast Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs1.16552.3313.49654.6625.8275SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 155.182.652.12

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - SHA-512 Benchmark Time8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 0.85, N = 3SE +/- 0.22, N = 3SE +/- 0.45, N = 993.6751.4046.30

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Calculate Pi Benchmark8 vCPUs16 vCPUs32 vCPUs60120180240300SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 9278.37136.8569.57

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Calculate Pi Benchmark Using Dataframe8 vCPUs16 vCPUs32 vCPUs48121620SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 915.658.404.76

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Group By Test Time8 vCPUs16 vCPUs32 vCPUs1122334455SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.16, N = 950.8735.6427.64

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Repartition Test Time8 vCPUs16 vCPUs32 vCPUs1530456075SE +/- 0.25, N = 3SE +/- 0.92, N = 3SE +/- 0.12, N = 968.6837.2224.36

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 0.19, N = 3SE +/- 0.72, N = 3SE +/- 0.44, N = 980.0244.6230.32

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 100 - Broadcast Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 0.22, N = 3SE +/- 0.44, N = 3SE +/- 0.26, N = 980.2645.2931.98

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - SHA-512 Benchmark Time8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 0.52, N = 3SE +/- 0.08, N = 3SE +/- 0.55, N = 1289.2351.5539.22

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Calculate Pi Benchmark8 vCPUs16 vCPUs32 vCPUs60120180240300SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.11, N = 12277.83137.0469.79

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Calculate Pi Benchmark Using Dataframe8 vCPUs16 vCPUs32 vCPUs48121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 1215.718.344.78

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Group By Test Time8 vCPUs16 vCPUs32 vCPUs1020304050SE +/- 0.57, N = 3SE +/- 0.23, N = 3SE +/- 0.32, N = 1245.6130.7022.84

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Repartition Test Time8 vCPUs16 vCPUs32 vCPUs1530456075SE +/- 0.36, N = 3SE +/- 0.20, N = 3SE +/- 0.24, N = 1266.2735.4522.22

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 1.11, N = 3SE +/- 0.73, N = 3SE +/- 0.19, N = 1278.0044.3928.66

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark 3.3Row Count: 40000000 - Partitions: 2000 - Broadcast Inner Join Test Time8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 0.33, N = 3SE +/- 0.40, N = 3SE +/- 0.17, N = 1274.7142.7526.55

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding8 vCPUs16 vCPUs32 vCPUs10002000300040005000SE +/- 5.73, N = 3SE +/- 5.95, N = 3SE +/- 35.89, N = 152360.633789.014456.551. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding8 vCPUs16 vCPUs32 vCPUs12002400360048006000SE +/- 7.87, N = 3SE +/- 2.61, N = 3SE +/- 80.56, N = 152196.744083.165522.071. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding8 vCPUs16 vCPUs32 vCPUs8001600240032004000SE +/- 24.91, N = 15SE +/- 12.80, N = 3SE +/- 54.84, N = 151325.062585.313962.081. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding8 vCPUs16 vCPUs32 vCPUs8001600240032004000SE +/- 23.42, N = 15SE +/- 32.26, N = 3SE +/- 42.99, N = 151977.893343.253899.281. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding8 vCPUs16 vCPUs32 vCPUs16003200480064008000SE +/- 29.91, N = 3SE +/- 43.40, N = 3SE +/- 66.63, N = 32296.103631.817262.741. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding8 vCPUs16 vCPUs32 vCPUs2K4K6K8K10KSE +/- 33.24, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32421.435023.709181.241. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP8 vCPUs16 vCPUs32 vCPUs2004006008001000SE +/- 1.21, N = 3SE +/- 0.00, N = 3SE +/- 3.30, N = 3371.30645.16996.701. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Medium8 vCPUs16 vCPUs32 vCPUs3691215SE +/- 0.0253, N = 3SE +/- 0.0194, N = 3SE +/- 0.0035, N = 39.05056.94495.98251. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Thorough8 vCPUs16 vCPUs32 vCPUs714212835SE +/- 0.0316, N = 3SE +/- 0.0106, N = 3SE +/- 0.0033, N = 329.050514.21467.16191. (CXX) g++ options: -O3 -march=native -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: Exhaustive8 vCPUs16 vCPUs32 vCPUs60120180240300SE +/- 3.08, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3276.88137.6268.661. (CXX) g++ options: -O3 -march=native -flto -pthread

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing is supported. This system/blender test profile makes use of the system-supplied Blender. Use pts/blender if wishing to stick to a fixed version of Blender. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlenderBlend File: BMW27 - Compute: CPU-Only8 vCPUs16 vCPUs32 vCPUs100200300400500SE +/- 0.04, N = 3SE +/- 0.50, N = 3SE +/- 0.10, N = 3447.71226.26112.47

OpenBenchmarking.orgSeconds, Fewer Is BetterBlenderBlend File: Classroom - Compute: CPU-Only8 vCPUs16 vCPUs32 vCPUs2004006008001000SE +/- 1.99, N = 3SE +/- 0.22, N = 3SE +/- 0.07, N = 31016.66506.10249.89

OpenBenchmarking.orgSeconds, Fewer Is BetterBlenderBlend File: Fishy Cat - Compute: CPU-Only8 vCPUs16 vCPUs32 vCPUs2004006008001000SE +/- 1.74, N = 3SE +/- 0.85, N = 3SE +/- 0.42, N = 3841.18426.04214.41

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second8 vCPUs16 vCPUs32 vCPUs150K300K450K600K750KSE +/- 85.46, N = 3SE +/- 87.09, N = 3SE +/- 385.56, N = 3175037.77351562.54700917.941. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Tradesoap8 vCPUs16 vCPUs32 vCPUs2K4K6K8K10KSE +/- 68.63, N = 4SE +/- 52.52, N = 4SE +/- 95.95, N = 20804055985015

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 7.0.1Test: Random Read8 vCPUs16 vCPUs32 vCPUs30M60M90M120M150MSE +/- 252880.06, N = 3SE +/- 735054.27, N = 3SE +/- 376574.31, N = 331055689620489671247042011. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 7.0.1Test: Read While Writing8 vCPUs16 vCPUs32 vCPUs600K1200K1800K2400K3000KSE +/- 9067.95, N = 15SE +/- 20446.66, N = 15SE +/- 32390.32, N = 12594702126482626109921. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 7.0.1Test: Read Random Write Random8 vCPUs16 vCPUs32 vCPUs300K600K900K1200K1500KSE +/- 4976.00, N = 15SE +/- 1701.74, N = 3SE +/- 9643.50, N = 1554835388470013218271. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

GPAW

GPAW is a density-functional theory (DFT) Python code based on the projector-augmented wave (PAW) method and the atomic simulation environment (ASE). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotube8 vCPUs16 vCPUs32 vCPUs80160240320400SE +/- 0.63, N = 3SE +/- 0.03, N = 3SE +/- 0.30, N = 3381.20208.97130.351. (CC) gcc options: -shared -fwrapv -O2 -O3 -march=native -lxc -lblas -lmpi

Graph500

This is a benchmark of the reference implementation of Graph500, an HPC benchmark focused on data intensive loads and commonly tested on supercomputers for complex data problems. Graph500 primarily stresses the communication subsystem of the hardware under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbfs median_TEPS, More Is BetterGraph500 3.0Scale: 2616 vCPUs32 vCPUs100M200M300M400M500M2574780004773770001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

OpenBenchmarking.orgbfs max_TEPS, More Is BetterGraph500 3.0Scale: 2616 vCPUs32 vCPUs110M220M330M440M550M2625630005083720001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 2616 vCPUs32 vCPUs30M60M90M120M150M707502001247020001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

OpenBenchmarking.orgsssp max_TEPS, More Is BetterGraph500 3.0Scale: 2616 vCPUs32 vCPUs40M80M120M160M200M952655001695420001. (CC) gcc options: -fcommon -O3 -march=native -lpthread -lm -lmpi

Scale: 26

Tau T2A: 8 vCPUs: The test quit with a non-zero exit status. E: mpirun noticed that process rank 2 with PID 0 on node instance-2 exited on signal 9 (Killed).

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare8 vCPUs16 vCPUs32 vCPUs0.38660.77321.15981.54641.933SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.010, N = 30.4500.8801.7181. (CXX) g++ options: -O3 -march=native

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.18 vCPUs16 vCPUs32 vCPUs510152025SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 311.1017.1022.091. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atoms8 vCPUs16 vCPUs32 vCPUs48121620SE +/- 0.025, N = 3SE +/- 0.112, N = 3SE +/- 0.004, N = 34.6628.49916.5501. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Protein8 vCPUs16 vCPUs32 vCPUs48121620SE +/- 0.011, N = 3SE +/- 0.037, N = 3SE +/- 0.012, N = 34.8128.86116.5961. (CXX) g++ options: -O3 -march=native -ldl

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 08 vCPUs16 vCPUs32 vCPUs100200300400500SE +/- 0.90, N = 3SE +/- 0.80, N = 3SE +/- 0.65, N = 3456.24328.97266.341. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 28 vCPUs16 vCPUs32 vCPUs50100150200250SE +/- 0.32, N = 3SE +/- 0.47, N = 3SE +/- 0.13, N = 3245.60194.77169.641. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 68 vCPUs16 vCPUs32 vCPUs510152025SE +/- 0.130, N = 3SE +/- 0.037, N = 3SE +/- 0.020, N = 320.27311.1686.6821. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, Lossless8 vCPUs16 vCPUs32 vCPUs612182430SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.00, N = 323.3114.7010.341. (CXX) g++ options: -O3 -fPIC -march=native -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, Lossless8 vCPUs16 vCPUs32 vCPUs3691215SE +/- 0.096, N = 3SE +/- 0.065, N = 8SE +/- 0.072, N = 39.9977.6586.7751. (CXX) g++ options: -O3 -fPIC -march=native -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.C8 vCPUs16 vCPUs32 vCPUs15K30K45K60K75KSE +/- 23.11, N = 3SE +/- 18.18, N = 3SE +/- 272.46, N = 314368.2949125.9369530.641. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C8 vCPUs16 vCPUs32 vCPUs5K10K15K20K25KSE +/- 49.63, N = 15SE +/- 171.15, N = 3SE +/- 35.67, N = 36855.8112171.9521433.921. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D8 vCPUs16 vCPUs32 vCPUs7001400210028003500SE +/- 0.56, N = 3SE +/- 1.03, N = 3SE +/- 2.04, N = 3820.941634.993265.681. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C8 vCPUs16 vCPUs32 vCPUs11K22K33K44K55KSE +/- 15.96, N = 3SE +/- 300.01, N = 3SE +/- 41.18, N = 318574.2332644.8552309.811. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D8 vCPUs16 vCPUs32 vCPUs400800120016002000SE +/- 1.14, N = 3SE +/- 14.70, N = 3SE +/- 0.86, N = 31104.261498.451822.771. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C8 vCPUs16 vCPUs32 vCPUs20K40K60K80K100KSE +/- 50.76, N = 3SE +/- 701.24, N = 3SE +/- 137.48, N = 332029.1455447.3187702.301. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C8 vCPUs16 vCPUs32 vCPUs11K22K33K44K55KSE +/- 46.23, N = 3SE +/- 102.49, N = 3SE +/- 31.40, N = 327703.3333309.7650939.051. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.B8 vCPUs16 vCPUs32 vCPUs7K14K21K28K35KSE +/- 17.11, N = 3SE +/- 244.58, N = 3SE +/- 38.20, N = 37338.9819552.4534381.911. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C8 vCPUs16 vCPUs32 vCPUs6K12K18K24K30KSE +/- 39.91, N = 3SE +/- 112.56, N = 3SE +/- 31.60, N = 37115.2819710.9026843.581. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Mesh Time8 vCPUs16 vCPUs32 vCPUs90180270360450425.95303.71206.40-ltransportModels -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats1. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 9Input: drivaerFastback, Medium Mesh Size - Execution Time8 vCPUs16 vCPUs32 vCPUs50010001500200025002426.161534.72994.53-ltransportModels -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats-lfoamToVTK -ldynamicMesh -llagrangian -lfileFormats1. (CXX) g++ options: -std=c++14 -O3 -mcpu=native -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.0Algorithm: SHA2568 vCPUs16 vCPUs32 vCPUs6000M12000M18000M24000M30000MSE +/- 19026629.44, N = 3SE +/- 19283388.31, N = 3SE +/- 119493320.18, N = 3645608350712926411527257889199131. (CC) gcc options: -pthread -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.0Algorithm: RSA40968 vCPUs16 vCPUs32 vCPUs30060090012001500SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3393.7786.71570.21. (CC) gcc options: -pthread -O3 -march=native -lssl -lcrypto -ldl

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.0Algorithm: RSA40968 vCPUs16 vCPUs32 vCPUs30K60K90K120K150KSE +/- 10.26, N = 3SE +/- 8.35, N = 3SE +/- 29.86, N = 332136.864247.0128273.11. (CC) gcc options: -pthread -O3 -march=native -lssl -lcrypto -ldl

PostgreSQL pgbench

This is a benchmark of PostgreSQL using pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only8 vCPUs16 vCPUs32 vCPUs70K140K210K280K350KSE +/- 663.61, N = 3SE +/- 697.61, N = 3SE +/- 1811.74, N = 3542371578943295391. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency8 vCPUs16 vCPUs32 vCPUs0.41490.82981.24471.65962.0745SE +/- 0.023, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 31.8440.6330.3041. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only8 vCPUs16 vCPUs32 vCPUs70K140K210K280K350KSE +/- 588.20, N = 12SE +/- 1418.81, N = 3SE +/- 4561.68, N = 12496281316073122391. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency8 vCPUs16 vCPUs32 vCPUs1.13512.27023.40534.54045.6755SE +/- 0.060, N = 12SE +/- 0.021, N = 3SE +/- 0.012, N = 125.0451.9000.8031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lm

Renaissance

Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.14Test: Apache Spark Bayes8 vCPUs16 vCPUs32 vCPUs5001000150020002500SE +/- 42.77, N = 15SE +/- 7.45, N = 3SE +/- 9.73, N = 32249.81262.0766.4MIN: 1478.18 / MAX: 2434.18MIN: 877.37 / MAX: 1398.23MIN: 495.95 / MAX: 1178.88

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.14Test: Savina Reactors.IO8 vCPUs16 vCPUs32 vCPUs6K12K18K24K30KSE +/- 435.60, N = 9SE +/- 583.25, N = 12SE +/- 131.70, N = 426456.415981.510705.9MIN: 13667.16 / MAX: 42318.14MIN: 12776.53 / MAX: 36273.51MIN: 10505.49 / MAX: 14847.21

SPECjbb 2015

This is a benchmark of SPECjbb 2015. For this test profile to work, you must have a valid license/copy of the SPECjbb 2015 ISO (SPECjbb2015-1.02.iso) in your Phoronix Test Suite download cache. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgjOPS, More Is BetterSPECjbb 2015SPECjbb2015-Composite max-jOPS8 vCPUs16 vCPUs32 vCPUs8K16K24K32K40K91581809235075

OpenBenchmarking.orgjOPS, More Is BetterSPECjbb 2015SPECjbb2015-Composite critical-jOPS8 vCPUs16 vCPUs32 vCPUs5K10K15K20K25K3921920722955

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Futex8 vCPUs16 vCPUs32 vCPUs300K600K900K1200K1500KSE +/- 36001.77, N = 15SE +/- 30917.20, N = 15SE +/- 15026.23, N = 3937451.991198681.871437660.621. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Cache8 vCPUs16 vCPUs32 vCPUs120240360480600SE +/- 2.30, N = 3SE +/- 2.05, N = 3SE +/- 0.28, N = 3436.31551.25566.911. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU Stress8 vCPUs16 vCPUs32 vCPUs2K4K6K8K10KSE +/- 1.28, N = 3SE +/- 2.80, N = 3SE +/- 4.23, N = 32065.534116.968209.471. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix Math8 vCPUs16 vCPUs32 vCPUs30K60K90K120K150KSE +/- 25.04, N = 3SE +/- 10.44, N = 3SE +/- 9.80, N = 338215.9576177.56151792.831. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector Math8 vCPUs16 vCPUs32 vCPUs20K40K60K80K100KSE +/- 6.31, N = 3SE +/- 27.43, N = 3SE +/- 190.70, N = 324633.9949102.3097749.081. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: System V Message Passing8 vCPUs16 vCPUs32 vCPUs1.3M2.6M3.9M5.2M6.5MSE +/- 12538.93, N = 3SE +/- 15929.45, N = 3SE +/- 7551.56, N = 34507844.175475267.366128517.101. (CC) gcc options: -O3 -march=native -O2 -std=gnu99 -lm -laio -lapparmor -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU8 vCPUs16 vCPUs32 vCPUs20K40K60K80K100KSE +/- 6.95, N = 3SE +/- 12.70, N = 3SE +/- 23.77, N = 327237.2854317.42108241.611. (CC) gcc options: -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

TensorFlow Lite

This is a benchmark of the TensorFlow Lite implementation focused on TensorFlow machine learning for mobile, IoT, edge, and other cases. The current Linux support is limited to running on CPUs. This test profile is measuring the average inference time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: SqueezeNet8 vCPUs16 vCPUs32 vCPUs14002800420056007000SE +/- 9.96, N = 3SE +/- 11.05, N = 3SE +/- 31.57, N = 86618.323955.893853.90

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception V48 vCPUs16 vCPUs32 vCPUs20K40K60K80K100KSE +/- 49.87, N = 3SE +/- 84.24, N = 3SE +/- 149.01, N = 397646.146113.431657.3

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Mobilenet Float8 vCPUs16 vCPUs32 vCPUs9001800270036004500SE +/- 3.06, N = 3SE +/- 4.32, N = 3SE +/- 17.55, N = 34395.732481.962093.25

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2022-05-18Model: Inception ResNet V28 vCPUs16 vCPUs32 vCPUs20K40K60K80K100KSE +/- 70.02, N = 3SE +/- 16.18, N = 3SE +/- 379.42, N = 391592.645445.933994.9

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.4Time To Compile8 vCPUs16 vCPUs32 vCPUs306090120150SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3113.4661.9838.96

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compile8 vCPUs16 vCPUs32 vCPUs2004006008001000SE +/- 0.66, N = 3SE +/- 1.16, N = 3SE +/- 1.96, N = 3917.39495.54312.12

Timed MPlayer Compilation

This test times how long it takes to build the MPlayer open-source media player program. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile8 vCPUs16 vCPUs32 vCPUs20406080100SE +/- 0.03, N = 3SE +/- 0.26, N = 3SE +/- 0.35, N = 488.8447.6228.93

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNet8 vCPUs16 vCPUs32 vCPUs8001600240032004000SE +/- 9.75, N = 3SE +/- 12.43, N = 3SE +/- 6.90, N = 33842.123358.733056.90MIN: 3619.38 / MAX: 4060.16MIN: 3163.2 / MAX: 3575.85MIN: 2928.19 / MAX: 3237.581. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v28 vCPUs16 vCPUs32 vCPUs70140210280350SE +/- 0.78, N = 3SE +/- 1.36, N = 3SE +/- 0.05, N = 3331.34328.89322.77MIN: 327.36 / MAX: 339.94MIN: 322.15 / MAX: 373.8MIN: 319.63 / MAX: 326.431. (CXX) g++ options: -O3 -march=native -fopenmp -pthread -fvisibility=hidden -fvisibility=default -rdynamic -ldl

VP9 libvpx Encoding

This is a standard video encoding performance test of Google's libvpx library and the vpxenc command for the VP9 video format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 4K8 vCPUs16 vCPUs32 vCPUs246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.116.686.991. (CXX) g++ options: -lm -lpthread -O3 -march=native -march=armv8-a -fPIC -U_FORTIFY_SOURCE -std=gnu++11

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 0 - Input: Bosphorus 1080p8 vCPUs16 vCPUs32 vCPUs1.12282.24563.36844.49125.614SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.654.844.991. (CXX) g++ options: -lm -lpthread -O3 -march=native -march=armv8-a -fPIC -U_FORTIFY_SOURCE -std=gnu++11

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 1080p8 vCPUs16 vCPUs32 vCPUs3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 311.2711.7812.111. (CXX) g++ options: -lm -lpthread -O3 -march=native -march=armv8-a -fPIC -U_FORTIFY_SOURCE -std=gnu++11

102 Results Shown

Aircrack-ng
Apache Cassandra
Apache Spark:
  1000000 - 100 - SHA-512 Benchmark Time
  1000000 - 100 - Calculate Pi Benchmark
  1000000 - 100 - Calculate Pi Benchmark Using Dataframe
  1000000 - 100 - Repartition Test Time
  1000000 - 100 - Inner Join Test Time
  1000000 - 100 - Broadcast Inner Join Test Time
  1000000 - 2000 - SHA-512 Benchmark Time
  1000000 - 2000 - Calculate Pi Benchmark
  1000000 - 2000 - Calculate Pi Benchmark Using Dataframe
  1000000 - 2000 - Group By Test Time
  1000000 - 2000 - Repartition Test Time
  1000000 - 2000 - Inner Join Test Time
  1000000 - 2000 - Broadcast Inner Join Test Time
  40000000 - 100 - SHA-512 Benchmark Time
  40000000 - 100 - Calculate Pi Benchmark
  40000000 - 100 - Calculate Pi Benchmark Using Dataframe
  40000000 - 100 - Group By Test Time
  40000000 - 100 - Repartition Test Time
  40000000 - 100 - Inner Join Test Time
  40000000 - 100 - Broadcast Inner Join Test Time
  40000000 - 2000 - SHA-512 Benchmark Time
  40000000 - 2000 - Calculate Pi Benchmark
  40000000 - 2000 - Calculate Pi Benchmark Using Dataframe
  40000000 - 2000 - Group By Test Time
  40000000 - 2000 - Repartition Test Time
  40000000 - 2000 - Inner Join Test Time
  40000000 - 2000 - Broadcast Inner Join Test Time
ASKAP:
  tConvolve MT - Gridding
  tConvolve MT - Degridding
  tConvolve MPI - Degridding
  tConvolve MPI - Gridding
  tConvolve OpenMP - Gridding
  tConvolve OpenMP - Degridding
  Hogbom Clean OpenMP
ASTC Encoder:
  Medium
  Thorough
  Exhaustive
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Fishy Cat - CPU-Only
Coremark
DaCapo Benchmark
Facebook RocksDB:
  Rand Read
  Read While Writing
  Read Rand Write Rand
GPAW
Graph500:
  26:
    bfs median_TEPS
    bfs max_TEPS
    sssp median_TEPS
    sssp max_TEPS
GROMACS
High Performance Conjugate Gradient
LAMMPS Molecular Dynamics Simulator:
  20k Atoms
  Rhodopsin Protein
libavif avifenc:
  0
  2
  6
  6, Lossless
  10, Lossless
NAS Parallel Benchmarks:
  BT.C
  CG.C
  EP.D
  FT.C
  IS.D
  LU.C
  MG.C
  SP.B
  SP.C
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
OpenSSL:
  SHA256
  RSA4096
  RSA4096
PostgreSQL pgbench:
  100 - 100 - Read Only
  100 - 100 - Read Only - Average Latency
  100 - 250 - Read Only
  100 - 250 - Read Only - Average Latency
Renaissance:
  Apache Spark Bayes
  Savina Reactors.IO
SPECjbb 2015:
  SPECjbb2015-Composite max-jOPS
  SPECjbb2015-Composite critical-jOPS
Stress-NG:
  Futex
  CPU Cache
  CPU Stress
  Matrix Math
  Vector Math
  System V Message Passing
Sysbench
TensorFlow Lite:
  SqueezeNet
  Inception V4
  Mobilenet Float
  Inception ResNet V2
Timed FFmpeg Compilation
Timed Gem5 Compilation
Timed MPlayer Compilation
TNN:
  CPU - DenseNet
  CPU - MobileNet v2
VP9 libvpx Encoding:
  Speed 5 - Bosphorus 4K
  Speed 0 - Bosphorus 1080p
  Speed 5 - Bosphorus 1080p