CloudSuite Data Analytics

CloudSuite Data Analytics is a Docker-based benchmark and runs a Naive Bayes classifier on a Wikimedia dataset with Hadoop and Mahout.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark cloudsuite-da.

Project Site

github.com

Source Repository

github.com

Test Created

1 November 2019

Last Updated

3 November 2022

Test Maintainer

Michael Larabel 

Test Type

System

Average Install Time

1 Minute, 22 Seconds

Average Run Time

23 Minutes, 5 Seconds

Accolades

10k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsCloudSuite Data Analytics Popularity Statisticspts/cloudsuite-da2019.112019.122020.012020.022020.032020.042020.052020.062020.072020.082020.102020.112020.122021.022021.032021.042021.052021.062021.072021.082021.092021.102021.112021.122022.012022.022022.032022.042022.052022.062022.072022.082022.092022.102022.11400800120016002000
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data current as of 1 December 2022.
828.6%428.6%128.6%3214.3%Hadoop Slaves Option PopularityOpenBenchmarking.org

Revision History

pts/cloudsuite-da-1.1.0   [View Source]   Thu, 03 Nov 2022 17:52:22 GMT
Update Docker locations, allow Hadoop slave count configuration. This test though doesn't seem too useful/good...

pts/cloudsuite-da-1.0.0   [View Source]   Fri, 01 Nov 2019 17:55:39 GMT
Initial commit of CloudSuite Data Analytics benchmark.


Performance Metrics

Analyze Test Configuration:

CloudSuite Data Analytics

Hadoop Slaves: 8

OpenBenchmarking.org metrics for this test profile configuration based on 20 public results since 3 November 2022 with the latest data as of 4 November 2022.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
ms (Average)
96th
3
553651 +/- 2615
76th
3
566372 +/- 1662
Mid-Tier
75th
> 568198
56th
3
827596 +/- 834
Median
50th
831411
Low-Tier
25th
> 902999
16th
4
905892 +/- 5240
11th
3
921784 +/- 45505
OpenBenchmarking.orgDistribution Of Public Results - Hadoop Slaves: 820 Results Range From 550884 To 957991 ms550884559027567170575313583456591599599742607885616028624171632314640457648600656743664886673029681172689315697458705601713744721887730030738173746316754459762602770745778888787031795174803317811460819603827746835889844032852175860318868461876604884747892890901033909176917319925462933605941748949891958034246810

Based on OpenBenchmarking.org data, the selected test / test configuration (CloudSuite Data Analytics - Hadoop Slaves: 8) has an average run-time of 44 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkHadoop Slaves: 8Run-Time1122334455Min: 33 / Avg: 42.88 / Max: 53

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.2%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsHadoop Slaves: 8Deviation246810Min: 0 / Avg: 0.16 / Max: 1

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 6 Benchmark Results

AMD Ryzen 9 3900X 12-Core - ASUS ROG STRIX X570-E GAMING - AMD Starship

Ubuntu 20.04 - 5.4.0-132-generic - NVIDIA

4 Systems - 8 Benchmark Results

2 x Intel Xeon Platinum 8380 - Intel M50CYP2SB2U - Intel Ice Lake IEH

Ubuntu 22.10 - 6.0.0-060000rc3daily20220904-generic - GNOME Shell

3 Systems - 7 Benchmark Results

Intel Core i9-13900K - ASUS PRIME Z790-P WIFI - Intel Device 7a27

Ubuntu 22.10 - 5.19.0-23-generic - GNOME Shell 43.0

3 Systems - 9 Benchmark Results

AMD Ryzen Threadripper 3990X 64-Core - Gigabyte TRX40 AORUS PRO WIFI - AMD Starship

Ubuntu 22.10 - 6.0.0-060000rc7daily20220927-generic - GNOME Shell 43.0

3 Systems - 4 Benchmark Results

AMD Ryzen 7 7700X 8-Core - ASRock X670E PG Lightning - AMD Device 14d8

Ubuntu 22.10 - 5.19.0-23-generic - GNOME Shell 43.0

3 Systems - 9 Benchmark Results

AMD Ryzen 7 PRO 6850U - LENOVO 21CM0001US - AMD Device 14b5

Ubuntu 22.10 - 6.1.0-060100rc2daily20221028-generic - GNOME Shell 43.0

3 Systems - 6 Benchmark Results

AMD Ryzen 9 5900HX - ASUS G513QY v1.0 - AMD Renoir

Ubuntu 22.10 - 5.19.0-23-generic - GNOME Shell 43.0

Find More Test Results