Apache Hadoop

This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark).

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark hadoop.

Project Site

hadoop.apache.org

Source Repository

github.com

Test Created

7 September 2023

Test Maintainer

Michael Larabel 

Test Type

System

Average Install Time

13 Seconds

Average Run Time

19 Minutes, 28 Seconds

Test Dependencies

Java

Accolades

10k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page ViewsOpenBenchmarking.orgEventsApache Hadoop Popularity Statisticspts/hadoop2023.092023.102023.112023.122024.012024.022024.032024.042024.052024.062024.072024.082024.092024.102024.1110002000300040005000
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data updated weekly as of 22 November 2024.
Rename19.0%Open21.7%Delete20.3%File Status14.2%Create24.7%Operation Option PopularityOpenBenchmarking.org
50010.7%2029.3%5032.6%10027.4%Threads Option PopularityOpenBenchmarking.org
100000037.7%100000007.8%10000054.5%Files Option PopularityOpenBenchmarking.org

Revision History

pts/hadoop-1.0.0   [View Source]   Thu, 07 Sep 2023 14:37:09 GMT
Initial commit of Hadoop benchmark.

Suites Using This Test

Database Test Suite

Server


Performance Metrics

Analyze Test Configuration:

Apache Hadoop 3.3.6

Operation: Create - Threads: 20 - Files: 100000

OpenBenchmarking.org metrics for this test profile configuration based on 94 public results since 7 September 2023 with the latest data as of 13 July 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Ops per sec (Average)
99th
4
73882 +/- 1150
86th
10
64007 +/- 7202
80th
4
56025 +/- 435
Mid-Tier
75th
< 55036
74th
6
53973 +/- 1164
53rd
6
39262 +/- 178
Median
50th
39124
43rd
4
30488 +/- 85
36th
3
27927 +/- 1670
26th
4
18966 +/- 187
Low-Tier
25th
< 18804
17th
3
7733 +/- 672
OpenBenchmarking.orgDistribution Of Public Results - Operation: Create - Threads: 20 - Files: 10000094 Results Range From 3567 To 75327 Ops per sec3567500364397875931110747121831361915055164911792719363207992223523671251072654327979294153085132287337233515936595380313946740903423394377545211466474808349519509555239153827552635669958135595716100762443638796531566751681876962371059724957393175367246810

Based on OpenBenchmarking.org data, the selected test / test configuration (Apache Hadoop 3.3.6 - Operation: Create - Threads: 20 - Files: 100000) has an average run-time of 4 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkOperation: Create - Threads: 20 - Files: 100000Run-Time3691215Min: 1 / Avg: 3.35 / Max: 10

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.2%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsOperation: Create - Threads: 20 - Files: 100000Deviation246810Min: 0 / Avg: 0.2 / Max: 2

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 242 Benchmark Results

2 x QEMU Virtual 2.5+ - QEMU Standard PC - 48GB

Alpine Linux v3.20 3.20.2 - 6.6.44-0-virt - GCC 13.2.1 20240309

1 System - 389 Benchmark Results

AMD Ryzen Threadripper 7970X 32-Cores - ASRock TRX50 WS - AMD Device 14a4

Debian 12 - 6.1.0-22-amd64 - GNOME Shell 43.9

Find More Test Results