Apache Spark TPC-H

This is a benchmark of Apache Spark using TPC-H data-set. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmarks the Apache Spark in a single-system configuration using spark-submit. The test makes use of https://github.com/ssavvides/tpch-spark/ for facilitating the TPC-H benchmark.


Apache Spark TPC-H 3.5

Scale Factor: 10 - Q07

OpenBenchmarking.org metrics for this test profile configuration based on 79 public results since 4 December 2023 with the latest data as of 18 February 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Details
Percentile Rank
# Compatible Public Results
Seconds (Average)
Raptor Lake [24 Cores / 32 Threads]
95th
5
5.5 +/- 0.1
Ice Lake [40 Cores / 80 Threads]
87th
6
5.8 +/- 0.1
Zen 4 [64 Cores / 128 Threads]
79th
4
6.3
Mid-Tier
75th
> 6.5
Zen 4 [96 Cores / 192 Threads]
75th
3
6.7 +/- 0.2
Zen 4 [16 Cores / 32 Threads]
73rd
4
6.8
Ice Lake [80 Cores / 160 Threads]
63rd
4
7.2 +/- 0.2
Sapphire Rapids [120 Cores / 240 Threads]
57th
4
8.1 +/- 0.1
Zen 4 [6 Cores / 12 Threads]
52nd
4
8.1
Median
50th
8.2
Raptor Lake [14 Cores / 20 Threads]
50th
3
8.4
Cascade Lake [18 Cores / 36 Threads]
42nd
3
10.1 +/- 0.3
Cascade Lake [16 Cores / 32 Threads]
37th
4
10.5 +/- 0.3
Zen 2 [64 Cores / 128 Threads]
33rd
4
10.6 +/- 0.1
Low-Tier
25th
> 11.8
Zen 2 [8 Cores / 16 Threads]
25th
4
11.9 +/- 0.2
Emerald Rapids [128 Cores / 256 Threads]
21st
4
13.0 +/- 1.1
Raptor Lake [4 Cores / 8 Threads]
14th
3
14.2
Zen 4 [192 Cores / 384 Threads]
12th
4
14.8 +/- 0.1
Zen 3 [8 Cores / 16 Threads]
7th
3
16.9 +/- 0.1
Ice Lake [4 Cores / 8 Threads]
3rd
4
55.2 +/- 0.5