Apache Spark TPC-DS

This is a benchmark of Apache Spark using the TPC-DS data-set. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmarks the Apache Spark in a single-system configuration and leverages the https://github.com/databricks/tpcds-kit and https://github.com/IBM/spark-tpc-ds-performance-test/ projects for testing.


Apache Spark TPC-DS 3.5

Scale Factor: 10 - Q66

OpenBenchmarking.org metrics for this test profile configuration based on 64 public results since 3 January 2024 with the latest data as of 14 April 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Details
Percentile Rank
# Compatible Public Results
Seconds (Average)
Zen 4 [16 Cores / 32 Threads]
91st
6
3.20 +/- 0.11
Zen 4 [6 Cores / 12 Threads]
76th
4
3.98
Mid-Tier
75th
> 3.98
Raptor Lake [4 Cores / 8 Threads]
69th
3
4.14
Raptor Lake [14 Cores / 20 Threads]
65th
3
4.26
Zen 4 [64 Cores / 128 Threads]
60th
10
4.36 +/- 0.52
Median
50th
4.61
Zen 4 [8 Cores / 16 Threads]
49th
3
4.72 +/- 0.21
Zen 4 [96 Cores / 192 Threads]
32nd
8
5.81 +/- 0.69
Meteor Lake [16 Cores / 22 Threads]
32nd
4
5.86 +/- 0.21
Zen 2 [8 Cores / 16 Threads]
26th
4
6.11 +/- 0.13
Low-Tier
25th
> 6.22
Zen 2 [64 Cores / 128 Threads]
15th
4
7.36 +/- 0.33