This is a benchmark of Apache Spark with its PySpark interface. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmars the Apache Spark in a single-system configuration using spark-submit. The test makes use of DIYBigData's pyspark-benchmark (https://github.com/DIYBigData/pyspark-benchmark/) for generating of test data and various Apache Spark operations.
To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark spark.
OpenBenchmarking.org metrics for this test profile configuration based on 412 public results since 4 August 2022 with the latest data as of 24 November 2023.
Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.
Based on OpenBenchmarking.org data, the selected test / test configuration (Apache Spark 3.3 - Row Count: 1000000 - Partitions: 100 - Calculate Pi Benchmark) has an average run-time of 14 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.
Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.1%.
Yes, based on the automated analysis of the collected public benchmark data, this test / test settings does generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.
This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.
2 Systems - 452 Benchmark Results |
ARMv8 Neoverse-N1 - GIGABYTE MP32-AR2-00 v01000100 - Ampere Computing LLC Altra PCI Root Complex A Ubuntu 22.04 - 5.15.0-89-generic - 1.3.238 |
1 System - 484 Benchmark Results |
ARMv8 Neoverse-N1 - GIGABYTE MP32-AR2-00 v01000100 - Ampere Computing LLC Device e100 Ubuntu 20.04 - 5.4.0-167-generic - 1.1.182 |
1 System - 112 Benchmark Results |
AMD Ryzen 9 5950X 16-Core - ASUS TUF GAMING B550-PLUS WIFI II - AMD Starship Fedora Linux 39 - 6.5.11-300.fc39.x86_64 - KDE Plasma 5.27.9 |
1 System - 7 Benchmark Results |
ARMv8 Neoverse-V1 - Amazon EC2 r7gd.16xlarge - 496GB Ubuntu 20.04 - 5.15.0-1036-aws - 1.1.182 |
2 Systems - 23 Benchmark Results |
ARMv8 Neoverse-N1 - Amazon EC2 r6gd.16xlarge - 512GB Ubuntu 20.04 - 5.15.0-1036-aws - 1.1.182 |
1 System - 260 Benchmark Results |
Intel Xeon Platinum 8375C - Amazon EC2 m6i.4xlarge - Intel 440FX 82441FX PMC Ubuntu 20.04 - 5.15.0-1049-aws - GNOME Shell 3.36.9 |
1 System - 251 Benchmark Results |
Intel Xeon Platinum 8358 - QEMU Standard PC - Intel 440FX 82441FX PMC Ubuntu 20.04 - 5.15.0-1042-oracle - 1.1.182 |
1 System - 256 Benchmark Results |
AMD EPYC 7J13 64-Core - QEMU Standard PC - Intel 440FX 82441FX PMC Ubuntu 20.04 - 5.15.0-1042-oracle - 1.1.182 |
1 System - 241 Benchmark Results |
Intel Xeon Platinum 8375C - Amazon EC2 m6i.4xlarge - Intel 440FX 82441FX PMC Ubuntu 20.04 - 5.15.0-1049-aws - GNOME Shell 3.36.9 |
1 System - 275 Benchmark Results |
AMD EPYC 7R13 - Amazon EC2 m6a.4xlarge - Intel 440FX 82441FX PMC Ubuntu 20.04 - 5.15.0-1048-aws - 1.1.182 |
1 System - 96 Benchmark Results |
2 x Intel Xeon Platinum 8474C - Inspur NF5280M7 - Intel Device 1bce Ubuntu 22.04 - 6.2.0-34-generic - X Server |
1 System - 7 Benchmark Results |
2 x Intel Xeon Platinum 8474C - Inspur NF5280M7 - Intel Device 1bce Ubuntu 22.04 - 6.2.0-34-generic - X Server |
1 System - 352 Benchmark Results |
2 x Intel Xeon Platinum 8452Y - Lenovo SB27A92818 v06 - 8 x 32 GB DDR5-4800MT AlmaLinux 9.2 - 5.14.0-284.30.1.el9_2.x86_64 - GCC 11.3.1 20221121 |
1 System - 115 Benchmark Results |
2 x Intel Xeon Silver 4416+ - Dell PowerEdge R760 [0NH8MJ] - Intel Device 1bce Ubuntu 18.04 - 5.4.0-150-generic - GNOME Shell 3.28.4 |
1 System - 105 Benchmark Results |
2 x Intel Xeon Silver 4416+ - Dell PowerEdge R760 [0NH8MJ] - Intel Device 1bce Ubuntu 18.04 - 5.4.0-150-generic - GNOME Shell 3.28.4 |
4 Systems - 134 Benchmark Results |
AMD Ryzen 9 5950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship Ubuntu 22.04 - 5.15.0-40-generic - GNOME Shell 42.2 |
Featured Graphics Comparison |
AMD Ryzen 5 7600X 6-Core - ASUS ROG CROSSHAIR X670E HERO - AMD Device 14d8 Ubuntu 22.04 - 6.0.0-060000rc7daily20221001-generic - GNOME Shell 42.4 |
2 Systems - 156 Benchmark Results |
2 x AMD EPYC 75F3 32-Core - ASRockRack ROME2D16-2T - AMD Starship Ubuntu 21.10 - 5.19.0-rc2-phx-mglru-v12 - GNOME Shell 40.5 |
3 Systems - 311 Benchmark Results |
Intel Core i5-12600K - ASUS PRIME Z690-P WIFI D4 - Intel Device 7aa7 Ubuntu 22.04 - 5.19.0-051900rc6daily20220716-generic - GNOME Shell 42.1 |
2 Systems - 219 Benchmark Results |
Apple M1 - Apple Mac mini - 8GB Arch Linux ARM - 5.19.0-rc7-asahi-2-1-ARCH - KDE Plasma 5.25.4 |
2 Systems - 384 Benchmark Results |
Intel Core i9-13900K - ASUS PRIME Z790-P WIFI - Intel Device 7a27 Ubuntu 22.10 - 5.19.0-23-generic - GNOME Shell 43.0 |
2 Systems - 112 Benchmark Results |
AMD Ryzen 7 PRO 6850U - LENOVO 21CM0001US - AMD Device 14b5 Ubuntu 22.04 - 5.15.0-41-generic - GNOME Shell 42.2 |
2 Systems - 218 Benchmark Results |
Intel Core i7-1280P - MSI MS-14C6 - Intel Alder Lake PCH Arch Linux - 5.18.16-arch1-1 - KDE Plasma 5.25.4 |
3 Systems - 190 Benchmark Results |
AMD Ryzen 7 PRO 6850U - LENOVO 21CM0001US - AMD Device 14b5 Arch rolling - 5.18.16-arch1-1 - KDE Plasma 5.25.4 |
2 Systems - 130 Benchmark Results |
Intel Xeon Silver 4216 - TYAN S7100AG2NR - Intel Sky Lake-E DMI3 Registers Debian 11 - 5.10.0-10-amd64 - X Server |
Featured Kernel Comparison |
AMD Ryzen Threadripper PRO 5965WX 24-Cores - ASUS Pro WS WRX80E-SAGE SE WIFI - AMD Starship Ubuntu 22.04 - 5.19.0-051900daily20220809-generic - GNOME Shell 42.2 |
2 Systems - 112 Benchmark Results |
AMD Ryzen 7 PRO 5850U - HP 8A78 - AMD Renoir Ubuntu 22.04 - 5.19.0-051900rc7-generic - GNOME Shell 42.2 |
Featured Graphics Comparison |
AMD Ryzen 9 5900HX - ASUS G513QY v1.0 - AMD Renoir Ubuntu 22.10 - 5.15.0-27-generic - GNOME Shell 42.3.1 |
2 Systems - 361 Benchmark Results |
AMD Ryzen Threadripper 3960X 24-Core - MSI Creator TRX40 - AMD Starship Ubuntu 22.04 - 5.19.0-051900rc7-generic - GNOME Shell 42.2 |
Featured Graphics Comparison |
AMD Ryzen 5 4500U - LENOVO LNVNB161216 - AMD Renoir Pop 22.04 - 5.17.5-76051705-generic - GNOME Shell 42.1 |