HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark heffte.

Project Site

icl.utk.edu

Source Repository

github.com

Test Created

18 June 2023

Last Updated

27 October 2023

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

49 Seconds

Average Run Time

1 Minute, 9 Seconds

Test Dependencies

C/C++ Compiler Toolchain + Fortran + OpenMPI + CMake + FFTW

Accolades

10k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page ViewsOpenBenchmarking.orgEventsHeFFTe - Highly Efficient FFT for Exascale Popularity Statisticspts/heffte2023.062023.072023.082023.092023.102023.112023.122024.012024.022024.032024.048001600240032004000
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
Data updated weekly as of 21 April 2024.
c2c49.5%r2c50.5%Test Option PopularityOpenBenchmarking.org
Stock48.7%FFTW51.3%Backend Option PopularityOpenBenchmarking.org
double25.6%float27.6%double-long23.0%float-long23.8%Precision Option PopularityOpenBenchmarking.org
12832.1%25631.6%10247.1%51229.3%X Y Z Option PopularityOpenBenchmarking.org

Revision History

pts/heffte-1.1.0   [View Source]   Fri, 27 Oct 2023 15:16:02 GMT
Update against HeFFTe 2.4 upstream.

pts/heffte-1.0.0   [View Source]   Sun, 18 Jun 2023 09:51:50 GMT
Initial commit of HeFFTe benchmark.


Performance Metrics

Analyze Test Configuration:

HeFFTe - Highly Efficient FFT for Exascale 2.4

Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128

OpenBenchmarking.org metrics for this test profile configuration based on 73 public results since 27 October 2023 with the latest data as of 26 November 2023.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
GFLOP/s (Average)
Mid-Tier
75th
< 119
70th
8
118 +/- 1
Median
50th
87
Low-Tier
25th
< 26
18th
5
12 +/- 1
OpenBenchmarking.orgDistribution Of Public Results - Test: c2c - Backend: FFTW - Precision: float - X Y Z: 12873 Results Range From 4 To 235 GFLOP/s419344964799410912413915416918419921422924448121620

Based on OpenBenchmarking.org data, the selected test / test configuration (HeFFTe - Highly Efficient FFT for Exascale 2.4 - Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128) has an average run-time of 2 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkTest: c2c - Backend: FFTW - Precision: float - X Y Z: 128Run-Time246810Min: 1 / Avg: 1 / Max: 1

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 0.2%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsTest: c2c - Backend: FFTW - Precision: float - X Y Z: 128Deviation246810Min: 0 / Avg: 0.21 / Max: 2

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
Loongson LoongArch 64-bit
loongarch64
Loongson-3A6000

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 1 Benchmark Result

2 x Intel Xeon E5-2699 v4 - Supermicro X10DRL-i v1.01 - Intel Xeon E7 v4

Ubuntu 22.04 - 6.5.0-21-generic - X Server 1.21.1.4

1 System - 494 Benchmark Results

1 System - 4 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

1 System - 7 Benchmark Results

Loongson-3A6000 - Loongson Loongson-LS3A6000-7A2000-1w-EVB-V1.21 - Loongson LLC Hyper Transport Bridge

Loongnix 20 - 4.19.0-19-loongson-3 - X Server 1.20.4

1 System - 64 Benchmark Results

AMD EPYC 7R13 48-Core - Supermicro H12SSL-I v1.02 - AMD Starship

EndeavourOS rolling - 6.6.1-zen1-1-zen - Xfce 4.18

3 Systems - 97 Benchmark Results

2 x AMD EPYC 9684X 96-Core - AMD Titanite_4G - AMD Device 14a4

Ubuntu 23.10 - 6.6.0-rc5-phx-patched - GNOME Shell 45.0

2 Systems - 142 Benchmark Results

Intel Core i9-10980XE - ASRock X299 Steel Legend - Intel Sky Lake-E DMI3 Registers

Ubuntu 22.04 - 6.2.0-33-generic - GNOME Shell 42.2

14 Systems - 159 Benchmark Results

2 x Intel Xeon Platinum 8490H - Quanta Cloud S6Q-MB-MPS - Intel Device 1bce

Ubuntu 23.10 - 6.6.0-rc5-phx-patched - GNOME Shell 45.0

2 Systems - 123 Benchmark Results

AMD Ryzen 7 7840U - PHX Ray_PEU - AMD Device 14e8

Ubuntu 23.10 - 6.5.0-with-patch2 - GNOME Shell 45.0

2 Systems - 84 Benchmark Results

AMD Ryzen 7 PRO 6850U - LENOVO 21CM0001US - AMD 17h-19h PCIe Root Complex

Fedora Linux 39 - 6.5.7-300.fc39.x86_64 - GNOME Shell 45.0

2 Systems - 125 Benchmark Results

2 x Intel Xeon Max 9480 - Supermicro X13DEM v1.10 - Intel Device 1bce

Fedora Linux 38 - 6.2.15-300.fc38.x86_64 - GCC 13.1.1 20230511 + Clang 16.0.3 + LLVM 16.0.3

4 Systems - 134 Benchmark Results

AMD EPYC 9334 32-Core - Supermicro H13SSW - 12 x 64 GB DDR5-4800MT

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

1 System - 650 Benchmark Results

4 Systems - 134 Benchmark Results

AMD EPYC 9334 32-Core - Supermicro H13SSW - 12 x 64 GB DDR5-4800MT

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

Most Popular Test Results

OpenBenchmarking.org Results Compare

4 Systems - 98 Benchmark Results

2 x Intel Xeon Platinum 8490H - Quanta Cloud S6Q-MB-MPS - Intel Device 1bce

Ubuntu 23.10 - 6.6.0-rc5-phx-patched - GNOME Shell 45.0

4 Systems - 134 Benchmark Results

AMD EPYC 9334 32-Core - Supermicro H13SSW - 12 x 64 GB DDR5-4800MT

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

14 Systems - 159 Benchmark Results

Intel Xeon Max 9468 - Quanta Cloud S6Q-MB-MPS - Intel Device 1bce

Ubuntu 23.10 - 6.6.0-rc5-phx-patched - GNOME Shell 45.0

2 Systems - 84 Benchmark Results

AMD Ryzen 7 PRO 6850U - LENOVO 21CM0001US - AMD 17h-19h PCIe Root Complex

Fedora Linux 39 - 6.5.7-300.fc39.x86_64 - GNOME Shell 45.0

4 Systems - 98 Benchmark Results

AMD EPYC 9334 32-Core - Supermicro H13SSW - 12 x 64 GB DDR5-4800MT

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

3 Systems - 67 Benchmark Results

AMD Ryzen Threadripper 3990X 64-Core - Gigabyte TRX40 AORUS PRO WIFI - AMD Starship

Ubuntu 23.04 - 6.2.0-34-generic - GNOME Shell 44.3

3 Systems - 48 Benchmark Results

AMD Ryzen 7 7840U - Framework FRANMDCP07 - AMD Device 14e8

Ubuntu 23.10 - 6.5.0-5-generic - GNOME Shell 45.0

2 Systems - 123 Benchmark Results

AMD Ryzen 7 7840U - PHX Ray_PEU - AMD Device 14e8

Ubuntu 23.10 - 6.5.0-with-patch2 - GNOME Shell 45.0

2 Systems - 84 Benchmark Results

AMD Ryzen 7 PRO 6850U - LENOVO 21CM0001US - AMD 17h-19h PCIe Root Complex

Fedora Linux 39 - 6.5.7-300.fc39.x86_64 - GNOME Shell 45.0

3 Systems - 48 Benchmark Results

AMD Ryzen 9 7950X 16-Core - ASUS ROG STRIX X670E-E GAMING WIFI - AMD Device 14d8

Ubuntu 23.10 - 6.6.0-060600rc5-generic - GNOME Shell 45.0

2 Systems - 142 Benchmark Results

Intel Core i9-10980XE - ASRock X299 Steel Legend - Intel Sky Lake-E DMI3 Registers

Ubuntu 22.04 - 6.2.0-33-generic - GNOME Shell 42.2

4 Systems - 134 Benchmark Results

AMD EPYC 9334 32-Core - Supermicro H13SSW - 12 x 64 GB DDR5-4800MT

AlmaLinux 9.2 - 5.14.0-284.25.1.el9_2.x86_64 - GCC 11.3.1 20221121

Find More Test Results