FFTW

FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions.

To run this test with the Phoronix Test Suite, the basic command is: phoronix-test-suite benchmark fftw.

Project Site

fftw.org

Test Created

22 January 2015

Last Updated

16 August 2017

Test Maintainer

Michael Larabel 

Test Type

Processor

Average Install Time

1 Minute, 50 Seconds

Average Run Time

2 Minutes, 54 Seconds

Test Dependencies

C/C++ Compiler Toolchain

Accolades

100k+ Downloads

Supported Platforms


Public Result Uploads *Reported Installs **Reported Test Completions **Test Profile Page Views ***OpenBenchmarking.orgEventsFFTW Popularity Statisticspts/fftw2015.012015.052015.092016.012016.052016.092017.012017.052017.092018.012018.052018.092019.012019.052019.092020.012020.052020.092021.012021.052021.092022.012022.052022.092023.012023.052023.092024.012024.0516K32K48K64K80K
* Uploading of benchmark result data to OpenBenchmarking.org is always optional (opt-in) via the Phoronix Test Suite for users wishing to share their results publicly.
** Data based on those opting to upload their test results to OpenBenchmarking.org and users enabling the opt-in anonymous statistics reporting while running benchmarks from an Internet-connected platform.
*** Test profile page view reporting began March 2021.
Data updated weekly as of 22 June 2024.
Float + SSE49.1%Stock50.9%Build Option PopularityOpenBenchmarking.org
2D FFT Size 1285.4%2D FFT Size 5125.5%1D FFT Size 10245.5%1D FFT Size 5125.3%2D FFT Size 2565.1%2D FFT Size 645.1%2D FFT Size 10245.7%1D FFT Size 1285.2%2D FFT Size 326.4%2D FFT Size 409611.8%1D FFT Size 2565.1%1D FFT Size 20485.4%2D FFT Size 20485.6%1D FFT Size 328.5%1D FFT Size 40969.2%1D FFT Size 645.1%Size Option PopularityOpenBenchmarking.org

Revision History

pts/fftw-1.2.0   [View Source]   Wed, 16 Aug 2017 10:29:55 GMT
Update against fftw 3.3.6, add AVX2/AVX512 enables

pts/fftw-1.1.0   [View Source]   Sat, 24 Jan 2015 12:28:44 GMT
Switch to using Mflops as a scale.

pts/fftw-1.0.0   [View Source]   Thu, 22 Jan 2015 11:35:11 GMT
Initial commit of fftw.

Suites Using This Test

C/C++ Compiler Tests

HPC - High Performance Computing

CPU Massive

Scientific Computing


Performance Metrics

Analyze Test Configuration:

FFTW 3.3.6

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.org metrics for this test profile configuration based on 1,577 public results since 16 August 2017 with the latest data as of 25 June 2024.

Below is an overview of the generalized performance for components where there is sufficient statistically significant data based upon user-uploaded results. It is important to keep in mind particularly in the Linux/open-source space there can be vastly different OS configurations, with this overview intended to offer just general guidance as to the performance expectations.

Component
Percentile Rank
# Compatible Public Results
Mflops (Average)
100th
8
42591 +/- 1527
100th
9
40273 +/- 3112
100th
6
36856 +/- 1015
99th
3
33260 +/- 1940
97th
13
31447 +/- 478
97th
29
31355 +/- 945
95th
19
30643 +/- 2585
94th
9
30085 +/- 530
94th
3
29469 +/- 290
93rd
8
28495 +/- 3090
92nd
22
27469 +/- 1236
92nd
12
27355 +/- 1686
91st
6
26974 +/- 1193
91st
6
26537 +/- 2254
90th
3
26366 +/- 586
88th
16
24865 +/- 1852
88th
12
24498 +/- 1979
88th
8
24188 +/- 2629
87th
6
23759 +/- 1005
84th
4
22290 +/- 3181
82nd
4
21070 +/- 1668
82nd
4
20881 +/- 1536
81st
6
20769 +/- 539
80th
4
20530 +/- 746
80th
5
20526 +/- 2273
80th
29
20358 +/- 1844
79th
8
20085 +/- 637
79th
6
19907 +/- 1868
78th
6
19556 +/- 2026
76th
3
19208 +/- 1178
76th
16
19078 +/- 1669
Mid-Tier
75th
< 19043
75th
14
18895 +/- 1080
74th
5
18780 +/- 474
74th
3
18734 +/- 456
72nd
30
18276 +/- 1066
69th
4
17830 +/- 1639
68th
3
17576 +/- 1101
67th
7
17353 +/- 535
66th
7
17295 +/- 1320
66th
9
17161 +/- 1602
65th
3
17084 +/- 239
65th
4
16990 +/- 278
64th
16
16944 +/- 1200
61st
6
16332 +/- 1325
59th
22
16162 +/- 110
58th
8
16037 +/- 1416
56th
12
15832 +/- 1326
56th
5
15759 +/- 1279
56th
11
15736 +/- 986
55th
4
15703 +/- 621
55th
18
15594 +/- 1013
55th
3
15565 +/- 1340
55th
4
15494 +/- 948
55th
6
15423 +/- 1088
54th
6
15401 +/- 1117
54th
6
15311 +/- 872
54th
6
15286 +/- 787
53rd
6
15256 +/- 1243
53rd
7
15211 +/- 1062
52nd
6
15142 +/- 1061
52nd
6
15100 +/- 1205
51st
4
15039 +/- 1188
Median
50th
14979
47th
5
14747 +/- 371
45th
3
14473 +/- 840
44th
7
14342 +/- 476
43rd
6
14294 +/- 205
43rd
11
14218 +/- 1596
37th
17
13107 +/- 313
36th
12
13000 +/- 1904
35th
22
12849 +/- 520
35th
3
12832 +/- 380
34th
3
12719 +/- 1487
34th
4
12710 +/- 999
32nd
3
12344 +/- 492
32nd
6
12245 +/- 334
30th
5
11882 +/- 98
29th
3
11169 +/- 393
28th
5
10693 +/- 268
27th
13
10636 +/- 148
Low-Tier
25th
< 10297
24th
12
10038 +/- 316
23rd
3
9085 +/- 116
22nd
3
8666 +/- 143
20th
11
6695 +/- 17
18th
4
6188 +/- 88
17th
5
5192 +/- 387
14th
3
4810 +/- 106
10th
29
3558 +/- 23
6th
28
2551 +/- 125
3rd
3
1156 +/- 8
OpenBenchmarking.orgDistribution Of Public Results - Build: Float + SSE - Size: 2D FFT Size 40961577 Results Range From 95 To 52279 Mflops951139218332274271531563597403844794911053511579126231366714711157551679917843188871993120975220192306324107251512619527239282832932730371314153245933503345473559136635376793872339767408114185542899439434498746031470754811949163502075125152295306090120150

Based on OpenBenchmarking.org data, the selected test / test configuration (FFTW 3.3.6 - Build: Float + SSE - Size: 2D FFT Size 4096) has an average run-time of 24 minutes. By default this test profile is set to run at least 3 times but may increase if the standard deviation exceeds pre-defined defaults or other calculations deem additional runs necessary for greater statistical accuracy of the result.

OpenBenchmarking.orgMinutesTime Required To Complete BenchmarkBuild: Float + SSE - Size: 2D FFT Size 4096Run-Time20406080100Min: 3 / Avg: 23.33 / Max: 92

Based on public OpenBenchmarking.org results, the selected test / test configuration has an average standard deviation of 1%.

OpenBenchmarking.orgPercent, Fewer Is BetterAverage Deviation Between RunsBuild: Float + SSE - Size: 2D FFT Size 4096Deviation3691215Min: 0 / Avg: 1.01 / Max: 7

Does It Scale Well With Increasing Cores?

No, based on the automated analysis of the collected public benchmark data, this test / test settings does not generally scale well with increasing CPU core counts. Data based on publicly available results for this test / test settings, separated by vendor, result divided by the reference CPU clock speed, grouped by matching physical CPU core count, and normalized against the smallest core count tested from each vendor for each CPU having a sufficient number of test samples and statistically significant data.

IntelAMDOpenBenchmarking.orgRelative Core Scaling To BaseFFTW CPU Core ScalingBuild: Float + SSE - Size: 2D FFT Size 40962468101216243248640.71881.43762.15642.87523.594

Tested CPU Architectures

This benchmark has been successfully tested on the below mentioned architectures. The CPU architectures listed is where successful OpenBenchmarking.org result uploads occurred, namely for helping to determine if a given test is compatible with various alternative CPU architectures.

CPU Architecture
Kernel Identifier
Verified On
Intel / AMD x86 64-bit
x86_64
(Many Processors)
SPARC64
sparc64
(Many Processors)
IBM Z
s390x
(Many Processors)
IBM POWER (PowerPC) 64-bit
ppc64le
POWER9 16-Core
MIPS 64-bit
mips64
ICT Loongson-3A R3
Intel / AMD x86 32-bit
i686
(Many Processors)
ARMv7 32-bit
armv7l
ARMv7 rev 4 4-Core
DEC Alpha
alpha
Alpha
ARMv8 64-bit
aarch64
ARMv8, ARMv8 Cortex-A76 4-Core, ARMv8 Neoverse-N1 160-Core

Recent Test Results

OpenBenchmarking.org Results Compare

1 System - 32 Benchmark Results

2 x AMD EPYC 7763 64-Core - Supermicro Super Server H12DSi-NT6 v1.02 - AMD Starship

Ubuntu 24.04 - 6.8.0-35-generic - GNOME Shell 46.0

9 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

8 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

7 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

6 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

5 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

4 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

3 Systems - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

1 System - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

1 System - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-437.el9.x86_64 - X Server

1 System - 34 Benchmark Results

AMD Ryzen 7 2700X Eight-Core - Gigabyte B450 AORUS M - AMD 17h

Pop 22.04 - 6.8.0-76060800daily20240311-generic - GNOME Shell 42.5

1 System - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-435.el9.x86_64 - X Server

1 System - 32 Benchmark Results

2 x Intel Xeon E5-2620 v2 - ASUS Z9PE-D8 WS - Intel Xeon E7 v2

CentOS Stream 9 - 5.14.0-432.el9.x86_64 - X Server

76 Systems - 921 Benchmark Results

Intel Core i9-14900K - ASUS PRIME Z790-P - Intel Device 7a27

SystemRescue 10.01 - 6.1.30-1-lts - X Server 1.21.1.8

1 System - 98 Benchmark Results

Intel Core i9-14900K - ASUS PRIME Z790-P - Intel Device 7a27

SystemRescue 10.01 - 6.1.30-1-lts - X Server 1.21.1.8

Most Popular Test Results

OpenBenchmarking.org Results Compare

6 Systems - 1421 Benchmark Results

Unknown - Marvell Armada 3720 Board - 2048MB

Ubuntu 16.04 - 4.4.52-armada-17.06.2-g12feccb - GCC 5.4.0 20160609

12 Systems - 593 Benchmark Results

AMD Ryzen 9 3950X 16-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 20.04 - 5.8.0-050800daily20200622-generic - GNOME Shell 3.36.2

4 Systems - 131 Benchmark Results

Intel Xeon E-2278GEL - Logic Supply RXM-181 TBD by OEM - Intel

FreeBSD - 13.0-BETA1 - Clang 11.0.1

2 Systems - 123 Benchmark Results

Intel Core i7-8700K - ASUS TUF Z370-PLUS GAMING - Intel 8th Gen Core

Clear Linux OS 29920 - 5.1.9-781.native - GNOME Shell 3.32.2

8 Systems - 439 Benchmark Results

AMD Ryzen 7 5800X 8-Core - ASUS ROG CROSSHAIR VIII HERO - AMD Starship

Ubuntu 21.04 - 5.12.0-051200rc3daily20210315-generic - GNOME Shell 3.38.3

3 Systems - 56 Benchmark Results

AMD Ryzen 5 2400G with Radeon Vega - Gigabyte AX370-Gaming 5 - AMD Device 15d0

Ubuntu 17.10 - 4.15.1-041501-generic - GNOME Shell 3.26.2

3 Systems - 376 Benchmark Results

2 x AMD EPYC 7F72 24-Core - Supermicro H11DSi-NT v2.00 - AMD Starship

Ubuntu 20.10 - 5.11.0-rc4-max-boost-inv-patch - GNOME Shell 3.38.1

1 System - 62 Benchmark Results

ICT Loongson-3A R3 - Unknown - AMD RS780 + SB7x0

Loongnix 1.0 - 3.10.84-16.fc21.loongson.mips64el - MATE 1.8.1

1 System - 748 Benchmark Results

Intel Core i7-7700K - MSI Z270 GAMING M7 - Intel Intel Kaby Lake + Z270

Ubuntu 18.04 - 4.15.0-23-generic - GNOME Shell 3.28.1

3 Systems - 301 Benchmark Results

Intel Core i5-7600K - Gigabyte Z270M-D3H-CF - Intel Xeon E3-1200 v6

Ubuntu 20.04 - 5.4.0-40-generic - GNOME Shell 3.36.3

Find More Test Results